Search (7 results, page 1 of 1)

  • × author_ss:"Chen, H."
  1. Chen, H.; Chung, Y.-M.; Ramsey, M.; Yang, C.C.: ¬A smart itsy bitsy spider for the Web (1998) 0.05
    0.04549888 = product of:
      0.18199553 = sum of:
        0.18199553 = weight(_text_:java in 1871) [ClassicSimilarity], result of:
          0.18199553 = score(doc=1871,freq=2.0), product of:
            0.4674661 = queryWeight, product of:
              7.0475073 = idf(docFreq=104, maxDocs=44421)
              0.0663307 = queryNorm
            0.38932347 = fieldWeight in 1871, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              7.0475073 = idf(docFreq=104, maxDocs=44421)
              0.0390625 = fieldNorm(doc=1871)
      0.25 = coord(1/4)
    
    Abstract
    As part of the ongoing Illinois Digital Library Initiative project, this research proposes an intelligent agent approach to Web searching. In this experiment, we developed 2 Web personal spiders based on best first search and genetic algorithm techniques, respectively. These personal spiders can dynamically take a user's selected starting homepages and search for the most closely related homepages in the Web, based on the links and keyword indexing. A graphical, dynamic, Jav-based interface was developed and is available for Web access. A system architecture for implementing such an agent-spider is presented, followed by deteiled discussions of benchmark testing and user evaluation results. In benchmark testing, although the genetic algorithm spider did not outperform the best first search spider, we found both results to be comparable and complementary. In user evaluation, the genetic algorithm spider obtained significantly higher recall value than that of the best first search spider. However, their precision values were not statistically different. The mutation process introduced in genetic algorithms allows users to find other potential relevant homepages that cannot be explored via a conventional local search process. In addition, we found the Java-based interface to be a necessary component for design of a truly interactive and dynamic Web agent
  2. Chen, H.; Chung, W.; Qin, J.; Reid, E.; Sageman, M.; Weimann, G.: Uncovering the dark Web : a case study of Jihad on the Web (2008) 0.04
    0.03932612 = product of:
      0.15730448 = sum of:
        0.15730448 = weight(_text_:having in 2880) [ClassicSimilarity], result of:
          0.15730448 = score(doc=2880,freq=2.0), product of:
            0.39673427 = queryWeight, product of:
              5.981156 = idf(docFreq=304, maxDocs=44421)
              0.0663307 = queryNorm
            0.39649835 = fieldWeight in 2880, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.981156 = idf(docFreq=304, maxDocs=44421)
              0.046875 = fieldNorm(doc=2880)
      0.25 = coord(1/4)
    
    Abstract
    While the Web has become a worldwide platform for communication, terrorists share their ideology and communicate with members on the Dark Web - the reverse side of the Web used by terrorists. Currently, the problems of information overload and difficulty to obtain a comprehensive picture of terrorist activities hinder effective and efficient analysis of terrorist information on the Web. To improve understanding of terrorist activities, we have developed a novel methodology for collecting and analyzing Dark Web information. The methodology incorporates information collection, analysis, and visualization techniques, and exploits various Web information sources. We applied it to collecting and analyzing information of 39 Jihad Web sites and developed visualization of their site contents, relationships, and activity levels. An expert evaluation showed that the methodology is very useful and promising, having a high potential to assist in investigation and understanding of terrorist activities by producing results that could potentially help guide both policymaking and intelligence research.
  3. Zhu, B.; Chen, H.: Information visualization (2004) 0.02
    0.022940237 = product of:
      0.09176095 = sum of:
        0.09176095 = weight(_text_:having in 5276) [ClassicSimilarity], result of:
          0.09176095 = score(doc=5276,freq=2.0), product of:
            0.39673427 = queryWeight, product of:
              5.981156 = idf(docFreq=304, maxDocs=44421)
              0.0663307 = queryNorm
            0.2312907 = fieldWeight in 5276, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.981156 = idf(docFreq=304, maxDocs=44421)
              0.02734375 = fieldNorm(doc=5276)
      0.25 = coord(1/4)
    
    Abstract
    Advanced technology has resulted in the generation of about one million terabytes of information every year. Ninety-reine percent of this is available in digital format (Keim, 2001). More information will be generated in the next three years than was created during all of previous human history (Keim, 2001). Collecting information is no longer a problem, but extracting value from information collections has become progressively more difficult. Various search engines have been developed to make it easier to locate information of interest, but these work well only for a person who has a specific goal and who understands what and how information is stored. This usually is not the Gase. Visualization was commonly thought of in terms of representing human mental processes (MacEachren, 1991; Miller, 1984). The concept is now associated with the amplification of these mental processes (Card, Mackinlay, & Shneiderman, 1999). Human eyes can process visual cues rapidly, whereas advanced information analysis techniques transform the computer into a powerful means of managing digitized information. Visualization offers a link between these two potent systems, the human eye and the computer (Gershon, Eick, & Card, 1998), helping to identify patterns and to extract insights from large amounts of information. The identification of patterns is important because it may lead to a scientific discovery, an interpretation of clues to solve a crime, the prediction of catastrophic weather, a successful financial investment, or a better understanding of human behavior in a computermediated environment. Visualization technology shows considerable promise for increasing the value of large-scale collections of information, as evidenced by several commercial applications of TreeMap (e.g., http://www.smartmoney.com) and Hyperbolic tree (e.g., http://www.inxight.com) to visualize large-scale hierarchical structures. Although the proliferation of visualization technologies dates from the 1990s where sophisticated hardware and software made increasingly faster generation of graphical objects possible, the role of visual aids in facilitating the construction of mental images has a long history. Visualization has been used to communicate ideas, to monitor trends implicit in data, and to explore large volumes of data for hypothesis generation. Imagine traveling to a strange place without a map, having to memorize physical and chemical properties of an element without Mendeleyev's periodic table, trying to understand the stock market without statistical diagrams, or browsing a collection of documents without interactive visual aids. A collection of information can lose its value simply because of the effort required for exhaustive exploration. Such frustrations can be overcome by visualization.
  4. Chen, H.: Generating, integrating and activating thesauri for concept-based document retrieval (1993) 0.01
    0.01441993 = product of:
      0.05767972 = sum of:
        0.05767972 = weight(_text_:und in 7622) [ClassicSimilarity], result of:
          0.05767972 = score(doc=7622,freq=2.0), product of:
            0.1471148 = queryWeight, product of:
              2.217899 = idf(docFreq=13141, maxDocs=44421)
              0.0663307 = queryNorm
            0.39207286 = fieldWeight in 7622, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.217899 = idf(docFreq=13141, maxDocs=44421)
              0.125 = fieldNorm(doc=7622)
      0.25 = coord(1/4)
    
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  5. Chen, H.; Ng, T.: ¬An algorithmic approach to concept exploration in a large knowledge network (automatic thesaurus consultation) : symbolic branch-and-bound search versus connectionist Hopfield Net Activation (1995) 0.01
    0.0054074735 = product of:
      0.021629894 = sum of:
        0.021629894 = weight(_text_:und in 2271) [ClassicSimilarity], result of:
          0.021629894 = score(doc=2271,freq=2.0), product of:
            0.1471148 = queryWeight, product of:
              2.217899 = idf(docFreq=13141, maxDocs=44421)
              0.0663307 = queryNorm
            0.14702731 = fieldWeight in 2271, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.217899 = idf(docFreq=13141, maxDocs=44421)
              0.046875 = fieldNorm(doc=2271)
      0.25 = coord(1/4)
    
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  6. Chen, H.; Martinez, J.; Kirchhoff, A.; Ng, T.D.; Schatz, B.R.: Alleviating search uncertainty through concept associations : automatic indexing, co-occurence analysis, and parallel computing (1998) 0.01
    0.0054074735 = product of:
      0.021629894 = sum of:
        0.021629894 = weight(_text_:und in 6202) [ClassicSimilarity], result of:
          0.021629894 = score(doc=6202,freq=2.0), product of:
            0.1471148 = queryWeight, product of:
              2.217899 = idf(docFreq=13141, maxDocs=44421)
              0.0663307 = queryNorm
            0.14702731 = fieldWeight in 6202, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.217899 = idf(docFreq=13141, maxDocs=44421)
              0.046875 = fieldNorm(doc=6202)
      0.25 = coord(1/4)
    
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus
  7. Chen, H.; Yim, T.; Fye, D.: Automatic thesaurus generation for an electronic community system (1995) 0.00
    0.004506228 = product of:
      0.018024912 = sum of:
        0.018024912 = weight(_text_:und in 2986) [ClassicSimilarity], result of:
          0.018024912 = score(doc=2986,freq=2.0), product of:
            0.1471148 = queryWeight, product of:
              2.217899 = idf(docFreq=13141, maxDocs=44421)
              0.0663307 = queryNorm
            0.12252277 = fieldWeight in 2986, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.217899 = idf(docFreq=13141, maxDocs=44421)
              0.0390625 = fieldNorm(doc=2986)
      0.25 = coord(1/4)
    
    Theme
    Konzeption und Anwendung des Prinzips Thesaurus