<?xml version="1.0" encoding="utf-8" ?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
<channel>
	<title>Articles&gt;Information Design&gt;Search</title>	<link>http://tc.eserver.org/dir/Articles/Information-Design/Search</link>
	<description>A listing of the most recently indexed works about Articles and Information Design and Search in the field of technical communication.</description>
	<language>en-us</language>
	<copyright>Copyright (c) 2005-08 by the EServer. All rights reserved.</copyright>
	<managingEditor>tclib-editorial@eserver.org (TC Library Editorial Board)</managingEditor>
	<webMaster>webmaster@eserver.org (Geoffrey Sauer)</webMaster>
	<image>
		<url>http://tc.eserver.org/images/newlogo.gif</url>
		<title>Articles&gt;Information Design&gt;Search</title>
		<link>http://tc.eserver.org/dir/Articles/Information-Design/Search</link>
	</image>
	<item>
		<title>What is Enough? Satisficing Information Needs</title>
		<link>http://tc.eserver.org/34961.html</link>
		<guid>http://tc.eserver.org/34961.html</guid>
		<description>This paper seeks to understand how users know when to stop searching for more information when the information space is so saturated that there is no certainty that the relevant information has been identified.</description>
	</item>
	<item>
		<title>Is Your Key Content Drowning in News?</title>
		<link>http://tc.eserver.org/34739.html</link>
		<guid>http://tc.eserver.org/34739.html</guid>
		<description>Many web editors spend a lot of their time writing news stories for the company web site. However, traffic analysis frequently reveals that this content is not very popular - and that users may in fact miss the key content they come to find (product data, addresses etc.) because it&apos;s practically drowning in news stories.</description>
	</item>
	<item>
		<title>Indexing the Web—It’s Not Just Google’s Business</title>
		<link>http://tc.eserver.org/34665.html</link>
		<guid>http://tc.eserver.org/34665.html</guid>
		<description>Web databases do much more than passively store information. Part of their power comes from indexing records efficiently. An index serves as a map, identifying the precise location of a small piece of data in a much larger pile. For example, when I search for “web development,” Google identifies two hundred million results and displays the first ten—in a quarter of a second. But Google isn’t loading every one of those pages and scanning their contents when I perform my search: they’ve analyzed the pages ahead of time and matched my search terms against an index that only references the original content.</description>
	</item>
	<item>
		<title>Designing for Faceted Search</title>
		<link>http://tc.eserver.org/34564.html</link>
		<guid>http://tc.eserver.org/34564.html</guid>
		<description>Faceted search, or guided navigation, has become the de facto standard for e-commerce and product-related websites, from big box stores to product review sites. But e-commerce sites aren’t the only ones joining the facets club. Other content-heavy sites such as media publishers (e.g. Financial Times: ft.com), libraries (e.g. NCSU Libraries: lib.ncsu.edu/), and even non-profits (e.g. Urban Land Institute: uli.org) are tapping into faceted search to make their often broad-range of content more findable. Essentially, faceted search has become so ubiquitous that users are not only getting used to it, they are coming to expect it.</description>
	</item>
	<item>
		<title>Web Object Retrieval</title>
		<link>http://tc.eserver.org/34238.html</link>
		<guid>http://tc.eserver.org/34238.html</guid>
		<description>The primary function of current Web search engines is essentially relevance ranking at the document level. However, myriad structured information about real-world objects is embedded in static Web pages and online Web databases. Document-level information retrieval can unfortunately lead to highly inaccurate relevance ranking in answering object-oriented queries. In this paper, we propose a paradigm shift to enable searching at the object level. In traditional information retrieval models, documents are taken as the retrieval units and the content of a document is considered reliable. However, this reliability assumption is no longer valid in the object retrieval context when multiple copies of information about the same object typically exist. These copies may be inconsistent because of diversity of Web site qualities and the limited performance of current information extraction techniques. If we simply combine the noisy and inaccurate attribute information extracted from different sources, we may not be able to achieve satisfactory retrieval performance. In this paper, we propose several language models for Web object retrieval, namely an unstructured object retrieval model, a structured object retrieval model, and a hybrid model with both structured and unstructured retrieval features. We test these models on a paper search engine and compare their performances. We conclude that the hybrid model is the superior by taking into account the extraction errors at varying levels.</description>
	</item>
	<item>
		<title>The Discoverability of the Web</title>
		<link>http://tc.eserver.org/34239.html</link>
		<guid>http://tc.eserver.org/34239.html</guid>
		<description>Previous studies have highlighted the high arrival rate of new content on the web. We study the extent to which this new content can be efficiently discovered by a crawler. Our study has two parts. First, we study the inherent difficulty of the discovery problem using a maximum cover formulation, under an assumption of perfect estimates of likely sources of links to new content. Second, we relax this assumption and study a more realistic setting in which algorithms must use historical statistics to estimate which pages are most likely to yield links to new content. We recommend a simple algorithm that performs comparably to all approaches we consider. We measure the overhead of discovering new content, de- ﬁned as the average number of fetches required to discover one new page. We show ﬁrst that with perfect foreknowledge of where to explore for links to new content, it is possible to discover 90% of all new content with under 3% overhead, and 100% of new content with 9% overhead. But actual algorithms, which do not have access to perfect foreknowl- edge, face a more difficult task: one quarter of new content is simply not amenable to efficient discovery. Of the re- maining three quarters, 80% of new content during a given week may be discovered with 160% overhead if content is recrawled fully on a monthly basis.</description>
	</item>
	<item>
		<title>Semantics Continues to Not be RDF, But Enrichment, Classification and Taxonomy</title>
		<link>http://tc.eserver.org/33632.html</link>
		<guid>http://tc.eserver.org/33632.html</guid>
		<description>Within the realm of computational semantics, there is still a fairly broad disconnect between triple pair semantics, the use of RDF (or turtle notation) to create atomic assertions, and the realm of semantics as reflected on the web. I do not expect this to change much in 2009, save perhaps that the gulf between the two will likely just get wider.</description>
	</item>
	<item>
		<title>Readability, Browsability, Searchability Plus Assistance</title>
		<link>http://tc.eserver.org/33115.html</link>
		<guid>http://tc.eserver.org/33115.html</guid>
		<description>Readability, browsability, and searchability do not have to be equally represented in every information system. As your collection of information increases, different aspects of these qualities take on greater significance. Thus, the amount of readability, browsability, and searchability your information system exhibits depends on the type and quality of your collected data, as well as the information needs of your clientele.</description>
	</item>
	<item>
		<title>Metadata and Search</title>
		<link>http://tc.eserver.org/33030.html</link>
		<guid>http://tc.eserver.org/33030.html</guid>
		<description>The 2003 Dublin Core Conference took as its basic premise that &quot;Metadata is fundamental to persons, organizations, machines, and an array of enterprises that are increasingly turning to the Web and electronic communication for disseminating and accessing information.&quot; One of the reasons metadata is receiving such attention is its role in facilitating information seeking.</description>
	</item>
	<item>
		<title>Web Search: How the Web Has Changed Information Retrieval</title>
		<link>http://tc.eserver.org/33038.html</link>
		<guid>http://tc.eserver.org/33038.html</guid>
		<description>Topical metadata have been used to indicate the subject of Web pages. They have been simultaneously hailed as building blocks of the semantic Web and derogated as spam. At this time major Web browsers avoid harvesting topical metadata. This paper suggests that the significance of the topical metadata controversy depends on the technological appropriateness of adding them to Web pages. This paper surveys Web technology with an eye on assessing the appropriateness of Web pages as hosts for topical metadata. The survey reveals Web pages to be both transient and volatile: poor hosts of topical metadata. The closed Web is considered to be a more supportive environment for the use of topical metadata. The closed Web is built on communities of trust where the structure and meaning of Web pages can be anticipated. The vast majority of Web pages, however, exist in the open Web, an environment that challenges the application of legacy information retrieval concepts and methods.</description>
	</item>
	<item>
		<title>Intelligent Site Structure for Better SEO!</title>
		<link>http://tc.eserver.org/32550.html</link>
		<guid>http://tc.eserver.org/32550.html</guid>
		<description>Search engines are one of the most important traffic drivers to sites these days, which is why Search Engine Optimization (SEO) is becoming more and more important.&#xD;&#xD;SEO is often thought to be just a set of some technical tricks, and as a professional SEO, I confess to spending a lot of time with clients fixing technical issues. A site&apos;s structure though, is just as important. Your site&apos;s structure determines whether a search engine understands what your site is about, and how easily it will find and index content relevant to your site&apos;s purpose and intent.&#xD;&#xD;By creating a good structure, you can use the content you&apos;ve written that has attracted links from others, and use your site&apos;s structure to spread some of that &quot;linkjuice&quot; to the other pages on your site.</description>
	</item>
	<item>
		<title>The Information User: Past, Present and Future</title>
		<link>http://tc.eserver.org/32303.html</link>
		<guid>http://tc.eserver.org/32303.html</guid>
		<description>The emergence of research on various aspects of `information behaviour&apos; is explored and its growth as a subject of academic research is documented. The origin of the field as a potential aid to the development of library and information services is noted, as is the transition from this status to that of a subject for research at PhD level and beyond. The development of the field has thus led to a division between the needs of academia for theoretically grounded work, and the needs of the field of practice for guidance for service development. There is, today, a disconnection between research and practice, to a significant extent: early research was undertaken by practitioners but today academic research dominates the scene. Suggestions are made as to how this disconnection can be repaired.</description>
	</item>
	<item>
		<title>On the History of Evaluation in Information Retrieval</title>
		<link>http://tc.eserver.org/32306.html</link>
		<guid>http://tc.eserver.org/32306.html</guid>
		<description>This paper is a personal take on the history of evaluation experiments in information retrieval. It describes some of the early experiments that were formative in our understanding, and goes on to discuss the current dominance of TREC (the Text REtrieval Conference) and to assess its impact.</description>
	</item>
	<item>
		<title>Novel Fuzzy Information Proximity Measures</title>
		<link>http://tc.eserver.org/32323.html</link>
		<guid>http://tc.eserver.org/32323.html</guid>
		<description>As a measure of information shared between two fuzzy pattern vectors, the fuzzy information proximity measure (FIPM) plays an important part in fuzzy pattern recognition, fuzzy clustering analysis and fuzzy approximate reasoning. In this paper, two novel FIPMs are set up. Firstly, an axiom theory about the FIPM is given, and different expressions of the FIPM are discussed. A new FIPM is then proposed based on the axiom theory of the FIPM and the concept of fuzzy subsethood function. Two concepts based on the idea of Shannon information entropy, fuzzy joint entropy (FJE) and fuzzy conditional entropy (FCE), are proposed and the basic properties of FJE and FCE are given and proved. Finally, classical similarity measures such as dissimilarity measure (DM) and similarity measure (SM) are studied, and two new measures, fuzzy absolute information measure (FAIM) and fuzzy relative information measure (FRIM), are set up, which can be used as measures of the proximity between fuzzy sets A and B.</description>
	</item>
	<item>
		<title>Judgments During Information Seeking: A Naturalistic Approach to Understanding the Assessment of Enough Information</title>
		<link>http://tc.eserver.org/32326.html</link>
		<guid>http://tc.eserver.org/32326.html</guid>
		<description>In this article, theories of human judgement and decision making are reviewed and their use by library and information science researchers examined. A different perspective on judgement and decision making is offered by the field of naturalistic decision making (NDM) and the implications of this approach are considered for an expanded understanding of how judgements and decisions are made during information seeking. This discussion is illustrated by a case from a recent empirical investigation into how judgements of enough information are made in the workplace. The article concludes with a critical evaluation of the NDM approach. It is argued that NDM, a recent development in decision theory, offers a new perspective from which to investigate judgements and decisions during information seeking.</description>
	</item>
	<item>
		<title>Strategies for Improving Enterprise Search</title>
		<link>http://tc.eserver.org/29676.html</link>
		<guid>http://tc.eserver.org/29676.html</guid>
		<description>Acquiring and installing a search engine is just the beginning of creating an effective enterprise search system. John Ferrara walks us through strategies for addressing critical aspects of the user experience often overlooked or ignored.</description>
	</item>
	<item>
		<title>Effective Search Engine Submission Strategies</title>
		<link>http://tc.eserver.org/29493.html</link>
		<guid>http://tc.eserver.org/29493.html</guid>
		<description>Now that you&apos;ve got a website it&apos;s time to start thinking about promoting it. Search engine listings are the number one way to generate traffic to your website.</description>
	</item>
	<item>
		<title>The Google Sandbox and How To Get Out</title>
		<link>http://tc.eserver.org/29491.html</link>
		<guid>http://tc.eserver.org/29491.html</guid>
		<description>The Google Sandbox is a filter that was put in place in about March of 2004. New websites with new domain names can take 6 to 12 months to get decent rankings on Google. Some are reporting stays of up to 18 months. The Sandbox seems to affect nearly all new websites placing them on probation. Similarly, websites that have made comprehensive redesigns have been caught up in this Sandbox. Does this Sandbox Really Exist, or is it just part of the Google algorithm? This has been a big controversy with many different opinions. Most now believe that this is an algorithm. In either case, the Sandbox functions to keep new sites from shooting to the top of Google in just a few weeks and overtaking quality sites that have been around for many years. This appears to be an initiation period for new websites.</description>
	</item>
	<item>
		<title>Google Search Engine Optimisation and their 80/20 Rule</title>
		<link>http://tc.eserver.org/29489.html</link>
		<guid>http://tc.eserver.org/29489.html</guid>
		<description>Google&apos;s increasing use of anti-spam features has meant that optimising websites for Google has become much harder and it&apos;s now not just a case of opening your websites source files in notepad, adding some keywords into your various HTML tags, uploading your files and waiting for the results. In fact in my opinion and I&apos;m sure others will agree with me, this type of optimisation, commonly referred to as onpage optimisation will only ever be 20% effective at achieving rankings for any keywords which are even mildly competitive. Those of us who aced maths in school will know this leaves us with 80% unaccounted for.</description>
	</item>
	<item>
		<title>Search Engine Optimization: Getting Started</title>
		<link>http://tc.eserver.org/29490.html</link>
		<guid>http://tc.eserver.org/29490.html</guid>
		<description>Explain some of the first steps to get your website not only optimized for the search engines, but to push your website up in the rankings war.</description>
	</item>
	<item>
		<title>SEO Outbound Link Relevance</title>
		<link>http://tc.eserver.org/29492.html</link>
		<guid>http://tc.eserver.org/29492.html</guid>
		<description>Outbound links&apos; anchor text affects a page&apos;s search engine ranking in much the same way that inbound links&apos; anchor text affects search engine ranking.</description>
	</item>
	<item>
		<title>SQL Cookbook: Advanced Searching</title>
		<link>http://tc.eserver.org/27059.html</link>
		<guid>http://tc.eserver.org/27059.html</guid>
		<description>Some types of searching operations stand apart from others in that they represent a different way of thinking about searching. Perhaps you&apos;re displaying a result set one page at a time. Half of that problem is to identify (search for) the entire set of records that you want to display. The other half of that problem is to repeatedly search for the next page to display as a user cycles through the records on a display. Your first thought may not be to think of pagination as a searching problem, but it can be thought of that way, and it can be solved that way; that is the type of searching solution this chapter is all about.</description>
	</item>
	<item>
		<title>Getting to &quot;the Right Stuff&quot;</title>
		<link>http://tc.eserver.org/26795.html</link>
		<guid>http://tc.eserver.org/26795.html</guid>
		<description>In a world awash with information, finding what you really want can be difficult. Any database or web index can deliver a set of results. But it&apos;s particularly difficult to highlight the most relevant &apos;stuff.&apos; Web search engines such as Google and Yahoo try their best to recommend some items over others, and now libraries are trying to do this for their holdings.</description>
	</item>
	<item>
		<title>Are You Using the Right Search Engine?</title>
		<link>http://tc.eserver.org/25937.html</link>
		<guid>http://tc.eserver.org/25937.html</guid>
		<description>What this all means is that when web users can&apos;t find what they want in Google, they should not automatically assume that they&apos;re at fault. At present, Google is heavily weighted.</description>
	</item>
	<item>
		<title>Using Search Profitably</title>
		<link>http://tc.eserver.org/25932.html</link>
		<guid>http://tc.eserver.org/25932.html</guid>
		<description>As an information source, the Internet is amazing. No matter what the topic, chances are you can find out all about it on the &apos;net. However, as the information haystack gets bigger, finding that crucial needle of data can become a frustrating challenge. Here are some simple tips for finding what you want. </description>
	</item>
	<item>
		<title>An Information Retrieval Using Conceptual Index Term For Technical Paper on Digital Library</title>
		<link>http://tc.eserver.org/25664.html</link>
		<guid>http://tc.eserver.org/25664.html</guid>
		<description>This paper presents a method for semantic Information Retrieval(IR) which is implemented on Digital Library. It is well known that Digital Library should have the IR system that user may automatically access every kind of media from anywhere. However, no improvement is made for the retrieval errors based on individual differences of user&apos;s request. This is one of the significant problem for the searching efficiency of IR. Our approach does not use the request itself but the concepts. This makes it possible to retrieve semantic information not merely to compare with the word strings of the request.</description>
	</item>
	<item>
		<title>Judgment Day: Google Strikes Back</title>
		<link>http://tc.eserver.org/25357.html</link>
		<guid>http://tc.eserver.org/25357.html</guid>
		<description>Google has never cared for search engine optimization and has always looked at it as an attempt to taint what they hold as precious. The question that must be asked is, &apos;Why?&apos; Why does Google see search engine optimization as the enemy and are they justified in their thinking?</description>
	</item>
	<item>
		<title>Trovabile</title>
		<link>http://tc.eserver.org/23255.html</link>
		<guid>http://tc.eserver.org/23255.html</guid>
		<description>Trovabile raccoglie risorse sui temi della trovabilità (Findability), con particolare riguardo alla teoria della classificazione e alla classificazione a faccette.</description>
	</item>
	<item>
		<title>Ambient Findability</title>
		<link>http://tc.eserver.org/23041.html</link>
		<guid>http://tc.eserver.org/23041.html</guid>
		<description>For an information architect with library roots, what&apos;s next is obvious: ambient findability. I want to be able to find anything, anywhere, anytime.</description>
	</item>
	<item>
		<title>The High Cost of Not Finding Information</title>
		<link>http://tc.eserver.org/23035.html</link>
		<guid>http://tc.eserver.org/23035.html</guid>
		<description>In an increasingly information-based world, we turn out complex products that are less tangible than they are knowledge-based. The very complexity of the decisions we make and the products we manufacture makes it impossible to check, test and retest them adequately enough to be sure that they will function properly in any circumstance. Information disasters are a growing threat, and one that few businesses can ignore.</description>
	</item>
	<item>
		<title>Perspectives on Information Retrieval</title>
		<link>http://tc.eserver.org/23037.html</link>
		<guid>http://tc.eserver.org/23037.html</guid>
		<description>This report provides a new look at the business and technology dynamics driving the move to a new generation of search in the enterprise.</description>
	</item>
	<atom:link href="http://tc.eserver.org/dir/Articles/Information-Design/Search.xml" rel="self" type="application/rss+xml"/>
</channel>
</rss>