<?xml version="1.0" encoding="utf-8" ?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
<channel>
	<title>Lindemann, Christoph and Lars Littig</title>	<link>http://tc.eserver.org/authors/Lindemann,_Christoph_and_Lars_Littig</link>
	<description>A bibliography of works by Lindemann, Christoph and Lars Littig in the field of technical communication.</description>
	<language>en-us</language>
	<copyright>Copyright (c) 2005-08 by the EServer. All rights reserved.</copyright>
	<managingEditor>tclib-editorial@eserver.org (TC Library Editorial Board)</managingEditor>
	<webMaster>webmaster@eserver.org (Geoffrey Sauer)</webMaster>
	<image>
		<url>http://tc.eserver.org/images/newlogo.gif</url>
		<title>Lindemann, Christoph and Lars Littig</title>
		<link>http://tc.eserver.org/dir/Lindemann,_Christoph_and_Lars_Littig</link>
	</image>
	<item>
		<title>Classifying Web Sites</title>
		<link>http://tc.eserver.org/34183.html</link>
		<guid>http://tc.eserver.org/34183.html</guid>
		<description>In this paper, we present a novel method for the classification of Web sites. This method exploits both structure and content of Web sites in order to discern their functionality. It allows for distinguishing between eight of the most relevant functional classes of Web sites. We show that a pre-classification of Web sites utilizing structural properties considerably improves a subsequent textual classification with standard techniques. We evaluate this approach on a dataset comprising more than 16,000 Web sites with about 20 million crawled and 100 million known Web pages. Our approach achieves an accuracy of 92% for the coarse-grained classification of these Web sites.</description>
	</item>
	<atom:link href="http://tc.eserver.org/authors/Lindemann,_Christoph_and_Lars_Littig.xml" rel="self" type="application/rss+xml"/>
</channel>
</rss>