<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>SEO Hardcore &#187; robots.txt</title>
	<atom:link href="http://blog.seo-hardcore.com/category/robots-txt/feed" rel="self" type="application/rss+xml" />
	<link>http://blog.seo-hardcore.com</link>
	<description>Organic SEO Programming</description>
	<lastBuildDate>Mon, 06 Feb 2012 05:02:17 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.3</generator>
<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
		<item>
		<title>Preventing unwanted robots from crawling your site.</title>
		<link>http://blog.seo-hardcore.com/robots-txt/preventing-unwanted-robots-from-crawling-your-site#utm_source=feed&#038;utm_medium=feed&#038;utm_campaign=feed</link>
		<comments>http://blog.seo-hardcore.com/robots-txt/preventing-unwanted-robots-from-crawling-your-site#comments</comments>
		<pubDate>Tue, 27 May 2008 16:26:14 +0000</pubDate>
		<dc:creator>John Sisler</dc:creator>
				<category><![CDATA[robots.txt]]></category>

		<guid isPermaLink="false">http://blog.seo-hardcore.com/?p=93</guid>
		<description><![CDATA[There are a lot of good reasons to use a robots.txt file but one of the increasingly important ones is to prevent unwanted visits from robots. If you notice in your logs that there are a lot of user-agents that you don&#8217;t recognize you may be getting visits from crawlers that add no value to [...]]]></description>
			<content:encoded><![CDATA[<p>There are a lot of good reasons to use a robots.txt file but one of the increasingly important ones is to prevent unwanted visits from robots. If you notice in your logs that there are a lot of user-agents that you don&#8217;t recognize you may be getting visits from crawlers that add no value to your site and simply digest bandwidth.</p>
<p>A <a title="comprehensive list of robots" href="http://www.siteware.ch/webresources/useragents/db.html">comprehensive list of robots</a> can help educate you about which crawlers are out there and which may bring you the most value.</p>
<p>Also you can fin a list of <a title="robots commands" href=" http://www.searchtools.com/robots/robots-txt.html#utm_source=feed&amp;utm_medium=feed&amp;utm_campaign=feed">robots commands</a> and a <a title="robots.txt file generator " href=" http://www.mcanerin.com/EN/search-engine/robots-txt.asp#utm_source=feed&amp;utm_medium=feed&amp;utm_campaign=feed">robots.txt file generator.</a></p>
<p>As an example the section below allows certain crawlers while shoo-ing away others:</p>
<blockquote><p># For domain: http://www.domain.com</p>
<p>User-agent: Googlebot</p>
<p>Disallow:</p>
<p>User-agent: Googlebot-Image</p>
<p>Disallow:</p>
<p>User-agent: MSNBot</p>
<p>Disallow:</p>
<p>User-agent: Slurp</p>
<p>Disallow:</p>
<p>User-agent: Teoma</p>
<p>Disallow:</p>
<p>User-agent: Gigabot</p>
<p>Disallow:</p>
<p>User-agent: Scrubby</p>
<p>Disallow:</p>
<p>User-agent: Robozilla</p>
<p>Disallow:</p>
<p>User-agent: Nutch</p>
<p>Disallow:</p>
<p>User-agent: ia_archiver</p>
<p>Disallow:</p>
<p>User-agent: baiduspider</p>
<p>Disallow:</p>
<p>User-agent: yahoo-mmcrawler</p>
<p>Disallow:</p>
<p>User-agent: psbot</p>
<p>Disallow:</p>
<p>User-agent: asterias</p>
<p>Disallow:</p>
<p>User-agent: yahoo-blogs/v3.9</p>
<p>Disallow:</p>
<p># Shoo</p>
<p>User-agent: *</p>
<p>Disallow: /</p>
<p>Disallow: /cgi-bin/</p>
<p># Disallow: /images/ &#8211; uncomment line with correct path for images</p>
<p># File exclusions</p>
<p>Disallow: /dir/Privacy-Policy</p>
<p>Disallow: /dir/Security</p>
<p># Sitemap declaration</p>
<p>sitemap: http://www.domain.com/sitemap.xml</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://blog.seo-hardcore.com/robots-txt/preventing-unwanted-robots-from-crawling-your-site/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
<!-- WP Super Cache is installed but broken. The path to wp-cache-phase1.php in wp-content/advanced-cache.php must be fixed! -->
