<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>

<channel>
	<title>SEO Hardcore</title>
	<atom:link href="http://blog.seo-hardcore.com/feed" rel="self" type="application/rss+xml" />
	<link>http://blog.seo-hardcore.com</link>
	<description>Search Engine Optimization</description>
	<pubDate>Tue, 03 Jun 2008 19:55:46 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5</generator>
	<language>en</language>
			<item>
		<title>SharePoint 2007 301 redirects</title>
		<link>http://blog.seo-hardcore.com/caching/sharepoint-2007-301-redirects</link>
		<comments>http://blog.seo-hardcore.com/caching/sharepoint-2007-301-redirects#comments</comments>
		<pubDate>Wed, 28 May 2008 19:42:31 +0000</pubDate>
		<dc:creator>john</dc:creator>
		
		<category><![CDATA[MOSS]]></category>

		<category><![CDATA[caching]]></category>

		<guid isPermaLink="false">http://blog.seo-hardcore.com/?p=95</guid>
		<description><![CDATA[Each time you request a Site Collection (http://domain/) or a Site (http://domain/foo/) of your Publishing Site you get redirected to the .aspx&#8221;&#62;http://domain/Pages/&#60;WelcomePage&#62;.aspx. SharePoint 2007 uses the 302 header (location temporarily moved) for this purpose. Surprisingly even WSS uses the 302 header to redirect a root url to the default.aspx. In comparison ASP.NET uses an internal [...]]]></description>
			<content:encoded><![CDATA[<p>Each time you request a Site Collection (<a href="http://domain/">http://domain/</a>) or a Site (<a href="http://domain/foo/">http://domain/foo/</a>) of your Publishing Site you get redirected to the <a href="http://domain/Pages/%3CWelcomePage%3E.aspx">.aspx&#8221;&gt;http://domain/Pages/&lt;WelcomePage&gt;.aspx</a>. SharePoint 2007 uses the 302 header (location temporarily moved) for this purpose. Surprisingly even WSS uses the 302 header to redirect a root url to the default.aspx. In comparison ASP.NET uses an internal redirect to render the default page when the root url requested: there is no redirect in this situation.</p>
<p>The whole issue about the 302 headers is that the redirected locations don&#8217;t get crawled by search spiders which don&#8217;t follow temporarily moved pages. While it&#8217;s not really an issue for intranet environments it has major impact on indexing the content of Internet-facing web sites and making them searchable using a search engine.</p>
<p>Looking for an answer I have researched the SharePoint runtime: SPHttpHandler, SPRequestModule and PublishingHttpModule classes. As none of these has given me a clear answer I have noticed that there are multiple references to the Redirect method present in the code which uses the 302 header as well.</p>
<p>To solve the issue I have designed a custom redirect HttpModule which uses 301 headers instead.</p>
<h2>The requirements</h2>
<p>The module must rewrite all request for a Site Collection or Site. Url&#8217;s of these request might but don&#8217;t have to contain trailing slash (/). Furthermore the module must distinct a WSS request from a Publishing Site / Publishing Web request. Also the module has to be aware of Variations if used by the Site Collection.</p>
<h2>The work</h2>
<p>Firs of all we create a new HttpModule. As we want the redirect to find place as soon as possible we will hook it up in the BeginRequest event. Furthermore we want the module to be the first one to interact with the request. As we use an external assembly we need to define it as the first element in the httpModules section of web.config.</p>
<blockquote>
<pre class="code"><span>namespace</span> Imtech.SharePoint.Enhancement.HttpModules
{
<span>public</span> <span>class</span> <span>RedirectModule</span> : <span>IHttpModule
</span>    {
<span>        #region</span> IHttpModule Members

<span>public</span> <span>void</span> Dispose()
{ }

<span>public</span> <span>void</span> Init(<span>HttpApplication</span> context)
{
context.BeginRequest +=
<span>new</span> <span>EventHandler</span>(context_BeginRequest);
}

<span>void</span> context_BeginRequest(<span>object</span> sender, <span>EventArgs</span> e)
{
<span>HttpApplication</span> app = (<span>HttpApplication</span>)sender;
<span>string</span> requestUrl = app.Request.Url.ToString();
}

<span>        #endregion
</span>    }
}</pre>
</blockquote>
<p>Because we will need the Request url later on in quite a few places I have decided to store it in a separate variable.</p>
<p>The first requirement states that the module should redirect only requests for Site Collections and Sites. If the requirement wouldn&#8217;t have say that the trailing slash is optional you could solve it using a simple if (requestUrl.EndsWith(&#8221;/&#8221;)). In our situation we will have to use a Regular Expression in order to figure out whether we need to rewrite the url or not.</p>
<blockquote>
<pre class="code"><span>Regex</span> regEx =
<span>new</span> <span>Regex</span>(<span>@&#8221;^https?://.*(?&lt;itemUrl&gt;/[^/]+\.[^/\.]+)$&#8221;</span>);
<span>if</span> (regEx.IsMatch(requestUrl))
<span>return</span>;

<span>if</span> (!requestUrl.EndsWith(<span>&#8220;/&#8221;</span>,
<span>StringComparison</span>.CurrentCulture))
requestUrl += <span>&#8220;/&#8221;</span>;</pre>
</blockquote>
<p>If the url matches the regular expression it means it&#8217;s a page request and should be passed on along the request pipeline unaltered. Later in the module we will combine the request url with the page url. As the trailing slash is optional I have decided to add it at the end if not present - just to be sure that combining the destination url of different parts will produce correct result.</p>
<p>The next requirement is distinction between WSS and Publishing Site requests.</p>
<blockquote>
<pre class="code"><span>string</span> destinationUrl = <span>String</span>.Empty;

<span>SPSecurity</span>.RunWithElevatedPrivileges(<span>delegate</span>()
{
<span>try
</span>    {
<span>using</span> (<span>SPSite</span> site = <span>new</span> <span>SPSite</span>(requestUrl))
{
<span>using</span> (<span>SPWeb</span> web = site.OpenWeb())
{
<span>if</span> (<span>PublishingWeb</span>.IsPublishingWeb(web))
destinationUrl = <span>String</span>.Concat(requestUrl,
publishingWeb.DefaultPage.Url);
<span>else
</span>                    destinationUrl = <span>String</span>.Concat(requestUrl,
<span>&#8220;default.aspx&#8221;</span>);
}
}
}
<span>catch</span> { }
});</pre>
</blockquote>
<p>Based on the request url we create a new instance of SPSite and then open the requested web. As we can fail at this point already (for example when passing a list url) I have decided to catch the thrown exception to avoid turning the request into an error message. The distinction itself is quite straight forward and makes use of the IsPublishingWeb method. One important thing: because we are very likely to use the module for anonymous users we need to run the code with elevated privileges: the IsPublishingWeb method requires some extra permission in order to run.</p>
<p>Our last requirement was making the redirect module aware of Variations if used by the Site Collection. Depending on the requirements defined by your customer you might need to implement the standard SharePoint Variation logic which chooses the variation basin on the User Agent language settings. Unfortunately most users are not aware of the existence and usage possibilities of the language settings most of our customers choose to load the Dutch variation by default. If your customer requires the standard SharePoint approach you would need to implement the logic from the VariationRootLanding User Control in the ControlTemplates directory. I will focus on the scenario we&#8217;re using.</p>
<blockquote>
<pre class="code"><span>PublishingWeb</span> publishingWeb = <span>PublishingWeb</span>.GetPublishingWeb(web);
<span>if</span> (publishingWeb.DefaultPage.Url.EndsWith(<span>&#8220;/VariationRoot.aspx&#8221;</span>,
<span>StringComparison</span>.CurrentCultureIgnoreCase))
{
<span>string</span> defaultPage = <span>String</span>.Empty;
<span>using</span> (<span>SPWeb</span> nlWeb = site.OpenWeb(<span>&#8220;nl&#8221;</span>))
{
defaultPage =
<span>PublishingWeb</span>.GetPublishingWeb(nlWeb).DefaultPage.Url;
}

destinationUrl = <span>String</span>.Concat(requestUrl, <span>&#8220;nl/&#8221;</span>, defaultPage);
}
<span>else
</span>    destinationUrl =
<span>String</span>.Concat(requestUrl, publishingWeb.DefaultPage.Url);</pre>
</blockquote>
<p>In most scenarios the variation redirect finds place at the Site Collection level. The default page of the root web is then set to Pages/VariationRoot.aspx. Knowing this we can check whether we need to use the variation redirect or not. The rest is quite straight-forward: we obtain the Dutch site and its Welcome Page.</p>
<p>The last part is the redirect itself using the 301 header:</p>
<blockquote>
<pre class="code"><span>if</span> (!<span>String</span>.IsNullOrEmpty(destinationUrl))
{
app.Response.AddHeader(<span>&#8220;Location&#8221;</span>, destinationUrl);
app.Response.StatusCode = 301;
}</pre>
</blockquote>
<p>The destination url might be empty if an exception has occurred during the request processing. We will therefore redirect only if a destination url has been set by our module.</p>
<p>Putting it all together:</p>
<blockquote>
<pre class="code"><span>namespace</span> Imtech.SharePoint.Enhancement.HttpModules
{
<span>public</span> <span>class</span> <span>RedirectModule</span> : <span>IHttpModule
</span>  {
<span>    #region</span> IHttpModule Members

<span>public</span> <span>void</span> Dispose()
{ }

<span>public</span> <span>void</span> Init(<span>HttpApplication</span> context)
{
context.BeginRequest +=
<span>new</span> <span>EventHandler</span>(context_BeginRequest);
}

<span>void</span> context_BeginRequest(<span>object</span> sender, <span>EventArgs</span> e)
{
<span>HttpApplication</span> app = (<span>HttpApplication</span>)sender;
<span>string</span> requestUrl = app.Request.Url.ToString();
<span>Regex</span> regEx =
<span>new</span> <span>Regex</span>(<span>@&#8221;^https?://.*(?&lt;itemUrl&gt;/[^/]+\.[^/\.]+)$&#8221;</span>);
<span>if</span> (regEx.IsMatch(requestUrl))
<span>return</span>;

<span>if</span> (!requestUrl.EndsWith(<span>&#8220;/&#8221;</span>,
<span>StringComparison</span>.CurrentCulture))
requestUrl += <span>&#8220;/&#8221;</span>;

<span>string</span> destinationUrl = <span>String</span>.Empty;

<span>SPSecurity</span>.RunWithElevatedPrivileges(<span>delegate</span>()
{
<span>try
</span>        {
<span>using</span> (<span>SPSite</span> site = <span>new</span> <span>SPSite</span>(requestUrl))
{
<span>using</span> (<span>SPWeb</span> web = site.OpenWeb())
{
<span>if</span> (<span>PublishingWeb</span>.IsPublishingWeb(web))
{
<span>PublishingWeb</span> publishingWeb =
<span>PublishingWeb</span>.GetPublishingWeb(web);
<span>if</span> (publishingWeb.DefaultPage.Url.
EndsWith(<span>&#8220;/VariationRoot.aspx&#8221;</span>,
<span>StringComparison</span>.CurrentCultureIgnoreCase))
{
<span>string</span> defaultPage = <span>String</span>.Empty;
<span>using</span> (<span>SPWeb</span> nlWeb = site.OpenWeb(<span>&#8220;nl&#8221;</span>))
{
defaultPage =
<span>                       PublishingWeb</span>.GetPublishingWeb(nlWeb)
.DefaultPage.Url;
}

destinationUrl = <span>String</span>.Concat(requestUrl,
<span>&#8220;nl/&#8221;</span>, defaultPage);
}
<span>else
</span>                  destinationUrl = <span>String</span>.Concat(requestUrl,
publishingWeb.DefaultPage.Url);
}
<span>else
</span>                destinationUrl = <span>String</span>.Concat(requestUrl, <span>&#8220;default.aspx&#8221;</span>);
}
}
}
<span>catch</span> { }
});

<span>if</span> (!<span>String</span>.IsNullOrEmpty(destinationUrl))
{
app.Response.AddHeader(<span>&#8220;Location&#8221;</span>, destinationUrl);
app.Response.StatusCode = 301;
}
}

<span>    #endregion
</span>  }
}</pre>
</blockquote>
<p>To see it working build the project, copy the assembly to the bin directory of your web application and add the following element to the httpModules section of the web.config:</p>
<blockquote>
<pre class="code"><span>&lt;</span><span>add</span><span> </span><span>name</span><span>=</span>&#8220;<span>ImtechRedirectModule</span>&#8220;<span style="color: #0000ff;">
</span><span>type</span><span>=</span>&#8220;<span>Imtech.SharePoint.Enhancement.HttpModules.RedirectModule</span>&#8220;<span> /&gt;</span></pre>
</blockquote>
<h2>Summary</h2>
<p>Redirects using the 302 header can form a serious issue on Internet-facing web sites as it comes to indexing the content of a web site. Using custom HttpModules to overrule the standard behavior of SharePoint is a flexible solution for this challenge.<br />
The example above should work good enough in most scenarios. Depending on the requirements of your customers you might need to extend it with some extra functionality like for example standard Variations logic support. Custom HttpModules prove the extensibility and flexibility of SharePoint 2007 and the way it can be made to fit various requirements and scenarios.</p>
<p>Published             <a href="http://www.sharepointblogs.com/tmt/archive/2008/01/21/sharepoint-2007-redirect-solved-using-301-instead-of-302-redirects.aspx">Jan 21 2008, 10:46 AM</a> by             <a href="http://www.sharepointblogs.com/members/Waldek-Mastykarz.aspx">Waldek Mastykarz</a> <span id="ctl00_ctl00_ctl00_bcr_bcr_bcr_ctl09_ctl01">Filed under: <a rel="tag" href="http://www.sharepointblogs.com/tmt/archive/tags/Web+Content+Management/default.aspx">Web Content Management</a>, <a rel="tag" href="http://www.sharepointblogs.com/tmt/archive/tags/SharePoint+customization/default.aspx">SharePoint customization</a>, <a rel="tag" href="http://www.sharepointblogs.com/tmt/archive/tags/Accessibility/default.aspx">Accessibility</a>, <a rel="tag" href="http://www.sharepointblogs.com/tmt/archive/tags/SharePoint+Best+Practices/default.aspx">SharePoint Best Practices</a></span></p>

<span class="slashdigglicious">
<a href="http://slashdot.org/bookmark.pl?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fcaching%2Fsharepoint-2007-301-redirects&amp;title=SharePoint+2007+301+redirects" title="Slashdot It!"><img src="http://slashdot.org/favicon.ico" height="16" width="16" alt="[Slashdot]" /></a>
<a href="http://digg.com/submit?phase=2&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fcaching%2Fsharepoint-2007-301-redirects&amp;title=SharePoint+2007+301+redirects" title="Digg This Story"><img src="http://digg.com/favicon.ico" width="16" height="16" alt="[Digg]" /></a>
<a href="http://reddit.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fcaching%2Fsharepoint-2007-301-redirects&amp;title=SharePoint+2007+301+redirects" title="Reddit"><img src="http://reddit.com/favicon.ico" width="16" height="16" alt="[Reddit]" /></a>
<a href="http://del.icio.us/post?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fcaching%2Fsharepoint-2007-301-redirects&amp;title=SharePoint+2007+301+redirects" title="Save to del.icio.us" onclick="window.open('http://del.icio.us/post?v=4&amp;noui&amp;jump=close&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fcaching%2Fsharepoint-2007-301-redirects&amp;title=SharePoint+2007+301+redirects', 'delicious', 'toolbar=no,width=700,height=400'); return false;"><img src="http://del.icio.us/favicon.ico" width="16" height="16" alt="[del.icio.us]" /></a>
<a href="http://www.facebook.com/share.php?u=http%3A%2F%2Fblog.seo-hardcore.com%2Fcaching%2Fsharepoint-2007-301-redirects" title="Share on Facebook"><img src="http://www.facebook.com/favicon.ico" width="16" height="16" alt="[Facebook]" /></a>
<a href="http://technorati.com/faves?add=http%3A%2F%2Fblog.seo-hardcore.com%2Fcaching%2Fsharepoint-2007-301-redirects" title="Add to my Technorati Favorites"><img src="http://technorati.com/favicon.ico" width="16" height="16" alt="[Technorati]" /></a>
<a href="http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http%3A%2F%2Fblog.seo-hardcore.com%2Fcaching%2Fsharepoint-2007-301-redirects&amp;title=SharePoint+2007+301+redirects" title="Save to Google Bookmarks"><img src="http://www.google.com/favicon.ico" width="16" height="16" alt="[Google]" /></a>
<a href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fcaching%2Fsharepoint-2007-301-redirects&amp;title=SharePoint+2007+301+redirects" title="Stumble it!"><img src="http://www.stumbleupon.com/favicon.ico" width="16" height="16" alt="[StumbleUpon]" /></a>
</span>]]></content:encoded>
			<wfw:commentRss>http://blog.seo-hardcore.com/caching/sharepoint-2007-301-redirects/feed</wfw:commentRss>
		</item>
		<item>
		<title>Preventing unwanted robots from crawling your site.</title>
		<link>http://blog.seo-hardcore.com/robotstxt/preventing-unwanted-robots-from-crawling-your-site</link>
		<comments>http://blog.seo-hardcore.com/robotstxt/preventing-unwanted-robots-from-crawling-your-site#comments</comments>
		<pubDate>Tue, 27 May 2008 16:26:14 +0000</pubDate>
		<dc:creator>john</dc:creator>
		
		<category><![CDATA[robots.txt]]></category>

		<guid isPermaLink="false">http://blog.seo-hardcore.com/?p=93</guid>
		<description><![CDATA[There are a lot of good reasons to use a robots.txt file but one of the increasingly important ones is to prevent unwanted visits from robots. If you notice in your logs that there are a lot of user-agents that you don&#8217;t recognize you may be getting visits from crawlers that add no value to [...]]]></description>
			<content:encoded><![CDATA[<p>There are a lot of good reasons to use a robots.txt file but one of the increasingly important ones is to prevent unwanted visits from robots. If you notice in your logs that there are a lot of user-agents that you don&#8217;t recognize you may be getting visits from crawlers that add no value to your site and simply digest bandwidth.</p>
<p>A <a title="comprehensive list of robots" href="http://www.siteware.ch/webresources/useragents/db.html">comprehensive list of robots</a> can help educate you about which crawlers are out there and which may bring you the most value.</p>
<p>Also you can fin a list of <a title="robots commands" href=" http://www.searchtools.com/robots/robots-txt.html">robots commands</a> and a <a title="robots.txt file generator " href=" http://www.mcanerin.com/EN/search-engine/robots-txt.asp">robots.txt file generator.</a></p>
<p>As an example the section below allows certain crawlers while shoo-ing away others:</p>
<blockquote><p># For domain: http://www.domain.com</p>
<p>User-agent: Googlebot</p>
<p>Disallow:</p>
<p>User-agent: Googlebot-Image</p>
<p>Disallow:</p>
<p>User-agent: MSNBot</p>
<p>Disallow:</p>
<p>User-agent: Slurp</p>
<p>Disallow:</p>
<p>User-agent: Teoma</p>
<p>Disallow:</p>
<p>User-agent: Gigabot</p>
<p>Disallow:</p>
<p>User-agent: Scrubby</p>
<p>Disallow:</p>
<p>User-agent: Robozilla</p>
<p>Disallow:</p>
<p>User-agent: Nutch</p>
<p>Disallow:</p>
<p>User-agent: ia_archiver</p>
<p>Disallow:</p>
<p>User-agent: baiduspider</p>
<p>Disallow:</p>
<p>User-agent: yahoo-mmcrawler</p>
<p>Disallow:</p>
<p>User-agent: psbot</p>
<p>Disallow:</p>
<p>User-agent: asterias</p>
<p>Disallow:</p>
<p>User-agent: yahoo-blogs/v3.9</p>
<p>Disallow:</p>
<p># Shoo</p>
<p>User-agent: *</p>
<p>Disallow: /</p>
<p>Disallow: /cgi-bin/</p>
<p># Disallow: /images/ - uncomment line with correct path for images</p>
<p># File exclusions</p>
<p>Disallow: /dir/Privacy-Policy</p>
<p>Disallow: /dir/Security</p>
<p># Sitemap declaration</p>
<p>sitemap: http://www.domain.com/sitemap.xml</p></blockquote>

<span class="slashdigglicious">
<a href="http://slashdot.org/bookmark.pl?url=http%3A%2F%2Fblog.seo-hardcore.com%2Frobotstxt%2Fpreventing-unwanted-robots-from-crawling-your-site&amp;title=Preventing+unwanted+robots+from+crawling+your+site." title="Slashdot It!"><img src="http://slashdot.org/favicon.ico" height="16" width="16" alt="[Slashdot]" /></a>
<a href="http://digg.com/submit?phase=2&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Frobotstxt%2Fpreventing-unwanted-robots-from-crawling-your-site&amp;title=Preventing+unwanted+robots+from+crawling+your+site." title="Digg This Story"><img src="http://digg.com/favicon.ico" width="16" height="16" alt="[Digg]" /></a>
<a href="http://reddit.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Frobotstxt%2Fpreventing-unwanted-robots-from-crawling-your-site&amp;title=Preventing+unwanted+robots+from+crawling+your+site." title="Reddit"><img src="http://reddit.com/favicon.ico" width="16" height="16" alt="[Reddit]" /></a>
<a href="http://del.icio.us/post?url=http%3A%2F%2Fblog.seo-hardcore.com%2Frobotstxt%2Fpreventing-unwanted-robots-from-crawling-your-site&amp;title=Preventing+unwanted+robots+from+crawling+your+site." title="Save to del.icio.us" onclick="window.open('http://del.icio.us/post?v=4&amp;noui&amp;jump=close&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Frobotstxt%2Fpreventing-unwanted-robots-from-crawling-your-site&amp;title=Preventing+unwanted+robots+from+crawling+your+site.', 'delicious', 'toolbar=no,width=700,height=400'); return false;"><img src="http://del.icio.us/favicon.ico" width="16" height="16" alt="[del.icio.us]" /></a>
<a href="http://www.facebook.com/share.php?u=http%3A%2F%2Fblog.seo-hardcore.com%2Frobotstxt%2Fpreventing-unwanted-robots-from-crawling-your-site" title="Share on Facebook"><img src="http://www.facebook.com/favicon.ico" width="16" height="16" alt="[Facebook]" /></a>
<a href="http://technorati.com/faves?add=http%3A%2F%2Fblog.seo-hardcore.com%2Frobotstxt%2Fpreventing-unwanted-robots-from-crawling-your-site" title="Add to my Technorati Favorites"><img src="http://technorati.com/favicon.ico" width="16" height="16" alt="[Technorati]" /></a>
<a href="http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http%3A%2F%2Fblog.seo-hardcore.com%2Frobotstxt%2Fpreventing-unwanted-robots-from-crawling-your-site&amp;title=Preventing+unwanted+robots+from+crawling+your+site." title="Save to Google Bookmarks"><img src="http://www.google.com/favicon.ico" width="16" height="16" alt="[Google]" /></a>
<a href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Frobotstxt%2Fpreventing-unwanted-robots-from-crawling-your-site&amp;title=Preventing+unwanted+robots+from+crawling+your+site." title="Stumble it!"><img src="http://www.stumbleupon.com/favicon.ico" width="16" height="16" alt="[StumbleUpon]" /></a>
</span>]]></content:encoded>
			<wfw:commentRss>http://blog.seo-hardcore.com/robotstxt/preventing-unwanted-robots-from-crawling-your-site/feed</wfw:commentRss>
		</item>
		<item>
		<title>The top 10 ways to twist your ankle doing search.</title>
		<link>http://blog.seo-hardcore.com/search-fun/the-top-10-ways-to-twist-your-ankle-doing-search</link>
		<comments>http://blog.seo-hardcore.com/search-fun/the-top-10-ways-to-twist-your-ankle-doing-search#comments</comments>
		<pubDate>Tue, 06 May 2008 09:39:24 +0000</pubDate>
		<dc:creator>john</dc:creator>
		
		<category><![CDATA[search fun]]></category>

		<guid isPermaLink="false">http://blog.seo-hardcore.com/fun-with-search/the-top-10-ways-to-twist-your-ankle-doing-search</guid>
		<description><![CDATA[My counselsee had an accident last night and had to alert us that he would not be in today.  My counselor responded asking &#8220;How do you twist your ankle doing search?&#8221;
The truth is that there are several work related hazards to SEO and here are the top 10:
The top 10 reasons people twist their [...]]]></description>
			<content:encoded><![CDATA[<p>My counselsee had an accident last night and had to alert us that he would not be in today.  My counselor responded asking &#8220;How do you twist your ankle doing search?&#8221;</p>
<p>The truth is that there are several work related hazards to SEO and here are the top 10:</p>
<p class="MsoPlainText"><span style="font-family: ">The top 10 reasons people twist their ankle doing Search:</span></p>
<p class="MsoPlainText"><span style="font-family: ">10. Viscosity of link juice</span></p>
<p class="MsoPlainText"><span style="font-family: ">9.   Stumbled into a slow moving nofollow tag</span></p>
<p class="MsoPlainText"><span style="font-family: ">8.   Ankle refused NO Other Darn Position - noodp</span></p>
<p class="MsoPlainText"><span style="font-family: ">7.   Server Head = 302; Server ankle= 301</span></p>
<p class="MsoPlainText"><span style="font-family: ">6.   Tripped on hem in cloak through a doorway page</span></p>
<p class="MsoPlainText"><span style="font-family: ">5.   Landing page height mis-judged </span></p>
<p class="MsoPlainText"><span style="font-family: ">4.   Right foot out of synch while left observes latent semantic indexing </span></p>
<p class="MsoPlainText"><span style="font-family: ">3.   Ankle suffered http compression</span></p>
<p class="MsoPlainText"><span style="font-family: ">2.   No friendly URLs to protect ankle from hostile URLs</span></p>
<p class="MsoPlainText"><span style="font-family: ">1.   Linko de Mayo</span></p>
<p class="MsoPlainText">If you have experienced or know of some that i forgot please let me know.</p>

<span class="slashdigglicious">
<a href="http://slashdot.org/bookmark.pl?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fsearch-fun%2Fthe-top-10-ways-to-twist-your-ankle-doing-search&amp;title=The+top+10+ways+to+twist+your+ankle+doing+search." title="Slashdot It!"><img src="http://slashdot.org/favicon.ico" height="16" width="16" alt="[Slashdot]" /></a>
<a href="http://digg.com/submit?phase=2&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fsearch-fun%2Fthe-top-10-ways-to-twist-your-ankle-doing-search&amp;title=The+top+10+ways+to+twist+your+ankle+doing+search." title="Digg This Story"><img src="http://digg.com/favicon.ico" width="16" height="16" alt="[Digg]" /></a>
<a href="http://reddit.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fsearch-fun%2Fthe-top-10-ways-to-twist-your-ankle-doing-search&amp;title=The+top+10+ways+to+twist+your+ankle+doing+search." title="Reddit"><img src="http://reddit.com/favicon.ico" width="16" height="16" alt="[Reddit]" /></a>
<a href="http://del.icio.us/post?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fsearch-fun%2Fthe-top-10-ways-to-twist-your-ankle-doing-search&amp;title=The+top+10+ways+to+twist+your+ankle+doing+search." title="Save to del.icio.us" onclick="window.open('http://del.icio.us/post?v=4&amp;noui&amp;jump=close&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fsearch-fun%2Fthe-top-10-ways-to-twist-your-ankle-doing-search&amp;title=The+top+10+ways+to+twist+your+ankle+doing+search.', 'delicious', 'toolbar=no,width=700,height=400'); return false;"><img src="http://del.icio.us/favicon.ico" width="16" height="16" alt="[del.icio.us]" /></a>
<a href="http://www.facebook.com/share.php?u=http%3A%2F%2Fblog.seo-hardcore.com%2Fsearch-fun%2Fthe-top-10-ways-to-twist-your-ankle-doing-search" title="Share on Facebook"><img src="http://www.facebook.com/favicon.ico" width="16" height="16" alt="[Facebook]" /></a>
<a href="http://technorati.com/faves?add=http%3A%2F%2Fblog.seo-hardcore.com%2Fsearch-fun%2Fthe-top-10-ways-to-twist-your-ankle-doing-search" title="Add to my Technorati Favorites"><img src="http://technorati.com/favicon.ico" width="16" height="16" alt="[Technorati]" /></a>
<a href="http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http%3A%2F%2Fblog.seo-hardcore.com%2Fsearch-fun%2Fthe-top-10-ways-to-twist-your-ankle-doing-search&amp;title=The+top+10+ways+to+twist+your+ankle+doing+search." title="Save to Google Bookmarks"><img src="http://www.google.com/favicon.ico" width="16" height="16" alt="[Google]" /></a>
<a href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fsearch-fun%2Fthe-top-10-ways-to-twist-your-ankle-doing-search&amp;title=The+top+10+ways+to+twist+your+ankle+doing+search." title="Stumble it!"><img src="http://www.stumbleupon.com/favicon.ico" width="16" height="16" alt="[StumbleUpon]" /></a>
</span>]]></content:encoded>
			<wfw:commentRss>http://blog.seo-hardcore.com/search-fun/the-top-10-ways-to-twist-your-ankle-doing-search/feed</wfw:commentRss>
		</item>
		<item>
		<title>.htaccess commands</title>
		<link>http://blog.seo-hardcore.com/spidering/htaccess-commands</link>
		<comments>http://blog.seo-hardcore.com/spidering/htaccess-commands#comments</comments>
		<pubDate>Fri, 04 Jan 2008 16:39:46 +0000</pubDate>
		<dc:creator>john</dc:creator>
		
		<category><![CDATA[duplicate content]]></category>

		<category><![CDATA[htaccess]]></category>

		<category><![CDATA[spidering]]></category>

		<guid isPermaLink="false">http://blog.seo-hardcore.com/spidering/htaccess-commands</guid>
		<description><![CDATA[1. Redirect to Files or Directories
Redirect permanent /oldfile.html http://www.domain.com/filename.html or
Redirect 301 /oldfile.html http://www.domain.com/filename.html
Redirect 301 /olddirectory/oldfile.html http://www.domain.com/newdirectory/newfile.html
Redirect 301 www.olddomain.com/
Also you can program your home page to always load the www.domain.com version of the URL regardless of your actualy homepage file or a user not typing www.
1a.To redirect the non-www version use:
Options +FollowSymLinks
RewriteEngine on
RewriteCond %{HTTP_HOST} ^domain.com [NC]
RewriteRule [...]]]></description>
			<content:encoded><![CDATA[<p><strong>1. Redirect to Files or Directories</strong></p>
<p>Redirect permanent /oldfile.html http://www.domain.com/filename.html or<br />
Redirect 301 /oldfile.html http://www.domain.com/filename.html</p>
<p>Redirect 301 /olddirectory/oldfile.html http://www.domain.com/newdirectory/newfile.html</p>
<p>Redirect 301 www.olddomain.com/</p>
<p>Also you can program your home page to always load the www.domain.com version of the URL regardless of your actualy homepage file or a user not typing www.</p>
<p>1a.To redirect the non-www version use:</p>
<p>Options +FollowSymLinks<br />
RewriteEngine on<br />
RewriteCond %{HTTP_HOST} ^domain.com [NC]<br />
RewriteRule ^(.*)$ http://www.domain.com/$1 [L,R=301]</p>
<p>1b.To prevent  your default filename from loading use:</p>
<p>Options +FollowSymLinks<br />
RewriteEngine on<br />
# index.php to /<br />
RewriteCond %{THE_REQUEST} ^[A-Z]{3, 9}\ /.*index\.php\ HTTP/<br />
RewriteRule ^(.*)index\.php$ /$1 [R=301,L]</p>
<p><strong>2. Change the Default Directory Page</strong></p>
<p>Directory Index <em>yourfilename</em>.html</p>
<p><strong>3. Allow/Prevent Directory Browsing</strong></p>
<p>IndexIgnore */*</p>
<p><strong>4. Allow SSI in .html files</strong></p>
<p>AddType text/html .html AddHandler server-parsed .html AddHandler server-parsed .htm</p>
<p><strong>5. Keep Unwanted Users Out</strong></p>
<p>order allow,deny deny from 123.456.78.90 deny from 123.456.78 deny from .aol.com allow from all</p>
<p><strong>6. Prevent Linking to Your Images</strong></p>
<p>RewriteEngine on RewriteCond %{HTTP_REFERER} !^$ RewriteCond %{HTTP_REFERER} !^http://(www.)?domain.com/.*$ [NC] RewriteRule .(gif|jpg)$ - [F]</p>
<p><strong>7. Stop the Email Collectors</strong></p>
<p>RewriteCond %{HTTP_USER_AGENT} Wget [OR] RewriteCond %{HTTP_USER_AGENT} CherryPickerSE [OR] RewriteCond %{HTTP_USER_AGENT} CherryPickerElite [OR] RewriteCond %{HTTP_USER_AGENT} EmailCollector [OR] RewriteCond %{HTTP_USER_AGENT} EmailSiphon [OR] RewriteCond %{HTTP_USER_AGENT} EmailWolf [OR] RewriteCond %{HTTP_USER_AGENT} ExtractorPro RewriteRule ^.*$ X.html [L]</p>

<span class="slashdigglicious">
<a href="http://slashdot.org/bookmark.pl?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fhtaccess-commands&amp;title=.htaccess+commands" title="Slashdot It!"><img src="http://slashdot.org/favicon.ico" height="16" width="16" alt="[Slashdot]" /></a>
<a href="http://digg.com/submit?phase=2&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fhtaccess-commands&amp;title=.htaccess+commands" title="Digg This Story"><img src="http://digg.com/favicon.ico" width="16" height="16" alt="[Digg]" /></a>
<a href="http://reddit.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fhtaccess-commands&amp;title=.htaccess+commands" title="Reddit"><img src="http://reddit.com/favicon.ico" width="16" height="16" alt="[Reddit]" /></a>
<a href="http://del.icio.us/post?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fhtaccess-commands&amp;title=.htaccess+commands" title="Save to del.icio.us" onclick="window.open('http://del.icio.us/post?v=4&amp;noui&amp;jump=close&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fhtaccess-commands&amp;title=.htaccess+commands', 'delicious', 'toolbar=no,width=700,height=400'); return false;"><img src="http://del.icio.us/favicon.ico" width="16" height="16" alt="[del.icio.us]" /></a>
<a href="http://www.facebook.com/share.php?u=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fhtaccess-commands" title="Share on Facebook"><img src="http://www.facebook.com/favicon.ico" width="16" height="16" alt="[Facebook]" /></a>
<a href="http://technorati.com/faves?add=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fhtaccess-commands" title="Add to my Technorati Favorites"><img src="http://technorati.com/favicon.ico" width="16" height="16" alt="[Technorati]" /></a>
<a href="http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fhtaccess-commands&amp;title=.htaccess+commands" title="Save to Google Bookmarks"><img src="http://www.google.com/favicon.ico" width="16" height="16" alt="[Google]" /></a>
<a href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fhtaccess-commands&amp;title=.htaccess+commands" title="Stumble it!"><img src="http://www.stumbleupon.com/favicon.ico" width="16" height="16" alt="[StumbleUpon]" /></a>
</span>]]></content:encoded>
			<wfw:commentRss>http://blog.seo-hardcore.com/spidering/htaccess-commands/feed</wfw:commentRss>
		</item>
		<item>
		<title>Search in 2007</title>
		<link>http://blog.seo-hardcore.com/spidering/search-in-2007</link>
		<comments>http://blog.seo-hardcore.com/spidering/search-in-2007#comments</comments>
		<pubDate>Wed, 02 Jan 2008 22:47:54 +0000</pubDate>
		<dc:creator>john</dc:creator>
		
		<category><![CDATA[Local Search]]></category>

		<category><![CDATA[Universal search]]></category>

		<category><![CDATA[analytics]]></category>

		<category><![CDATA[google]]></category>

		<category><![CDATA[microsoft]]></category>

		<category><![CDATA[spidering]]></category>

		<guid isPermaLink="false">http://blog.seo-hardcore.com/spidering/search-in-2007</guid>
		<description><![CDATA[The following was originally presented on SEW
Universal/Blended search

What Does Universal Search Mean for SEM?
Ask Launches Ask3D
Universal Search Coming to Microsoft&#8217;s Live Search
Yahoo Search Gets Blended, Helpful
The Real Impact of Blended Search

Google Universal search

Google Goes Universal, Adds Navigation
Will Universal Search Mean Universal Domination?
Will Universal Search Drive Growth of Google&#8217;s Vertical Search Properties?
The Impact of Universal Search
What [...]]]></description>
			<content:encoded><![CDATA[<p>The following was originally presented on SEW</p>
<h3>Universal/Blended search</h3>
<ul>
<li><a href="http://searchenginewatch.com/3625951">What Does Universal Search Mean for SEM?</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070605-000001">Ask Launches Ask3D</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070925-094532">Universal Search Coming to Microsoft&#8217;s Live Search</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071002-084252">Yahoo Search Gets Blended, Helpful</a></li>
<li><a href="http://searchenginewatch.com/3627322">The Real Impact of Blended Search</a></li>
</ul>
<h3>Google Universal search</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070516-164804">Google Goes Universal, Adds Navigation</a></li>
<li><a href="http://searchenginewatch.com/3625897">Will Universal Search Mean Universal Domination?</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070518-094406">Will Universal Search Drive Growth of Google&#8217;s Vertical Search Properties?</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070525-114700">The Impact of Universal Search</a></li>
<li><a href="http://searchenginewatch.com/3625951">What Does Universal Search Mean for SEM?</a></li>
<li><a href="http://searchenginewatch.com/3625985">Universal Search for Small Business: What You Need to Know About Google&#8217;s New Search Results</a></li>
<li><a href="http://searchenginewatch.com/3626082">Distilling Universal Search</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070615-111019">The End of SEO, or a New Beginning?</a></li>
</ul>
<h3>Yahoo Panama</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070130-175340">Yahoo Site Explorer Gets Updates</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070212-103748">Yahoo Outlines Keys to PPC Success: Test, Test, Test</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070228-135132">Panama from a Big Agency Perspective</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070226-101007">comScore Defines Panama Effects</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070307-174902">More Early Yahoo Panama Results</a></li>
<li><a href="http://searchenginewatch.com/3625185">Early Returns Encouraging for Panama</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070306-163812">Time to Put the Pipe Wrench Away? Semel Says: &#8220;We Fixed the Plumbing&#8221;</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070316-100708">Panama Ad Ranking Algo Explained</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071031-132255">Yahoo Quietly Making Algorithms Changes</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071129-141850">Yahoo Fine Tuning Panama Tools, Launches Campaign Tune Up and More</a></li>
</ul>
<h3>Yahoo Reorgs</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070215-111653">Yahoo Reorg In Full Effect: Advertiser and Publisher Group Changes</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070212-112034">Yahoo Builds &#8220;Brickhouse&#8221;</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070315-091722">Yahoo Puts Addition on Brickhouse</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070516-104753">Yahoo&#8217;s New Mission</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070618-163330">Semel Ousted as Yahoo CEO</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070624-145933">Yahoo Combining Search And Display</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070626-125746">Yahoo Keeps Creating its Own Troubles</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070830-080836">More Exec Moves at Yahoo</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070910-151445">Overhaul, or Tune-Up at Yahoo?</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071016-092938">Is Yahoo Gaining Momentum?</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071019-171251">Yahoo CMO Resigns, Heading for Big Consumer Brand?</a></li>
</ul>
<h3>Ask 3D</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070417-225318">Ask CEO Explains &#8220;The Algorithm&#8221;</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070417-160247">The Algorithm Constantly Finds Jesus</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070425-000108">Ask to Launch Contextual Ad Network</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070503-202150">Ask Brings the Algorithm Mainstream</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070523-001236">What Should Ask.com Do?</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070605-000001">Ask Launches Ask3D</a></li>
<li><a href="http://searchenginewatch.com/3626058">Ask.com Launches Major Updates</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070605-035441">Ask.com: The Other Search Engine?</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070629-101652">WSJ&#8217;s Mossberg Likes Ask3D</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070814-112943">Ask Ads Shift Direction (for the Better)</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070926-152416">Ask To Launch New PPC Platform October 2</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071105-105415">IAC Splitting Into Five Separate Companies</a></li>
</ul>
<h3>Microsoft Live Search/AdCenter</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070126-113750">Microsoft Still Hopeful, But Not Happy With Search Share</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070411-093104">MSN adCenter Update Coming Soon</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070417-145141">Microsoft Announces Changes To adCenter</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070509-105504">Gates Plans to Spend Remaining Time Focusing on Search</a></li>
<li><a href="http://searchenginewatch.com/3626607">Microsoft adCenter Could Win the Search Battle</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070927-010101">Microsoft Updates Live Search</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070927-092553">Overview of Microsoft Searchification</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071029-145031">Microsoft adCenter Adds Immediate Editorial Updates, Daily Spend Limits</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071203-075942">Can Better Tools Make Up for Less Traffic for Microsoft?</a></li>
<li><a href="http://searchenginewatch.com/3627758">Microsoft Adds More Tools for Search Marketers</a></li>
</ul>
<h3>AOL Platform A</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070214-052916">AOL Turns Up Dial on FullView Marketing</a></li>
<li><a href="http://www.clickz.com/3627048">AOL Creates Platform A, New Ad Network Division</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071107-114156">AOL Adds Quigo to Platform A</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071112-111712">AOL Acquires Q&amp;A Platform Yedda</a></li>
</ul>
<h3>Social Media</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070206-103853">Linkbait Rules As Linkbait - Masterful</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070209-182832">Facebook on the Rise</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070214-111207">Where the Influencers Roam</a></li>
<li><a href="http://searchenginewatch.com/3625530">Why Are Search Marketers Getting Social?</a></li>
<li><a href="http://searchenginewatch.com/3625819">Social Media Marketing Do&#8217;s and Don&#8217;ts</a></li>
<li><a href="http://searchenginewatch.com/3626187">Can Social Media Work in Big Business?</a></li>
<li><a href="http://searchenginewatch.com/3626222">Finding Success in Social Media</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070907-143157">AOL announces changes to Netscape</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070913-120529">AOL Moves Social News from Netscape.com to Propeller.com</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071031-115541">Google to Launch OpenSocial APIs</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071102-103611">OpenSocial and Search Marketing</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071107-112118">Here Come the Social Network Ads</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071121-093318">Privacy Group Takes on Facebook</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071130-163226">Facebook Alters Beacon program</a></li>
</ul>
<h3>Personalization &amp; Customization of SERPs</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070202-233347">More Users to See Personalized Google Results</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070207-082028">What Personalization Means to SEOs</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070302-114355">Cutts: The End is Near for Black Hat SEO</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070302-163419">Personalization, SEO, and Web Marketing</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070501-132505">Google Gets Personal with iGoogle</a></li>
<li><a href="http://searchenginewatch.com/3626550">Yahoo Testing New Query Refinement Tools</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071217-090143">Google&#8217;s Sep Kamvar Discusses Personalization</a></li>
</ul>
<h3>Quality Score of Ads</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070123-192338">Yahoo To Add Quality Score To Panama Feb. 5th</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070202-114140">Yahoo Gives Tips To Improve Quality Score</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070205-095925">Yahoo Ad Ranking System Now Live</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070214-192520">Google to Update Ad Quality Scoring</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070604-211818">Yahoo Launches Quality-Based Pricing</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070609-124419">Google Quality Scoring: Changes And Insights</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070809-123204">Google Makes Changes to Ads Quality Calculation</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070809-133302">Implications of Google&#8217;s New Bid Calculations</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071023-121800">Google Sheds Light on Ad Quality Score</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071109-114909">Yahoo Shares Hints on Improving Quality Index</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071121-113928">BidHero Brings Analytics to PPC Bid Management</a></li>
</ul>
<h3>SEM/SEO 2.0</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070307-154052">The Future of Link Building</a></li>
<li><a href="http://searchenginewatch.com/3625530">Why Are Search Marketers Getting Social?</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070615-111019">The End of SEO, or a New Beginning?</a></li>
<li><a href="http://searchenginewatch.com/showPage.html?page=3626453">Housebreaking the Search</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070727-080000">Can the Search Marketing Industry Grow Up Fast Enough?</a></li>
<li><a href="http://searchenginewatch.com/3626959">Everything You Assumed About Search Is About to Change</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070920-175033">Getting Vertical Key to Google&#8217;s Ad Success</a></li>
<li><a href="http://searchenginewatch.com/3627258">The Year Search Makes Contact With Everything</a></li>
<li><a href="http://searchenginewatch.com/3627570">The Future of Search: Strategy, Execution, Evolution</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071207-113007">Thoughts on Don Schultz&#8217;s Keynote at SES</a></li>
</ul>
<h3>Social Search</h3>
<ul>
<li><a href="http://searchenginewatch.com/3624837">A Look at the Next Generation of Search?</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070206-095653">Eurekster Adds Community Features</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070309-160007">Wikia Search Plans Progressing</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070418-000638">The &#8220;Bon Mots&#8221; of Social Search</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070531-100604">Calacanis Launches Human-edited Search Engine</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070620-101131">Seth Godin Interview</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070710-093852">Mahalo Moves Forward</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070727-123000">Search Wikia Launches Open Source, Distributed Crawler</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071214-104308">&#8216;Knol&#8217; - Google Unit of Knowledge - Expert Authors Gathering</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071217-171124">Who&#8217;s Google Taking On with Knols?</a></li>
</ul>
<h3>Local Search</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/061226-192048">Vertical Strategies in Local Search</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070201-141420">SuperPages.com Combines Local Search with Social Networking</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070201-101300">A Look at Yahoo&#8217;s Local Ad Options</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070215-132648">Yahoo&#8217;s Local Ad Options</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070301-111715">Citysearch to Acquire Insider Pages</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070307-104442">Local Search, IYP Set to Grow</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070308-163958">Google Enhances its Local Business Center</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070314-140534">The Webification of SMBs</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070328-235249">Local Video Advertising on the Move</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070426-124939">Superpages.com Revises Ad Algorithm</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070627-103036">Marchex Launches Content-Rich Direct-Navigation Strategy</a></li>
<li><a href="http://searchenginewatch.com/showPage.html?page=3626457">Getting Started in Local Search</a></li>
<li><a href="http://searchenginewatch.com/showPage.html?page=3626655">Yellow Pages and Search</a></li>
<li><a href="http://searchenginewatch.com/3627000">The Local Search Landscape</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070927-171018">Marchex Connects Its Local-Vertical Network</a></li>
</ul>
<h3>Mobile Search</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070213-045125">Yahoo, Microsoft Get Mobile</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070320-000107">Yahoo Increases Reach of OneSearch</a></li>
<li><a href="http://searchenginewatch.com/showPage.html?page=3625413">Is This the Year for Mobile Search?</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070911-132349">Google Mobile to Start Running AdWords</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070911-001853">The Year for Mobile Ads is Here&#8230;Really</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070910-090000">Gary Price on Mobile Search</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070918-100518">Google AdSense for Mobile</a></li>
</ul>
<h3>Paid Links</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070416-020746">Google Goes to War on Paid Text Links</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070122-123055">Wikipedia External Links Now &#8220;Nofollow&#8221;</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070126-113345">Googlebombs Defused</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070430-110549">Adam Lasnik on Paid Links</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070508-111607">More Search Marketers Weigh in on Paid Links</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070514-153234">Matt Cutts&#8217; Paid Links Update</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070613-120419">Google Webmaster Tools Adds Paid Links Reporting</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070823-224513">Paid Links Discussions at San Jose</a></li>
<li><a href="http://searchenginewatch.com/3626870">The Great Link Buying Debate</a></li>
<li><a href="http://searchenginewatch.com/3626945">The Paid Links Debate: Shades of Gray</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071025-101700">A look into the purpose of Google&#8217;s &#8220;PageRank update&#8221;</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071029-102542">Matt Cutts Confirms Google-Slap</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071203-093112">Google Makes Paid Link Guidelines Crystal Clear</a></li>
</ul>
<h3>Search in the Mix/Integration</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070129-100233">Plan Your SEO for the Long Haul</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070202-220922">Search Seen as Most Effective by Marketers</a></li>
<li><a href="http://searchenginewatch.com/3626025">In the Mix: Search in the Overall Marketing Mix</a></li>
<li><a href="http://searchenginewatch.com/3626276">Coordinating Search with External Media: Can Less Be More?</a></li>
<li><a href="http://searchenginewatch.com/3627080">Searching for Options: Integration Spells Sweet Success</a></li>
<li><a href="http://searchenginewatch.com/3627270">Is Your Paid Search Campaign Part of a Mix or a Mess?</a></li>
<li><a href="http://searchenginewatch.com/3627538">Making It Work: 6 Factors to Integrate Search with Other Best of Breed Partners</a></li>
<li><a href="http://searchenginewatch.com/showPage.html?page=3626347">The Beginning of the Fragmentation of Search</a></li>
<li><a href="http://searchenginewatch.com/3627017">Baking SEO into a Full Fledged Interactive Work Plan</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071204-000002">SEW Experts: SEMs Get &#8216;No Respect&#8217; No More</a></li>
</ul>
<h3>Click Fraud</h3>
<ul>
<li><a href="http://searchenginewatch.com/3624397">It&#8217;s Time to See What&#8217;s In the PPC Sausage</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070130-162405">Click Fraud Numbers Up, Content Networks Near 20%</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070201-103133">Google Refutes Click Fraud Numbers, Once Again </a></li>
<li><a href="http://searchenginewatch.com/3625316">Yahoo Steps Up Click Fraud Efforts</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070301-065431">Google Shares More Click Fraud Numbers</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070426-121615">Click Quality Council Outlines 8 Principles of Click Quality</a></li>
<li>Google, Yahoo Respond to Click Quality Council Guidelines</li>
<li><a href="http://blog.searchenginewatch.com/blog/070519-082108">Fair Isaac Click Fraud Report Spreads False Alarm</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070518-121721">Fair Isaac Pegs Billed Click Fraud at 10-15% – IN VERY LIMITED CASES</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070817-105801">Google, Yahoo Launch Click Fraud Resource Centers</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070914-101356">Forbes Looks at Click Fraud – and Doesn&#8217;t Sensationalize It!</a></li>
</ul>
<h3>G-phone/Android</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070313-173102">Google and Microsoft Rumors: Implications for Mobile Local Search</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070802-105638">The Gphone and Mobile Local Search</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071008-114817">GPhone a Linux-based Mobile OS?</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071105-123949">Google Android: Mobile Platform, No Gphone (Yet)</a></li>
</ul>
<h3>Analytics</h3>
<li><a href="http://blog.searchenginewatch.com/blog/070905-101917">Stone Temple Consulting Publishes Web Analytics Shootout Results</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071016-120011">Google Analytics Update</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071030-113615">Microsoft &#8220;Gatineau&#8221; Analytics in Beta</a><br />
<h3>Aquisitions</h3>
<p><strong>Google-DoubleClick, Feedburner</strong></p>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070413-200905">Google to Buy DoubleClick</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070416-014859">Google/DoubleClick Deal Shakes Things Up</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070429-233127">Google and DoubleClick&#8217;s Integration Hurdles</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070601-142809">Google Acquires Feedburner</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070529-120302">FTC Looking at Google-DoubleClick Deal?</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070719-001440">Google-DoubleClick Deal Faces Congressional Scrutiny</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070928-112651">Google-DoubleClick Hearings Get Muddled</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071022-112404">Google, DoubleClick: Myths and Facts</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071101-130559">GoogleClick Clears One Hurdle, Another Looms</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071120-092235">Senate Subcommittee Cautions FTC on Google-DoubleClick Acquisition</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071220-101621">FTC approves Google&#8217;s acquisition of DoubleClick</a></li>
</ul>
<p><strong>Microsoft-aQuantive</strong></p>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070518-103342">Microsoft to Acquire aQuantive</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070813-114047">Microsoft Completes aQuantive Acquisition</a></li>
</ul>
<p><strong>Yahoo-Right Media</strong></p>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070430-000145">Yahoo to Acquire Right Media</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070713-141906">Yahoo Closes Right Media Deal</a></li>
</ul>
<p><strong>AOL</strong></p>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070516-084534">AOL on Ad Technology Buying Binge</a></li>
</ul>
<p><strong>Others</strong></p>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070412-130654">Interpublic Group Acquires Reprise Media</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070517-120022">WPP Snaps Up 24/7 Real Media</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070604-081431">360i in Management Buyout</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070726-114208">Business.com Acquired by Yellow Pages Publisher</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070809-180821">Marchex to Acquire Pay-per-Call Provider</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070907-094826">Omniture Acquires Offermatica</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070917-133259">Idearc Buys Switchboard, Other Assets from InfoSpace</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071026-095058">The Impact of the Omniture - Visual Sciences Merger</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/071119-125821">AT&amp;T To Acquire Pay Per Call Company Ingenio</a>
</ul>
<h3>SEW/SES</h3>
<ul>
<li><a href="http://blog.searchenginewatch.com/blog/070607-070000">Kevin Ryan Named VP &amp; Global Content Director, Search Engine Watch and Search Engine Strategies</a><a href="http://blog.searchenginewatch.com/blog/070609-121101">Happy 10th Birthday Search Engine Watch</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070614-124210">Kevin Ryan on the Future of Search Engine Strategies</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070618-080409">Update on SES Advisory Board</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070723-092615">New Look, New Awards</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070821-133745">Kevin Heisler Joins SEW as Executive Editor</a></li>
<li><a href="http://blog.searchenginewatch.com/blog/070904-091853">Interview with Kevin Ryan of SES</a></li>
</ul>

<span class="slashdigglicious">
<a href="http://slashdot.org/bookmark.pl?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fsearch-in-2007&amp;title=Search+in+2007" title="Slashdot It!"><img src="http://slashdot.org/favicon.ico" height="16" width="16" alt="[Slashdot]" /></a>
<a href="http://digg.com/submit?phase=2&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fsearch-in-2007&amp;title=Search+in+2007" title="Digg This Story"><img src="http://digg.com/favicon.ico" width="16" height="16" alt="[Digg]" /></a>
<a href="http://reddit.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fsearch-in-2007&amp;title=Search+in+2007" title="Reddit"><img src="http://reddit.com/favicon.ico" width="16" height="16" alt="[Reddit]" /></a>
<a href="http://del.icio.us/post?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fsearch-in-2007&amp;title=Search+in+2007" title="Save to del.icio.us" onclick="window.open('http://del.icio.us/post?v=4&amp;noui&amp;jump=close&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fsearch-in-2007&amp;title=Search+in+2007', 'delicious', 'toolbar=no,width=700,height=400'); return false;"><img src="http://del.icio.us/favicon.ico" width="16" height="16" alt="[del.icio.us]" /></a>
<a href="http://www.facebook.com/share.php?u=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fsearch-in-2007" title="Share on Facebook"><img src="http://www.facebook.com/favicon.ico" width="16" height="16" alt="[Facebook]" /></a>
<a href="http://technorati.com/faves?add=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fsearch-in-2007" title="Add to my Technorati Favorites"><img src="http://technorati.com/favicon.ico" width="16" height="16" alt="[Technorati]" /></a>
<a href="http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fsearch-in-2007&amp;title=Search+in+2007" title="Save to Google Bookmarks"><img src="http://www.google.com/favicon.ico" width="16" height="16" alt="[Google]" /></a>
<a href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fsearch-in-2007&amp;title=Search+in+2007" title="Stumble it!"><img src="http://www.stumbleupon.com/favicon.ico" width="16" height="16" alt="[StumbleUpon]" /></a>
</span>]]></content:encoded>
			<wfw:commentRss>http://blog.seo-hardcore.com/spidering/search-in-2007/feed</wfw:commentRss>
		</item>
		<item>
		<title>What can ACAP do for me?</title>
		<link>http://blog.seo-hardcore.com/spidering/what-can-acap-do-for-me</link>
		<comments>http://blog.seo-hardcore.com/spidering/what-can-acap-do-for-me#comments</comments>
		<pubDate>Thu, 06 Dec 2007 00:28:33 +0000</pubDate>
		<dc:creator>john</dc:creator>
		
		<category><![CDATA[spidering]]></category>

		<guid isPermaLink="false">http://blog.seo-hardcore.com/uncategorized/what-can-acap-do-for-me</guid>
		<description><![CDATA[Implement ACAP
What do I need to do to use ACAP on my website?
ACAP may be already be used in a variety of different business contexts, but its main initial purpose is for communicating access and usage permissions to web crawlers (also known as &#8217;spiders&#8217; or &#8216;robots&#8217;) - the automated processes employed by search engine operators [...]]]></description>
			<content:encoded><![CDATA[<p>Implement ACAP<br />
What do I need to do to use ACAP on my website?<br />
ACAP may be already be used in a variety of different business contexts, but its main initial purpose is for communicating access and usage permissions to web crawlers (also known as &#8217;spiders&#8217; or &#8216;robots&#8217;) - the automated processes employed by search engine operators and others to &#8220;crawl&#8221; the web and harvest content for use in their services.</p>
<p>There are two stages to using ACAP on your website for communicating permissions to web crawlers. They are: <a href="http://www.the-acap.org/implement-acap.php#imp">implementation</a> and <a href="http://www.the-acap.org/implement-acap.php#ver">verification</a>.</p>
<p><a name="imp" title="imp"></a><br />
1. Implementation<br />
Implementing ACAP involves making some important changes to your website. These are of two kinds:</p>
<p>* changes to your &#8216;robots.txt&#8217; file<br />
* changes to content resources</p>
<p>The simplest way of communicating permissions information to web crawlers is through a &#8216;robots.txt&#8217; file placed on your web server. If you already have a &#8216;robots.txt&#8217; file, this will need to be modified to contain ACAP permissions in addition to the information that it already contains (remember that not all web crawlers will be able to interpret ACAP permissions, for a while at least, so you should retain the existing content of your &#8216;robots.txt&#8217; file).</p>
<p>To save time and effort in modifying existing &#8216;robots.txt&#8217; files, a tool for converting them to include the corresponding ACAP permissions is available <a href="http://www.the-acap.org/convert-robots-txt-to-acap.php">here: &gt;</a>.</p>
<p>However, to gain the full benefit of using ACAP, you will need to make further modifications to ensure that the permissions say exactly what you need them to say. How to modify your &#8216;robots.txt&#8217; file is explained in the ACAP Technical Framework documentation, which you will find <a href="http://www.the-acap.org/project_documents/ACAP-TF-CrawlerCommunications-Part1-V1.0.pdf">here: &gt;</a>.</p>
<p>Another way of communicating permissions information to web crawlers is by embedding permissions information directly in the content resources themselves. ACAP permissions can be embedded in almost any kind of web resource, and how to do so is explained in the ACAP Technical Framework documentation, which you will find <a href="http://www.the-acap.org/project_documents/ACAP-TF-CrawlerCommunications-Part2-V1.0.pdf">here: &gt;</a>.</p>
<p>Once you have implemented ACAP on your website, please add the &#8220;ACAP enabled&#8221; logo to your home page and other pages on your site. Go to: <a href="http://www.the-acap.org/add-acap-enabled.php">Add an “ACAP-enabled” logo: &gt;</a> to find out how to add the logo to your website.</p>
<p><a name="ver" title="ver"></a><br />
2. Verification<br />
Once you have implemented ACAP on your website, you will probably wish to verify the quality of your implementation. We expect there to be a number of ACAP verification services that can tell you whether your implementation conforms to the ACAP Technical Framework or not, and details will shortly be available on the ACAP website. Once your ACAP implementation has been verified, you will be entitled to add the &#8220;ACAP compliant&#8221; logo to your website.</p>
<h3>Looking for an answer</h3>
<p>Try searching through our comprehensive FAQ section</p>
<p><a href="http://www.the-acap.org/faqs.php">Frequently Asked Questions</a></p>

<span class="slashdigglicious">
<a href="http://slashdot.org/bookmark.pl?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fwhat-can-acap-do-for-me&amp;title=What+can+ACAP+do+for+me%3F" title="Slashdot It!"><img src="http://slashdot.org/favicon.ico" height="16" width="16" alt="[Slashdot]" /></a>
<a href="http://digg.com/submit?phase=2&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fwhat-can-acap-do-for-me&amp;title=What+can+ACAP+do+for+me%3F" title="Digg This Story"><img src="http://digg.com/favicon.ico" width="16" height="16" alt="[Digg]" /></a>
<a href="http://reddit.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fwhat-can-acap-do-for-me&amp;title=What+can+ACAP+do+for+me%3F" title="Reddit"><img src="http://reddit.com/favicon.ico" width="16" height="16" alt="[Reddit]" /></a>
<a href="http://del.icio.us/post?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fwhat-can-acap-do-for-me&amp;title=What+can+ACAP+do+for+me%3F" title="Save to del.icio.us" onclick="window.open('http://del.icio.us/post?v=4&amp;noui&amp;jump=close&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fwhat-can-acap-do-for-me&amp;title=What+can+ACAP+do+for+me%3F', 'delicious', 'toolbar=no,width=700,height=400'); return false;"><img src="http://del.icio.us/favicon.ico" width="16" height="16" alt="[del.icio.us]" /></a>
<a href="http://www.facebook.com/share.php?u=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fwhat-can-acap-do-for-me" title="Share on Facebook"><img src="http://www.facebook.com/favicon.ico" width="16" height="16" alt="[Facebook]" /></a>
<a href="http://technorati.com/faves?add=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fwhat-can-acap-do-for-me" title="Add to my Technorati Favorites"><img src="http://technorati.com/favicon.ico" width="16" height="16" alt="[Technorati]" /></a>
<a href="http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fwhat-can-acap-do-for-me&amp;title=What+can+ACAP+do+for+me%3F" title="Save to Google Bookmarks"><img src="http://www.google.com/favicon.ico" width="16" height="16" alt="[Google]" /></a>
<a href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Fwhat-can-acap-do-for-me&amp;title=What+can+ACAP+do+for+me%3F" title="Stumble it!"><img src="http://www.stumbleupon.com/favicon.ico" width="16" height="16" alt="[StumbleUpon]" /></a>
</span>]]></content:encoded>
			<wfw:commentRss>http://blog.seo-hardcore.com/spidering/what-can-acap-do-for-me/feed</wfw:commentRss>
		</item>
		<item>
		<title>ACAP Launches, Robots.txt 2.0 For Blocking Search Engines?</title>
		<link>http://blog.seo-hardcore.com/spidering/acap-launches-robotstxt-20-for-blocking-search-engines</link>
		<comments>http://blog.seo-hardcore.com/spidering/acap-launches-robotstxt-20-for-blocking-search-engines#comments</comments>
		<pubDate>Thu, 06 Dec 2007 00:20:41 +0000</pubDate>
		<dc:creator>john</dc:creator>
		
		<category><![CDATA[spidering]]></category>

		<guid isPermaLink="false">http://blog.seo-hardcore.com/spidering/acap-launches-robotstxt-20-for-blocking-search-engines</guid>
		<description><![CDATA[After a year of discussions, ACAP &#8212; Automated Content Access Protocol &#8212; was released today as a sort of robots.txt 2.0 system for telling search engines what they can or can&#8217;t include in their listings. However, none of the major search engines support ACAP, and its future remains firmly one of &#8220;watch and see.&#8221; Below, [...]]]></description>
			<content:encoded><![CDATA[<p>After a year of discussions, ACAP &#8212; Automated Content Access Protocol &#8212; was released today as a sort of robots.txt 2.0 system for telling search engines what they can or can&#8217;t include in their listings. However, none of the major search engines support ACAP, and its future remains firmly one of &#8220;watch and see.&#8221; Below, more about the how and why of ACAP.<br />
Let&#8217;s start with some history. ACAP got going in September 2006, backed by major European newspaper and publishing groups that in particular felt Google was using content without proper permissions and wanting a more flexible means to provide this than allowed by the long-standing robots.txt and meta robots standards.</p>
<p>These two standards are found at the robotstxt.org, and ACAP has been referring to them often at &#8220;Robots Exclusion Protocol&#8221; or REP, though within the SEO world, they&#8217;re generally known by their actual names.</p>
<p>Robots.txt was born in 1994 as a way to block content on a server-wide basis; meta robots emerged in 1996 as a system to block on a page-by-page basis (see Meta Robots Tag 101: Blocking Spiders, Cached Pages &amp; More for more about it). Neither has been updated since those years ago, in terms of search engines coming together to agree on new universal standards. In short, REP has no &#8220;guardians&#8221; or group to take it forward.</p>
<p>Enter ACAP. If the search engines weren&#8217;t going to improve robots.txt, the aforementioned publishers decided they&#8217;d take on the challenge. Of course, creating a standard for search engine indexing is kind of a waste of time, if you don&#8217;t have the search engines themselves to actually support it. But ACAP didn&#8217;t let that be a deterrent. Over the past year, it has had a working group setting up a new system, with search engines Google and Ask.com, along with Exalead, taking part in the discussions. FYI, I&#8217;ve not been an active working member, but I&#8217;ve been included on the working group&#8217;s emails and chimed in from time to time with advice and thoughts.</p>
<p>The ACAP System</p>
<p>Now the new system has arrived, being unveiled at the ACAP conference in New York today. Before getting into support, let&#8217;s cover what&#8217;s in it. You&#8217;ll find an overview page for the specifications here, which leads to:</p>
<p><a href="http://www.the-acap.org/convert-robots-txt-to-acap.php" title="A robots.txt-to-ACAP conversoin tool">A robots.txt-to-ACAP conversion tool</a> (don&#8217;t worry; this should make your robots.txt file still work as a regular one and double as an ACAP file)<br />
 <br />
<a href="http://www.the-acap.org/project_documents/ACAP-TF-CrawlerCommunications-Part1-V1.0.pdf">ACAP extensions to use with robots.txt (here, PDF file)</a><br />
 <br />
<a href="http://www.the-acap.org/project_documents/ACAP-TF-CrawlerCommunications-Part2-V1.0.pdf">ACAP extensions to use with meta robots (here, PDF file)</a><br />
 <br />
<a href="http://www.the-acap.org/add-acap-enabled.php">ACAP logo</a> for those that want to show they&#8217;re using ACAP (not required to make ACAP work, but expect publishers pushing ACAP to make use of it)<br />
What does ACAP provide that robots.txt and meta robots does not? After going through the technical specs, which are pretty dense reading, I&#8217;d summarize it this way:</p>
<p>Emphasis on both granting permissions and blocking<br />
 <br />
Support for time-based inclusion or exclusion<br />
That&#8217;s it. Discussions have covered concepts such as how password-protected content could be indexed, or whether you could issue permissions on a country-by-country basis, but some of these ideas haven&#8217;t made it into the first cut.</p>
<p>AP has a nice overview article about the ACAP launch, and I found the companion piece a nice summary if you&#8217;re looking for some faster specifics. A key part:</p>
<p>Some search engines have interpreted &#8220;disallow&#8221; to mean that the site cannot be added to the index but could be fetched for use in various algorithms employed to determine how high a site appears in search results&#8230;.ACAP proposes to clarify that &#8220;disallow&#8221; refers to indexing.</p>
<p>A separate &#8220;crawl&#8221; command would be added to bar the indexing software or crawler entirely.</p>
<p>In addition, Web sites would be able to add qualifiers stipulating that the information expires from the search index on a specific date, in a given number of days or whenever the crawler returns to the site.</p>
<p>A &#8220;follow&#8221; command would permit or block the crawler from following links within a page.</p>
<p>&#8220;Preserve,&#8221; with similar time limits available for &#8220;index,&#8221; would stipulate whether a copy may be stored in a search engine&#8217;s cache.</p>
<p>&#8220;Present&#8221; would govern a search engine&#8217;s ability to display the copy, and a site may limit that further — for example, to a snippet or to a miniaturized version, or thumbnail.</p>
<p>As I said, there&#8217;s an emphasis on granting permission. By default, search engines assume everything is open to indexing. ACAP changes this assumption, asking those that create the files to explicitly indicate yes or no.</p>
<p>Should You Use It?</p>
<p>So now we have a new standard for expressing search engine permissions. Do site owners need to run out and immediately use it?</p>
<p>No. Not immediately. Not even long term.</p>
<p>Right now, none of the major search engines are supporting ACAP. If you were to use ACAP without ensuring that standard robots.txt or meta robots commands were also included, you&#8217;d fail to properly block search engines. Only Exalead, which is not a major multi-country service, would currently act upon your ACAP-only commands.</p>
<p>Even if ACAP were to magically get endorsed and supported by all the major search engines, robots.txt and meta robots support wouldn&#8217;t go away for many years. There are simply too many sites that use those systems, have used them for over a decade, and would fail to upgrade. Those two systems will continue to be supported in the same way Microsoft has had to support DOS programs despite the growth of Windows.</p>
<p>So why bother at all? Probably two reasons:</p>
<p>You want to personally test out how ACAP works, playing with the permissions and seeing what happens in Exalead<br />
 <br />
You want to support the ACAP system and hope that if enough people use it, perhaps the search engines will adopt it. FYI, ACAP is urging (PDF file) &#8220;universal adoption&#8221; by publishers by the end of next year.<br />
Search Engine Support</p>
<p>What&#8217;s up with the major services? I emailed the big three, Google, Microsoft, and Yahoo, all of whom either took part in the working group or are at today&#8217;s conference. Google&#8217;s canned answer:</p>
<p>We are interested in all initiatives that allow web publishers and search engines to work more closely together. We have undertaken many efforts in this direction over the years including supporting file-extension and wildcard specifications in robots.txt, SiteMaps, our Webmaster Console, extending per-item indexing specification to non-html documents and specifying how long a url would be available. We will examine ACAP proposals when they become available. As a broad-based search engine, we need to keep in mind the needs of millions of web publishers worldwide.</p>
<p>As it happens, I was at Microsoft yesterday, and while I haven&#8217;t gotten a formal statement to post, the sentiment was the same as Google. Microsoft is interesting in supporting publishers, is continuing to grow its own tools and will also watch ACAP, wanting to support publishers in general</p>
<p>Yahoo&#8217;s not sent a statement back yet, but when it arrives, you can expect it will be pretty much the same as Google and Microsoft.</p>
<p>Why not just jump into ACAP? Between the lines time here &#8212; no one really wants to hand over control of the standard to the ACAP group, especially in my view when it has been born out of some anti-search engine hype.</p>
<p>So why not jump behind improving robots.txt and meta robots? Another issue here is that no one is officially in charge of those standards. The search engines are sort of the gatekeepers, because it&#8217;s what they decide to support that effectively becomes &#8220;law.&#8221; If they don&#8217;t support a particular exclusion command, it might as well not exist.</p>
<p>The various search engines tell me they have been talking more about making some collective improvements. Individually, they&#8217;ve already added to both robots.txt and meta robots over the years, extensions that may work with their particular search engines. Perhaps they will become more unified.</p>
<p>In particular, they&#8217;ve united around the sitemaps standard. That sort of picks up what ACAP does in terms of being a system to provide express permission of indexing, and it&#8217;s where I&#8217;d expect any search engine-driven, collective agreement about improved blocking tools to emerge.</p>
<p>Also be sure to read Up Close &amp; Personal With Robots.txt, which summarizes the second robots.txt summit that I organized earlier this year. The article covers a lot of things that general site owners and SEOs have wished for, along with some search engine responses.</p>
<p>Conclusion</p>
<p>So has the entire ACAP project been a waste of time, or as Andy Beal&#8217;s great headline put it when ACAP was announced last year, Publishers to Spend Half Million Dollars on a Robots.txt File? That still makes me laugh.</p>
<p>No, I&#8217;d say not. I think it&#8217;s been very useful that some group has diligently and carefully tried to explore the issues, and having ACAP lurking at the very least gives the search engines themselves a kick in the butt to work on better standards. Plus, ACAP provides some groundwork they may want to use. Personally, I doubt ACAP will become Robots.txt 2.0 &#8212; but I suspect elements of ACAP will flow into that new version or a successor.</p>
<p>originally written by Danny Sullivan on Search Engine Land</p>

<span class="slashdigglicious">
<a href="http://slashdot.org/bookmark.pl?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Facap-launches-robotstxt-20-for-blocking-search-engines&amp;title=ACAP+Launches%2C+Robots.txt+2.0+For+Blocking+Search+Engines%3F" title="Slashdot It!"><img src="http://slashdot.org/favicon.ico" height="16" width="16" alt="[Slashdot]" /></a>
<a href="http://digg.com/submit?phase=2&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Facap-launches-robotstxt-20-for-blocking-search-engines&amp;title=ACAP+Launches%2C+Robots.txt+2.0+For+Blocking+Search+Engines%3F" title="Digg This Story"><img src="http://digg.com/favicon.ico" width="16" height="16" alt="[Digg]" /></a>
<a href="http://reddit.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Facap-launches-robotstxt-20-for-blocking-search-engines&amp;title=ACAP+Launches%2C+Robots.txt+2.0+For+Blocking+Search+Engines%3F" title="Reddit"><img src="http://reddit.com/favicon.ico" width="16" height="16" alt="[Reddit]" /></a>
<a href="http://del.icio.us/post?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Facap-launches-robotstxt-20-for-blocking-search-engines&amp;title=ACAP+Launches%2C+Robots.txt+2.0+For+Blocking+Search+Engines%3F" title="Save to del.icio.us" onclick="window.open('http://del.icio.us/post?v=4&amp;noui&amp;jump=close&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Facap-launches-robotstxt-20-for-blocking-search-engines&amp;title=ACAP+Launches%2C+Robots.txt+2.0+For+Blocking+Search+Engines%3F', 'delicious', 'toolbar=no,width=700,height=400'); return false;"><img src="http://del.icio.us/favicon.ico" width="16" height="16" alt="[del.icio.us]" /></a>
<a href="http://www.facebook.com/share.php?u=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Facap-launches-robotstxt-20-for-blocking-search-engines" title="Share on Facebook"><img src="http://www.facebook.com/favicon.ico" width="16" height="16" alt="[Facebook]" /></a>
<a href="http://technorati.com/faves?add=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Facap-launches-robotstxt-20-for-blocking-search-engines" title="Add to my Technorati Favorites"><img src="http://technorati.com/favicon.ico" width="16" height="16" alt="[Technorati]" /></a>
<a href="http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Facap-launches-robotstxt-20-for-blocking-search-engines&amp;title=ACAP+Launches%2C+Robots.txt+2.0+For+Blocking+Search+Engines%3F" title="Save to Google Bookmarks"><img src="http://www.google.com/favicon.ico" width="16" height="16" alt="[Google]" /></a>
<a href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fspidering%2Facap-launches-robotstxt-20-for-blocking-search-engines&amp;title=ACAP+Launches%2C+Robots.txt+2.0+For+Blocking+Search+Engines%3F" title="Stumble it!"><img src="http://www.stumbleupon.com/favicon.ico" width="16" height="16" alt="[StumbleUpon]" /></a>
</span>]]></content:encoded>
			<wfw:commentRss>http://blog.seo-hardcore.com/spidering/acap-launches-robotstxt-20-for-blocking-search-engines/feed</wfw:commentRss>
		</item>
		<item>
		<title>Example uses of the visitor engagement metric</title>
		<link>http://blog.seo-hardcore.com/analytics/example-uses-of-the-visitor-engagement-metric</link>
		<comments>http://blog.seo-hardcore.com/analytics/example-uses-of-the-visitor-engagement-metric#comments</comments>
		<pubDate>Wed, 21 Nov 2007 13:44:41 +0000</pubDate>
		<dc:creator>john</dc:creator>
		
		<category><![CDATA[analytics]]></category>

		<guid isPermaLink="false">http://blog.seo-hardcore.com/analytics/example-uses-of-the-visitor-engagement-metric</guid>
		<description><![CDATA[Delivered by RSS from webanalyticsdemystified.com
My post last week on measuring visitor engagement was pretty long by the time I outlined the calculation, so I put off publishing examples of how the metric could be used until now. I’m excited to see that this topic has generated so much interest, both in terms of comments and [...]]]></description>
			<content:encoded><![CDATA[<p>Delivered by RSS from webanalyticsdemystified.com</p>
<p>My <strong><a target="_blank" href="http://blog.webanalyticsdemystified.com/weblog/2007/10/how-to-measure-visitor-engagement-redux.html">post last week on measuring visitor engagement</a></strong> was pretty long by the time I outlined the calculation, so I put off publishing examples of how the metric could be used until now. I’m excited to see that this topic <strong><a target="_blank" href="http://blog.webanalyticsdemystified.com/weblog/2007/10/nick-arnett-challenges-my-visitor-engagement-calculation.html">has generated so much interest,</a></strong> both in terms of comments and emails sent to me directly.</p>
<p>My goal for this post is to provide a few examples and explanations to show how the metric can be used to supplement our otherwise already-rich set of web analytics data. Since so many folks have been willing to explore the engagement metric, <em>I have embedded a bunch of questions in this post in italics</em> that I’d love your feedback on.</p>
<p><img border="1" src="http://blog.seo-hardcore.com/wp-content/plugins/wp-o-matic/cache/6c2c1_engagement_segments.gif" /></p>
<p><strong>Distribution of engagement scores and segmentation.</strong> Here is the distribution of engagement scores for about six months at Web Analytics Demystified by percent of visitors. As you can see, these scores are left-skewed and tail off as the score increases, showing that nearly half (47.6%) of visitors to my site are “poorly engaged”. <em>When I look at this distribution it makes perfect sense to me — what do you think?</em></p>
<p>I have created segments to group visitors by their engagement score: “Well engaged” visitors have engagement scores over 30%, “moderately engaged” visitors are those between 10% and 30%, and”poorly engaged” visitors score less than 10%. These segments can then be used to explore how the behavior of visitors in each engagement group differs by looking at my page and referring source dimensions (page, content group, referring domain, campaign, search phrase, etc.)</p>
<p><img border="1" src="http://blog.seo-hardcore.com/wp-content/plugins/wp-o-matic/cache/6c2c1_search_phrases_and_engageme.gif" /></p>
<p><strong>Identify relationships that might otherwise not be found.</strong> At the top of this report you can see the pronounced difference in visitor engagement (and traditional metrics) for “branded” and unbranded searches (”None”) bringing visitors to my site. Now, because <strong><a target="_blank" href="http://blog.webanalyticsdemystified.com/weblog/2007/10/how-to-measure-visitor-engagement-redux.html">branded searches are a component of the calculation (Brand Index),</a></strong> you definitely expect to see a difference between the two engagement scores. What is interesting is that while other metrics (duration, sessions per visitor, page views per session) show a slight difference, visitor engagement and conversion are all three times higher for branded searches. <em>I think this difference observed in all the metrics is further evidence that brand-driven searches are bringing more engaged visitors — what do you think?</em></p>
<p>In the middle table you can see search phrases bringing visitor to my site, showing visitor engagement, page views per session, and sessions per visitor. Here three phrases stand out to me:</p>
<ol>
<li><a target="_blank" href="http://www.google.com/search?source=ig&amp;hl=en&amp;rlz=&amp;q=web+analytics+book&amp;btnG=Google+Search">“web analytics book”</a> and <a target="_blank" href="http://www.google.com/search?hl=en&amp;q=web+analytics+process&amp;btnG=Search">“web analytics process”</a>, neither of which are particularly distinguished from other search phrases based on page views per session or sessions per visitor but both of which have visitor engagement scores over double my site-wide average of 8.8%. This is important to me because these are un-branded search terms that are critically important to my business.</li>
<li><a target="_blank" href="http://www.google.com/search?hl=en&amp;q=vendor+discovery+tool&amp;btnG=Search">“vendor discovery tool”</a> which would appear to be pretty important based on traditional metrics but only stands out slightly using the visitor engagement score (at 13.6%) I spend a lot of time trying to figure out how to drive folks using the vendor discovery tool to take other actions (buy books, inquire about consulting) and this data suggests that there is an unrealized opportunity.</li>
<li><a target="_blank" href="http://www.google.com/search?hl=en&amp;q=performance+indicators&amp;btnG=Search">“performance indicators”</a> which shows that the visitor engagement metric is useful to identify terms that you’d think are important to the site but aren’t attracting the right audience (average engagement score for these visitors is only 5.6%)</li>
</ol>
<p><em>I think this level of information is actually pretty helpful for identifying search marketing opportunities — what do you think?</em></p>
<p><img border="1" src="http://blog.seo-hardcore.com/wp-content/plugins/wp-o-matic/cache/6c2c1_engagement_vs_conversion.gif" /></p>
<p><strong>Engagement-derivative metrics like “Percent Highly Engaged Visitors” are useful.</strong> Here you can see a select group of referring domains showing the percent of highly and percent moderately engaged visitors they’re sending my way (with conversion to show that engagement and conversion are in fact different!) Avinash Kaushik is sending me a few (0.2%) highly engaged visitors (thanks!) but Ian Thomas is sending me a bunch (70.4%) of moderately engaged visitors, many of whom are purchasing books (1.2% conversion rate.)</p>
<p>By looking at traffic from Avinash’s site over time (bar graph) I can see peaks and valleys in overall engagement from folks coming from his site, which would be useful to back into those peaks to try and determine what other blogger’s readers might be reacting to when they’re exhibiting highly-engaged behavior on my site (see late August and early September.) Given that <a target="_blank" href="http://blog.instantcognition.com/research/2007/02/17/eric-peterson-poll-results/">Clint proved that conversion is a poor measure of success when trying to evaluate traffic from other bloggers,</a> <em>I think visitor engagement is useful for examining the non-revenue value of referring sources — what do you think?</em></p>
<p>Those of you who are looking for correlation between engagement and conversion, have a look at the data for Mr. Jim Sterne’s wonderful site <strong><a target="_blank" href="http://www.emetrics.org">emetrics.org</a></strong> —Â  5.6% of the folks coming from Jim’s site are highly engaged, 66.2% moderately engaged, and man-oh-man does Jim help <strong><a target="_blank" href="http://www.webanalyticsdemystified.com/web-analytics-books.asp">sell some copies of Web Analytics Demystified.</a></strong>Â  You’re the man, Jim!</p>
<p><img border="1" src="http://blog.seo-hardcore.com/wp-content/plugins/wp-o-matic/cache/6c2c1_engagement_is_global.gif" /></p>
<p><strong>Visitor engagement is globally useful.</strong> At least in Visual Sciences Visual Site you can apply engagement metrics and segments to pretty much any dimension tracked. Here I’m looking at the percentage of “highly” engaged visitors (50% or more) in my “well engaged” segment broken down by country. Now, this is certainly more interesting in light of the total volume of traffic coming from each geographic location, and as I think about localizing my books and planning <a target="_blank" href="http://blog.webanalyticsdemystified.com/weblog/2007/09/world-tour-day-two-stockolm-sweden.html">future trips around the world</a> this information becomes very helpful.</p>
<p>There is more, including some of the more granular visitor-level stuff I talked about in the first series of posts on the subject, but I want to be sensitive to protecting the identity of individual users on my site. If you’re interested in helping me collect some “ground truth” regarding the engagement calculation, <strong><a href="mailto:eric@webanalyticsdemystified.com">write me and I’ll explain how you can help.</a></strong></p>
<p>So what do you think? Do the screen-shots help you understand the calculation better? Or do they still make it look super-complicated and scary? Is there something specific you’d like to see me demonstrate with the calculation? Or do you think you could come up with these same insights using more traditional metrics?</p>
<hr noShade="true" />© 2007 Web Analytics Demystified | <a href="http://www.webanalyticsdemystified.com">www.webanalyticsdemystified.com</a><strong>Looking for a new job in web analytics?</strong> Check out the <a href="http://www.webanalyticsdemystified.com/job_list.asp">Web Analytics Demystified Job Board!</a><img width="1" src="http://blog.seo-hardcore.com/wp-content/plugins/wp-o-matic/cache/8e34b_176726511" height="1" /></p>

<span class="slashdigglicious">
<a href="http://slashdot.org/bookmark.pl?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fexample-uses-of-the-visitor-engagement-metric&amp;title=Example+uses+of+the+visitor+engagement+metric" title="Slashdot It!"><img src="http://slashdot.org/favicon.ico" height="16" width="16" alt="[Slashdot]" /></a>
<a href="http://digg.com/submit?phase=2&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fexample-uses-of-the-visitor-engagement-metric&amp;title=Example+uses+of+the+visitor+engagement+metric" title="Digg This Story"><img src="http://digg.com/favicon.ico" width="16" height="16" alt="[Digg]" /></a>
<a href="http://reddit.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fexample-uses-of-the-visitor-engagement-metric&amp;title=Example+uses+of+the+visitor+engagement+metric" title="Reddit"><img src="http://reddit.com/favicon.ico" width="16" height="16" alt="[Reddit]" /></a>
<a href="http://del.icio.us/post?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fexample-uses-of-the-visitor-engagement-metric&amp;title=Example+uses+of+the+visitor+engagement+metric" title="Save to del.icio.us" onclick="window.open('http://del.icio.us/post?v=4&amp;noui&amp;jump=close&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fexample-uses-of-the-visitor-engagement-metric&amp;title=Example+uses+of+the+visitor+engagement+metric', 'delicious', 'toolbar=no,width=700,height=400'); return false;"><img src="http://del.icio.us/favicon.ico" width="16" height="16" alt="[del.icio.us]" /></a>
<a href="http://www.facebook.com/share.php?u=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fexample-uses-of-the-visitor-engagement-metric" title="Share on Facebook"><img src="http://www.facebook.com/favicon.ico" width="16" height="16" alt="[Facebook]" /></a>
<a href="http://technorati.com/faves?add=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fexample-uses-of-the-visitor-engagement-metric" title="Add to my Technorati Favorites"><img src="http://technorati.com/favicon.ico" width="16" height="16" alt="[Technorati]" /></a>
<a href="http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fexample-uses-of-the-visitor-engagement-metric&amp;title=Example+uses+of+the+visitor+engagement+metric" title="Save to Google Bookmarks"><img src="http://www.google.com/favicon.ico" width="16" height="16" alt="[Google]" /></a>
<a href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fexample-uses-of-the-visitor-engagement-metric&amp;title=Example+uses+of+the+visitor+engagement+metric" title="Stumble it!"><img src="http://www.stumbleupon.com/favicon.ico" width="16" height="16" alt="[StumbleUpon]" /></a>
</span>]]></content:encoded>
			<wfw:commentRss>http://blog.seo-hardcore.com/analytics/example-uses-of-the-visitor-engagement-metric/feed</wfw:commentRss>
		</item>
		<item>
		<title>My thoughts about Omniture and WebTrends</title>
		<link>http://blog.seo-hardcore.com/analytics/my-thoughts-about-omniture-and-webtrends</link>
		<comments>http://blog.seo-hardcore.com/analytics/my-thoughts-about-omniture-and-webtrends#comments</comments>
		<pubDate>Wed, 21 Nov 2007 13:44:10 +0000</pubDate>
		<dc:creator>john</dc:creator>
		
		<category><![CDATA[analytics]]></category>

		<guid isPermaLink="false">http://blog.seo-hardcore.com/analytics/my-thoughts-about-omniture-and-webtrends</guid>
		<description><![CDATA[Delivered by RSS from webanalyticsdemystified.com
A number of you have commented that I have been oddly quiet on the subject of Omniture planning to acquire Visual Sciences and then the news that four senior-most managers at WebTrends were let go. It’s not that I don’t have an opinion — I can assure you that I do [...]]]></description>
			<content:encoded><![CDATA[<p>Delivered by RSS from webanalyticsdemystified.com</p>
<p>A number of you have commented that I have been oddly quiet on the subject of Omniture planning to acquire Visual Sciences and then the news that four senior-most managers at WebTrends were let go. It’s not that I don’t have an opinion — I can assure you that I do — but I wanted to take a little time to clarify my thoughts on these subjects before blogging about it.</p>
<p>On the Omniture/Visual Sciences deal, I sincerely do congratulate Josh James and the entire team at Omniture on building a company capable of completely taking out their biggest competitor. Over the years I have found myself having a somewhat topsy-turvy relationship with Mr. James and his organization: First I had to compete with them while at WebSideStory, winning some deals and losing others. Then I worked directly with them while at JupiterResearch, spending time both in their offices and also on their behalf through online seminars and client events. Finally I spent a little over a year competing with them again, this time at Visual Sciences, again winning some deals and losing others.</p>
<p>Regardless of where I worked, it was impossible to not develop a healthy respect for Omniture and their success. It pained me to watch deals like HP, AOL, CBS Sportsline, USAtoday, Overstock.com and others go their way, despite hard work from a talented group of individuals, and I absolutely hated going up against their particular version of salesmanship. But as an analyst it was encouraging to see Josh and John Pestana build a company that understood the underlying technology but also how that technology could make their customers more successful.</p>
<p>Their customers responded to this, and still do. It is not uncommon to meet Omniture customers who have “drank the kool aid” and for whom their customer status is very much a badge of honor. Hopefully Mr. James <em>et al.</em> will deliver the same Omniture experience for the 1,500-odd companies they’re purchasing from Visual Sciences/WebSideStory, because that is where I see the inherent risk in this deal.</p>
<p>All week last week people with <em>a lot</em> of money under their management asked me “what is the upside and what is the risk in this acquisition?” I’m not a financial analyst (<u>disclosure</u>: I don’t have any holdings in OMTR or VSCN) so all I could comment on was what I hoped the combined company would do and not do. And while nobody from Omniture has asked me — not that it would be particularly appropriate anyway — here are a few thoughts on what the new company needs to do to make this acquisition successful:</p>
<ol>
<li><strong>Suck it up and start the migration from HBX to SiteCatalyst immediately.</strong> I haven’t read all of the various transcripts on this deal, but nobody I am talking to expects HBX to survive the balance of 2008 if the deal is approved (which I sincerely believe it will be.) Omniture should smooth the transition path by splitting data collection at the gateways <em>now</em> and simultaneously loading whatever HBX-collected data into the SiteCatalyst data collectors, thusly giving HBX customers the easiest possible transition from one technology to the other. And while if I were on HBX I would be aggressively thinking about migrating to the SiteCatalyst code base, <a target="_blank" href="http://semphonic.blogs.com/semangel/2007/11/transitioning-f.html">this transition is far from a slam-dunk at the customer-level.</a> Splitting the data will give marketing something to show I.T. if they complain about needing to replace the JavaScript (again), and getting started on data collection now will potentially ease some of the pain associated with not being able to migrate years of HBX data that some customers might not want to lose (if that is the final assessment.)</li>
<li><strong>Admit that Visual Workstation is the right interface for serious analysts.</strong> Again, I have not read the transcripts, but comments I have read are unclear about whether Discover 2 or Visual Workstation will live past the acquisition point. And while I have spent much time looking at Discover 2, I can assure you that of the two, Visual Workstation is the technology to keep (<u>disclosure</u>: I recently entered into a licensing agreement with Visual Sciences to use Visual Workstation at Web Analytics Demystified.) No disrespect to Omniture’s fine product team, but Visual Workstation is unparalleled for sheer analyst-class power, and I’m fairly sure that without modification Visual Workstaion can leverage whatever format Omniture stores visitor-level data to get up and running quickly. This may cause problems from a pure SaaS-perspective, and I could be wrong, but I suspect that most analysts wouldn’t actually mind having to run the software locally in exchange for having the robust data manipulation capabilities that Workstation provides.</li>
<li><strong>As painful as it will be, resolve the internal stuff quickly.</strong> A huge potential pitfall in this deal is that it has tremendous potential to create confusion regarding who is managing what, when, where, and how all of these technologies are presented in sales and support situations. Not that this will be easy, any M&amp;A transaction has the potential to be messy, but Josh and Jim MacIntyre won’t be doing anyone favors by sugarcoating what this deal is or being vague about who might be reassigned and who might be let go (keep in mind that these companies that were <em>bitter enemies </em>in the marketplace up until two weeks ago.) Any internal confusion about the transition will inevitably impact customers in the form of unclear deadlines, changing account managers, and other miscommunication that will only open the door for other vendors …</li>
</ol>
<p>Which brings me to the other change in the web analytics market last week: <strong><a target="_blank" href="http://www.clickz.com/3627489">WebTrends announcing</a></strong> that Greg Drew, Jason Palmer, Tore Steen, and Hamid Bahadori had all been asked to leave the company. I have to admit, I was more-or-less shocked by this announcement, specially given that <a target="_blank" href="http://www.webtrends.com/AboutWebTrends/NewsRoom/NewsRoomArchive/2007/WebTrendsUnveilsthePowertoMeasureCustomerEngagementwithLaunchofWebTrendsMarketingLab2.aspx">I have been saying to folks since mid-July</a> that I believe, at least from a software perspective, that WebTrends is finally getting back on track. I really do believe that WebTrends Score is one of the few true innovations we’ve seen in the web analytics marketplace recently, and learned WebTrends users far and wide have commented that <a target="_blank" href="http://www.thebigintegration.com/blog/?p=34">they really like the stuff in the MarketingLab2 release.</a></p>
<p>I should also say that I personally really like Greg Drew and Jason Palmer. Now, I say that not having worked with or for them in any role other than that of an industry analyst, and anecdotally some of the recent flight from the company can be tied back to their leadership. But Greg has always struck me as one of the nicest guys in the entire industry and someone who was willing to do what it took to get the job done.</p>
<p><strong>Regardless of Greg’s personal disposition,</strong> I again find myself nearly flabbergasted that the folks at Francisco Partners who are calling the shots would give up Greg and Jason’s experience in the field and knowledge about web analytics in general. I mean, it’s not like Eli Shapira is going to come back and run the company, or that it will be easy to find someone else to run the ship as experienced with web analytics as Josh James from Omniture, Joe Davis at Coremetrics, or Dennis Mortensen at IndexTools. Especially on the heels of the Omniture/Visual Sciences announcement, this whole thing sounds so fishy it’s almost unbelievable, but I have to believe that these four guys will be harder to replace than people think.</p>
<p>Case-in-point: when Jeff Lunsford showed up at WebSideStory sans web analytics experience, some of us were worried. But Jeff was a natural born-leader, and given time it was clear that Jeff had what it took to get the job done. Unfortunately, in retrospect, it is no longer clear exactly what that job was aside from making a small number of people a huge sum of money, and Jeff has moved on to even bigger deals. I liked working for Jeff tremendously, but I’m not 100 percent sure he left WebSideStory in better shape than he found it.</p>
<p>I’ll admit, I don’t have the experience that these guys have … I’ve been running a company of two people for seven months. But just as I felt like WebTrends was well positioned (along with Coremetrics) to be a strong solution with a great customer base, a good set of features, that was incidentally “not Omniture”, I now find myself questioning how strong the organization will really be when run by folks largely new to web analytics. No disrespect to Tim, John, Leo or Bruce, but <strong>web analytics is hard,</strong> the competition is big and about to get bigger, and sophisticated web analytics buyers will easily differentiate between <em>passion </em>and <em>experience</em>.</p>
<p><strong>Trust me, I want to be wrong about this.</strong> I would like nothing more than to have someone clarify what happened at WebTrends and detail how the company is going to accelerate growth against Omniture given their recent momentum. I think despite <strong><a target="_blank" href="http://www.webanalyticsdemystified.com/vendor_discovery_tool.asp?vendor=Omniture&amp;compare=TRUE&amp;domain_table_name=fortune_1000&amp;fortune_industry=%25">Omniture’s strength</a></strong> and <strong><a target="_blank" href="http://www.webanalyticsdemystified.com/vendor_discovery_tool.asp?vendor=Google+Analytics&amp;compare=TRUE&amp;domain_table_name=fortune_1000&amp;fortune_industry=%25">Google Analytics widespread deployment</a></strong> that the “web analytics wars” are far from over. Like others, I worry that a two horse race isn’t very exciting to watch, and despite believing that “it’s not the technology, it’s how you use it” that it’s nice to see innovation from time to time. For this to happen, I believe we need a strong WebTrends, a strong Coremetrics, and at least a small handful of smaller innovators out there in the world (Nedstat, IndexTools, Clicktracks, etc.) nipping at everyone else’s heels.</p>
<p><strong>What do you think? Am I crazy?</strong> Am I just missing the most obvious thing? Am I too close to the situation, having worked with or for all of the companies involved in the past few weeks insanity? Or do you share some of the same concerns I do? Either way, I’d love to hear what you have to say.</p>
<hr noShade="true" />© 2007 Web Analytics Demystified | <a href="http://www.webanalyticsdemystified.com">www.webanalyticsdemystified.com</a><strong>Looking for a new job in web analytics?</strong> Check out the <a href="http://www.webanalyticsdemystified.com/job_list.asp">Web Analytics Demystified Job Board!</a><img width="1" src="http://blog.seo-hardcore.com/wp-content/plugins/wp-o-matic/cache/28676_180197795" height="1" /></p>

<span class="slashdigglicious">
<a href="http://slashdot.org/bookmark.pl?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fmy-thoughts-about-omniture-and-webtrends&amp;title=My+thoughts+about+Omniture+and+WebTrends" title="Slashdot It!"><img src="http://slashdot.org/favicon.ico" height="16" width="16" alt="[Slashdot]" /></a>
<a href="http://digg.com/submit?phase=2&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fmy-thoughts-about-omniture-and-webtrends&amp;title=My+thoughts+about+Omniture+and+WebTrends" title="Digg This Story"><img src="http://digg.com/favicon.ico" width="16" height="16" alt="[Digg]" /></a>
<a href="http://reddit.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fmy-thoughts-about-omniture-and-webtrends&amp;title=My+thoughts+about+Omniture+and+WebTrends" title="Reddit"><img src="http://reddit.com/favicon.ico" width="16" height="16" alt="[Reddit]" /></a>
<a href="http://del.icio.us/post?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fmy-thoughts-about-omniture-and-webtrends&amp;title=My+thoughts+about+Omniture+and+WebTrends" title="Save to del.icio.us" onclick="window.open('http://del.icio.us/post?v=4&amp;noui&amp;jump=close&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fmy-thoughts-about-omniture-and-webtrends&amp;title=My+thoughts+about+Omniture+and+WebTrends', 'delicious', 'toolbar=no,width=700,height=400'); return false;"><img src="http://del.icio.us/favicon.ico" width="16" height="16" alt="[del.icio.us]" /></a>
<a href="http://www.facebook.com/share.php?u=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fmy-thoughts-about-omniture-and-webtrends" title="Share on Facebook"><img src="http://www.facebook.com/favicon.ico" width="16" height="16" alt="[Facebook]" /></a>
<a href="http://technorati.com/faves?add=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fmy-thoughts-about-omniture-and-webtrends" title="Add to my Technorati Favorites"><img src="http://technorati.com/favicon.ico" width="16" height="16" alt="[Technorati]" /></a>
<a href="http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fmy-thoughts-about-omniture-and-webtrends&amp;title=My+thoughts+about+Omniture+and+WebTrends" title="Save to Google Bookmarks"><img src="http://www.google.com/favicon.ico" width="16" height="16" alt="[Google]" /></a>
<a href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fanalytics%2Fmy-thoughts-about-omniture-and-webtrends&amp;title=My+thoughts+about+Omniture+and+WebTrends" title="Stumble it!"><img src="http://www.stumbleupon.com/favicon.ico" width="16" height="16" alt="[StumbleUpon]" /></a>
</span>]]></content:encoded>
			<wfw:commentRss>http://blog.seo-hardcore.com/analytics/my-thoughts-about-omniture-and-webtrends/feed</wfw:commentRss>
		</item>
		<item>
		<title>SharePoint SEO Fundamentals</title>
		<link>http://blog.seo-hardcore.com/moss/sharepoint-seo-fundamentals</link>
		<comments>http://blog.seo-hardcore.com/moss/sharepoint-seo-fundamentals#comments</comments>
		<pubDate>Wed, 21 Nov 2007 02:05:52 +0000</pubDate>
		<dc:creator>john</dc:creator>
		
		<category><![CDATA[MOSS]]></category>

		<guid isPermaLink="false">http://blog.seo-hardcore.com/moss/sharepoint-seo-fundamentals</guid>
		<description><![CDATA[Delivered by RSS from thesug.org
Body:

Many things factor into overall ranking of a web site. Some of the more obvious ones include: site availability, amount of content, keyword density, etc… But what about the things that harm your sites rankings which you may be doing and should NOT be? This list would include: keyword stuffing, trying [...]]]></description>
			<content:encoded><![CDATA[<p>Delivered by RSS from thesug.org</p>
<div><b>Body:</b>
<div>
<div><strong>Many things factor into overall ranking of a web site. </strong>Some of the more obvious ones include: site availability, amount of content, keyword density, etc… But what about the things that harm your sites rankings which you may be doing and should NOT be? This list would include: keyword stuffing, trying to “fool” the search engines with doorway pages, or hidden text. Are you using the exact same html title on every page of your site? Are you submitting your site to multiple search engines more than once – or worse yet; allowing some online “SEO” company to submit your site to multiple search engines on your behalf? All the aforementioned things are very common mistakes that may negatively affect your web sites rankings. </div>
<div> </div>
<div>These are not “trade secrets” in fact the major players all have information available online which explains how to improve your web sites rankings. Some less obvious things which could potentially harm rankings include: domain forwarding, content syndication, domain registration term, all those super-cool graphics and page layouts that require mind twisting table nesting or worse, complex URL’s (query strings, for example: <a href="http://www.mossseo.com?querystring=1&amp;anotherquerystring=2&amp;soOn=50">www.mossseo.com?querystring=1&amp;anotherquerystring=2&amp;soOn=50</a>). </div>
<div> </div>
<div>You can learn a lot about what not to do by reading what the search engine folks themselves advise you to do and not do. The most important thing to remember is do NOT try to fool the search engines. Google has <a href="https://www.google.com/webmasters/tools/docs/en/about.html">free advise and tools </a>which you should spend some time reviewing.</div>
</div>
</div>
<div><b>Category:</b> SEO for Beginners</div>
<div><b>Published:</b> 10/26/2007 9:12 PM</div>

<span class="slashdigglicious">
<a href="http://slashdot.org/bookmark.pl?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fmoss%2Fsharepoint-seo-fundamentals&amp;title=SharePoint+SEO+Fundamentals" title="Slashdot It!"><img src="http://slashdot.org/favicon.ico" height="16" width="16" alt="[Slashdot]" /></a>
<a href="http://digg.com/submit?phase=2&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fmoss%2Fsharepoint-seo-fundamentals&amp;title=SharePoint+SEO+Fundamentals" title="Digg This Story"><img src="http://digg.com/favicon.ico" width="16" height="16" alt="[Digg]" /></a>
<a href="http://reddit.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fmoss%2Fsharepoint-seo-fundamentals&amp;title=SharePoint+SEO+Fundamentals" title="Reddit"><img src="http://reddit.com/favicon.ico" width="16" height="16" alt="[Reddit]" /></a>
<a href="http://del.icio.us/post?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fmoss%2Fsharepoint-seo-fundamentals&amp;title=SharePoint+SEO+Fundamentals" title="Save to del.icio.us" onclick="window.open('http://del.icio.us/post?v=4&amp;noui&amp;jump=close&amp;url=http%3A%2F%2Fblog.seo-hardcore.com%2Fmoss%2Fsharepoint-seo-fundamentals&amp;title=SharePoint+SEO+Fundamentals', 'delicious', 'toolbar=no,width=700,height=400'); return false;"><img src="http://del.icio.us/favicon.ico" width="16" height="16" alt="[del.icio.us]" /></a>
<a href="http://www.facebook.com/share.php?u=http%3A%2F%2Fblog.seo-hardcore.com%2Fmoss%2Fsharepoint-seo-fundamentals" title="Share on Facebook"><img src="http://www.facebook.com/favicon.ico" width="16" height="16" alt="[Facebook]" /></a>
<a href="http://technorati.com/faves?add=http%3A%2F%2Fblog.seo-hardcore.com%2Fmoss%2Fsharepoint-seo-fundamentals" title="Add to my Technorati Favorites"><img src="http://technorati.com/favicon.ico" width="16" height="16" alt="[Technorati]" /></a>
<a href="http://www.google.com/bookmarks/mark?op=edit&amp;output=popup&amp;bkmk=http%3A%2F%2Fblog.seo-hardcore.com%2Fmoss%2Fsharepoint-seo-fundamentals&amp;title=SharePoint+SEO+Fundamentals" title="Save to Google Bookmarks"><img src="http://www.google.com/favicon.ico" width="16" height="16" alt="[Google]" /></a>
<a href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fblog.seo-hardcore.com%2Fmoss%2Fsharepoint-seo-fundamentals&amp;title=SharePoint+SEO+Fundamentals" title="Stumble it!"><img src="http://www.stumbleupon.com/favicon.ico" width="16" height="16" alt="[StumbleUpon]" /></a>
</span>]]></content:encoded>
			<wfw:commentRss>http://blog.seo-hardcore.com/moss/sharepoint-seo-fundamentals/feed</wfw:commentRss>
		</item>
	</channel>
</rss>
