<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>
<channel>
	<title>Comments on: Drop statistics pages</title>
	<atom:link href="http://spamhuntress.com/2005/03/13/drop-statistics-pages/feed/" rel="self" type="application/rss+xml" />
	<link>http://spamhuntress.com/2005/03/13/drop-statistics-pages/</link>
	<description>writes on spam and admin issues</description>
	<pubDate>Tue, 06 Jan 2009 18:44:31 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Mike Boone</title>
		<link>http://spamhuntress.com/2005/03/13/drop-statistics-pages/#comment-89</link>
		<dc:creator>Mike Boone</dc:creator>
		<pubDate>Sun, 13 Mar 2005 19:57:07 +0000</pubDate>
		<guid isPermaLink="false">http://spamhuntress.com/2005/03/13/drop-statistics-pages/#comment-89</guid>
		<description>If you wanted to block robots from an entire forum site, you could add the robots meta tag to the PHPBB templates/subSilver/overall_header.tpl file. If you only want to do it for certain pages of a forum, like the member list or user profile pages, you would have to try a method like I described in the blog.
</description>
		<content:encoded><![CDATA[<p>If you wanted to block robots from an entire forum site, you could add the robots meta tag to the PHPBB templates/subSilver/overall_header.tpl file. If you only want to do it for certain pages of a forum, like the member list or user profile pages, you would have to try a method like I described in the blog.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Administrator</title>
		<link>http://spamhuntress.com/2005/03/13/drop-statistics-pages/#comment-88</link>
		<dc:creator>Administrator</dc:creator>
		<pubDate>Sun, 13 Mar 2005 19:15:40 +0000</pubDate>
		<guid isPermaLink="false">http://spamhuntress.com/2005/03/13/drop-statistics-pages/#comment-88</guid>
		<description>Sooo, if I hack the templates, I could add those nofollow tags into the links I need to remove from Google et al? Yeah, I'll look into that!</description>
		<content:encoded><![CDATA[<p>Sooo, if I hack the templates, I could add those nofollow tags into the links I need to remove from Google et al? Yeah, I&#8217;ll look into that!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike Boone</title>
		<link>http://spamhuntress.com/2005/03/13/drop-statistics-pages/#comment-87</link>
		<dc:creator>Mike Boone</dc:creator>
		<pubDate>Sun, 13 Mar 2005 18:50:54 +0000</pubDate>
		<guid isPermaLink="false">http://spamhuntress.com/2005/03/13/drop-statistics-pages/#comment-87</guid>
		<description>Here's how I did it for PHPBB, but it only uses meta 'robots' tags.

http://boonedocks.net/mike/index.php?/archives/70-PHPBB-Member-List-Link-Spam-Part-2.html

I'm sure you could set up .htaccess controls like you're suggesting with some sort of regexes.
</description>
		<content:encoded><![CDATA[<p>Here&#8217;s how I did it for PHPBB, but it only uses meta &#8216;robots&#8217; tags.</p>
<p><a href="http://boonedocks.net/mike/index.php?/archives/70-PHPBB-Member-List-Link-Spam-Part-2.html" rel="nofollow">http://boonedocks.net/mike/index.php?/archives/70-PHPBB-Member-List-Link-Spam-Part-2.html</a></p>
<p>I&#8217;m sure you could set up .htaccess controls like you&#8217;re suggesting with some sort of regexes.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Administrator</title>
		<link>http://spamhuntress.com/2005/03/13/drop-statistics-pages/#comment-86</link>
		<dc:creator>Administrator</dc:creator>
		<pubDate>Sun, 13 Mar 2005 17:56:15 +0000</pubDate>
		<guid isPermaLink="false">http://spamhuntress.com/2005/03/13/drop-statistics-pages/#comment-86</guid>
		<description>That's the easy part.

I'm thinking more complex things, like specific URL's.

viewtopic.php?p=
but not
viewtopic.php?t=10

See what I mean?</description>
		<content:encoded><![CDATA[<p>That&#8217;s the easy part.</p>
<p>I&#8217;m thinking more complex things, like specific URL&#8217;s.</p>
<p>viewtopic.php?p=<br />
but not<br />
viewtopic.php?t=10</p>
<p>See what I mean?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Arve</title>
		<link>http://spamhuntress.com/2005/03/13/drop-statistics-pages/#comment-85</link>
		<dc:creator>Arve</dc:creator>
		<pubDate>Sun, 13 Mar 2005 17:37:37 +0000</pubDate>
		<guid isPermaLink="false">http://spamhuntress.com/2005/03/13/drop-statistics-pages/#comment-85</guid>
		<description>See http://www.searchengineworld.com/robots/robots_tutorial.htm for a tutorial on how to instruct robots not to index or follow links to certain pages.</description>
		<content:encoded><![CDATA[<p>See <a href="http://www.searchengineworld.com/robots/robots_tutorial.htm" rel="nofollow">http://www.searchengineworld.com/robots/robots_tutorial.htm</a> for a tutorial on how to instruct robots not to index or follow links to certain pages.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Administrator</title>
		<link>http://spamhuntress.com/2005/03/13/drop-statistics-pages/#comment-84</link>
		<dc:creator>Administrator</dc:creator>
		<pubDate>Sun, 13 Mar 2005 16:31:49 +0000</pubDate>
		<guid isPermaLink="false">http://spamhuntress.com/2005/03/13/drop-statistics-pages/#comment-84</guid>
		<description>Could you do a tutorial on your site about how you disallow certain pages or sections in an .htaccess file?

I mean, what especially irritates me is when Googlebot starts following every little link on forums. Say for instance those "last post" links on forums. And the quote links and other stuff. Googlebot gets SO lost, it's not even funny! In my logs, the links are even truncated because of the session ID's of the URL's on IPB.</description>
		<content:encoded><![CDATA[<p>Could you do a tutorial on your site about how you disallow certain pages or sections in an .htaccess file?</p>
<p>I mean, what especially irritates me is when Googlebot starts following every little link on forums. Say for instance those &#8220;last post&#8221; links on forums. And the quote links and other stuff. Googlebot gets SO lost, it&#8217;s not even funny! In my logs, the links are even truncated because of the session ID&#8217;s of the URL&#8217;s on IPB.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike Boone</title>
		<link>http://spamhuntress.com/2005/03/13/drop-statistics-pages/#comment-83</link>
		<dc:creator>Mike Boone</dc:creator>
		<pubDate>Sun, 13 Mar 2005 15:31:48 +0000</pubDate>
		<guid isPermaLink="false">http://spamhuntress.com/2005/03/13/drop-statistics-pages/#comment-83</guid>
		<description>I think the stats page authors should make their pages non-indexed by default. It's so easy to add in the proper meta tags.

I ran into a similar situation with a PHPBB bulletin board I manage. I get spam user account signups where the bogus user leaves a link to their spam website. Some spammers will even do create these accounts manually (I have a basic bot blocker). My solution was to set the robots meta tag to noindex, nofollow on the pages that list the members and their accounts. Unfortunately, that will probably not stop the tactic unless the authors would make that the default.

Having the stats page authors change their indexing settings is a long-term solution, but a good one I think.
</description>
		<content:encoded><![CDATA[<p>I think the stats page authors should make their pages non-indexed by default. It&#8217;s so easy to add in the proper meta tags.</p>
<p>I ran into a similar situation with a PHPBB bulletin board I manage. I get spam user account signups where the bogus user leaves a link to their spam website. Some spammers will even do create these accounts manually (I have a basic bot blocker). My solution was to set the robots meta tag to noindex, nofollow on the pages that list the members and their accounts. Unfortunately, that will probably not stop the tactic unless the authors would make that the default.</p>
<p>Having the stats page authors change their indexing settings is a long-term solution, but a good one I think.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
