The first bots to reach a new blog

I thought I’d see which bots are checking out a brand new blog. A few pings have gone out to pingomatic, and it’s linked from my old blog, which pings a few services as well.

I see quite a few searching for technoratibot:
That one makes your posts available for bloggers. You can search for keywords and such.

Here’s what I found:

  • 216.52.237.214 with user agent: geourl/2.0b4 - http://geourl.org/bot
  • 198.87.83.123 with user agent: Syndic8/1.0 (http://www.syndic8.com/)
  • 213.239.211.101 with user agent: A2B Location-Based Search Engine (+http://www.a2b.cc)
  • 170.224.8.126 was seen on my old blog February 6th. But on this one it’s accessed with two different user agents: 1) libwww-perl/5.65 (which also checks robots.txt) 2) Java/1.4.2_06 goes straight for the feed, and then individual posts.
  • Alexander Morozov’s bot was one of the first to reach it. Block 69.50.170.122 before you bring a new blog live to hopefully avoid his trackbacks.
  • A human with a Firefox browser leaves the user agent Sage in one of the accesses - the feed.
  • 66.151.189.7 with user agent: Feedster Crawler/1.0; Feedster, Inc. Checks several different feed types
  • A human with Firefox leaves the user agent Straw/0.25.1 when fetching the feed
  • 216.148.212.180 with user agent: Bloglines/2.0 (http://www.bloglines.com). And subscribers clicking on links follow right behind.
  • A human sets up his feed software. User agent: NetNewsWire/2.0b25(Mac OS X; http://ranchero.com/netnewswire/)
  • Googlebot comes sniffing for the root and robots.txt
  • Raggle/0.3.1 (i386-linux; Ruby/1.8.2) comes for the feed. Unsure if this is a bot or a human.
  • My first referrer spam, I believe? 61.210.180.74 http://www.dela-grante.net/ and user agent: Mozilla/4.0 (compatible; MSIE 6.0)
  • 66.250.128.131 with user agent: ping.blo.gs/2.0 and referrer: http://blo.gs/ping.php
  • 64.26.171.196 with empty user agent. Two different feeds. It’s all over my old blog as well
  • 209.237.230.104 with user agent: Technoratibot/0.6
  • 205.147.9.200 with user agent: blogsnowbot (+http://www.blogsnow.com/bot.html)
  • A human comes with a Linux version of Firefox, then sends an aggregator back for the feed: Liferea/0.9.0b (Linux; fr_FR@euro; http://liferea.sf.net/)
  • Ask Jeeves/Teoma have been by

Phew! Quite a few bots and aggregators!

3 Responses to “The first bots to reach a new blog”

  1. David Parrott Says:

    “A human with a Firefox browser leaves the user agent Sage in one of the accesses - the feed.”

    Ooh, I wonder if that’s me.

  2. Administrator Says:

    Well, one of them, yes…

  3. Dave Brewer Says:

    Since I added a blog to my site, I’ve gotten crawled by the obidos-bot? Ever heard of that one?

Leave a Reply