The first bots to reach a new blog
I thought I’d see which bots are checking out a brand new blog. A few pings have gone out to pingomatic, and it’s linked from my old blog, which pings a few services as well.
I see quite a few searching for technoratibot:
That one makes your posts available for bloggers. You can search for keywords and such.
Here’s what I found:
- 216.52.237.214 with user agent: geourl/2.0b4 - http://geourl.org/bot
- 198.87.83.123 with user agent: Syndic8/1.0 (http://www.syndic8.com/)
- 213.239.211.101 with user agent: A2B Location-Based Search Engine (+http://www.a2b.cc)
- 170.224.8.126 was seen on my old blog February 6th. But on this one it’s accessed with two different user agents: 1) libwww-perl/5.65 (which also checks robots.txt) 2) Java/1.4.2_06 goes straight for the feed, and then individual posts.
- Alexander Morozov’s bot was one of the first to reach it. Block 69.50.170.122 before you bring a new blog live to hopefully avoid his trackbacks.
- A human with a Firefox browser leaves the user agent Sage in one of the accesses - the feed.
- 66.151.189.7 with user agent: Feedster Crawler/1.0; Feedster, Inc. Checks several different feed types
- A human with Firefox leaves the user agent Straw/0.25.1 when fetching the feed
- 216.148.212.180 with user agent: Bloglines/2.0 (http://www.bloglines.com). And subscribers clicking on links follow right behind.
- A human sets up his feed software. User agent: NetNewsWire/2.0b25(Mac OS X; http://ranchero.com/netnewswire/)
- Googlebot comes sniffing for the root and robots.txt
- Raggle/0.3.1 (i386-linux; Ruby/1.8.2) comes for the feed. Unsure if this is a bot or a human.
- My first referrer spam, I believe? 61.210.180.74 http://www.dela-grante.net/ and user agent: Mozilla/4.0 (compatible; MSIE 6.0)
- 66.250.128.131 with user agent: ping.blo.gs/2.0 and referrer: http://blo.gs/ping.php
- 64.26.171.196 with empty user agent. Two different feeds. It’s all over my old blog as well
- 209.237.230.104 with user agent: Technoratibot/0.6
- 205.147.9.200 with user agent: blogsnowbot (+http://www.blogsnow.com/bot.html)
- A human comes with a Linux version of Firefox, then sends an aggregator back for the feed: Liferea/0.9.0b (Linux; fr_FR@euro; http://liferea.sf.net/)
- Ask Jeeves/Teoma have been by
Phew! Quite a few bots and aggregators!
February 25th, 2005 at 8:54 am
“A human with a Firefox browser leaves the user agent Sage in one of the accesses - the feed.”
Ooh, I wonder if that’s me.
February 25th, 2005 at 8:57 am
Well, one of them, yes…
June 8th, 2005 at 10:17 am
Since I added a blog to my site, I’ve gotten crawled by the obidos-bot? Ever heard of that one?