State of The Blogosphere, March 2005, Part 1: Growth of Blogs
Dave Sifry presents "The State of the Blogosphere" in State of The Blogosphere, March 2005, Part 1: Growth of Blogs

It's a facinating read, and part of it deals with the creation of automated fake blogs - blogs that build themselves, trackback themselves and others, ping the ping services and trackback the Technorati Tags system for traffic and SE rankings.

Only a little while back we were disussing automated blogs and automated site creation and now we see that the practice (which is not at all new Seth) is starting to grab mainstream attention. Here's a snippet from Dave's post on automated blogs:

There is a dark underbelly to these numbers, however: Part of the growth of new weblogs created each day is due to an increase in spam blogs - fake blogs that are created by robots in order to foster link farms, attempted search engine optimization, or drive traffic through to advertising or affiliate sites. We have been battling the spam situation in a significant way for about 2 months - prior to January, spam wasn't much of an issue. All of these charts reflect Technorati's databases after spam blogs have been removed, and we feel that we've been able to capture and identify most of the spam out there, but one should note that there is definitely blog spam that we don't catch (tell us if you see spam in the index!). I'd estimate that we currently catch about 90% of spam and remove it from the index, and notify the blog hosting operators. Most of this fake blog spam comes from hosted services or from specific IP addresses. One of the results of the extremely productive Spam Squashing Summit of a few weeks ago is the increased collaboration between services in order to report and combat this spam. Right now, about 20% of the aggregate pings Technorati receives are from spam blogs, so you won't see that in these numbers - these statistics show only "cleaned" data.

Seth's right though when he says that it's anonymity that is the big problem, it's just too easy...


Go here: http://www.joepetrow.com/index-2.cgi?PAGE=resume

in the upper LH corner, click 'blog bitch' --read about it

then go to the site here: http://blog_bitch.blogspot.com/

look at the date.
three months ago, there was a blog script that passed the turing test with flying colors while on beta. human initiated links (well, I think they were human initiated, hard to tell nowadays) from other blogs were coming in, linking to it's musings (mostly political) ---oddly enough the site disappeared within days after a footnote was added that the script was for sale.

more breacrumbs

now search on the term Artificial Intelligence Chat Bots

spend a day playing with some of them

when you're all done, download JanusNode and play with it.


To those of us who see the rise of the blog as a blight on the internet, automated blogs are really no different from the rest of the blogs.



The problem is significant --starting to ooze out of all the cracks now.

The problem is significant

Well, I'll say once again I think it was a mistake for the envelope pushing members of the webmaster commuinity to target blogs. Having said that I'd rather read the blog referenced than read anil dash's, it makes more sense and speaks to me directly.

Make of that what you will.


>makes more sense and speaks to me directly.

HHH! To tell the truth, I kinda liked the blog-bitch's way of thinking.

