Archive for the 'Search engine spam' Category

Whatever it is, you’ll find it on a scraper site

Thursday, October 27th, 2005

I was searching for info on a specific motherboard today. And one of the hits high up in Google was a scraper site. So OBVIOUSLY a scraper site. It had an on topic file name and heading, and then Adsense taking up much of the window. The bottom half of the screen was some generic nonsense about computers. It’s as if they’d used mail merge in Word to get the name of the motherboard in there, in a generic generated text. At the bottom there SEEMED to be the typical scraper links. Except they’d switched the real links for links to that SAME page.

Grrrr….

How many times have we heard scraper site/search engine spammers say they think their pages add value to search results? Exactly how dumb do they think we are?

And just so we can compare oranges to oranges, here’s the page:
computer-infocentre.com/intel-ga-6zma.html

Matt’s one of us now

Monday, August 15th, 2005

Matt Cutts is one of us now. Bloggers I mean. This is Google’s face towards webmasters. He’s got a LOT of mentions on various blogs. On the whole people like him. Except for some SEO’s. Considering what some SEO’s do, I guess I could live with that if I were him.

Anyway, he’s been blogging for a while, and I didn’t discover it until today. I guess I’m the one sleeping in class. Either that, or he picked the summer because there’s less traffic and he could experiment a bit until the unwashed masses found out?

Come on Matt, blog more about search engine spam!

Netsphere domain parking

Sunday, June 26th, 2005

I caught the jaja-jak-globusy.com domain referrer spammed on two of my sites. Turns out the domain is part of the Netsphere domain parking program, and uses Adsense for domainpark. BTW, Netsphere appears to be a closed program. No sign of it in Google.

Problem is, the pages look undistinguishable from Adsense scraper pages. They have a search function that puts keywords into the URL, which means relevant Adsense ads will be returned.

The server is full of such domains. I don’t know exactly how many, because webhosting.info refused to return any results, and whois.sc was struggling for a long time too, before reporting 389526 websites. A significant portion of those belong to the spammer, as far as I can tell.

And they’re referrer spamming a domain that was bought June 9th! It probably was previously owned, though.

Guys, am I the only one who thinks this looks awfully fishy?

MSN algo change more spammy

Sunday, June 26th, 2005

Remember a while ago, someone spamvertized this site? One of the key phrases was “lesbian movies”. I was struggling with hits from MSN and Yahoo for weeks. Eventually, it subsided. Most likely because the search engines gave precedence to newly spammed keywords.

But just lately I noticed more hits from MSN on those keywords.

Turns out there’s a new algo, and unfortunately, I’m now at 4th place for that keyword.

I guess I have to agree with the guys at Webmasterworld about the new algo being spammier, based on my rather limited experience.

Reporting scraper sites

Sunday, June 26th, 2005

Threadwatch has a story on something Matt Cutts said (top Google engineer/PR guy).

He says to follow the Ads by Google on Adsense ads if you want to report violations of the Adsense policies.

And in one of the comments, I found a spam report address for MSN. Long overdue, in my opinion.

High paying Adsense

Friday, June 17th, 2005

Got a trackback on annelisabeth.com today.

It advertized many subdomains on
purichee.info

And when I checked earlier trackbacks, I found some more June 15:
tellskin.info

And spamvertized on other sites:
cartmark.info
cheesquare.info
cheesquare.info
kesktown.info
kesktown.info
malknight.info
mapledark.info
minismall.info
netistar.com
plexsky.com
suptarg.com
tresdin.info
valitest.info
radcoin.info
dreamstin.info
samtailo.com
spritkin.com
kingdest.com
nexshine.com
synckin.com

I did some digging:

He’s been at this at least since June 1, 2005.

The point is to present a dead end to surfers, so they’re redirected to a page that just displays the name of the site (including subdomain).

However, spiders will see a directory with links. Googlebot is specifically instructed to not index the page, but is expected to follow the links.

The next layer of pages are also protected with javascript to throw off human visitors - same purple page.

The next layer is the final layer, and has Adsense on it:
pub-9219922125396009

It also contains what must be random text. In one case information about Mars.
One part of the text was scarfed from an encyclopedia article. Another from Biology Daily. A third section is from Biography

And the final section is scraper search engine results. Many of the links are dead.

I checked a page for another topic altogether. That one also had an astronomical article as the middle content.

Conclusion: This is a scheme to get pages to rank for high paying Adwords. There’s no other point to the pages. In other words, it’s both search engine spam and Adsense spam.

Whois:

Domain Name:PURICHEE.INFO
Created On:14-Jun-2005 00:26:06 UTC
Last Updated On:15-Jun-2005 23:45:52 UTC
Expiration Date:14-Jun-2006 00:26:06 UTC
Sponsoring Registrar:R171-LRMS
Registrant Name:Freddy Dirkson
Registrant Street1:PO Box 3411 345 Nesville Ave
Registrant City:Toronto
Registrant State/Province:Ontario
Registrant Postal Code:M3L 3K9
Registrant Country:CA
Registrant Phone:+1.4164542655
Registrant Email: afflink@hotmail.com

The phone number is from Toronto, Rogers Cantel wireless. Might be legit.

Artificially increasing link popularity

Sunday, June 5th, 2005

Link spam in general has one goal: Increasing link popularity in search engine through employing various tricks. Usually increasing the number of links on the net, or the number of search terms in anchor text.

If you are tempted to use techniques of that sort, whether it’s Free For All, blog submitters or spamming, keep this in mind:

Artificially increasing link popularity may get you penalized in Google.

Have a look at their webmaster guidelines.

They have an open ended set of rules. One that includes common sense. Basically, if it’s too simple, too good to be true, or too sneaky, chances are good Google will condemn it as search engine spam. That could result in the backlinks not counting, weighted link rank (sinking a couple hundred places in the serps), or banning of the domain, subdomain or folder name (They can take out individual blogs or sites on an otherwise non-banned domain).

Basically, if you care about being in Google, do not do any aggressive off site SEO. Organic link building is the key. That means no automatic tools of any sort. If you comment on forums, blogs or other interactive places, do it in moderation, on topic and with class. That’s the key to long term off site SEO.

And one more thing: Us bloggers tend to remove links to spammy looking sites, even if your comments are on topic. I’ve been known to keep the comment and remove spammy looking links. Many of us frown on sites that are built with the sole purpose of earning money without having any intrinsic value, because we see them as polluting the web. If it’s trash, we trash it. That simple.

My apologies to Maryanne Myers

Saturday, June 4th, 2005

Looks like I didn’t have enough information when I called her a spammer.

Her blog submitter is set up to only submit to her own list of blogs. I don’t know if they are tame blogs - owned by others who approved the postings, or by her herself. Either way, it’s technically not blogspam as it is defined that way, since the owners have approved it.

Life would have been much easier if you’d EXPLAINED this to the spam hunters, Maryanne! Without anyone having to sign up - which you know we’d never do.

However, there is one area where Maryanne, and the people who sign up, might have a problem. The problem is Google. The way they define search engine spam, then I believe this either falls within that definition, or they’re likely to expand it to incorporate what you guys are doing. More on Google and webmasters. And this warning at the bottom of the page here.

The likely outcome is that they’ll ban or weight all blogs they identify as being part of that network, and so any comments with links posted to them will not have any weight in Google. I’ve seen Google do a lot of stuff in the past, and this is - still pure speculation - something they’ll probably devote some thought to and come up with SOMETHING.

As a friend said: This is a new version of a linkfarm. Something Google already hates. I don’t know what they do to linkfarms, but they do …something. Look at a discussion on Webmaster world about link farms. And this article. Blogmysite is different from linkfarms, because as far as I know it’s not a reciprocal links scheme. That doesn’t insulate it against sanctions however, as all new techniques, it will be held to scrutiny.

Another problem is rel=”nofollow”. Blogger for instance, has that implemented. That means any comments with live links, will have code in the links (even though the commenter didn’t code them that way) that tells search engines (and not just Google) not to pay any attention to the links. It’s possible they’ll follow the links. It’s even possible they’ll index the pages linked to. But the links won’t count as backlinks. Most blog software has that implemented by default as well by now. The point was never to have people click on the links, but to get better rating in Google, right?

Updates
Here we go. The positioning has begun. I can’t remember having seen the sentences that are on top of blogmysite before. I won’t quote them directly, though, it’s probably not allowed by her TOS. Basically, she says the ads will be posted to two blog networks that are owned by her, and to blogs that have requested to be added. I would NOT have thought this was a spam submitter if that had been present on her page before!!!! So this must be very new. Oh yes, I found a cached affiliate ID page from May 22 (on MSN). It wasn’t there then.

The sentence on her page argueing that posting comments isn’t spam would be meaningless if the comments are meant to be posted to a network of blogs that have given permission. So it’s no wonder I thought she was going to spam me and others.

Hey, Maryanne, you may want to have your computer checked. There’s a string in your user agent that I suspect could mean you’re infected with something. Read more about it here.

I see she’s posted on usenet about this too. Maryanne, if Google was mistaken in banning you (because the software isn’t a blogspam submitter), then they will unban you. Simple as that. If however the ban is held, then it’s because they consider it search engine spamming (and I don’t know if they will or not). Wordpress got banned earlier this year, and was unbanned when they took down the linkfarm they got banned for. In high profile cases that get the attention of the top dogs (and this is certainly high profile enough), the status is often updated quickly.

There’s a link to my blog from her member login page. Some untruths on there about me. I am into truth. If I had to lie to win a fight, I’d back out. I call’em as I see em. And if I was wrong - through not having enough knowledge, I’ll admit to that, and make reparations.

last update:

I don’t see the logic in telling all her members about my site and my fight against webspam in general, and then saying she’s going to sue me. One of my pet peeves is search engine spam. It’s one of my categories on this blog. I was wrong about Maryanne’s program being blogspam. But as far as I can tell, it’s search engine spam. Which is something that I consider bad. No, not illegal. But since the search engines can do pretty much whatever they want - there’s no law that says they have to index or list your site, then laws don’t enter into it. They’ll do what they do, and we don’t know what that is. I was afraid I’d be banned myself for a while, and I could well have been, if the chips had fallen a bit differently. It happens quickly, and they don’t tell you why.

Scraper site

Saturday, May 28th, 2005

I got a referrer on annelisabeth from:

thesecuritypro

There IS a link to my site from that page. But the site itself is nothing but a search engine scraper page. It appears Google has banned the domain. It’s mostly linked from other scraper sites. Like:
howtopaks.com
divedch.de

Two payoffs:

1) A numbered link to anrdoezrs.net advertizing the “Spam X-Terminator”. The root domain redirects to Commission Junction.
2) Google Adsense:
pub-7164896056230752

Time to pull the plug for Adsense on all those search scraper sites, Google?