Powered by Blogger.
Menu :
Showing posts with label Spam Classification. Show all posts
Showing posts with label Spam Classification. Show all posts

Blogger Blog Content Needs To Be Unique, And Properly Targeted

I've been studying spam, in Blogger blogs, for over 5 years.

This year, we've seen improvement in the automated spam classification process, implied by a noticeable reduction in spam review requests overall - and in a considerable reduction in the proportions of false positive classifications. During the past few months, blogs requested for review are 2 or 3 times more likely to be confirmed as legitimate spam blogs (true positives), compared to this time last year.

Of the blogs confirmed to be spam hosts, when I am able to examine cached copies of the content, 3 out of 4 of those appear to contain material scraped or syndicated from other blogs or websites.
  • Content scraped (stolen), or syndicated (copied, with permission), from other blogs / websites. Content scraped or syndicated to other blogs / websites.

Google describes the problem, in Blogger Help: Spam, phishing, or malware on Blogger, quite simply.
Spam blogs cause various problems, beyond simply wasting a few seconds of your time when you happen to come across one. They can clog up search engines, making it difficult to find real content on the subjects that interest you. They may scrape content from other sites on the web, using other people's writing to make it look as though they have useful information of their own. And if an automated system is creating spam posts at an extremely high rate, it can impact the speed and quality of the service for other, legitimate users.

Long ago, spam blogs were first encountered as startup components in large spam blog farms.

Later, we explored the involvement of various "get rich quick" schemes, and of affiliate marketing.
  • Content or links which reference referral-based activities such as GPT ("Get Paid To"), MLM ("Multi-Level Marketing"), MMF ("Make Money Fast"), MMH ("Make Money from Home"), PTC ("Pay To Click"), or PTS ("Pay To Surf").
  • Affiliate marketing (Please, don't confuse this with "affiliate networking"!).

Of these three broad descriptions of confirmed spam blog content - spam blog startups, get rich quick schemes, and affiliate marketing - the one common feature in most of the blogs, confirmed as spam hosts, seems to be the lack of unique content. One of the features of the Panda update to Google Search was described as "content quality" in search results.

The past year tuning to Blogger spam classification appears to be in keeping with Panda, in that it is targeting blogs which rely upon content intentionally replicated from blog to blog - whether "scraped" (stolen, without permission), or "syndicated" (copied, with permission).

The end result here is that Blogger blogs, to avoid spurious spam classification, need to contain as much unique material as possible. While some amounts of quotation of other blogs and websites is beneficial, the majority of blog content needs to be written by the blog owners and contributors, and properly targeted to the reader population.

>> Top

Market Your Blog, To Those Who Are Interested

We see many questions in Blogger Help Forum: Something Is Broken, about blog content, and (lack of) appreciation by the readers.

Occasionally, people become concerned about activity of the people reading their blogs - why they get so many new visitors (but nobody returns later, to read more), or why the main page is so well read (but nobody reads the archived posts).

In other cases, they wonder why the blog was deleted - even though it had the required warning protecting it. And sometimes, they may wonder why so many people read the blog, but nobody comments on the posts.

If your blog just uses Stats, to provide you reader activity statistics, you'll probably be only concerned with visitor activity, and pageview count.

If you use an actual visitor activity log, like SiteMeter or StatCounter, you may also look at new / return visitor ratio, or pages read / visitor count. In both cases, you're examining visitor interest.

Visitor interest starts with how you advertise the blog. You get good visitor interest when you advertise the blog where it will be welcomed, and in a style where your advertisements will be welcomed.

People who don't like your blog may read the first page, then go elsewhere. If they read more than the first page, they will probably be doing that so they can report the blog, for TOS Violation or similar. This is why some blogs are unrighteously clasified as abusive.

No matter how objectively you may write a blog discussing alternative lifestyles, if you advertise your blog where Bible Belt USA Conservatives may gather, the best result that you may see is "single pageview" visitors.

Occasionally, we'll see a problem report that starts out with a common complaint.
Why was my blog deleted? I don't spam.
When we're able to retrieve a cached copy of the blog, we'll agree with the owner.

Some blogs, deleted as abusive, will have a little known problem.
Commercially funded adult content.
Other times, the problem will be more subtle.
I set the "Adult Content?" flag to "Yes"! Why am I getting complaints? (Why was the blog deleted?).
The problem here starts with the nature of the "Adult Content" warning.

Not every blog owner realises that the warning is only advisory. Anybody, no matter their age or religious preference, may (by accident, or intentionally) click on "Agree". Having clicked, they may be subject to a faceful of content which is not in their best interest, or which they do not appreciate.

If a link to your blog appears in the wrong forum discussion, or on the wrong website - either a Bible Belt forum or School Children's website - don't be surprised if your blog continues to get content complaints. And in some cases, we'll see you in the forums.
Why was my blog deleted? I don't spam.

Be sensitive to both the stated, and unstated, policies where you post. If your blog contains controversial material, be very conservative about how and where you advertise. Publish properly targeted posts, with unique content, for the best future of your blog.

>> Top