- Posted On: 12 Sept 2014
- Posted By: Crescentek
30 Jun 2017
Sad to say, duplicate content on websites have not only peaked now but are occurring on the same domain of the same website more often than not! Nevertheless, this treatise will deal with all the aspects of duplicate content, along with describing ways about fixing these. At the same time, it will specify what duplicate content is all about, canonicalization, i.e. indicating your preferred URL to Google as well as how best to use free tools to deal with it. But to start with, it may be worthwhile to define what Duplicate Content actually stands for.
As per Google’s own definition, “Duplicate content generally refers to substantive blocks of content within or across domains that either completely matches other content or is appreciably similar. Mostly, this is not deceptive in origin.” As for canonicalization, Google defines it as “Many sites make the same HTML content or files available via different URLs. [….] To gain more control over how your URLs appear in search results…. We recommend that you pick a canonical (preferred) URL as the preferred version of the page. You can indicate your preference to Google in a number of ways. We recommend them all, though none of them are required (if you do not indicate a canonical URL, we’ll identify what we think is the best version)”.
Incidentally, there are three broad categories of duplicates, such as (a) True Duplicates, (b) Near Duplicates and (c) Cross-domain Duplicates. A True Duplicate represents a page that is 100% identical in content to another page, differing only by the URL. A Near Duplicate differs negligibly in regard to another page – maybe a block of text, order of the content or an image. A cross-domain duplicate comes about when two websites share the same content. These often give rise to issues even for legitimate, syndicated content.
One of the ways of finding duplicate content involves the use of PlagSpotter or Copyscape which is an online duplicate content checking and monitoring tool. You may enter your URL to obtain an exhaustive list of sources or sites that duplicate your content. Just Login and use its proprietary Batch Search feature to check your whole site by way of providing a sitemap or copy/pasting the URLs that you wish to be checked.