What is the best method to solve duplicate page content?
-
The issue I am having is an overwhelmingly large number of pages on cafecartel.com show that they have duplicate page content.
But when I check the errors on SEOmoz it shows that the duplicate content is from www.cafecartel.com not cafecartel.com.
So first of all, does this mean that there are two sites? and is this a problem I can fix easily? (i.e. redirecting the URL and deleting the extra pages)
Is this going to make all other SEO useless due to the fact that it shows that nearly every page has duplicate page content?
Or am I just completely reading the data wrong?
-
the wordpress just has a setting under general settings for www or non www.
-
I had the htaccess redirect, but the ccsnews is a wordpress blog. When I had that re-direct going, the blog complained of too many re-directs. I've seen this happen before even on seomoz.
So I'm using a joomla redirect plug in. I'm thinking the wordpress has a redirect plug in also, just haven't installed it yet.
-
The internal crawl report from SEOmoz is based on your internal links, not external inbound links. So if there are any errors, it is in your site.
At a quick glance, I see that you have setup the 301 to www, but if you click into the blog (news), then you aren't at the www anymore. http://cafecartel.com/ccsnews/ - (if wordpress, then it's just a simple settings change.)
Run a crawl test on it (http://pro.seomoz.org/tools/crawl-test) and keep on plugging away and fixing every issue until there are no more.
And make sure you use rel=canonical tags. This will help out with the duplicate content as well. http://www.seomoz.org/learn-seo/canonicalization
-
Thank you Brent, and Mark...
So taking your advice this is what happened...
At the tail end of last week, we implemented a 301 redirect to www.cafecartel.com, we adjusted the .htaccess file to implement it and it worked as far as always landing on www.cafecartel.com....BUT the errors didn't adjust after the crawl.
I fear that the mere existence of these links to cafecartel.com and www.cafecartel.com may need to be manually redirected for each page.
The pages that are showing the highest errors are the blog article pages, quote request pages, and the free download pages. These same pages have links going between pages on www.cafecartel.com and other blog sites, which we did as an organic SEO tactic. Is this possibly something that is causing errors?
Thank you all for your advice!
-
You need to setup your site Canonicalization so that you don't have the duplicates. SEOmoz has a great article here: http://www.seomoz.org/learn-seo/canonicalization
Since you are hosted on an Apache server, you will need to modify your .htaccess file in your root directory to take care of these.
Make sure you also setup the www or non www preference in GWT. (Google Webmaster Tools)
-
You are reading the correct data. You should be redirecting the pages to cafecartel.com/.... this will eliminate the duplicate content issues. You also might be able to see the issue with the sitemap....if the website was converted from another website then the pages might still be attached.
Another option, less SEO favorable, but will eliminate the duplicate content, is figuring out where the pages are and then installing robot no follows....
This will help your SEO not hurt it. You are being penalized for the duplicate content.
Hope this helps....
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Best Metrics but Consistently Outranked
I am hoping someone could help us determine why we generally rank quite poorly compared to our competition, despite leading in every single Competitive Metric. We get outranked on a term where the Page Grade gives us an "A", and we best the competitor on each of the metrics. Where would those with more experience suggest we start looking? rank.jpg
Moz Pro | | Yardboy0 -
On-page grader question
Hi there, Getting to know the Pro tools and can't find an answer to this. Can someone explain for me please? Using on page grader, I found a couple pages with an F. I scrolled downWTO where it shows the keyword phrases and under each, the URL. Clicking on the first keyword "Building site alarms"it tells me off essentially for not optimising the page for that term. The URL is "construction site security systems" which are different to building site alarms which also have their own page. I don't understand why is Moz associating this keyword with this page? I certainly haven't told it to. Please he
Moz Pro | | DaddySmurf0 -
Duplicate page report
We ran a CSV spreadsheet of our crawl diagnostics related to duplicate URLS' after waiting 5 days with no response to how Rogerbot can be made to filter. My IT lead tells me he thinks the label on the spreadsheet is showing “duplicate URLs”, and that is – literally – what the spreadsheet is showing. It thinks that a database ID number is the only valid part of a URL. To replicate: Just filter the spreadsheet for any number that you see on the page. For example, filtering for 1793 gives us the following result: | URL http://truthbook.com/faq/dsp_viewFAQ.cfm?faqID=1793 http://truthbook.com/index.cfm?linkID=1793 http://truthbook.com/index.cfm?linkID=1793&pf=true http://www.truthbook.com/blogs/dsp_viewBlogEntry.cfm?blogentryID=1793 http://www.truthbook.com/index.cfm?linkID=1793 | There are a couple of problems with the above: 1. It gives the www result, as well as the non-www result. 2. It is seeing the print version as a duplicate (&pf=true) but these are blocked from Google via the noindex header tag. 3. It thinks that different sections of the website with the same ID number the same thing (faq / blogs / pages) In short: this particular report tell us nothing at all. I am trying to get a perspective from someone at SEOMoz to determine if he is reading the result correctly or there is something he is missing? Please help. Jim
Moz Pro | | jimmyzig0 -
Getting rid of duplicate content
Hi everyone, I'm a newbie and at the moment don't know very much about SEO. I have a problem with some of my campaigns where i keep getting a report with either Duplicate Page and/or Duplicate Content errors. I have no idea how to rectify this error, remove it or fix it on the relevant websites. Can anyone please help explain how to do this, maybe step by step? I really appreciate your views and opinions! Regards, Hugh
Moz Pro | | DigitalAcademyZA0 -
In my errors I have 2 different products on the same page?
Hello, I have 2039 duplicate page errors and most of them are 2 different products on 1 page, I haven't set it up in the CMS, how has this happened? here's 2 examples, the 1st example has ghd's on the back of a different brand and the 2nd has gift packs on the back of the same brand 'rockaholic'? and what does 'norec' mean? http://www.thehairroom.co.uk/Tigi-Rockaholic-797658/ghd-straightening-irons/norec http://www.thehairroom.co.uk/Tigi-Rockaholic-797658/tigi-bed-head-gift-packs/norec Thanks Mark
Moz Pro | | smoki6660 -
Truncate page URLs
We have some pages (for example a contact us form) for which the URL is modified by the CMS depending on the referring page (this helps to put the form submission in context for the sales reps who get the contact submission). The SEOmoz crawler considers each URL a new page -- and so numbers like in diagnostics are all inflated as the same page is listed multiple times (e.g. for too many links) Is there a setting to change what the crawler considers to be the same page? Here are two URLs for the same page that the reports treat as separate pages: http://www.spirent.com/About-Us/Contact_us.aspx?referurl=0F528F4D703D8BB3523738D6373AA8AD http://www.spirent.com/About-Us/Contact_us.aspx?referurl=10ACDA6055244E369395223437FDCF30 The page is actually: http://www.spirent.com/About-Us/Contact_us.aspx Thanks Ken
Moz Pro | | spirent.marcom0 -
Why aren't canonical tags reducing duplicate page title/content?
We have canonical tags set up for a feature page on one of our sites. This site has an image gallery controlled by javascript. To aid the user experience the image can also be specified by a URL parameter (the javascript also uses this URL to fetch the images). The SEOMoz report complains that the links to these images have duplicate page titles and content. To try and combat this we set canonical tags to point only to the original page, without the slideshow parameter. e.g. http://www.example.com/feature-page/ http://www.example.com/feature-page/?slideshow=1 -> canonical tag set to http://www.example.com/feature-page/ http://www.example.com/feature-page/?slideshow=2 -> canonical tag set to http://www.example.com/feature-page/ The latest SEOMoz report has come back and the errors still exist. What can we do to remove these error messages? Thanks
Moz Pro | | TJSSEO1 -
Is there a Tool to compare Duplicate content for non web Live content?
Is there a tool that can give me % of duplicate content when comparing two pieces of content that are not Live on the web? Like copyscape but for content that may not be indexed by copyscape or not live on the web? Does Word or any other program allow you do do this?
Moz Pro | | bozzie3110