How can i see the pages that cause duplicate content?
-
SEOmoz PRO is giving me back duplicate content errors. However, i don't see how i can get a list of pages that are duplicate to the one shown. If i don't know which pages/urls cause the issue i can't really fix it. The only way would be placing canonical tags but that's not always the best solution.
Is there a way to see the actual duplicate pages?
-
The only other thing I can think of is there's duplicate page content and duplicate title content. If it says true in either of those columns then there's no URLs in the columns to the right of it (headed duplicate_page_content or duplicate_title) then I'd contact Moz and work with them. Mine populate fine.
-
That surely makes sense! But when i look at the column that says duplicate_page_content then there is nothing shown.. even if they are marked as true. I must be missing something...
-
OK, within that Excel file, there's a column header with "duplicate page content" - so, the URL in question will be in the far left (URL) then there's a column that says "duplicate page" (with true/false as the options) and if it's true, then there's another column with "duplicate page content" as a header and URLs in it. Those should be the ones that Moz caught duplicating the URL in the "URL" column - if that makes any sense at all!
-
True! it's really helpful! I might have one more question regarding this. When i export to csv i get a ton of data. I open the file in Excel and seperate the data to columns. The pages that have duplicate content issues are marked as "true". But how can i see within this document which pages are duplicate for another specific page?
-
No shame! There's a ton of data here and it can be a bit of a needle in a haystack at first to figure out That's why these forums are so helpful!
-
Exactly. The download gives much deeper data, however with a few clicks that Netlogiq suggested you can find it w/o downloading.
-
Ummm.. i just found it. Not having bright moments today. shame. You must click on the number which is in the column "Other urls's". I was clicking on the page title shown in the column: "Page title url"
Didn't really jump to mind to click on the number.
Everything in order! Thx for responding everyone!
-
Hmmm not quite clear yet..
When i click on the issue in the overview a list of pages which have a duplicate content issue, opens. Then when i click on one of those links the only thing i see is a bold URL and some information about the duplicate content. But i don't see the url that is duplicate to the one displayed bold.
-
Now, I'll preface this by saying I don't know what documents you may be looking at vs what I have access to. I see duplicate links from SEOMOz, so you can get to it.
For example, when I log into my SEOMoz campaign information and click on the red errors box, then the duplicate content box, there's a selection of duplicate URLs right below the chart. My current one is indicating it caught 29 duplicate pages of content for my Spanish signs product section, then I can see all the URLs listed out that it sees as duplicates.
Granted, SEOMoz only crawls 10,000URLs at a time, so for a major site like mine that's only part of what we have, but it's an indicator of stuff we need to fix. I download my campaign report into a CSV file and there's columns in that identifying what's duplicate, too.
-
You can also export the document:
Crawl Diagnosis - Duplicate page content - export to CVS. Or - click on the +x number of duplicate pages, and you will see all the duplicate pages for that URL.
-
Yes, you can click on the error/duplicate content link and the pages will list. It will list the other pages below the bolded listing. Hope that helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Magento Duplicate Content help!
How can I remove the duplicate page content in my Magento store from being read as duplicate. I added the Magento robots file that i have used on many stores and it keeps giving us errors. Also we have enabled the canonical links in magento admin I am getting 3616 errors and can't seem to get around it .. any suggestions?
Technical SEO | | adamxj20 -
Would Google Call These Pages Duplicate Content?
Our Web store, http://www.audiobooksonline.com/index.html, has struggled with duplicate content issues for some time. One aspect of duplicate content is a page like this: http://www.audiobooksonline.com/out-of-publication-audio-books-book-audiobook-audiobooks.html. When an audio book title goes out-of-publication we keep the page at our store and display a http://www.audiobooksonline.com/out-of-publication-audio-books-book-audiobook-audiobooks.html whenever a visitor attempts to visit a specific title that is OOP. There are several thousand OOP pages. Would Google consider these OOP pages duplicate content?
Technical SEO | | lbohen0 -
Duplicate Content in Wordpress.com
Hi Mozers! I have a client with a blog on wordpress.com. http://newsfromtshirts.wordpress.com/ It just had a ranking drop because of a new Panda Update, and I know it's a Dupe Content problem. There are 3900 duplicate pages, basically because there is no use of noindex or canonical tag, so archives, categories pages are totally indexed by Google. If I could install my usual SEO plugin, that would be a piece of cake, but since Wordpress.com is a closed environment I can't. How can I put a noindex into all category, archive and author peges in wordpress.com? I think this could be done by writing a nice robot.txt, but I am not sure about the syntax I shoud use to achieve that. Thank you very much, DoMiSol Rossini
Technical SEO | | DoMiSoL0 -
How do I deal with Duplicate content?
Hi, I'm trying SEOMOZ and its saying that i've got loads of duplicate content. We provide phone numbers for cities all over the world, so have pages like this... https://www.keshercommunications.com/Romaniavoipnumbers.html https://www.keshercommunications.com/Icelandvoipnumbers.html etc etc. One for every country. The question is, how do I create pages for each one without it showing up as duplicate content? Each page is generated by the server, but Its impossible to write unique text for each one. Also, the competition seem to have done the same but google is listing all their pages when you search for 'DID Numbers. Look for DIDWW or MyDivert.
Technical SEO | | DanFromUK0 -
Duplicate Page Titles and %3E, how can I avoid this?
In my crawl report I keep seeing duplicate page title warning with URL's being referenced twice: e.g. /company/ceo-message/ /company/ceo-message/%3E I'm using canonical link tags but after the new crawl report, I'm still seeing this duplicate page title crawl error. How can I avoid this? I've been looking for answers for a few days but don't seem to see this exact problem discussed. Any insight is appreciated!
Technical SEO | | mxmo0 -
How do I fix this type of duplicate page content problem?
Sample URLs with this Duplicate Page Content URLs Internal Links External Links Page Authority Linking Root Domains http://rogerelkindlaw.com/index.html 30 0 26 1 http://www.rogerelkindlaw.com/index.html 30 0 20 1 http://www.rogerelkindlaw.com/ | 1,630 | 613 | 43 | 110 | As you can see there are three duplicate pages; http://rogerelkindlaw.com/index.html http://www.rogerelkindlaw.com/index.html http://www.rogerelkindlaw.com/ What would be the best and most efficient way to fix this problem and also how to prevent this from happening? Thank you.
Technical SEO | | brianhughes0 -
Duplicate Page Content and Titles
A few weeks ago my error count went up for Duplicate Page Content and Titles. 4 errors in all. A week later the errors were gone... But now they are back. I made changes to the Webconfig over a month ago but nothing since. SEOmoz is telling me the duplicate content is this http://www.antiquebanknotes.com/ and http://www.antiquebanknotes.com Thanks for any advise! This is the relevant web.config. <rewrite><rules><rule name="CanonicalHostNameRule1"><match url="(.*)"><conditions><add input="{HTTP_HOST}" pattern="^www.antiquebanknotes.com$" negate="true"></add></conditions>
Technical SEO | | Banknotes
<action type="Redirect" url="<a href=" http:="" www.antiquebanknotes.com="" {r:1"="">http://www.antiquebanknotes.com/{R:1}" />
</action></match></rule>
<rule name="Default Page" enabled="true" stopprocessing="true"><match url="^default.aspx$"><conditions logicalgrouping="MatchAll"><add input="{REQUEST_METHOD}" pattern="GET"></add></conditions>
<action type="Redirect" url="/"></action></match></rule></rules></rewrite>0 -
Noindex duplicate content penalty?
We know that google now gives a penalty to a whole duplicate if it finds content it doesn't like or is duplicate content, but has anyone experienced a penalty from having duplicate content on their site which they have added noindex to? Would google still apply the penalty to the overall quality of the site even though they have been told to basically ignore the duplicate bit. Reason for asking is that I am looking to add a forum to one of my websites and no one likes a new forum. I have a script which can populate it with thousands of questions and answers pulled direct from Yahoo Answers. Obviously the forum wil be 100% duplicate content but I do not want it to rank for anyway anyway so if I noindex the forum pages hopefully it will not damage the rest of the site. In time, as the forum grows, all the duplicate posts will be deleted but it's hard to get people to use an empty forum so need to 'trick' them into thinking the section is very busy.
Technical SEO | | Grumpy_Carl0