Why Does SEOMOZ Crawl show that i have 5,769 pages with Duplicate Content
-
Hello...
I'm trying to do some analysis on my site (http://goo.gl/JgK1e) and SEOMOZ Crawl Diagnostics is telling me that I have 5,769 pages with duplicate content.
Can someone, anyone, please help me understand:
-
how does SEOMOZ determine if i have duplicate content
-
Is it correct ? Are there really that many pages of duplicate content
-
How do i fix this, if true <---- ** Most important **
Thanks in advance for any help!!
-
-
Looks like its now sorted!
Checked link: http://www.plasticstorage.com/
Type of redirect: 301 Moved Permanently
Redirected to: http://plasticstorage.com/
-
Hey Lavellester,
I believe we got our 302/301 issue resolved. Any way you can check through whatever tool you were using last week?
We are still trying to figure out how to fix the code creating the dupe content, but can you advise if i fixed the redirect properly?
Thanks,
-
it will be related to the code/logic of your site creating the duplicate pages. You could work out why/update the code so you only have 1 page for each product OR you could use the rel canonical tag to resolve the issue.
Just thought...as you appear to have so many duplicate pages it may be quicker to look at the logic of the site and fix it all in one go.
-
Lavellester, thanks for all of your help, i am going to have that redirect changed to a 301 ASAP.
Do you have any idea where that duplicate content is coming from since i only have that item in my database once?
Thanks
-
The redirect is still a 302:
Checked link: http://www.plasticstorage.com/
Type of redirect: 302 Moved Temporarily
Redirected to: http://plasticstorage.com
Change the type of redirect to a 301 - this is done at the server level usually.
-
Unless I'm mistaken they appear to be identical duplicate pages.
-
little update:
In our GWT we have both the www version and the non www version.
On the non, we had set the preferred domain as the non www
On the WWW, we also had set the preferred domain as the non www
Does this mean I've done it correctly or not?
Thanks
-
I am looking into the 301/302 issue right now and i will report back...
Here are 3 supposed duplicate content pages:
<colgroup><col width="64"></colgroup>
|plasticstorage.com/catalog/product/view/id/5079/s/305b1/category/100/
plasticstorage.com/catalog/product/view/id/5079/s/305b1/category/101/
|
|
<colgroup><col width="64"></colgroup>
|Those pages when entered into your browser will appear identical... There is only one 305b1 item in the database. Somehow you can change the "100" or the "101" to "166666662" and you will still see that same "305b1" item.... Can you please help me figure out how or why and is this the "DUPLICATE CONTENT" or perhaps something else is being considered...
Thanks again
|
|
-
I'd still strongly recommend fixing the 301 redirect. The preference in the GWT tools is kinda like a soft preference.
Can you show 2 URSs that are deemed to be the same/duplicate?
-
We have set the canonical through Google to choose the non www version
Some examples of pages that are deemed duplicate is:
Page Title URL Other URLs Page Authority Linking Root Domains Ultra-Clear InSight Bin - Stacking 10x5x5 305B1 Akro-Mils
http://plasticstorage.com/305b1.html 50+ 15 1 Lids for Ultra-Clear InSight Bin - 305B1 - Akro-Mils 305B2
http://plasticstorage.com/305b2.html?related=1 50+ 1 0 Dividers for Ultra-Clear InSight Bin - 305B1 - Akro-Mils 405B1
http://plasticstorage.com/405b1.html?related=1 | 50+ | 1 |
| -
Hi there,
One issue is that your www.plasticstorage.com is a 302 redirect to plasticstorage.com. You should update this to a 301 redirect. This will be causing a few issues.
I think the SEOMoz tool shows content that it deems duplicate in the report. Can you post some examples of pages that are deemed duplicate?
Cheers.
-
I also had duplicate content. The cause was I have 2 similar domains that both led to the same page. Mine were (1) www.msperformanceonline.com & (2) w/o the " www." just msperformanceonline.com Look and see if this is why.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How do I fix my portfolio causing duplicate content issues?
Hi, Im new to this whole duplicate content issue. I have a website, fatcatpaperie.com that I use the portofolio feature in Wordpress as my gallery for all my wedding invitations. I have a ton of duplicate content issues from this. I don't understand at all how to fix this. I'd appreciate any help! Below is an example of one duplicate content issue. They have slightly different names, different urls, different images and all have no text. But are coming up as duplicates. Would it be as easy as putting a different metadescription for each?? Thanks for the help! Rena | "Treasure" by Designers Fine Press - Fat Cat Paperie http://fatcatpaperie.com/portfolio-item/treasure-designers-fine-press 1 0 0 0 200 3 duplicates "Perennial" by Designers Fine Press - Fat Cat Paperie http://fatcatpaperie.com/portfolio-item/perennial-by-designers-fine-press 1 0 0 0 200 1 of 3 duplicates "Primrose" by Designers Fine Press - Fat Cat Paperie http://fatcatpaperie.com/portfolio-item/8675 1 0 0 0 200 2 of 3 duplicates "Catalina" by Designers Fine Press - Fat Cat Paperie http://fatcatpaperie.com/portfolio-item/catalina-designers-fine-press |
On-Page Optimization | | HonestSEOStudio0 -
Category Page Content
Hey Mozzers, I've recently been doing a content audit on the category and sub-category pages on our site. The old pages had the following "profile" Above The Fold
On-Page Optimization | | ATP
Page Heading
Image Links to Categories / Products
Below the Fold
The rest of the Image Links to Categories / Products
600 words+ of content duplicated from articles, sub categories and products My criticisms of the page were
1. No content (text) above the fold
2. Page content was mostly duplicated content
3. No keyword structure, many pages competed for the same keywords and often unwanted pages outranked the desired page for the keyword. I cleaned this up to the following structure Above The Fold
H1 Page Heading 80-200 Word of Content (Including a link to supporting article)
H2 Page Heading (Expansion or variance of the H1 making sure relevant) 80-200 150 Words of Content
Image Links to Categories / Products
Below the Fold
The rest of the Image Links to Categories / Products The new pages are now all unique content, targeted towards 1-2 themed keywords. I have a few worries I was hoping you could address. 1. The new pages are only 180-300 words of text, simply because that is all that is needed to describe that category and provide some supporting information. the pages previously contained 600 words. Should I be looking to get more content on these pages?
2. If i do need more content, It wont fit "above the fold" without pushing the products and sub categories below the fold, which isn't ideal. Should I be putting it there anyway or should I insert additional text below the products and below the fold or would this just be a waste.
3. Keyword Structure. I have designed each page to target a selction of keywords, for example.
a) The main widget pages targets all general "widget" terms and provides supporting infromation
b) The sub-category blue widget page targets anything related and terms such as "Navy Widgets" because navy widgets are a type of blue widget etc"
Is this keyword structure over-optimised or exactly what I should be doing. I dont want to spread content to thin by being over selective in my categories Any other critisms or comment welcome0 -
Not sure if I need to be concerned with duplicate content plus too many links
Someone else supports this site in terms of making changes so I want to make sure that I know what I am talking about before I speak to them about changes. We seem to have a lot of duplicate content and duplicate titles. This is an example http://www.commonwealthcontractors.com/tag/big-data-scientists/ of a duplicate. Do I need to get things changed? The other problem that crops up on reports is too many on page links. I am going to get shot of the block of tags but need to keep the news. Is there much else I can do? Many thanks.
On-Page Optimization | | Niamh20 -
Is there anything wrong with having duplicate description tags if they are relevant to their pages?
I have duplicate description tags, but they make sense for the pages they're on. Is there anything wrong with this? Thanks for reading!
On-Page Optimization | | DA20130 -
Duplicate Content
Part of a site I am working on, features many different bags in all thicknesses colors and sizes. I'm getting an error when some pages have different content like different thicknesses. The only differences between the pages are a single digit - but in trash bags that makes it a whole different product! I can't do a canonical because it's not the same. For example: http://www.plasticplace.net/index.php?file=productdetail&iprod_id=274 and http://www.plasticplace.net/index.php?file=productdetail&iprod_id=268 Any ideas?
On-Page Optimization | | EcomLkwd0 -
Duplicate Content from on Competitor's site?
I've recently discovered large blocks of content on a competitors site that has been copy and pasted from a client's site. From what I know, this will only hurt the competitor and not my client since my guy was the original. Is this true? Is there any risk to my client? Should we take action? Dino
On-Page Optimization | | Dino640 -
Only showing googlebot schema.org tagged content - cloaking??
Would it be considered cloaking if I only show schema.or tagged content to searchengine bots and not to regular visitors. Mind you, no other change on the page, design or content. So instead of Googlebot would be served: 41 Main Street Regular visitors: 41 Main Street
On-Page Optimization | | Sebes0 -
Filtered Navigation, Duplicate content issue on an Ecommerce Website
I have navigation that allows for multiple levels of filtering. What is the best way to prevent the search engine from seeing this duplicate content? Is it a big deal nowadays? I've read many articles and I'm not entirely clear on the solution. For example. You have a page that lists 12 products out of 100: companyname.com/productcategory/page1.htm And then you filter these products: companyname.com/productcategory/filters/page1.htm The filtered page may or may not contain items from the original page, but does contain items that are in the unfiltered navigation pages. How do you help the search engine determine where it should crawl and index the page that contains these products? I can't use rel=canonical, because the exact set of products on the filtered page may not be on any other unfiltered pages. What about robots.txt to block all the filtered pages? Will that also stop pagerank from flowing? What about the meta noindex tag on the filitered pages? I have also considered removing filters entirely, but I'm not sure if sacrificing usability is worth it in order to remove duplicate content. I've read a bunch of blogs and articles, seen the whiteboard special on faceted navigation, but I'm still not clear on how to deal with this issue.
On-Page Optimization | | 13375auc30