Would Google Call These Pages Duplicate Content?
-
Our Web store, http://www.audiobooksonline.com/index.html, has struggled with duplicate content issues for some time. One aspect of duplicate content is a page like this: http://www.audiobooksonline.com/out-of-publication-audio-books-book-audiobook-audiobooks.html.
When an audio book title goes out-of-publication we keep the page at our store and display a http://www.audiobooksonline.com/out-of-publication-audio-books-book-audiobook-audiobooks.html whenever a visitor attempts to visit a specific title that is OOP. There are several thousand OOP pages.
Would Google consider these OOP pages duplicate content?
-
I'm confused. When a book goes out of print, does the URL change to this long OOP html page? Or does that book's URL then redirect to this page? Or *(shudders) do you make the OOP page re-titled to whatever the OOP book's page was?
If it were me I'd do the first scenario here. It's essentially the same concept as a 404.
-
Yes that is duplicate content, you should make these pages return a 404 instead. or leave the content in place with a sold out banner or something.
Something I don't like is your index.htm on your home page, people how link to you are likely to link to http://www.audiobooksonline.com/ you will then get a 301 redirect to http://www.audiobooksonline.com/index.html
this will leak link juice, as all 301's leak link juice just the same as a link does, 155 if we go by the original published Google algorithm. Also your internal pages link to http://www.audiobooksonline.com and are once again redirected to http://www.audiobooksonline.com/index.html
-
yes larry that is fine. so long as it is a single URL with a single HTML file on it, there is no duplicate issues. If you want to clarify I would suggest (if you aren't an SEOmoz pro member) to use a sitemap generator to ensure it isn't crawling multiple pages... But if that page is only listed once (and from what you are saying here that should be the case) then you have no duplicate content issues.
It's just the same as linking to one page from every page on your website. A redirect doesn't work much differently (although it does drop a small amount of linkjuice.)
You might consider no-crawling that OOP page anyway if you're still concerned. Not sure why you would need that one indexed in the first place.
Good luck to you!
-
We use only one URL for the OOP pages. It is 301 redirected from the each unique OOP title's page. Based on what you said, I am understanding that this is fine. Correct?
-
Hi Larry
Couple of questions - is that the only URL for the OOP pages, or are there other versions of the page and/or URL that exist?
If there are multiple pages, then that is definitely duplicate content. However, that can quite easily be fixed. If you add this code to the head tag of all those OOP pages, it will prevent Google from indexing the pages (thus not seeing them as duplicate):
That way you can keep the page for the user but not have to worry about duplicate content. I would do this anyway even if there is only one version of the page, as the page is thin on content as it is.
If you are displaying that image on other URLs that used to have products on them, but have gone OOP, then those multiple URLs and pages would be duplicate. Again, if you add the above code into the head text, it removes the problem. You could also 301 redirect the URL of the product page to the OOP page. For example, if you had a page for a product called: http://www.audiobooksonline.com/examplerecord.html that is now OOP, you could put in a 301 redirect to the http://www.audiobooksonline.com/out-of-publication-audio-books-book-audiobook-audiobooks.html. page and it wouldn't be duplicate. You can learn more about redirection here.
Hope this helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Tag Content Mystery
Hello Moz Communtiy! i am also having error of Duplicate Tag Content Mystery like: http://www.earnmoneywithgoogleadsense.com/tag/blog-post/ http://www.earnmoneywithgoogleadsense.com/tag/effective-blog-post/ Pages are same. I have 100+ Error on website so how can i remove this error? DO you have any tutorial based on this? Can i change canonical url at once or i need to set it one by one? If you have any video basis on it, i will recommend.
Technical SEO | | navneetkumar7860 -
Is this duplicate content?
All the pages have same information but content is little bit different, is this low quality and considered as duplicate content? I only trying to make services pages for each city, any other way for doing this. http://www.progressivehealthofpa.com/brain-injury-rehabilitation-pennsylvania/
Technical SEO | | JordanBrown
http://www.progressivehealthofpa.com/brain-injury-rehabilitation-new-york/
http://www.progressivehealthofpa.com/brain-injury-rehabilitation-new-jersey/
http://www.progressivehealthofpa.com/brain-injury-rehabilitation-connecticut/
http://www.progressivehealthofpa.com/brain-injury-rehabilitation-maryland/
http://www.progressivehealthofpa.com/brain-injury-rehabilitation-massachusetts/
http://www.progressivehealthofpa.com/brain-injury-rehabilitation-philadelphia/
http://www.progressivehealthofpa.com/brain-injury-rehabilitation-new-york-city/
http://www.progressivehealthofpa.com/brain-injury-rehabilitation-baltimore/
http://www.progressivehealthofpa.com/brain-injury-rehabilitation-boston/0 -
Duplicate content on Product pages for different product variations.
I have multiple colors of the same product, but as a result I'm getting duplicate content warnings. I want to keep these all different products with their own pages, so that the color can be easily identified by browsing the category page. Any suggestions?
Technical SEO | | bobjohn10 -
Google Cache is not showing in my page
Hello Everyone, I have issue in my Page, My category page (http://www.bannerbuzz.com/custom-vinyl-banners.html) is regular cached in past, but before sometime it can't show the cached result in SERP and not show in cached result , I have also fetch this link in google web master, but can't get the result, it is showing following message. 404. That’s an error. The requested URL /search?q=cache%3A http%3A//www.bannerbuzz.com/custom-vinyl-banners.html was not found on this server. That’s all we know. My category page rank is 2 and its keyword is on first in google.com, so i am little bit worried about this page cache issue, Can someone please tell me why is this happening? Is this a temporary issue? Help me to solve out this cache issue and once again my page will regularly cache in future. Thanks
Technical SEO | | CommercePundit0 -
How can i see the pages that cause duplicate content?
SEOmoz PRO is giving me back duplicate content errors. However, i don't see how i can get a list of pages that are duplicate to the one shown. If i don't know which pages/urls cause the issue i can't really fix it. The only way would be placing canonical tags but that's not always the best solution. Is there a way to see the actual duplicate pages?
Technical SEO | | 5MMedia0 -
Canonical - how can you tell if page is appearing duplicate in Google?
Our home page file is www.ides.com/default.asp and appears in Google as www.ides.com. Would it be a good thing for us to include the following tag in the head section of our website homepage?
Technical SEO | | Prospector-Plastics0 -
301ed Pages Still Showing as Duplicate Content in GWMT
I thank anyone reading this for their consideration and time. We are a large site with millions of URLs for our product pages. We are also a textbook company, so by nature, our products have two separate ISBNs: a 10 digit and a 13 digit form. Thus, every one of our books has at least two pages (10 digit and 13 digit ISBN page). My issue is that we have established a 301 for all the 10 digit URLs so they automatically redirect to the 13 digit page. This fix has been in place for months. However, Google still reports that they are detecting thousands of pages with duplicate title and meta tags. Google is referring to these page URLs that I already have 301ed to the canonical version many months ago! Is there anything that I can do to fix this issue? I don't understand what I am doing wrong. Example:
Technical SEO | | dfinn
http://www.bookbyte.com/product.aspx?isbn=9780321676672
http://www.bookbyte.com/product.aspx?isbn=032167667X As you can see the 10 digit ISBN page 301s to 13 digit canonical version. Google reports that they have detected duplicate title and meta tags between the two pages and there are thousands of these duplicate pages listed. To add some further context: The ISBN is just a parameter that allows us to provide content when someone searches for a product with the 10 or 13 digit ISBN. The 13 digit version of the page is the only physical page that exists, the 10 digit is only a part of the virtual URL structure of the website. This is why I cannot simply change the title and meta tags of the 10 digit pages because they only exist in the sense that the URL redirects to the 13 digit version. Also, we submit a sitemap every day of all the 13 digit pages so Google knows exactly what our physical URL structure is. I have submitted this question to GWMT forums and received no replies.0 -
Avoiding duplicate content/same pages
hi I have been checking through all the Q and A but i i'm still not sure how you get http://www.domain.co.uk/index.html to be just http://www.domain.co.uk/? Do you add canonical to the index page to point to the page you prefer and then add a 301 redirect? thanks
Technical SEO | | challen0