What constitutes a duplicate page?
-
Hi, I have a question about duplicate page content and wondered if someone is able to shed some light on what actually constitutes a "duplicate". We publish hundreds of bus timetable pages that have similar, but technically with unique urls and content. For example http://www.intercity.co.nz/travel-info/timetable/lookup/akl
The template of the page is oblivious duplicated, but the vast majority of the content is unique to each page, with data being refreshed each night.
Our crawl shows these as duplicate page errors, but is this just a generalisation because the urls are very similar? (only the last three characters change for each page - in this case /akl)
Thanks in advance.
-
While you visit those pages that SEOMoz tags as duplicate, is the content duplicate? If it isn't, then there's nothing to worry about.
We have duplicate content notices too, and those are usually tag pages that at a certain moment have the same posts within the listing as all those posts use the same tag.
It would be great if you post a couple pages that are reporting duplicate content and where it can be found so we can take a look at that.
-
Thanks Federico, the page is republished each night to capture any timetable or stop changes. We just can't figure out why it is being tagged as duplicate content?
-
Hi Moosa, yes we refresh the data feed to the timetable page each night although in most cases the data does not change. What we can't understand is why the SEOMOZ crawl flags these pages as duplicates?
-
Not sure if I understand it correctly...I think you are saying that you create a new page every night for the new schedule! I mean if this is the case then why not you just simply refresh the information on the same page as technically Google will love it and duplication issue will be reduced to none.
-
My first guess is: if the information of the page is updated because the previous details are no longer valid, why not removing the old page entirely?
Anyway, removing or leaving the info there shouldn't cause any problem, the content isn't the same. But I guess for some days data does match previous dates, therefore my idea of removing the old (useless) time tables.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Webshop landing pages and product pages
Hi, I am doing extensive keyword research for the SEO of a big webshop. Since this shop sells technical books and software (legal books, tax software and so on), I come across a lot of very specific keywords for separate products. Isn't it better to try and rank in the SERP's with all the separate product pages, instead of with the landing (category) pages?
Intermediate & Advanced SEO | | Mat_C0 -
Article page canonicalization
Hey there, A client rents all kinds of party articles, like plates, bowles, etc. Currently, al his article pages have canonicals to their parent category pages, supposedly to have any pagevalue flow to these category pages, (which are much more relevant for SEO). Is there anyone who agrees with this method? I think a noindex,follow would be a better measure to prevent Google from accessing all these 'low value' article pages. Besides, a canonical should indicate that page A and B are (almost) identical, which they most certainly are not in this case. What are your thoughts?
Intermediate & Advanced SEO | | Adriaan.Multiply0 -
Dynamic pages
Hello Team, How can we create dynamic pages or more pages on website but maintaining SEO standards.
Intermediate & Advanced SEO | | Obbserv0 -
Pages with Duplicate Page Content (with and without www)
How can we resolve pages with duplicate page content? With and without www?
Intermediate & Advanced SEO | | directiq
Thanks in advance.0 -
Category Pages For Distributing Authority But Not Creating Duplicate Content
I read this interesting moz guide: http://moz.com/learn/seo/robotstxt, which I think answered my question but I just want to make sure. I take it to mean that if I have category pages with nothing but duplicate content (lists of other pages (h1 title/on-page description and links to same) and that I still want the category pages to distribute their link authority to the individual pages, then I should leave the category pages in the site map and meta noindex them, rather than robots.txt them. Is that correct? Again, don't want the category pages to index or have a duplicate content issue, but do want the category pages to be crawled enough to distribute their link authority to individual pages. Given the scope of the site (thousands of pages and hundreds of categories), I just want to make sure I have that right. Up until my recent efforts on this, some of the category pages have been robot.txt'd out and still in the site map, while others (with different url structure) have been in the sitemap, but not robots.txt'd out. Thanks! Best.. Mike
Intermediate & Advanced SEO | | 945010 -
Category Pages up - Product Pages down... what would help?
Hi I mentioned yesterday how one of our sites was losing rank on product pages. What steps do you take to improve the SERPS of product pages, in this case home/category/product is the tree. There isn't really any internal linking, except one link from the category page to each product, would setting up a host of internal links perhaps "similar products" linking them together be a place to start? How can I improve my ranking of these more deeply internal pages? Not just internal links?
Intermediate & Advanced SEO | | xoffie0 -
Wordpress Duplicate Content
We have recently moved our company's blog to Wordpress on a subdomain (we utilize the Yoast SEO plugin). We are now experiencing an ever-growing volume of crawl errors (nearly 300 4xx now) for pages that do not exist to begin with. I believe it may have something to do with having the blog on a subdomain and/or our yoast seo plugin's indexation archives (author, category, etc) --- we currently have Subpages of archives and taxonomies, and category archives in use. I'm not as familiar with Wordpress and the Yoast SEO plugin as I am with other CMS' so any help in this matter would be greatly appreciated. I can PM further info if necessary. Thank you for the help in advance.
Intermediate & Advanced SEO | | BethA0 -
Does rel=canonical fix duplicate page titles?
I implemented rel=canonical on our pages which helped a lot, but my latest Moz crawl is still showing lots of duplicate page titles (2,000+). There are other ways to get to this page (depending on what feature you clicked, it will have a different URL) but will have the same page title. Does having rel=canonical in place fix the duplicate page title problem, or do I need to change something else? I was under the impression that the canonical tag would address this by telling the crawler which URL was the URL and the crawler would only use that one for the page title.
Intermediate & Advanced SEO | | askotzko0