Multiple Instances of the Same Article
-
Hi, I'm having a problem I cannot solve about duplicate article postings.
As you will see from the attached images, I have a page with multiple variants of the same URL in google index and as well as duplicate title tag in the search console of webmasters tools. Its been several months I have been using canonical meta tags to resolve the issue, aka declare all variants to point to a single URL, however the problem remains. Its not just old articles that stay like that, even new articles show the same behaviour right when they are published even thought they are presented correctly with canonical links and sitemap as you will see from the example bellow.
Example URLs of the attached Image
-
All URLs belonging to the same article ID, have the same canonical link inside the html head.
-
Also because I have a separate mobile site, I also include in every desktop URL an "alternate" link to the mobile site.
-
At the Mobile Version of the Site, I have another canonical link, pointing back to the original Desktop URL. So the mobile site article version also has
-
Now, when it comes to the xml sitemap, I pass only the canonical URL and none of the other possible variants (to avoid multiple indexing), and I also point to the mobile version of the article.
<url><loc>http://www.neakriti.gr/?page=newsdetail&DocID=1300357</loc>
<xhtml:link rel="alternate" media="only screen and (max-width: 640px)" href="http://mobile.neakriti.gr/fullarticle.php?docid=1300357"><lastmod>2016-02-20T21:44:05Z</lastmod>
<priority>0.6</priority>
<changefreq>monthly</changefreq>
image:imageimage:lochttp://www.neakriti.gr/NewsASSET/neakriti-news-image.aspx?Doc=1300297</image:loc>
image:titleΟΦΗ</image:title></image:image></xhtml:link></url>
The above Sitemap snippet Source: http://www.neakriti.gr/WebServices/sitemap.aspx?&year=2016&month=2
The main sitemap of the website: http://www.neakriti.gr/WebServices/sitemap-index.aspxDespite my efforts you see that webmasters tools reports three variants for the desktop URL, and google search reports 4 URLs (3 different desktop variant urls and the mobile url).
I get this when I type the article code to see if what is indexed in google search: site:neakriti.gr 1300297
So far I believe I have done all I could in order to resolve the issue by addressing canonical links and alternate links, as well as correct sitemap.xml entry. I don't know what else to do... This was done several months ago and there is absolutelly no improvement.
Here is a more recent example of an article added 5 days ago (10-April-2016), just type
site:neakriti.gr 1300357
at google search and you will see the variants of the same article in google cache. Open the google cached page, and you will see the cached pages contain canonical link, but google doesn't obey the direction given there.Please help!
-
-
Hi all,
sorry for the delay, I am away on a business trip, this is why I stopped communicating the past few days.
I can confirm that the latest entries (those after March) come as a single instance.
However there are some minor exceptions like the one hereExample of a recent article indexed in both desktop (even though desktop url is not the canonical) and mobile URL
https://www.google.gr/search?q=site:neakriti.gr&biw=1527&bih=899&source=lnms&sa=X&ved=0ahUKEwiIxODGt5_MAhUsKpoKHdcUAkYQ_AUIBigA&dpr=1.1#q=site:neakriti.gr+1315539&tbs=qdr:w&filter=0Also I noticed that with the "alternate" and "canonical" links the mobile version of the site doesn't get indexed anymore (with minor exceptions like the one above).
-
Hi Ioannis!
How's this going? We'd love an update.
-
Hmm, interestingly, when I followed your link, I only saw the canonical version of the article. Is this what you're seeing now?
Also, in response to your earlier question, yes, you can disallow parameters with robots.txt. If these canonical issues continue, that may be the best next step.
-
Thank you for your response, I will take a look at this.
However I have two questions regarding your suggestion
- Since I have canonical links at the loading page, doesn't that resolve the issue?
- the printerfriendly variation has a noindex meta at the head, shouldn't that be taken into account?
- Can I put regular expressions in my robots.txt? How can I block url params? Because printerfriendly and newsdetailsports are values of the "page" GET param
Infact the printerfriendly contains canonical link and noindex meta to inform search engines not to index content, and let them know where the original content exists
-
Hi there
The printer friendly URL is coming from the print this article button (attached) and the /default.aspx URL is coming from the ^ TOP button (attached).
What you could do is use your robots.txt to ignore these URLs. You can all tell Google what URL parameters to ignore, but please be EXTREMELY careful doing this. It's not a fine comb tool, not a hatchet.
Let me know if you have any questions or comments, good luck!
Patrick
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Does having Multiple Similar Topic pages hurt my ranking?
Hi, We have an ecommerce store and currently have topic pages setup for each category/location combination, each topic page lists relevant products available for sale, so for example Most Popular Birthday Party products in UK
Intermediate & Advanced SEO | | cmavroudisyahoocom
Most Popular Birthday Party products in London
Most Popular Birthday Party products in Manchester We are now looking at ways of capitalising on longtail keyword searches and a potential solution is to expand the number of Topic Pages/Location combinations so for example Most Popular Birthday Party products in UK
Cheapest Birthday Party products in UK
Birthday Party products for small groups in UK
Birthday Party products for large groups in UK
Children Birthday Party products for in UK
etc In general would this be a positive or negative thing to do for our site to give each longtail keyword its own dedicated topic page (given that our crawl budget is not necessarily high). Or should we just try to add longtail keyword to the original topic page itself and make that one rank better? Thanks0 -
Multiple product hierarchies (creation of refurbished products section) - best solution?
Hi all, I'm in discussion with a client who wishes to introduce a 'refurbished' products section to their website. This section will effectively replicate the structure of the 'brand new' products section. Unusually the key difference will be the fact that the 'refurbished' products section will feature significantly more products than the 'brand new' section, in the region of four times as many. As a guide the website currently stocks approximately 200 products across 8 core product areas. We have recommended that the two sections should be combined in order to prevent the creation of two separate product hierarchies. With 'brand new' / 'refurbished' products segmented via filter functionality. However the client is set on having two separate product hierarchies, i.e. a 'refurbished' section within a completely separate directory. Just wanted to crowd source opinion, in additionally to gaining insight if anyone has experience of a similar request. What solution did you implement? My feeling is that there is a high likelihood over time of the 'refurbished' section growing in authority and starting to outrank the 'brand new' products section. Not to mention a key missed opportunity to group and build authority / content within one product hierarchy. All thoughts and opinions much appreciated!
Intermediate & Advanced SEO | | 26ryan0 -
Adding a Directory to Successful Article Website
We are considering adding roughly 1,300 pages to a 2,300 page website within the drug rehab niche. Our website is generating roughly 10,000 uniques from Search / month. **Is there a way to estimate the change in traffic to the existing content on the site when we add 30-40% pages in the form of a directory? ** **Is there a way to estimate the effect of the existing traffic and links to our newly added part of the site (the directory)? **
Intermediate & Advanced SEO | | alltreatment0 -
Duplicate peices of content on multiple pages - is this a problem
I have a couple of WordPress clients with the same issue but caused in different ways: 1. The Slash WP theme which is a portfolio theme, involves setting up multiple excerpts of content that can then be added to multiple pages. So although the pages themselves are not identical, there are the same snippets of content appearing on multiple pages 2. A WP blog which has multiple categories and/or tags for each post, effectively ends up with many pages showing duplicate excerpts of content. My view has always been to noindex these pages (via Yoast), but was advised recently not to. In both these cases, even though the pages are not identical, do you think this duplicate content across multiple pages could cause an issue? All thoughts appreciated
Intermediate & Advanced SEO | | Chammy0 -
Link from archived article.
A strong news site has an "archived.domainname" folder, where they have older articles listed. I can get a link on a page where there is a 4 year old article, which will be in this archived sub-domain. My questions: Will Google view a link from a 4 year old article as less valuable. Will Google notice the article is 4 years old and find it odd why the page all of a sudden has a link to my site, and thus devalue such link the sub-domain "archived" does that tell Google it is old and a link will be less valuable thank you
Intermediate & Advanced SEO | | knielsen0 -
How does Google see an article in two languages?
Hi, We are translating our articles into French (they are already in English) and are considering Cantonese & Mandarin. How does Google see this? Say I post an article on Diabetes Symptoms in English, Cantonese and French. Same article, different languages. Does Google look at this as three separate articles, ranking you uniquely, or does it count as one article? Thanks, Erin
Intermediate & Advanced SEO | | erinhealthchoices0 -
Why do branded manufacturer websites have multiple pages for their products?
My favorite golf ball is the Srixon Tour Yellow ball. Srixon has a product detail page here (www.srixon.com) AND there's also a product detail page here at shop.srixon.com. Is there any sort of SEO penalty here because there's some duplication? Does the fact the store is a separate subdomain make this more allowable? Many branded manufacturer websites work this way but it just doesn't make sense to me to have two product pages that you have to manage content when you can have just 1 with a call to action. I also work for a branded manufacturer and am considering rebuilding our website from the ground up with the online store and the main/marketing website blended into one to eliminate this duplication. We have this same duplicated marketing/store setup as well. any feedback is greatly appreciated. Confused.
Intermediate & Advanced SEO | | Timmmmy0 -
Multiple anchor text links
Hi. I wanted to ask about having multiple text links to an internal page from the same page. So I have a section title on my home page which will vary with each article. It may say "Healthiest Cat Foods" as the title then offer a snippet and finally offer a "continue reading..." anchor text. The title is a great link to the article while the "continue reading..." is another link to the same article. I like the to keep the title link because it is perfect anchor text. I like to keep the "continue reading..." because it seems helpful for users. I have read that search engines will only count the first link to an article which is fine as I only want the first one to count anyway. What I am wondering is do I lose any page rank because I added the second link? Does that second link hurt me in any way?
Intermediate & Advanced SEO | | NikkiGaul0