Duplicate Content
-
HI There,
Hoping someone can help me - before i damage my desk banging my head.
Getting notifications from ahrefs and Moz for duplicate content. I have no idea where these weird urls have came from , but they do take us to the correct page (but it seems a duplicate of this page).
correct url http://www.acsilver.co.uk/shop/pc/Antique-Vintage-Rings-c152.htm
Incorrect url http://www.acsilver.co.uk/shop/pc/vintage-Vintage-Rings- c152.htm
This is showing for most of our store categories
Desperate for help as to what could be causing these issues. I have a technical member of the ecommerce software go through the large sitemap files and they assured me it wasn't linked to the sitemap files.
Gemma
-
Hi Gemma,
Strange! Typically, %20 is the symbol that content management systems use to convert spaces into allowable characters in URLs. Have you found any URLs that were written in the HTML with an accidental space?
That said, I know I'm a Moz associate and all, but Moz and Ahrefs are not nearly as good at understanding the web as Google; it's completely possible that these are errors that their crawlers are picking up, but Google isn't having a problem. Try searching for "site:[duplicate URL]" to see if Google is indexing this "duplicate content." I just checked with the example you provided, and it's not in Google's index.
If some other duplicate content URLs are in Google's index, then I'd use Google Analytics to determine where the traffic is coming from to these pages, in order to find where the URLs are written incorrectly.
Hope this helps!
Kristina
-
It seems strange that the weird url i get takes me to the right page but it shows the sites homepage meta information !! I have no clue why this would occur
-
Thanks for the suggestion. I will try and get to the cause of these urls and then if i cant get to the bottom of it i will look at adding 301's however it will mean adding a lot of them
-
Hi,
Thanks for taking the time to assist.
I have checked the internal and external links to the url and there aren't any, also checked sitemap and these links aren't present there.
Regarding https, it was switched on at one point for a matter of minutes as it was done in error.
Regarding the link you state we had issues with these in the past which were generated from an external site which we had no control over.
Im hitting blanks as to locating how these are being generated.
Can you tell me what screaming frog can show me that Moz and ahrefs software doesn't? I haven't used it before.
Gemma
-
In addition to what Bryan suggested, have you crawled your site using screaming frog or some other service.
Did you try going https: at some point??
Or... http://www.acsilver.co.uk/www.acsilver.co.uk/shop/pc/vintage-Vintage-Rings- c152.htm
-
It seems like something about either your search functions, or the e-commerce functionality, is causing these duplicate pages. Without having access to that information, I can tell you that the best option from my point of view would be to redirect all of the "incorrect" urls to the "correct" ones using 301s. I suggest taking a look at this page on Google Search Console (formerly known as Webmaster Tools) if you're unfamiliar with how that works. No matter what platform your website uses, this should do you good.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Gallery Causing Duplicate Content Issues
Hi! I have a gallery on my website. When you click to view the next image it goes to a new page but the content is exactly the same as the first page. This is flagging up a duplicate content issue. What is the best way to fix this? Add a canonical tag to page 2,3,4 or add a noindex tag? I have found a lot of conflicting answers. Thanks in advance
Technical SEO | | emma19860 -
Content and url duplication?
One of the campaign tools flags one of my clients sites as having lots of duplicates. This is true in the sense the content is sort of boiler plate but with the different countries wording changed. The is same with the urls but they are different in the sense a couple of words have changed in the url`s. So its not the case of a cms or server issue as this seomoz advises. It doesnt need 301`s! Thing is in the niche, freight, transport operators, shipping, I can see many other sites doing the same thing and those sites have lots of similar pages ranking very well. In fact one site has over 300 keywords ranked on page 1-2, but it is a large site with an 12yo domain, which clearly helps. Of course having every page content unique is important, however, i suppose it is better than copy n paste from other sites. So its unique in that sense. Im hoping to convince the site owner to change the content over time for every country. A long process. My biggest problem for understanding duplication issues is that every tabloid or broadsheet media website would be canned from google as quite often they scrape Reuters or re-publish standard press releases on their sites as newsworthy content. So i have great doubt that there is a penalty for it. You only have to look and you can see media sites duplication everywhere, everyday, but they get ranked. I just think that google dont rank the worst cases of spammy duplication. They still index though I notice. So considering the business niche has very much the same content layout replicated content, which rank well, is this duplicate flag such a great worry? Many businesses sell the same service to many locations and its virtually impossible to re write the services in a dozen or so different ways.
Technical SEO | | xtopher660 -
Question about duplicate content in crawl reports
Okay, this one's a doozie: My crawl report is listing all of these as separate URLs with identical duplicate content issues, even though they are all the home page and the one that is http://www.ccisolutions.com (the preferred URL) has a canonical tag of rel= http://www.ccisolutions.com: http://www.ccisolutions.com http://ccisolutions.com http://www.ccisolutions.com/StoreFront/IAFDispatcher?iafAction=showMain I will add that OSE is recognizing that there is a 301-redirect on http://ccisolutions.com, but the duplicate content report doesn't seem to recognize the redirect. Also, every single one of our 404-error pages (we have set up a custom 404 page) is being identified as having duplicate content. The duplicate content on all of them is identical. Where do I even begin sorting this out? Any suggestions on how/why this is happening? Thanks!
Technical SEO | | danatanseo1 -
Tags and Duplicate Content
Just wondering - for a lot of our sites we use tags as a way of re-grouping articles / news / blogs so all of the info on say 'government grants' can be found on one page. These /tag pages often come up with duplicate content errors, is it a big issue, how can we minimnise that?
Technical SEO | | salemtas0 -
Large Scale Ecommerce. How To Deal With Duplicate Content
Hi, One of our clients has a store with over 30,000 indexed pages but less then 10,000 individual products and make a few hundred static pages. Ive crawled the site in Xenu (it took 12 hours!) and found it to by a complex mess caused by years of hack add ons which has caused duplicate pages, and weird dynamic parameters being indexed The inbound link structure is diversified over duplicate pages, PDFS, images so I need to be careful in treating everything correctly. I can likely identify & segment blocks of 'thousands' of URLs and parameters which need to be blocked, Im just not entirely sure the best method. Dynamic Parameters I can see the option in GWT to block these - is it that simple? (do I need to ensure they are deinxeded and 301d? Duplicate Pages Would the best approach be to mass 301 these pages and then apply a no-index tag and wait for it to be crawled? Thanks for your help.
Technical SEO | | LukeyJamo0 -
Duplicate Content
Hello All, my first web crawl has come back with a duplicate content warning for www.simodal.com and www.simodal.com/index.htm slightly mystified! thanks paul
Technical SEO | | simodal0 -
50+ duplicate content pages - Do we remove them all or 301?
We are working on a site that has 50+ pages that all have duplicate content (1 for each state, pretty much). Should we 301 all 50 of the URLs to one URL or should we just completely get rid of all the pages? Are there any steps to take when completely removing pages completely? (submit sitemap to google webmaster tools, etc) thanks!
Technical SEO | | Motava0 -
Duplicate content question with PDF
Hi, I manage a property listing website which was recently revamped, but which has some on-site optimization weaknesses and issues. For each property listing like http://www.selectcaribbean.com/property/147.html there is an equivalent PDF version spidered by google. The page looks like this http://www.selectcaribbean.com/pdf1.php?pid=147 my question is: Can this create a duplicate content penalty? If yes, should I ban these pages from being spidered by google in the robots.txt or should I make these link nofollow?
Technical SEO | | multilang0