Bad Duplicate content issue
-
Hi,
for grappa.com I have about 2700 warnings of duplicate page content. My CMS generates long url like: http://www.grappa.com/deu/news.php/categoria=latest_news/idsottocat=5 and http://www.grappa.com/deu/news.php/categoria%3Dlatest_news/idsottocat%3D5 (this is a duplicated content).
What's the best solution to fix this problem? Do I have to set up a 301 redirect for all the duplicated pages or insert the rel=canonical or rel=prev,next ?
It's complicated becouse it's a multilingual site, and it's my first time dealing with this stuff.
Thanks in advance.
-
Your original question had two URLs, one of where the "=" was replaced with "%3D". If that was an actual crawled URL (and not a copy-and-paste error), then it's likely coming from bad links within your own site. That's malformed, so you should definitely check it out. A desktop crawler like Xenu or Screaming Frog could help track down the culprit:
http://www.seomoz.org/blog/crawler-faceoff-xenu-vs-screaming-frog
-
Thanks Peter for the reply!
What do you mean by "bad internal links" ?
I'm well ranked so based on your suggestions what I have to do is to set up properly the rel=canonical tag and rel=alternate, right? I'm still bit scarred about duplicate content report in the SEOmoz campaign. 2.700 warnings is kind of a big deal.
-
One of these URLs just seems to be the encoded version of the other, which should appear as identical. I'm not seeing any evidence that Google is indexing both. I have a feeling that you may have some bad internal links that need to be fixed. I'm seeing the English/German version of this page in the index, but that should be fine. As Khem said, you could use .
Be careful about converting to a "static" version. It's not that it's a bad idea, but the problem is that you could end up turning 2 duplicates into 3 duplicates. You'll still have to canonicalize the dynamic version to the static version. In other words, done badly, changing your URLs could actually make the problem worse.
-
Rel=prev/next is for paginated series, such as internal search results. While I see you have a pagination parameter on these pages ("idpagina=13"), it doesn't seem like this is a series or that the two pages are even duplicates. I'm a bit confused on the intent, but my initial reaction is that rel=prev/next doesn't fit the bill here.
-
As long as you are managing a multilingual site, it is always recommended to use rel="alternative" even if you're redirecting your website.
For next, prev, don't use, unless you feel it is really required, as I could not find the need May be I missed something, could you be please bit more specific?
-
Thanks Raj! I will for sure re-write the dynamic urls into static and that's a starting point. Take for example these pages:
http://www.grappa.com/eng/grappa.php/argomento=grappa_in_italy/idsezione=1/idpagina=13
Do you suggest in this case to use rel=nex, prev ?
I thought about using rel="alternate" for the multilingual issue, but now my site redirects automatically from www.grappa.com to www.grappa.com/eng/index.php. is that bad for SEO? Should I put rel="canonical" to www.grappa.com ?
Many thanks
-
Hey Nicola, ~2700 is a huge no.
I would suggest you to talk to you programmer/developer to re-write the dynamic URLs into static, which I am sure they can easily do.
second thing, make sure to delete all the duplicate pages or use rel=unfollow. using 301 for all the duplicate pages is not a bad option but not a permanent solutions. It is better to re-write all the dynamics urls into static one, delete all the dups pages and then 301 redirect all the deleted pages to the originals.
for multilingual you can use the following code:
The tag enables you to say, “This is for Spain. this is for Germany
The rel="alternate" hreflang="es" annotations help Google serve the Spanish language or regional URL to searchers
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is My Boilerplate Product Description Causing Duplicate Content Issues?
I have an e-commerce store with 20,000+ one-of-a-kind products. We only have one of each product, and once a product is sold we will never restock it. So I really have no intention to have these product pages showing up in SERPs. Each product has a boilerplate description that the product's unique attributes (style, color, size) are plugged into. But a few sentences of the description are exactly the same across all products. Google Webmaster Tools doesn't report any duplicate content. My Moz Crawl Report show 29 of these products as having duplicate content. But a Google search using the site operator and some text from the boilerplate description turns up 16,400 product pages from my site. Could this duplicate content be hurting my SERPs for other pages on the site that I am trying to rank? As I said, I'm not concerned about ranking for these products pages. Should I make them "rel=canonical" to their respective product categories? Or use "noindex, follow" on every product? Or should I not worry about it?
Technical SEO | | znagle0 -
Duplicate Footer Content
A client I just took over is having some duplicate content issues. At the top of each page he has about 200 words of unique content. Below this is are three big tables of text that talks about his services, history, etc. This table is pulled into the middle of every page using php. So, he has the exact same three big table of text across every page. What should I do to eliminate the dup content. I thought about removing the script then just rewriting the table of text on every page... Is there a better solution? Any ideas would be greatly appreciated. Thanks!
Technical SEO | | BigStereo0 -
Whats with the backslash in the url adding as duplicate content?
Is this a bug or something that needs to be addressed? If so, just use a redirect?
Technical SEO | | Boogily0 -
Duplicate content, how to solve?
I have about 400 errors about duplicate content on my seomoz dashboard. However I have no idea how to solve this, I have 2 main scenarios of duplication in my site: Scenario 1: http://www.theprinterdepo.com/catalogsearch/advanced/result/?name=64MB+SDRAM+DIMM+MEMORY+MODULE&sku=&price%5Bfrom%5D=&price%5Bto%5D=&category= 3 products with the same title, but different product models, as you can note is has the same price as well. Some printers use a different memory product module. So I just cant delete 2 products. Scenario 2: toners http://www.theprinterdepo.com/brother-high-capacity-black-toner-cartridge-compatible-73 http://www.theprinterdepo.com/brother-high-capacity-black-toner-cartridge-compatible-75 In this scenario, products have a different title but the same price. Again, in this scenario the 2 products are different. Thank you
Technical SEO | | levalencia10 -
I'm getting a Duplicate Content error in my Pro Dashboard for 2 versions of my Homepage. What is the best way to handle this issue?
Hi SEOMoz,I am trying to fix the final issues in my site crawl. One that confuses me is this canonical homepage URL fix. It says I have duplicate content on the following pages:http://www.accupos.com/http://www.accupos.com/index.phpWhat would be the best way to fix this problem? (...the first URL has a higher page authority by 10 points and 100+ more inbound links).Respectfully Yours,Derek M.
Technical SEO | | DerekM880 -
Thin/Duplicate Content
Hi Guys, So here's the deal, my team and I just acquired a new site using some questionable tactics. Only about 5% of the entire site is actually written by humans the rest of the 40k + (and is increasing by 1-2k auto gen pages a day)pages are all autogen + thin content. I'm trying to convince the powers that be that we cannot continue to do this. Now i'm aware of the issue but my question is what is the best way to deal with this. Should I noindex these pages at the directory level? Should I 301 them to the most relevant section where actual valuable content exists. So far it doesn't seem like Google has caught on to this yet and I want to fix the issue while not raising any more red flags in the process. Thanks!
Technical SEO | | DPASeo0 -
Duplicate Content Issue
Hi Everyone, I ran into a problem I didn't know I had (Thanks to the seomoz tool) regarding duplicate content. my site is oxford ms homes.net and when I built the site, the web developer used php to build it. After he was done I saw that the URL's looking like this "/blake_listings.php?page=0" and I wanted them like this "/blakes-listings" He changed them with no problem and he did the same with all 300 pages or so that I have on the site. I just found using the crawl diagnostics tool that I have like 3,000 duplicate content issues. Is there an easy fix to this at all or does he have to go in and 301 Redirect EVERY SINGLE URL? Thanks for any help you can give.
Technical SEO | | blake-766240 -
Up to my you-know-what in duplicate content
Working on a forum site that has multiple versions of the URL indexed. The WWW version is a top 3 and 5 contender in the google results for the domain keyword. All versions of the forum have the same PR, but but the non-WWW version has 3,400 pages indexed in google, and the WWW has 2,100. Even worse yet, there's a completely seperate domain (PR4) that has the forum as a subdomain with 2,700 pages indexed in google. The dupe content gets completely overwhelming to think about when it comes to the PR4 domain, so I'll just ask what you think I should do with the forum. Get rid of the subdomain version, and sometimes link between two obviously related sites or get rid of the highly targeted keyword domain? Also what's better, having the targeted keyword on the front of Google with only 2,100 indexed pages or having lower rankings with 3,400 indexed pages? Thanks.
Technical SEO | | Hondaspeder0