Duplicated content detected with MOZ crawl with canonical applied
-
Hi there!
I have a slight problem.
I have a site with Joomla 3.3 that we recently migrated from 2.5.Joomla, for some reason that I don´t really get, creates hundreds of weird urls for the site like
mydomain.com/en -> joomla creates en/home/149-xxx-xxx/xxxxxx-xxxxxx that links to the first one.
The new version 3.3 knows this bug and applies a rel=canonical to the ones created "artificially", so they should not be identified as duplicated.Sample piece of code: en/home/149-all-en/xxxxxxx-xxxxxx" rel="canonical" /
MOZ crawler identifies this as duplicated and like this I have thousands of pages duplicated all with titles, content etc... all the ones created by joomla. Still my site has good SEO results and I can not see any penalties but I am a bit concerned they may come in the future....
Can anyone explain me what is happening?
Thank you in advance for your time,
-
If it's a period of 2 weeks and you're going to do it anyways, I would just make the new content and not go to the expense of setting up redirects and then taking them down, which can cause issues when you plan on recreating a URL.
-
Thank you for your time!
We are going to setup 301 redirects (one colleague suggested importing those directly in the DB of redirects) from those duplicated pages until joomla has a native solution and we have the time to make all unique content, to avoid penalties.
At least, we would solve temporaly the problem, it will take 2 weeks to make all the unique content.
Would that make sense?
Have a nice weekend!
-
I personally would not generate new language sections unless the content has been translated and localized on those pages. Right now your Spanish homepage has English content in the body, so I would view this as incomplete. Ideally you'd translate the entire page for those sections.
When you do that, you'll want to use hreflang, not canonicals, to indicate different versions of the same content.
So, my recommendation is (A) get rid of the Spanish content sections which would solve the duplication problem, or (B) finish translating the content and then install hreflang code, which would also solve the duplication problem.
Unfortunately I don't know of a good hreflang tool for Joomla specifically.
Let me know if that makes sense?
-
Thank you Kane.
I would like to keep the content in all the languages, ,as I think it is useful for customers to enter easily certain areas.
The problem that I am always having is the implementation...There are not real good canonical plugins (that would allow me to do a bulk import), and I am not that advanced as for doing an htaccess redirect with 301... still, I would like that if someone from NL or FI version would like to find the area barcelona could see it....
Anything on mind!? Just to say, I tried SH404, does all the work but rewrites the whole url structure (not possible), I tried canonical http://www.cmsplugin.com/products/components/4-canonical-url which solves the duplication by languages but not the random urls created by 3.3...
Then I decided to leave the plugin I mentioned before, it deletes all the duplicated urls generated automatically but does not solve the language problem...So, here I am
Any suggestion?
-
Also, if you decide to keep the /es/ section of the website then you'll need to look into hreflang instead of canonical tags, because /es/ and /en/ will not be duplicate content once they're translated.
Read this Q&A from Google for details - https://sites.google.com/site/webmasterhelpforum/en/faq-internationalisation#q20
-
Hey Jose,
If you have an /es/ subfolder then ideally you would be translating that content to Spanish, not canonicalizing that content back to the English version.
I can see from http://www.spain-internship.com/es/internships-in-salamanca that not all /es/ pages are translated - is this true across the entire website?
If you don't have any Spanish content, then you should just kill off the /es/ version entirely.
-
Hi there,
Thanks for the update. Now that you told me the problem I found out this is a known bug for joomla and I am working on it.
I found a plugin http://styleware.eu/store/item/26-styleware-content-canonical-plugin that sends all the duplicated urls, generated automatically with a canonical to the home.Sample:
http://www.spain-internship.com/en/home/149-all-en/placement-spain
Now with the link http://www.spain-internship.com" rel="canonical" />.This solves the problem of the core canonical bug.
Would this be a proper solution?Now I only have to change all the ones duplicated due to languages config, block then in robots or canonical but as far as I control it, it is ok.
Please, let me know if this would be a proper solution.
Thank you in advance for your help, if I can help you in some moment with something here we are!
-
Ok, the problem is your pages are all canonical to themselves, the canonical tag should point at the main page for the content, not to every page. For your first example, all pages that get their content from http://www.spain-internship.com/en need to have canonical tags to that page, instead the copy page has this:
href="http://www.spain-internship.com/fi/etusivu/186-all-fi/home-page-fi" rel="canonical" />
it should have
href="http://www.spain-internship.com/fi/" rel="canonical" />
-
I will provide few so you can look!
Detected as duplicated:
http://www.spain-internship.com/en
http://www.spain-internship.com/en/home/149-all-en/placement-spainSame here:
http://www.spain-internship.com/fi
http://www.spain-internship.com/fi/etusivu/186-all-fi/home-page-fihttp://www.spain-internship.com/en/internships-in-salamanca
http://www.spain-internship.com/es/internships-in-salamancaFirst one is the original. The rest one have canonical. Still detected as duplicated.
-
Do you have an example of one of these generated pages as well, everything looks fine on the main page.
-
Hey,
Yes, sure.
This is the duplicated from the /en
http://www.spain-internship.com/en/home/149-all-en/placement-spain
Thanks!
-
Do you have a link to one of these pages so we can look at how it is deploying the canonical onto the page.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Moz bot transforms special characters into fake 404 Pages in Reports
I have a lot of 404 links in my exported Moz Pro reports which are not actually Not Found pages when checking the Referring URL. For example: Referring URL:
Link Explorer | | ProspectoGroup
https://ofertolino.ro/oferte/barn%C3%A4ngen 404:
https://ofertolino.ro/barn%C3%83%C2%A4ngen What happens is that the referring URL leads to a redirect (https://ofertolino.ro/barn%C3%A4ngen) but it never leads to the specified 404 Page (which has even a more weird symbol in the URL when it is url decoded) in the report. What could be the issue here and is the Moz bot responsible for such misbehaviour?0 -
Moz's new Link Explorer displaying the DA marginally less than Site Explorer
Moz's new Link Explorer displaying the DA marginally less than Site Explorer. Old one is showing it 46 while new link explorer is showing the DA as 40.
Link Explorer | | dhananjay.kumar11 -
None of the pages crawled contain an email address or links to a social profile...
I'm trying to reduce the amount of spam flags our website has using the Open Site Explorer tool. Currently, we have 2/17 flags based on: Large Site with Few Links - We found very few sites linking to this site, considering its size No Contact Info - None of the pages crawled contain an email address or links to a social profile The "few links" can be ignored, we're working on this. We don't have a visible email address on the website and we don't particularly want one. We prefer customers to fill out our online form or to call us. Moz say "none of the pages crawled contain an email address OR links to a social profile" - we do have social buttons on every page of the website, but these are official Facebook and Twitter buttons that are rendered with Javascript, so don't actually appear in the page source on load. If we replace these with actual links to our pages using Facebook and Twitter icons, will this flag be removed since Moz are saying "or links to a social profile" - making it sound optional. Thanks!
Link Explorer | | LiamMcArthur0 -
Why is OSE showing a higher Domain Authority than Moz?
Moz has today run monthly reports for 2 of my clients' sites. For both of these, the Domain Authority reported on the Moz dashboard is several points lower than that shown by OSE today for the same sites. Why would that be?
Link Explorer | | mfrgolfgti0 -
Learn how to use Moz's Spam Score metric to identify high risk links. Get your Daily SEO Fix.
Almost every site has a few bad links pointing to it but risky links can have a negative impact on your search engine rankings. Watch The Moz Daily SEO Fix: How to Use Spam Score to Identify High Risk Links to learn how to spot those spammy links and what to do with them. And, if you have more questions about Spam Score, check out Rand’s blog post: "Spam Score: Moz’s New Metric to Measure Penalization Risk." This video is part of The Moz Daily SEO Fix tutorial series--Moz tool tips and tricks in under 2 minutes. To watch all of our videos so far, and to subscribe to future ones, make sure to visit the Daily SEO Fix channel on YouTube.
Link Explorer | | kellyjcoop3 -
I have a robots.txt error on Moz but not on Google Webmaster tools. Wondering what to do.
For the site www.patrickwerry.com, I'm getting a DA of 1 and a Error Code 612: Error response for robots.txt However, when I check webmaster tools, it's showing no errors and allowing robots.txt for the domain. Is there anything I can do to fix the issue on the Moz side so I can get better data? If you can respond in layman's terms even better. 🙂 Not an SEO. Lisa
Link Explorer | | LisaGerber0 -
Why Moz Doesn't See or Count Our Backlinks?
Hi Moz Community! We have been working hard to improve our Moz metrics, measuring against a high ranking competitor to help us set our goals. Our Majestic and Webmaster Tools find tens of thousands of external backlinks pointing to our domain. That's all well and good. Moz's Open Site Explorer, however, only finds 900 total links - including internal links! This being the case, we have worked diligently to build a variety of great external backlinks, creating Bitly links and encouraging clicks on those through social promotion. Yet, our competitor has over 7,000 external backlinks in Moz's index, while ours is not growing relative to their number of backlinks. Can anyone share with us what they do to tell Moz about their backlinks? We already know we have many more backlinks than our competitor, from trusted domains with good authority, yet it seems Moz is not discovering them. We just want to understand how to use these Moz metrics to create meaningful calls-to-action. Otherwise, it seems like a gargantuan waste of time, and our team has difficulty getting buy-in from our company to put time and assets toward tasks based on our Moz numbers!
Link Explorer | | RegistrarCorp0