Scanning For Duplicate Canonical Tags
-
I'm looking for a solution for identifying pages on a site that have either empty/undefined canonical tags, or duplicate canonical tags (meaning the tag occurs twice within the same page).
I've used Screaming Frog to view sitewide canonical values, but the tool cannot identify when pages use the tag twice, nor can it differentiate between pages that have an empty canonical tag and pages that have no canonical tag at all.
Any help finding a tool of some sort that can assist me in doing this would be much appreciated, as I'm working with tens of thousands of pages and can't do this manually.
-
Paul,
Thanks for your reply! I have used the paid version of Screaming Frog with regex to exclude pages with certain parameters, but I have not tried the custom queries.
Could you give me an example of a custom query that would find empty canonical tags? That would be extremely helpful.
-
I think Screaming Frog is still the solution you want, John, but it's not configured to do what you need "out of the box". You're going to need to write a custom query for Screaming Frog to run while it's indexing your site.
This capability is only available in the paid version of the tool, but you'll need the paid version anyway to be able to crawl 10,000 page sites as the free tool cuts out at 500 pages.
You'll find the Custom settings link under the Configuration tab in the top navigation bar of the tool. Essentially what you're doing is writing custom filters.
You'll need to write a regex (regular expression) that is capable of finding pages with no canonical tag at all, and another which is capable of finding empty canonical tags. If your regex-fu is really strong, you may be able to write a single expression to capture both these states.
Had you already tried the custom queries with Screaming Frog?
Paul
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Are ALL duplicate title tags bad??
We’ve had some success recently by reducing the number of duplicate title tags on our website. We have managed to fix all the simple cases but there are a number of stubborn examples that we don’t know how to fix. A lot of the duplicate tags come from the website’s forums. Many questions have been asked multiple times over the years where the user has phrased the question in the same way. This has led to many cases where different forums posts have the same title tag. For example, there are six title tags with the words ‘’need help”! These are being highlighted as duplicates and currently we have several thousand of these. Would this be a problem? I’d be tempted to say that we should leave them as they don’t seem unnatural to me. One solution other solution we are considering is to append the forum name to the question to any post after the original, falling back to appending the date if that doesn’t distinguish it. Do people think that this is a good solution to implement or would it be better to leave these duplicate title tags as they are? Any help would be appreciated 🙂
Intermediate & Advanced SEO | | RG_SEO0 -
Tabs and duplicate content?
We own this site http://www.discountstickerprinting.co.uk/ and just a little concerned as I right clicked open in new tab on the tab content section and it went to a new page For example if you right click on the price tab and click open in new tab you will end up with the url
Intermediate & Advanced SEO | | BobAnderson
http://www.discountstickerprinting.co.uk/#tabThree Does this mean that our content is being duplicated onto another page? If so what should I do?0 -
Is legacy duplicate content an issue?
I am looking for some proof, or at least evidence to whether or not sites are being hurt by duplicate content. The situation is, that there were 4 content rich newspaper/magazine style sites that were basically just reskins of each other. [ a tactic used under a previous regime 😉 ] The least busy of the sites has since been discontinued & 301d to one of the others, but the traffic was so low on the discontinued site as to be lost in noise, so it is unclear if that was any benefit. Now for the last ~2 years all the sites have had unique content going up, but there are still the archives of articles that are on all 3 remaining sites, now I would like to know whether to redirect, remove or rewrite the content, but it is a big decision - the number of duplicate articles? 263,114 ! Is there a chance this is hurting one or more of the sites? Is there anyway to prove it, short of actually doing the work?
Intermediate & Advanced SEO | | Fammy0 -
Canonical url issue
Canonical url issue My site https://ladydecosmetic.com on seomoz crawl showing duplicate page title, duplicate page content errors. I have downloaded the error reports csv and checked. From the report, The below url contains duplicate page content.
Intermediate & Advanced SEO | | trixmediainc
https://www.ladydecosmetic.com/unik-colours-lipstick-caribbean-peach-o-27-item-162&category_id=40&brands=66&click=brnd And other duplicate urls as per report are,
https://www.ladydecosmetic.com/unik-colours-lipstick-plum-red-o-14-item-157&category_id=40&click=colorsu&brands=66 https://www.ladydecosmetic.com/unik-colours-lipstick-plum-red-o-14-item-157&category_id=40 https://www.ladydecosmetic.com/unik-colours-lipstick-plum-red-o-14-item-157&category_id=40&brands=66&click=brnd But on every these url(all 4) I have set canonical url. That is the original url and an existing one(not 404). https://www.ladydecosmetic.com/unik-colours-lipstick-caribbean-peach-o-27-item-162&category_id=0 Then how this issues are showing like duplicate page content. Please give me an answer ASAP.0 -
Alt tags
What options do I have if all my images are pulled in by tags background images? It's a customized CMS and the designer put all images in the CSS so he would have more control over image size. I would like to somehow add description elements to the alt tags Thank, Eric
Intermediate & Advanced SEO | | SeaDrive0 -
Canonical URL Question
Hi Everyone I like to run this question by the community and get a second opinion on best practices for an issue that I ran into. I got two pages, Page A is the original page and Page B is the page with duplicate content. We already added** ="Page A**" />** to the duplicate content (Page B).** **Here is my question, since Page B is duplicate content and there is a link rel="canonical" added to it, would you put in the time to add meta tags and optimize the title of the page? Thanks in advance for all your help.**
Intermediate & Advanced SEO | | DRTBA0 -
Duplicate page content
Hi. I am getting error of having duplicate content on my website and pages its showing there are: www.mysitename.com www.mysitename.com/index.html As my best knowledge it only one page, I know this can be solved with some conical tag used in header, but do not know how. Can anyone please tell me about that code or any other way to get this solved. Thanks
Intermediate & Advanced SEO | | onlinetraffic0 -
H1 Tags
Quick and easy most likely - Just need to clear a few point. I understand each page within the site should only have one H1 tag which should be the most important one. I also believe these only effect google ranking very slightly? right? Currently my CMS is system is pulling the H1 tag in from the page and automatically using the page heading that is on the page IE) the heading used for the content. Should this be a keyword / key phrase instead? and will it be duplicate if i used the same one on various pages in my site? Cheers guys look forward to hearing your feedback
Intermediate & Advanced SEO | | wazza19850