Scanning For Duplicate Canonical Tags
-
I'm looking for a solution for identifying pages on a site that have either empty/undefined canonical tags, or duplicate canonical tags (meaning the tag occurs twice within the same page).
I've used Screaming Frog to view sitewide canonical values, but the tool cannot identify when pages use the tag twice, nor can it differentiate between pages that have an empty canonical tag and pages that have no canonical tag at all.
Any help finding a tool of some sort that can assist me in doing this would be much appreciated, as I'm working with tens of thousands of pages and can't do this manually.
-
Paul,
Thanks for your reply! I have used the paid version of Screaming Frog with regex to exclude pages with certain parameters, but I have not tried the custom queries.
Could you give me an example of a custom query that would find empty canonical tags? That would be extremely helpful.
-
I think Screaming Frog is still the solution you want, John, but it's not configured to do what you need "out of the box". You're going to need to write a custom query for Screaming Frog to run while it's indexing your site.
This capability is only available in the paid version of the tool, but you'll need the paid version anyway to be able to crawl 10,000 page sites as the free tool cuts out at 500 pages.
You'll find the Custom settings link under the Configuration tab in the top navigation bar of the tool. Essentially what you're doing is writing custom filters.
You'll need to write a regex (regular expression) that is capable of finding pages with no canonical tag at all, and another which is capable of finding empty canonical tags. If your regex-fu is really strong, you may be able to write a single expression to capture both these states.
Had you already tried the custom queries with Screaming Frog?
Paul
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Importance (or lack of) Meta keywords tags and Tags in Drupal
I'm wondering should I put any effort in making Meta Keywords tags for my pages or normal Tags (they're separate in Drupal), since apparently first are not considered by most of search engines, while not sure about normal tags. Obviously SERPS has to determine partial valu of the page by content, thus consider keywords / tags to some extend. What's your opinion on that. Thank you.
Intermediate & Advanced SEO | | Optimal_Strategies1 -
Canonical Tag Question Regarding Two State Pages
Hi Moz Fam! Question... we have two pages for each state. Both pages are used in our PPC campaigns. One of the two state pages is fully optimized for organic search. The page that's optimized for organic has 1,500-2,000 words, the other one has thin content. All the content is unique, nothing duplicate. We call one set of state pages that I optimized my SEO state pages, then the other ones are our PPC state pages. Should I be setting a canonical tag to one of these pages to let Google know which one is the "master" page? (My SEO state page is the master) I've never used them, so I'm not sure what the right answer is for this. Thanks!
Intermediate & Advanced SEO | | LindsayE0 -
Internal Duplicate Content Question...
We are looking for an internal duplicate content checker that is capable of crawling a site that has over 300,000 pages. We have looked over Moz's duplicate content tool and it seems like it is somewhat limited in how deep it crawls. Are there any suggestions on the best "internal" duplicate content checker that crawls deep in a site?
Intermediate & Advanced SEO | | tdawson091 -
Canonical URL availability
Hi We have a website selling cellphones. They are available in different colors and with various data capacity, which slightly changes the URL. For instance: Black iphone, 16GB: www.site.com/iphone(black,16,000000000010204783).html White iphone, 16GB: www.site.com/iphone(white,16,000000000010204783).html White iphone, 24GB: www.site.com/iphone(white,24,000000000010204783).html Now, the canonical URL indicates a standard URL: But this URL is never physically available. Instead, a user gets 301 redirected to one of the above URLs. Is this a problem? Does a URL have to be "physically" available if it is indicated as canonical?
Intermediate & Advanced SEO | | zeepartner0 -
Same Alt tag on the images
Can We have same alt tags on all the images? Below pages have images with same alt tag "astrologer Ravi sharma". I used name of the person on every image. before today, all images were shown in google images but today no image is there. any comment. Like - http://www.astrologerravisharma.com/astrologer-ravi-sharma-photos/ http://www.astrologerravisharma.com/gallery/
Intermediate & Advanced SEO | | AlexanderWhite0 -
Does Bing support cross-domain canonical tag?
Hi folks, We are planning to implement a cross-domain canonical tag for a client and I'm looking for some information on bing supporting cross-domain canonical tag. Does anyone knows if there was a public announcement made by Bing or any representative about the support of this tag? Btw, the best info I've found is a Q&A here on Moz about it http://moz.com/community/q/does-bing-support-cross-domain-canonical-tags but I'm looking for a Bing information on the topic.
Intermediate & Advanced SEO | | fabioricotta-840380 -
Are all duplicate content issues bad? (Blog article Tags)
If so how bad? We use tags on our blog and this causes duplicate content issues. We don't use wordpress but with such a highly used cms having the same issue it seems quite plausible that Google would be smart enough to deal with duplicate content issues caused by blog article tags and not penalise at all. Here it has been discussed and I'm ready to remove tags from our blog articles or monitor them closely to see how it effects our rankings. Before I do, can you give me some advice around this? Thanks,
Intermediate & Advanced SEO | | Daniel_B
Daniel.0 -
Rel Canonical Syntax
My IT department is getting ready to setup the rel canonical tag, finally. I took a look at the code on our test server and see that they are using a single quote in the tag syntax (see code block below). Should I be concerned? Will Google read those lines the same? <link rel='canonical' href='[http://www.wholesalecostumeclub.com/easter-costumes/bunny-suits](view-source:http://www.wholesalecostumeclub.com/easter-costumes/bunny-suits)' />VS. **versus** <link rel="canonical" href="[http://www.wholesalecostumeclub.com/easter-costumes/bunny-suits](view-source:http://www.wholesalecostumeclub.com/easter-costumes/bunny-suits)" />
Intermediate & Advanced SEO | | costume0