Will rel canonical tags remove previously indexed URLs?
-
Hello,
7 days ago, we implemented canonical tags to resolve duplicate content issues that had been caused by URL parameters. These "duplicate content" had already been indexed.
Now that the URLs have rel canonical tags in place, will Google automatically remove from its index the other URLs with the URL parameters?
I ask because we have been tracking the approximate number of URLs indexed by doing a site: search in Google, and we have barely noticed a decrease in URLs indexed.
Thanks.
-
Thanks.
I think I will monitor for the next 2-3 weeks, and if there still is a lot of unwanted URLS with parameters in the index, I will start requesting removals.
-
You have two options here:
Let Google sort it out (which they will -- but it may take time)
Remove the unnecessary URLs yourself via Webmaster Tool's URL removal tool.
-
Hi Andrea,
yep - we did that.
7 days ago, we implemented the canonical tags because URLs such aswww.example.com/widget?color=blue
www.example.com/widget?size=largewere being indexed, along with the 'real' URL
We resubmitted the sitemap (which has all the 'real' URLs) as well.
At this time, many URLs with parameters are still indexed. I guess after reading this article:
http://www.seomoz.org/blog/catastrophic-canonicalization
I was expecting the change to happen a little quicker...
I just want to confirm no other action is needed on our part.
I understand canonical tags would tell the crawlers which page to index when it finds them for the first time, but I also wanted to confirm that if all URLs are already indexed (because, at the time, no canonical tags were present) implementing the tags would be enough to have the unwanted URLs removed automatically from the index. -
A week isn't very long. It can take Google months to recrawl and drop URLs from an index. Google will figure it out, you just need to give it time. If you haven't done so, update your sitemap to include the tagged pages and resubmit via Google. That will signal them to recrawl your site and could speed up the process.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Home Page Being Indexed / Referral URLs /
I have a few questions related to home page URLs being indexed, canonicalization, and GA reporting... 1. I can view the home page by typing in domain.com , domain.com/ and domain.com/index.htm There are no redirects and it's canonicalized to point to domain.com/index.htm -- how important is it to have redirects? I don't want unnecessary redirects or canonical tags, but I noticed the trailing slash can sometimes be typed in manually on other pages, sometimes not. 2. When I do a site search (site:domain.com), sometimes the HP shows up as "domain.com/", never "domain.com/index.htm" or "domain.com", and sometimes the HP doesn't show up period. This seems to change several times a day, sometimes within 15 minutes. I have no idea what is causing it and I don't know if it has anything to do with #1. In a perfect world, I would ask for the /index.htm to be dropped and redirected to .com/, and the canonical to point to .com/ 3. I've noticed in GA I see / , /index.htm, and a weird Google referral URL (/index.htm?referrer=https://www.google.com/) all showing up as top pages. I think the / and /index.htm is because I haven't setup a default URL in GA, but I'm not sure what would cause the referrer. I tracked back when the referrer URL started to show up in the top pages, and it was right around the time they moved over to https://, so I'm not sure what the best option is to remove that. I know this is a lot - I appreciate any insight anyone can provide.
Technical SEO | | DigMS0 -
Canonical Url Structure Vs. Google Search View
I recently set up a new site and set the "preferred" domain in Google Webmasters to show URLs WITHOUT the WWW for google search purposes. In the confirmation email from google, this confused me: "This setting defines which host - www or not - should be considered the canonical host when indexing your site." In the website, we have cononical URLS at the top of every page in the header, but still have the WWW in those. Any issues with that?
Technical SEO | | vikasnwu0 -
Removed Subdomain Sites Still in Google Index
Hey guys, I've got kind of a strange situation going on and I can't seem to find it addressed anywhere. I have a site that at one point had several development sites set up at subdomains. Those sites have since launched on their own domains, but the subdomain sites are still showing up in the Google index. However, if you look at the cached version of pages on these non-existent subdomains, it lists the NEW url, not the dev one in the little blurb that says "This is Google's cached version of www.correcturl.com." Clearly Google recognizes that the content resides at the new location, so how come the old pages are still in the index? Attempting to visit one of them gives a "Server Not Found" error, so they are definitely gone. This is happening to a couple of sites, one that was launched over a year ago so it doesn't appear to be a "wait and see" solution. Any suggestions would be a huge help. Thanks!!
Technical SEO | | SarahLK0 -
how to set rel canonical on wordpress.com sites
I know how to do this with a wordpress.org site but I have a client that does not want to switch and without a plugin I am lost. any help would be greatly appreciated. Jeremy Wood
Technical SEO | | SOtBOrlando0 -
301s vs. rel=canonical for duplicate content across domains
Howdy mozzers, I just took on a telecommunications client who has spent the last few years acquiring smaller communications companies. When they took over these companies, they simply duplicated their site at all the old domains, resulting in a bunch of sites across the web with the exact same content. Obviously I'd like them all 301'd to their main site, but I'm getting push back. Am I OK to simply plug in rel=canonical tags across the duplicate sites? All the content is literally exactly the same. Thanks as always
Technical SEO | | jamesm5i0 -
Will Google hit me because I hide my H1 tag?
I read an article this morning regarding keywords on a web page. In the article it said that Google would hit anyone putting keywords on a web page but then hiding them from anyone visiting the website. This makes sense. What it did make me think about though is the technique I use when building a website. If, for example, I'm building a website for "Acme Cheap Products". In the header of the page, I will have a H1 tag as well as an image tag for the company's logo. As the logo has the company name on it, I would usually put the company name in a H1 tag as well, and then hide the H1 tag, so I wouldn't have a logo and then a title next to it saying the same thing as the logo. The question is though, would this sort of technique trigger Google in to hitting my site?
Technical SEO | | -Al-0 -
Rel=canonical issue
Re. http://www.appetise.com. We have been alerted that we are "not making appropriate use of the rel=canonical tag". Please could someone just clarify this for us and let us know the recommended remedial action we need to take to rectify the issue? Many Thanks, RB
Technical SEO | | E-resistible0 -
Help with Rel Canonical on Wordpress?
Crawl Diagnostics is showing a lot of Rel Canonical warnings, I've installed Wordpress SEO by Joose De Valk and Home Canonical URL plugins without success. Any ideas? I'm getting a lot of URL's that I thought I blocked from being indexed, such as author pages, category pages, etc. I'm also getting stuff like "recessionitis.com/?homeq=recent" and "recessionitis.com/page/2/", those pages are similar to my homepage. I thought those plugins were suppose to automatically clean things up.. anyone use these plugins that have any helpful hints?
Technical SEO | | 10JQKAs0