Accidental Noindex/Mis-Canonicalisation - Please help!
-
Hi everybody,
I was hoping somebody might be able to help as this is an issue my team and I have never come across before.
A client of ours recently migrated to a new site design. 301 redirects were properly implemented and the transition was fairly smooth.
However, we realised soon after that a sub-section of pages had either one or both of the following errors:
- They featured a canonical tag pointing to the wrong page
- They featured the 'meta noindex' tag
After realising this, both the canonicals and the noindex tags were immediately removed. However, Google crawled the site while these were in place and the pages subsequently dropped out of Google's index.
We re-submitted the affected pages to Google's index and used WMT to 'Fetch' the pages as Google. We have also since 'allowed' the pages in the robots.txt file as an extra measure.
We found that the pages which just had the noindex tag were immediately re-indexed, while the pages which featured the noindex tag and which were mis-canonicalised are still not being re-indexed.
Can anyone think of a reason why this might be the case? One of the pages which featured both tags was one of our most important organic landing pages, so we're eager to resolve this.
Any help or advice would be appreciated.
Thanks!
-
I'm not sure how helpful it is, in the sense of being good news, but I did something like this to one of my sites on purpose once, and wrote it up:
http://www.seomoz.org/blog/catastrophic-canonicalization
A couple of tips:
(1) I think what Oleg is saying, which I agree with is that if Page A had a canonical to Page B, instead of just removing the canonical tag, put in a canonical tag pointing from Page A to Page A. Sometimes, the self-referencing canonical will help over-ride the old/bad canonical.
(2) Fetch is a good bet, but I'd also re-submit an XML sitemap with just the "bad" URLs. It's not a cure-all, but it can help nudge Google.
Unfortunately, it really can take time to sort out. Make sure your internal links are correct as well. You could temporarily build new internal links (list a few resources on your home-page, for example) to push link-juice temporarily. You could also post the proper URLs on Twitter/FB, etc., to kick them a bit. Of course, that only works for a few pages, not for hundreds.
-
Yes it may just be a waiting game as Oleg mentioned. But perhaps to help speed up the process you could link to some of those pages from a higher level page (like the homepage or a department landing page).Don't spam tho, no more than 100 links on a page (including navigation/footer etc).
I'd also recommend having an XML sitemap with all the URLs of your website on it. You'll need to upload this to Google Webmaster Tools as well.
When they do get re-indexed keep an eye out for how they have been indexed; so look at what keywords bring up that page in SERPs (Raven Tools is an easy way to track keywords and see which URL comes up). If you find that 'odd' pages are being indexed for a certain keyword search you should do some link building specific to the keyword you want ranked pointing to the page/URL you want ranked.
Good luck!
Davinia
-
Hi Oleg,
Thanks for your response. Unfortunately the canonical URL was another of our main organic landing pages so a redirect wouldn't be appropriate in this situation.
I agree that it's just a matter of time but it's frustrating that Google has crawled the site since we updated the pages and still hasn't re-indexed the page in question.
-
Can you set a canonical/redirect on the page that was incorrect pointing back to the correct page?
i.e. page1.html had wrong canonical to pgae1.html -> change pgae1.html canonical to page1.html
Overall, I think it's just a matter of time before Google is able to recrawl and fix itself... it's odd that canonical + noindex is slower than just noindex. Do whatever you can to get G to recrawl the pages.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Please help with serp placment question?
We own discount banner printing and we are trying to rank 1 for pvc banners or vinyl banners and cannot understand for example how the below is correct, we did suffer a link penalty years ago but we fixed this and the domain has some good links (more and better quality than the sites above us) and cannot understand how we rank below most of the sites above us? If we type on for example pvc banners we get http://www.bannershop.co.uk/cats/pvc_banners.htm https://www.hfe-signs.co.uk/banners.php http://bannerprintingandroid.co.uk/pvc-banners/ http://www.discountbannerprinting.co.uk/banners/vinyl-pvc-banners.html And if we type in vinyl banners we get http://www.vistaprint.co.uk/banners.aspx http://www.bigvaluebanners.co.uk/ http://vinylbannersprinting.co.uk/ http://www.discountdisplays.co.uk/html/vinyl_banners.html https://www.buildasign.co.uk/banners http://www.monkey-print.com/outdoor banners/budget-outdoor-banners http://www.discountbannerprinting.co.uk/banners/vinyl-pvc-banners.html
Intermediate & Advanced SEO | | BobAnderson0 -
Canonical Tag help
Hello everyone, We have implemented canonical tag on our website: http://www.indialetsplay.com/ For e.g. on http://www.indialetsplay.com/cycling-rollers?limit=42 we added canonical as http://www.indialetsplay.com/cycling-rollers?limit=all (as it showcase all products) Our default page is http://www.indialetsplay.com/cycling-rollers Is canonical tag implementation right? Or we need to add any other URL. Please suggest
Intermediate & Advanced SEO | | Obbserv0 -
Magento Help - Server Reset
Good Morning, After rebooting a server, a magento based website reset itself going back to December 2013. All changes to the site and orders dating up until yesterday (6/19/14) have disappeared. There are several folders on the root of the server that have files with yesterday's date but we don't know how to bring everything back and restore. Any Magento or server experts out there ever face this issue or have any ideas or potential solutions? Thanks
Intermediate & Advanced SEO | | Prime850 -
SEO within the URL /
If I were optimizing for 'marketing success' and my URL structure was domain.com/marketing/success would that count? I'm not sure if the '/' affects the keyword term. My assumption is that it does, but I wasn't 100% sure. Thanks!
Intermediate & Advanced SEO | | KristinaWitmer0 -
301 redirect with /? in URL
For a Wordpress site that has the ending / in the URL with a ? after it... how can you do a 301 redirect to strip off anything after the / For example how to take this URL domain.com/article-name/?utm_source=feedburner and 301 to this URL domain.com/article-name/ Thank you for the help
Intermediate & Advanced SEO | | COEDMediaGroup0 -
To noindex or not to noindex
Our website lets users test whether any given URL or keyword is censored in China. For each URL and keyword that a user looks up, a page is created, such as https://en.greatfire.org/facebook.com and https://zh.greatfire.org/keyword/freenet. From a search engines perspective, all these pages look very similar. For this reason we have implemented a noindex function based on certain rules. Basically, only highly ranked websites are allowed to be indexed - all other URLs are tagged as noindex (for example https://en.greatfire.org/www.imdb.com). However, we are not sure that this is a good strategy and so are asking - what should a website with a lot of similar content do? Don't noindex anything - let Google decide what's worth indexing and not. Noindex most content, but allow some popular pages to be indexed. This is our current approach. If you recommend this one, we would like to know what we can do to improve it. Noindex all the similar content. In our case, only let overview pages, blog posts etc with unique content to be indexed. Another factor in our case is that our website is multilingual. All pages are available (and equally indexed) in Chinese and English. Should that affect our strategy?References:https://zh.greatfire.orghttps://en.greatfire.orghttps://www.google.com/search?q=site%3Agreatfire.org
Intermediate & Advanced SEO | | GreatFire.org0 -
Newbie to SEO and SEOMOZ help
Hey everyone i just came across SEOMOZ today, i have been building websites for 3 years now but SEO is something which has always been a scary topic to consider trying to master. I have made a decision to do this in 2012 and i have been looking for a software package which can stear me and teach me. I have been reading the site help today and i feel totally swamped! i have created my campaign but a lot of the results dont make much sense to me and i am unsure of how to fix the errors they found. For instance the crawl diagnostics shows i have 5 4xx client errors. They show me a link to the page where the error is http://www.mydomain.com/category/latest-news/function.require but when i go to see what this is i just find an error 404 not found page.How do i go about removing this error if i have no idea where the problem is? I have started reading SEO User guide and beginers guide and i know it is going to take me a long time to get use to this all, but i am struggling to find the starting point and hope someone can possible help me find the first few steps. Thanks
Intermediate & Advanced SEO | | buntrosgali0 -
Noindex junk pages with inbound links?
I recently came across what is to me a new SEO problem. A site I consult with has some thin pages with a handful of ads at the top, some relevant local content sourced from a third party beneath that... and a bunch of inbound links to said pages. Not just any links, but links from powerful news sites. My impression is that said links are paid (sidebar links, anchor text... nice number of footprints.) Short version: They may be getting juice from these links. A preliminary lookup for one page's keywords in the title finds it top 100 on Google. I don't want to lose that juice, but do think the thin pages they link to can incur Panda's filter. They've got the same blurb for lots of [topic x] in [city y], plus the sourced content (not original...). So I'm thinking about noindexing said pages to avoid Panda filters. Also, as a future pre-emptive measure, I'm considering figuring out what they did to get these links and aiming to have them removed if they were really paid for. If it was a biz dev deal, I'm open to leaving them up, but that possibility seems unlikely. What would you do? One of the options I laid out above or something else? Why? p.s. I'm asking this on my blog (seoroi.com/blog/ ) too, so if you're up for me to quote you (and link to your site, do say so. You aren't guaranteed to be quoted if you answer here, but it's one of the easier ways you'll get a good quality link. p.p.s. Related note: I'm looking for intermediate to advanced guest posts for my blog, which has 2000+ RSS subs. Email me at gab@ my site if you're interested. You can also PM me here on SEOmoz, though I don't login as frequently.
Intermediate & Advanced SEO | | Gab-Goldenberg0