Tens of duplicate homepages indexed and blocked later: How to remove from Google cache?
-
Hi community,
Due to some WP plugin issue, many homepages indexed in Google with anonymous URLs. We blocked them later. Still they are in SERP. I wonder whether these are causing some trouble to our website, especially as our exact homepages indexed. How to remove these pages from Google cache? Is that the right approach?
Thanks
-
Hi Nigel,
Thanks for the suggestion. I'm going to use "Remove URLs" tool from GSC. They have been created due to a bug in the Yoast SEO plugin. Very unfortunate and we paid for no mistake from our end.
Removing from SERP means removing from Google index also? Or Google will still consider them and just stops showing us? My intention is: Anyway we blocked them, but whether they will cause some distraction to our ranking efforts being there in results being cached.
Thanks
-
Thanks!
A agree - I have just done a similar clean up by:
1. Don't let them be created
2. Redirect all previous versions!One site I just worked on had 8 versions of the home page! lol
http
https
/index.php
/index.php/A mess!
We stopped them all being created and 301'd all versions just in case they were indexed anywhere or linked externally.
Cheers
-
It is assuredly true that, just like in any number of fields (medicine) - in SEO, prevention is better than cleanup based methodology. If your website doesn't take its medicine, you get problems like this one
I think your advice here was really good
-
Good solid advice
They can be created in any number of ways but it's normally simple enough to specify the preferred URL on the server then move any variations in htaccess, such as those with www (if the none www is preferred), those with a trailing slash at the end etc.
The self canonical on all will sort out any other duplicates.
As for getting rid of them - the search console way is the quickest. If they don't exist after that then the won't be reindexed unless they are linked from somewhere else. In such cases, they will 301 from htaccess so it shouldn't be a problem.
if you 410 you will lose any benefit from those links going to the pages and it's a bad experience for a visitor. Always 301 do not 410 if it is a version.
410s are fine for old pages you never want to see in the index again but not for a home page version.
Regards
Nigel
-
It's likely that you don't have access to edit the coding on these weird plugin URLs. As such, normal techniques like using a Meta no-index tag in the HTML may be non-viable.
You could use the HTTP header (server level stuff) to help you out. I'd advise adding two strong directives to the afflicted URLs through the HTTP header so that Google gets the message:
-
Use the X-Robots deployment of the no-index directive on the affected URLs, at the HTTP header (not the HTML) level. That linked pages tells you about the normal HTML implementation, but also about the X-Robots implementation which is the one you need (scroll down a bit)
-
Serve status code 410 (gone) on the affected URLs
That should prompt Google to de-index those pages. Once they are de-indexed, you can use robots.txt to block Google from crawling such URLs in the future (which will stop the problem happening again!)
It's important to de-index the URLs before you do any robots.txt stuff. If Google can't crawl the affected URLs, it can't find the info (in the HTTP header) to know that it should de-index those pages
Once Google is blocked from both indexing and crawling these pages, they should begin to stop caching them too
Hope that helps
-
-
+1 for "Make sure that they are not created in the first place" haha
-
Hi again vtmoz!
1. Make sure that they are not created in the first place
2. Make sure that they are not in the sitemap
3. Go to search console and remove any you do not want - it will say temporary removal but they will not come back if they are not in the structure or the sitemap.More:
https://support.google.com/webmasters/answer/1663419?hl=en
Note: Always self canonicalize the home page to stop versions with UTM codes (created by Facebook, Twitter etc) appearing in SERPS
Regards
Nigel
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
My site is showing indexed in search console but not appearing in Serps
hi, i have recently made sites.google site and submitted to search console but when I copy paste in google , its not appearing
Algorithm Updates | | alan-shultis0 -
Need only tens of pages to be indexed out of hundreds: Robots.txt is Okay for Google to proceed with?
Hi all, We 2 sub domains with hundreds of pages where we need only 50 pages to get indexed which are important. Unfortunately the CMS of these sub domains is very old and not supporting "noindex" tag to be deployed on page level. So we are planning to block the entire sites from robots.txt and allow the 50 pages needed. But we are not sure if this is the right approach as Google been suggesting to depend mostly on "noindex" than robots.txt. Please suggest whether we can proceed with robots.txt file. Thanks
Algorithm Updates | | vtmoz0 -
Brand Video on homepage
Hope I'm not resurfacing this question. I don't mean about unrelated video. I wonder how much a brand video will entice the website visitors on homepage or any other landing page? And does Google consider it as a ranking factor?
Algorithm Updates | | vtmoz0 -
Google panda, penguin or Patience needed?
Dear friends, On 3rd of May, i suffered a Manual google unnatural outbound link penalty. I recovered from the said penalty on 27th of June. However, i have noticed that my traffic has been dropping since 23rd April. I am confused where to target my work. Should i work on thin content and is it an algorithmic Panda problem (but my keywords are still ranking good) or is it a Penguin problem (I had 6 domains with payday loans backlinks and i have dosavowed 32 backlinks recently) What should be my plan of action here and what would you recommend? An image is attached herewith for your reference, PHd8BzX.png
Algorithm Updates | | marketing911 -
Google Sign-In increasing organic encryption keywords?
I am curious how brands that have implemented Google Sign in dealing with the organic encryption keywords. Have encrypted keywords increased after applying Google Sign-in?
Algorithm Updates | | LNEseo
How are you dealing with the missing keyword information?0 -
Google Page Rank not improving
Hi All, I have a site live with a homepage rank of 5, Ever since relaunching (on the same domain) 6 months ago the inner page rank has remained at NA. Its crawled pretty consistently, Can anyone think of a reason this may be happening? www.glowm.com
Algorithm Updates | | thebluecubeuk0 -
When to remove bad links.
Hi everyone. We were hit on the 5th Oct with manual penalties - after building some good links and building good content we saw some gains in our SERPS, not to where they were, but they are definately improving for some low competition keywords. In this case would people recommend still trying to remove bad links? We have audited our links and identified ones which seem spammy. We were going to go through a step by step process, emailing bad link providers where possible, and then sending a disavow for any links we were not able to remove. If we have started to see gains through other means is it wise in people's opinion to start contacting google? We watched Matt Cutts video on disavow usage and he states not to use it unless in extreme situations, so we don't want to 'wake the beast'. Many thanks. James.
Algorithm Updates | | Quime0 -
Effect of new Google SSL policy on our Analytics - AACK!
So I went to look at our keyword reports in GA today and our most popular keyword was "(not provided)". It now accounts for 10% of our referred visits. Unfortunately, it also has a 125% avg order value compared to the rest of our site. This is a really annoying policy that Google has implemented and will clearly have an effect on our ability to effectively market our site.
Algorithm Updates | | IanTheScot0