Tens of duplicate homepages indexed and blocked later: How to remove from Google cache?
-
Hi community,
Due to some WP plugin issue, many homepages indexed in Google with anonymous URLs. We blocked them later. Still they are in SERP. I wonder whether these are causing some trouble to our website, especially as our exact homepages indexed. How to remove these pages from Google cache? Is that the right approach?
Thanks
-
Hi Nigel,
Thanks for the suggestion. I'm going to use "Remove URLs" tool from GSC. They have been created due to a bug in the Yoast SEO plugin. Very unfortunate and we paid for no mistake from our end.
Removing from SERP means removing from Google index also? Or Google will still consider them and just stops showing us? My intention is: Anyway we blocked them, but whether they will cause some distraction to our ranking efforts being there in results being cached.
Thanks
-
Thanks!
A agree - I have just done a similar clean up by:
1. Don't let them be created
2. Redirect all previous versions!One site I just worked on had 8 versions of the home page! lol
http
https
/index.php
/index.php/A mess!
We stopped them all being created and 301'd all versions just in case they were indexed anywhere or linked externally.
Cheers
-
It is assuredly true that, just like in any number of fields (medicine) - in SEO, prevention is better than cleanup based methodology. If your website doesn't take its medicine, you get problems like this one
I think your advice here was really good
-
Good solid advice
They can be created in any number of ways but it's normally simple enough to specify the preferred URL on the server then move any variations in htaccess, such as those with www (if the none www is preferred), those with a trailing slash at the end etc.
The self canonical on all will sort out any other duplicates.
As for getting rid of them - the search console way is the quickest. If they don't exist after that then the won't be reindexed unless they are linked from somewhere else. In such cases, they will 301 from htaccess so it shouldn't be a problem.
if you 410 you will lose any benefit from those links going to the pages and it's a bad experience for a visitor. Always 301 do not 410 if it is a version.
410s are fine for old pages you never want to see in the index again but not for a home page version.
Regards
Nigel
-
It's likely that you don't have access to edit the coding on these weird plugin URLs. As such, normal techniques like using a Meta no-index tag in the HTML may be non-viable.
You could use the HTTP header (server level stuff) to help you out. I'd advise adding two strong directives to the afflicted URLs through the HTTP header so that Google gets the message:
-
Use the X-Robots deployment of the no-index directive on the affected URLs, at the HTTP header (not the HTML) level. That linked pages tells you about the normal HTML implementation, but also about the X-Robots implementation which is the one you need (scroll down a bit)
-
Serve status code 410 (gone) on the affected URLs
That should prompt Google to de-index those pages. Once they are de-indexed, you can use robots.txt to block Google from crawling such URLs in the future (which will stop the problem happening again!)
It's important to de-index the URLs before you do any robots.txt stuff. If Google can't crawl the affected URLs, it can't find the info (in the HTTP header) to know that it should de-index those pages
Once Google is blocked from both indexing and crawling these pages, they should begin to stop caching them too
Hope that helps
-
-
+1 for "Make sure that they are not created in the first place" haha
-
Hi again vtmoz!
1. Make sure that they are not created in the first place
2. Make sure that they are not in the sitemap
3. Go to search console and remove any you do not want - it will say temporary removal but they will not come back if they are not in the structure or the sitemap.More:
https://support.google.com/webmasters/answer/1663419?hl=en
Note: Always self canonicalize the home page to stop versions with UTM codes (created by Facebook, Twitter etc) appearing in SERPS
Regards
Nigel
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Best and easiest Google Depersonalization method
Hello, Moz hasn't written anything about depersonalization for years. This article has methods, but I don't know if they are valid anymore. What's an easy, effective way to depersonalize Google search these days? I would just log out of Google, but that shows different ranking results than Moz's rank tracker for one of our main keywords, so I don't know if that method is correct. Thanks
Algorithm Updates | | BobGW0 -
Google sidebar advertising dropped
Has anyone noticed how the google sidebar advertising has completely disappeared? They only display top 4 adwords and then remaining on the bottom of each search page. I can't find any info on it or when it actually happened?
Algorithm Updates | | Purplesars110 -
Remove new Knowlage graph overlay
Hi Moz, Recently Google started showed new snippets in searches as seen in the attached below (1) but the problem I'm finding is I'm sure its showing me different results to "normal" searches especially when compared to a logged out or alternative account (2). Now I've got a solution to this but wanted to know if anyone else had solutions to removing this or at least gaining a more normal result with it on. Seems to almost personalize the search results which isn't ideal so if anyone knows how to get this to stop I would love to hear it You can also see some more info here - https://www.seroundtable.com/google-pop-up-information-box-18007.html thanks helping me with the mild irritant NNX9J8p.jpg Ntj7WbX.jpg
Algorithm Updates | | GPainter0 -
Where can I find a breakdown of google search volume by specific industry/vertical? For example, what % of people searching in google are looking for housing? Cars? Restaurants?
I"m looking for specific breakdowns of search volume in google by: #1 Vertical (Shopping/restaurants/Services etc). For example, how many people are searching in google for information pertaining to restaurants per month? Search volume for all of 2012, 2013, 2014? #2 More granular categories within verticals, people searching for: books,apartment rentals,cellphones) Is there a breakdown of google search somewhere online that gives this type of information? Thank you MOZ community, really appreciate it!
Algorithm Updates | | AppleSauceRules0 -
Same search term shows #1 on Bing but #140 on Google?
Hi, I am using the search term of my website domain i.e. "Series Digital" on both Bing and Google. Bing shows my website as the top most link. But on Google, my website appears on page 14!! Why is this happening when I am using the string within the " "?
Algorithm Updates | | Cloudguru990 -
Google is showing crazy results
Google is showing crazy results in these days sometimes my sites are on top of all keywords sometimes far behind in search engine in same day what is going on ????
Algorithm Updates | | GM0070 -
Disavow cache
Hey everyone, Currently helping a website that has been penalised and we've been going down a heavy link removal process as it has a pretty bad link profile. Our first disavow request has been rejected, and I was wondering.... When submitting a reconsideration request, do Google only know when a link has been removed when it's cached? If so, should I leave it a while for a reconsideration request as it might take a while for the cache to be updated Thanks
Algorithm Updates | | Sandeep_Matharu0 -
How Do I Make My Google SERP "SiteLinks" more relevant?
I have a shopping website with thousands of products, and the sitelinks that google has chosen for me (for a long time) are random product pages, which makes no sense to me. I do not emphasize those products on the home page, and I have a sitemap that clearly lists the directory of all the categories. I also added a "nofollow" attribute to almost every link on the home page that is not important. These products in the site links seem completely random and there isnt even a sitelink for "about" or any of the footer content! What gives? Also, my sitelinks never updated to the new, better version. Any suggestions?
Algorithm Updates | | cDNAInteractive0