Tens of duplicate homepages indexed and blocked later: How to remove from Google cache?
-
Hi community,
Due to some WP plugin issue, many homepages indexed in Google with anonymous URLs. We blocked them later. Still they are in SERP. I wonder whether these are causing some trouble to our website, especially as our exact homepages indexed. How to remove these pages from Google cache? Is that the right approach?
Thanks
-
Hi Nigel,
Thanks for the suggestion. I'm going to use "Remove URLs" tool from GSC. They have been created due to a bug in the Yoast SEO plugin. Very unfortunate and we paid for no mistake from our end.
Removing from SERP means removing from Google index also? Or Google will still consider them and just stops showing us? My intention is: Anyway we blocked them, but whether they will cause some distraction to our ranking efforts being there in results being cached.
Thanks
-
Thanks!
A agree - I have just done a similar clean up by:
1. Don't let them be created
2. Redirect all previous versions!One site I just worked on had 8 versions of the home page! lol
http
https
/index.php
/index.php/A mess!
We stopped them all being created and 301'd all versions just in case they were indexed anywhere or linked externally.
Cheers
-
It is assuredly true that, just like in any number of fields (medicine) - in SEO, prevention is better than cleanup based methodology. If your website doesn't take its medicine, you get problems like this one
I think your advice here was really good
-
Good solid advice
They can be created in any number of ways but it's normally simple enough to specify the preferred URL on the server then move any variations in htaccess, such as those with www (if the none www is preferred), those with a trailing slash at the end etc.
The self canonical on all will sort out any other duplicates.
As for getting rid of them - the search console way is the quickest. If they don't exist after that then the won't be reindexed unless they are linked from somewhere else. In such cases, they will 301 from htaccess so it shouldn't be a problem.
if you 410 you will lose any benefit from those links going to the pages and it's a bad experience for a visitor. Always 301 do not 410 if it is a version.
410s are fine for old pages you never want to see in the index again but not for a home page version.
Regards
Nigel
-
It's likely that you don't have access to edit the coding on these weird plugin URLs. As such, normal techniques like using a Meta no-index tag in the HTML may be non-viable.
You could use the HTTP header (server level stuff) to help you out. I'd advise adding two strong directives to the afflicted URLs through the HTTP header so that Google gets the message:
-
Use the X-Robots deployment of the no-index directive on the affected URLs, at the HTTP header (not the HTML) level. That linked pages tells you about the normal HTML implementation, but also about the X-Robots implementation which is the one you need (scroll down a bit)
-
Serve status code 410 (gone) on the affected URLs
That should prompt Google to de-index those pages. Once they are de-indexed, you can use robots.txt to block Google from crawling such URLs in the future (which will stop the problem happening again!)
It's important to de-index the URLs before you do any robots.txt stuff. If Google can't crawl the affected URLs, it can't find the info (in the HTTP header) to know that it should de-index those pages
Once Google is blocked from both indexing and crawling these pages, they should begin to stop caching them too
Hope that helps
-
-
+1 for "Make sure that they are not created in the first place" haha
-
Hi again vtmoz!
1. Make sure that they are not created in the first place
2. Make sure that they are not in the sitemap
3. Go to search console and remove any you do not want - it will say temporary removal but they will not come back if they are not in the structure or the sitemap.More:
https://support.google.com/webmasters/answer/1663419?hl=en
Note: Always self canonicalize the home page to stop versions with UTM codes (created by Facebook, Twitter etc) appearing in SERPS
Regards
Nigel
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Still no good search results after 2 months of indexation
Hi guys, One of our website (https://www.residentiebosrand.be/) has been online for about two months. It's indexed and Google shows search results. But the website is not ranking on the keywords it's supposed to be ranking: 'residentie bosrand'. How come we still don't find the website on the first pages in the search results, while these are the main keywords on the website's URL, page, ... ? Best regards,
Algorithm Updates | | conversal0 -
Google asking questions in SERPs
I just did s search for Hayley Kiyoko, and Google asked me which song is my favourite from her new album. Is this a new thing? I've never asked Google a question before and had it ask me something back, other than "did you mean... (the correct spelling for what I was looking for)?" u6qYnwq.png
Algorithm Updates | | 4RS_John1 -
Google Custom Search Engine: Good Idea?
I created a Google Custom Search Engine for our site, but I"m not sure implementing it is a good idea. When I tested it with the public URL, I noticed that ads show up on the search engine that could potentially move visitors away from our site to our competitors. Has anyone had success with implementing a Google Custom Search Engine? Do the pros outweigh the cons? Thanks, Ruben
Algorithm Updates | | KempRugeLawGroup0 -
Will Parked Domain hurt My SEO as Duplicate Content?
Hello, I have one website (Migration Lawyers) and I have an extra 8 domains Parked so they are basically cloning the content of the site. so if the main site is: migrationlawyers.co.za and I have an addon domain migration-lawyers.com is that good or bad? is there a proper way to redirect the sites, will redirecting (301) subdomains be more effective? Thanks for your Input 🙂 0i8VXqr.png
Algorithm Updates | | thealika0 -
Google Search CTR % By Position
Hello I am looking for an updated report regarding the CTR % by position for Google search results. I have the compete.com report which Gives the 1st organic position a 53% CTR but I have not be able to duplicate that number with any other report or research. I am just trying to validate this report before I suggest any recommendations to my company regarding our search efforts. Thank you Ben
Algorithm Updates | | bhalverson30 -
Google.uk rankings plummet, .com improves. What to do?
Hey Guys, Seems so much has changed with international SEO I'm not sure what to do with our site. We have a huge site with many country level landing pages that perform very well on google.com searches (IE; keyword + Jamaica) etc. We are not using a .co.uk version of our site and now our rankings have plummeted in the UK. Should we just make a .co.uk with similar (or the exact same content) or is there some newer strategy to follow?
Algorithm Updates | | iAnalyst.com0 -
Site name appended to page title in google search
Hi there, I have a strange problem concerning how the search results for my site appears in Google. The site is Texaspoker.dk and for some strange reason that name is appended at the end of the page title when I search for it in Google. The site name is not added to the page titles on the site. If I search in Google.dk (the relevant search engine for the country I am targeting) for "Unibet Fast Poker" I get the following page title displayed in the search results: Unibet Fast Poker starter i dag - få €10 og prøv ... - Texaspoker.dk If you visit the actual page you can see that there is no site name added to the page title: http://www.texaspoker.dk/unibet-fast-poker It looks like it is only being appended to the pages that contains rich snippets markup and not he forum threads where the rich snippets for some reason doesn't work. If I do a search for "Afstemning: Foretrukne TOPS Events" the title appears as it should without the site name being added: Afstemning: Foretrukne TOPS Events Anybody have any experience regarding this or an idea to why this is happening? Maybe the rich snippets are automatically pulling the publisher name from my Google+ account... edited: It doesn't seem to have anything to do with rich snippets, if I search for "Billeder og stuff v.2" the site name is also appended and if I search for "bedste poker bonus" the site name is not.
Algorithm Updates | | MPO0 -
How Google Determines Sitelinks
Does anyone have authoritative information on how Google determines which links to use as sitelinks? I thought I saw that Top Landing Pages was a metric Google used (in part).
Algorithm Updates | | joshfialkoff-778630