How Best to Handle Inherited 404s on Purchased Domain
-
We purchased a domain from another company and migrated our site over to it very successfully. However, we have one artifact of the original domain in that there was a page that was exploited by other sites on the web. This page allowed you to pass any URL to it and redirect to that URL (e.g. http://example.com/go/to/offsite_link.asp?GoURL=http://badactor.com/explicit_content).
This page does not exist on our site so the results always go to a 404 on our site. However, we find that crawlers are still attempting to access these invalid pages.
We have disavowed as many of the explicit sites as we can, but still some crawlers come looking for those links. We are considering blocking the redirect page in our robots.txt but we are concerned that the links will remain indexed but uncrawlable.
What's the best way to pull these pages from search engines and never have them crawled again?
UPDATE: Clarifying that what we're trying to do it get search engines to just never try to get to these pages. We feel the fact they're even wasting their time on getting a 404 is what we're trying to avoid. Is there any reason we shouldn't just block these in our robots.txt?
-
@gastonriera calm down mate. We have actually tested this at not seen any negative effect on any site we have done it on. It is the "easiest" option, but it won't cause the death and destruction your comment implies. Good day sir.
-
Hi there,
I'm considering that you have over 500k URLs, to be worrying about crawl efficiency. If you have less than that, please don't worry.
Having 404s is completely fine, and google will eventually lower their crawl frequency to those pages.
Blocking them in robots.txt will cause to google stop crawling them, but never to never remove them from the index.
My advice here: don't block them in robots.txtAs Rajesh pointed out, you could force those 404s into 410 to tell Google that they are gone forever. Yet, Google said that they treat 404s and 410s as the same.
John Mueller said over a year ago that 4xx status codes don't incur in crawl wastage. You can check it our in these Webmasters hangout notes - DeepcrawlHope it helps,
Best luck.
Gaston -
FOR THE LOVE OF GOD DONT REDIRECT 404s TO THE HOME!
This is terrible advice. Doing that you'll turn those 404s into soft 404s, making them more problematic than ever.
-
I would actually recommend redirecting it to the homepage. If you have a Wordpress website and a bunch of 404 pages, you can install a free plugin called "All 404 to Homepage" and this will solve the problem. I would, however, recommend that if you have replacement pages or pages covering similar content, that you redirect those to the corresponding replacement page.
-
You need to do one thing with those 404 pages. Move them as 410 status code. Redirection is not good practice for the same.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Domain name change
Here's the scenario... Client has two domain names: domain.com - targeting one country (Australia) otherdomain.com - targeting all other countries Both have identical products, but different currencies (AU$ and US$). The problem (as most of you will know) is that without using a sub-domain or country-code top-level domains, Google has no idea which domain should be served for which domain. Furthermore, because the root domain is different, Google doesn't see any connection between the two - other than the fact they have identical products! My recommendation to the client is to change to: domain.com to domain.com.au otherdomain.com to domain.com Arguably, we could leave the second one alone. But I think it's better for the brand to use the same root domain for each. Obviously this means both will need to be redirected. Since NONE of the pages within the sites will change, do we need to redirect every page, or just the root domain? Any other risks or concerns we should know about?
Intermediate & Advanced SEO | | muzzmoz0 -
Why is my domain authority still 1?
I changed the domain of my website from www.vanillacrush.co.uk to www.carissamay.co.uk at the end of December and yet my DA for carissamay is still 1. As advised, I set up a 301 redirect from VC to CM which seems to be working fine. However when I check on redirect detective it tells me I also have a 302 set up. Could this be confusing things? http://www.vanillacrush.co.uk http://www.vanillacrush.co.uk/ http://www.carissamay.co.uk Any help would be greatly appreciated! Many thanks
Intermediate & Advanced SEO | | Carissamay0 -
Best Approach to Redirect One Domain to Another
So I'm about to migrate one domain to another. Lets say I'm migrating boo.com to foo.com. Boo.com has good organic traffic & has some really well ranked pages. For this reason (I think) I want to send that traffic to some where other than the foo.com homepage. Perhaps a catered landing page. My question is can I redirect some of the specific pages on boo.com to a landing page on foo.com & then redirect the delta to foo.com's homepage? Or am a risking not fully transferring the full credit of one domain to another if I take that approach & therefore I should just redirect one domain to the other in its entirety? Thanks, Rich
Intermediate & Advanced SEO | | RPD0 -
How do you raise domain authority?
Hey guys, hoping you can help me out here. I've been tasked with raising several sites' domain authority to a level of 30. Right now, many of them are hovering around 20. Three weeks into this project and our numbers have dropped 1-2 points on average but I don't think our efforts would reflect that this quickly. From what I've read online, a good strategy is guest posting on relevant sites and collecting links from sites with higher DAs. I've also read at least one Moz article about this potentially being ineffective. I've read some of the related posts but they seem mostly dated and the answers didn't seem to help me. Hoping someone with some experience with this can help me out, I appreciate it.
Intermediate & Advanced SEO | | DustinAB0 -
Domain Forwarding for SEO
Hey guys, I recently created a new website for a client who was ranking #1 for the term "jupiter obgyn" but they have now dropped down to #4. This happened because their old home page was at www. instead of just jupiterobgyn.com. When you type in the www. version, it does take you to the root domain but it's not carrying the old PA! The www. version of the page had a 22 PA and the new root domain hosted page is a 1. How can I fix it so that "link juice" carries over? Is this something i need to do in 1and1 (their web host) or within Wordpress? Thanks!!!
Intermediate & Advanced SEO | | RickyShockley0 -
Sub domain on root domain
Hello,
Intermediate & Advanced SEO | | dror999
I have a question that I can't find a good answer on.
I have a site, actually a "portal"/ "directory" for service providers.
Now, for start, we opened every service provider own page on our site, but now we get a lot of applications from those providers that thy want sites from their own.
We want to make every service provider his own site, but on sub domain url. ( they don’t mind… its ok for them)
So, my site is www.exaple.com
There site will be: provider.exaple.com
Now I have two questions:
1. can it harm my site in SEO?
2. if one from those sub domain , punished by google because is owner do "black hat seo" , how it will affect the rood domin? It can make the root domain to get punished?
Thanks!!0 -
Is my other domain making me not rank?
Hi there, We have a .co.uk website which was ranking well for a number of highly competitive keywords, however in February 2012 those rankings for those keywords suddenly dropped off Google all together and have never came back. A few possibilties to why this has happened: We launched a .ie website which has exactly the same content, could this be the reason for the drop? I have put in all the necessary steps in making sure Google ranks these geographically correct by using hreflang and making sure everything is setup properly in webmaster tools. Why I think it could be this: If I copy and paste the first few paragraphs of text from the pages in the .co.uk website that were ranked highly in Google.co.uk it's the .ie version that appears not the .co.uk version. Here is the webpages in question: http://www.avogel.co.uk/health/menopause/ http://www.avogel.ie/health/menopause/ Forgot to mention, the reason we have these two websites is due to different currency and legalities. Hope someone can help me out with this.
Intermediate & Advanced SEO | | Paul780 -
Multi domain redirect to single domain
Hello, all SEOers. Today, I would like to get some ideas about handling multiple domains. I have a client who bought numerous domains under purpose of prevent abuse of their brand name and at the same time for future uses. This client bought more than 100 domains. Some domains are paused, parked, lived and redirected to other site. I don't worry too much of parked domains and paused domains. However, what I am worrying is that there are about 40 different domains are now redirected to single domain and meta refresh was used for redirections. As far as I know, this can raise red flag for Google. I asked clients to clean up unnecessary domains, yet they want to keep them all. So now I have to figure out how to handle all domains which are redirect to single domain. So far, I came up with following ideas. 1. Build gateway page which shows lists of my client sites and redirect all domains to gateway page. 2. Implement robots.txt file to all different domains 3. Delete the redirects and leave it as parked domains. Could anyone can share other ideas in order to handling current status? Please people, share your ideas for me.
Intermediate & Advanced SEO | | Artience0