[Very Urgent] More 100 "/search/adult-site-keywords" Crawl errors under Search Console
-
I just opened my G Search Console and was shocked to see more than 150 Not Found errors under Crawl errors. Mine is a Wordpress site (it's consistently updated too):
Here's how they show up:
Example 1:
- URL: www.example.com/search/adult-site-keyword/page2.html/feed/rss2
- Linked From: http://an-adult-image-hosting.com/search/adult-site-keyword/page2.html
Example 2 (this surprised me the most when I looked at the linked from data):
-
URL: www.example.com/search/adult-site-keyword-2.html/page/3/
-
Linked From:
-
www.example.com/search/adult-site-keyword-2.html/page/2/ (this is showing as if it's from our own site)
-
http://a-spammy-adult-site.com/search/adult-site-keyword-2.html
Example 3:
- URL: www.example.com/search/adult-site-keyword-3.html
- Linked From: http://an-adult-image-hosting.com/search/adult-site-keyword-3.html
How do I address this issue?
-
Here is what I would do
-
Disavow the domain that is linking to you from the adult site(s).
-
The fact that Google search console is showing that you have an internal page linking as well makes me want to know a) have you always owned this domain and maybe someone previously did link internally like this or b) you may have been or are hacked
In the case of b) this can be really tricky. I once had a site that in a crawl it was showing sitewide links to various external sites that we should not be linking to. When I looked at the internal pages via my browser, there was no link as far as I could see even though it showed up on the crawler report.
Here was the trick. The hacker had setup a script to only show the link when a bot was viewing the page. Plus, we were running mirrored servers and they had only hacked one server. So, the links only showed up when you were spidering a specific mirrored instance as a bot.
So thanks to the hacking, not only were we showing bad links to bad sites, we were doing this through cloaking methodology. Two strikes against us. Luckily we picked this up pretty quick and fixed immediately.
Use a spidering program or browser program to show a user agent of Googlebot and go visit your pages that are linking internally. You might be surprised.
Summary
Googlebot has a very long memory. It may be that this was an old issue that was fixed long ago. If that was the case, just show the 404s for the pages that do not exist, and disavow the bad domain and move on. Make sure that you have not been hacked as this would also be why this is showing.
Regardless, the fact that Google did find it at one point, you need to make sure you resolve. Pull all the URLs into a spreadsheet and run Screaming Frog in list mode to check them all to make sure you fix all of it.
-
-
Yep.. Looking if anyone can help with this..
-
Oh yea, I missed that. That's very strange, not sure how to explain that one!
-
Thanks for the response Logan. What you are saying definitely makes sense.. But it makes think why do I see something like Example 2 under Crawl errors. Why Google Search Console shows linked from as 2 URL - one the spammy site's and other is from my own website. How is that even possible?
-
I've seen similar situations, but never in bulk and not with adult sites. Basically what's happening is somehow a domain (or multiple) are linking to your site with inaccurate URLs. When bots crawling those sites find the links pointing to yours, they obviously hit a 404 page which triggers the error in Search Console.
Unfortunately, there's not too much you can do about this, as people (or automated spam programs) can create a link to any site and any time. You could disavow links from those sites, which might help from an SEO perspective, but it won't prevent the errors from showing up in your Crawl Error report.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Keyword Research: How best to target keywords without using a region as part of the search query.
When doing keyword research and trying to rank for a keyword. I am wondering if we need to localize the query by adding a city to it. For example Phoenix Web Design vs. just targeting web design since Google is localizing search results now. Then when creating content and optimizing the site do we just put the keyword in the title and page content or do we also add the region/city to the keyword phrase? Any insight would be appreciated.
Intermediate & Advanced SEO | | hireawizseo0 -
Monthly Searches from Google Keyword Planner
I've used Google Adwords, Google Analytics and competitors keywords to compile a master list. I'm now looking to evaluate metrics on the keywords / phrases / long tail phrases. My question is this ... Based on Googles use of Geo Targeting, would I be better to evaluate metrics (Avg. Monthly Searches, Competition, Avg CPC) based on United Kingdom or my local city (I only operate in my local city). I am looking to use the results to redesign my website. I will use the favorable keywords / phrases / long tail keywords to implement a new menu, new content page creation, articles, etc. Thanks Mark
Intermediate & Advanced SEO | | Mark_Ch0 -
Urgent Site Migration Help: 301 redirect from legacy to new if legacy pages are NOT indexed but have links and domain/page authority of 50+?
Sorry for the long title, but that's the whole question. Notes: New site is on same domain but URLs will change because URL structure was horrible Old site has awful SEO. Like real bad. Canonical tags point to dev. subdomain (which is still accessible and has robots.txt, so the end result is old site IS NOT INDEXED by Google) Old site has links and domain/page authority north of 50. I suspect some shady links but there have to be good links as well My guess is that since that are likely incoming links that are legitimate, I should still attempt to use 301s to the versions of the pages on the new site (note: the content on the new site will be different, but in general it'll be about the same thing as the old page, just much improved and more relevant). So yeah, I guess that's it. Even thought the old site's pages are not indexed, if the new site is set up properly, the 301s won't pass along the 'non-indexed' status, correct? Thanks in advance for any quick answers!
Intermediate & Advanced SEO | | JDMcNamara0 -
How important is the HTML structure for on-page/on-site SEO?
To be more specific, say a page layout has Header, Body, Left Sidebar, Footer sections. Which layout from the following options is more SEO-friendly? Header > Body > Right Sidebar > Footer Body > Header > Right Sidebar > Footer Does it make a big difference to code HTML so that the the copy of the body appears in front of all other sections when spiders crawl a website? Is it worth taking extra steps to make this happen? I am asking this question because our site has a header navigation with a lot of dropdown menus. So I assume that this is "noise" for spiders as it pushes the main content of the page down. Please bear in mind that the question is more geared towards how search engine see the page rather than how it appears to the end user as layout can be controlled by CSS.This question also assumes that all other on-site SEO best practices are followed for both options.
Intermediate & Advanced SEO | | Saugar0 -
Why does my home page show up in search results instead of my target page for a specific keyword?
I am using Wordpress and am targeting a specific keyword..and am using Yoast SEO if that question comes up.. and I am at 100% as far as what they recommend for on page optimization. The target html page is a "POST" and not a "Page" using Wordpress definitions. Also, I am using this Pinterest style theme here http://pinclone.net/demo/ - which makes the post a sort of "pop-up" - but I started with a different theme and the results below were always the case..so I don't know if that is a factor or not. (I promise .. this is not a clever spammy attempt to promote their theme - in fact parts of it don't even work for me yet so I would not recommend it just yet...) I DO show up on the first page for my keyword.. however.. instead of Google showing the page www.mywebsite.com/this-is-my-targeted-keyword-page.htm Google shows www.mywebsite.com in the results instead. The problem being - if the traffic goes only to my home page.. they will be less likely to stay if they dont find what they want immediately and have to search for it.. Any suggestions would be appreciated!
Intermediate & Advanced SEO | | chunkyvittles0 -
2 Year Old Keyword Focused Site Will Not Rank for Keyword
Hi All, I need your help. This site is confounding me. The site is turnstilefactory.com It's a few years old. Strong domain name and seo focused on the term 'turnstile'. In bound links are not abundant, but certainly not absent either. Considering the subject matter, content and competition in the space, I would expect this site by now to at least be in the top 10 pages for the search 'turnstile', but it's not. I've tried everything I can think of with this, but it just won't rank for anything other than it's domain name. Can anyone please take a look and let me know if they see something I'm missing? It would be appreciated. Thanks.
Intermediate & Advanced SEO | | seomozpaul0 -
Alexa site title shows as "302 Found" on search result pages
If you search for the site "ixl.com" in Alexa, for some reason, it's showing the site as "302 Found" instead of showing the website name, IXL. If you drill into that, it shows the site as ixl.com, but underneath that, it says "302 Found" again. Every other site I search for seems to show the site's name properly. I have no idea where it's getting this "302 Found" from. Does anyone know how to fix this? Here's a link directly to the search results page: http://www.alexa.com/search?q=ixl.com
Intermediate & Advanced SEO | | john4math0 -
<rel canonical="">and Query Strings</rel>
How are you supposed to <rel canonical="" tag="">a page with a query string that has already been indexed? It's not like you're serving that page from a CMS where you have an original page with content to add to the head tag.</rel> For example.... Original Page = http://www.example.com/about/products.php Query String Page = http://www.example.com/about/products.php?src=FrontDoorBox Would adding the <rel canonical="" tag="">to the original page, referencing itself, be the solution so that the next time the original page is crawled, the bot will know that the previously indexed URL with query string should actually be the "original"? That's the only solution I can come up with because there's no way to find the query string rendered page to tag with the canonical.....</rel>
Intermediate & Advanced SEO | | Yun0