Help with strange 404 Errors.
-
For the most part I have never had trouble tracking down 404's. Usually it's simply a broken link, but lately I have been getting these strange errors
http://gridironexperts.com/http%3A/www.nfl.com/gamecenter?game_id=29528&season=2008&displayPage=tab_gamecenter/
- What does; %C2%94 repersent?
- The error always points to NFL.com, but we don't link to them...like ever?
- Can I just 404: http://gridironexperts.com// to fix the problem, as all 404's start with this weird %C2%94 error.
- Is this error even on my site? Is in the backend...virus?
thanks
-Mike
-
When you say it did not fix them, do you mean that the 301 was not working, or that the 404s did not go away in the GWT report?
You will not see an immediate change in GWT for those errors. They may take 30-90 days to clear out. If you have them fixed, you can mark them as such and then take the error out of your console.
As a part of the SEO Membership, check out the SEOMoz report for some option. I have used Screaming Frog SEO spider with some success to look through my site and find random links.
P
-
Just to clarify. 404s dont always come from links on your site. Often, these are links on other sites etc that Google has in its index that they found somewhere and are trying to see if the 404s dont work.
Not saying that it is not malware, but clarifying the angle on these.
-
Actually. I tried to 301 direct http://gridironexperts.com// to the home page and it didn't fix the 404's.
can you send me a link to that spider reference you mentioned
-
Thanks - please mark as answered and like if you please!
-
cool that's what I was thinking. Thanks so much. awesome answer
you went above and beyond
-
Hey there. The %C2 %94 %3A are simply ASCII values of encoded versions of special characters in the URL.
http://www.w3schools.com/tags/ref_urlencode.asp
%3A is the same as a colon
%C2 is Â
%94 is "
This simply puts those characters in a format that is easier for the browser to read and then convert into a format it can use.
Couple of things to check on where this comes from.
Get a spider program and see if somewhere, waaay out there in the back ends of your content library that you have some crazy goofed up link that got planted here. Find it an delete it
Other than that, somewhere out on the internets, as my developer likes to say, "A bunch of monkeys banged heads on keyboards" There are site scrapers that do not do a good job and take your content and then repost it and they screw up all kinds of formatting and you end up with links like the above pointing to your site. The spiders look for it and you get a 404.
I just did a Google search on
www.nfl.com/gamecenter?game_id=29528&season=2008&displayPage=tab_gamecenter/
and you get all kinds of random pages linking to that.
Here is what I would do. You mention most errors start with
You can 301 all those to another page. Or, show a simple helpful page for the user to navigate off of with a noindex, nofollow meta tag. The noindex tag would get those pages out of the index at least and not show a 404 error.
-
You may want to check GWT under malware. That seems odd that you never link to NFL, but the 404 is coming from your site.
Check the source code of those particular pages that are giving the 404s. Check line by line for anything you don't recognize. Also, make SURE there aren't any .ru TLDs there.
When my site got attacked, my hosting company stepped up and did a scan of malware and they found severall things. So maybe your hosting company can do a scan for you.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Hi anyone please help I use this code but now getting 404 error. please help.
#index redirect
Technical SEO | | roynguyen
RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /index.html\ HTTP/
RewriteRule ^index.html$ http://domain.com/ [R=301,L]
RewriteCond %{THE_REQUEST} .html
RewriteRule ^(.*).html$ /$1 [R=301,L] hi anyone please help I use this code but now getting 404 error. please help. homepage and service.html page is working, but the rest pages like about.html, servicearea.html, and contact.html is not working showing 404 error. and also when you type this URL. generalapplianceserice.ca/about.html generalapplianceserice.ca/contact.html generalapplianceserice.ca/servicearea.html it automatically remove the .HTML extension and shows 404 error, the pages name in root directory is same. these pages work like generalapplianceservice.ca and generalapplianceservice.ca/services why? i also remove this code again but still same issue.0 -
Getting error in webmasters
My site was running perfectly from last one year... I don't know what happened now google is showing error while I am trying to use fetch option in webmasters. http://prntscr.com/6mtud5
Technical SEO | | Srinu0 -
My 404 page shows in the report as an error.
How can i make my actual 404 page not show up as a 404 error in the report?
Technical SEO | | LindseyNewman0 -
Google not found errors in webmaster tool help
Hi, Google Webmaster tools sent me a few messages recently about the jump in the number of 'not found' errors. From 0 to 290 errors, ouch. I know what it's from but I think Google is seeing things. We developed another page/subdomain we're working on with links back to the root domain. Basically a complete list of articles page that lists each article and links back to the root domain. Not sure what Google is crawling but the links that would result in a 'not found' error aren't there. Will these disappear over time? Thanks for the help!
Technical SEO | | astahl110 -
DNS error on webmaster tool
Google webmaster tool is showing DNS error and that is leading to many server error (502,500) almost 50+ in every crawl. Recently Google crawled one of our sub domains that we did not want google to crawl. We blocked it via Robots.txt and also removed all the URL's and since then we are having this issue. Any suggestions how to fix this DNS error? Thanks in advance.
Technical SEO | | tpt.com0 -
404 error
Both SEOmoz and Google webmaster tools are returning over 4000 error 404.The majority or returned error URLs are for images, and all URLs end up with %20target=as shown belowimages/products/detail/AD9058RoundGlassTableChairs.jpg%20target=images/products/detail/BM921ModernRoundDiningTable.jpg%20target=images/products/detail/CR701506CappuccinoCoffeeTableSet.jpg%20target=any suggestions?RegardsTony
Technical SEO | | OCFurniture0 -
Dealing with 404 pages
I built a blog on my root domain while I worked on another part of the site at .....co.uk/alpha I was really careful not to have any links go to alpha - but it seems google found and indexed it. The problem is that part of alpha was a copy of the blog - so now soon we have a lot of duplicate content. The /alpha part is now ready to be taken over to the root domain, the initial plan was to then delete /alpha. But now that its indexed I'm worried that Ill have all these 404 pages. I'm not sure what to do.. I know I can just do a 301 redirect for all those pages to go to the other ones in case a link comes on but I need to delete those pages as the server is already very slow. Or does a 301 redirect mean that I don't need those pages anymore? Will those pages still get indexed by google as separate pages? Please assist.
Technical SEO | | borderbound0 -
Strange duplicate content issue
Hi there, SEOmoz crawler has identified a set of duplicate content that we are struggling to resolve. For example, the crawler picked up that this page www. creative - choices.co.uk/industry-insight/article/Advice-for-a-freelance-career is a duplicate of this page www. creative - choices.co.uk/develop-your-career/article/Advice-for-a-freelance-career. The latter page's content is the original and can be found in the CMS admin area whilst the former page is the duplicate and has no entry in the CMS. So we don't know where to begin if the "duplicate" page doesn't exist in the CMS. The crawler states that this page www. creative-choices.co.uk/industry-insight/inside/creative-writing is the referrer page. Looking at it, only the original page's link is showing on the referrer page, so how did the crawler get to the duplicate page?
Technical SEO | | CreativeChoices0