Help with strange 404 Errors.
-
For the most part I have never had trouble tracking down 404's. Usually it's simply a broken link, but lately I have been getting these strange errors
http://gridironexperts.com/http%3A/www.nfl.com/gamecenter?game_id=29528&season=2008&displayPage=tab_gamecenter/
- What does; %C2%94 repersent?
- The error always points to NFL.com, but we don't link to them...like ever?
- Can I just 404: http://gridironexperts.com// to fix the problem, as all 404's start with this weird %C2%94 error.
- Is this error even on my site? Is in the backend...virus?
thanks
-Mike
-
When you say it did not fix them, do you mean that the 301 was not working, or that the 404s did not go away in the GWT report?
You will not see an immediate change in GWT for those errors. They may take 30-90 days to clear out. If you have them fixed, you can mark them as such and then take the error out of your console.
As a part of the SEO Membership, check out the SEOMoz report for some option. I have used Screaming Frog SEO spider with some success to look through my site and find random links.
P
-
Just to clarify. 404s dont always come from links on your site. Often, these are links on other sites etc that Google has in its index that they found somewhere and are trying to see if the 404s dont work.
Not saying that it is not malware, but clarifying the angle on these.
-
Actually. I tried to 301 direct http://gridironexperts.com// to the home page and it didn't fix the 404's.
can you send me a link to that spider reference you mentioned
-
Thanks - please mark as answered and like if you please!
-
cool that's what I was thinking. Thanks so much. awesome answer
you went above and beyond
-
Hey there. The %C2 %94 %3A are simply ASCII values of encoded versions of special characters in the URL.
http://www.w3schools.com/tags/ref_urlencode.asp
%3A is the same as a colon
%C2 is Â
%94 is "
This simply puts those characters in a format that is easier for the browser to read and then convert into a format it can use.
Couple of things to check on where this comes from.
Get a spider program and see if somewhere, waaay out there in the back ends of your content library that you have some crazy goofed up link that got planted here. Find it an delete it
Other than that, somewhere out on the internets, as my developer likes to say, "A bunch of monkeys banged heads on keyboards" There are site scrapers that do not do a good job and take your content and then repost it and they screw up all kinds of formatting and you end up with links like the above pointing to your site. The spiders look for it and you get a 404.
I just did a Google search on
www.nfl.com/gamecenter?game_id=29528&season=2008&displayPage=tab_gamecenter/
and you get all kinds of random pages linking to that.
Here is what I would do. You mention most errors start with
You can 301 all those to another page. Or, show a simple helpful page for the user to navigate off of with a noindex, nofollow meta tag. The noindex tag would get those pages out of the index at least and not show a 404 error.
-
You may want to check GWT under malware. That seems odd that you never link to NFL, but the 404 is coming from your site.
Check the source code of those particular pages that are giving the 404s. Check line by line for anything you don't recognize. Also, make SURE there aren't any .ru TLDs there.
When my site got attacked, my hosting company stepped up and did a scan of malware and they found severall things. So maybe your hosting company can do a scan for you.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Who is correct - please help!
I have a website with a lot of product pages - often thousands of pages. As each of these pages is for a specific lease car they are often only fractionally different from other pages. The urls are too long, the H1 is often too long and the Title is often too long for "SEO best practice". And they do create duplication issues according to MOZ. Some people tell me to change them to noindex/nofollow whilst others tell me to leave them as they are as best not to hide from google crawler. Any advice will be gratefully received. Thanks for listening.
Technical SEO | | jlhitch0 -
How to get rid of bot verification errors
I have a client who sells highly technical products and has lots and lots (a couple of hundred) pdf datasheets that can be downloaded from their website. But in order to download a datasheet, a user has to register on the site. Once they are registered, they can download whatever they want (I know this isn't a good idea but this wasn't set up by us and is historical). On doing a Moz crawl of the site, it came up with a couple of hundred 401 errors. When I investigated, they are all pages where there is a button to click through to get one of these downloads. The Moz error report calls the error "Bot verification". My questions are:
Technical SEO | | mfrgolfgti
Are these really errors?
If so, what can I do to fix them?
If not, can I just tell Moz to ignore them or will this cause bigger problems?0 -
Href lang issues - help needed!
Hi, I have an issue with Google indexing the US version of our website rather than the UK version on Google.co.uk. I have added hreflang tags to both sites (https://www.pacapod.com/ and https://us.pacapod.com/), have updated and submitted an XML sitemap for each website and checked that the country targeting in search console is set-up correctly but Google are still indexing the wrong website. I would be grateful for any assistance with this issue. Many thanks Eddie
Technical SEO | | mypetgiftbox0 -
Help Crawl friendliness for large site
After watching Rand's video I am trying to think of the best way to make my large site more crawl friendly. Background I have a large site with over 100k product skus and so when you get to a particular page of products there are tons of different refinements and options that help you sort the products. Most of these are noindex followed, but I was wondering if I should be nofollowing the internal links as well in order to keep bots out of those pages and going to the pages that I want them to go too. Is this a good way to handle it? Also, does anyone have good recommendations of links to posts that deal with helping the crawl friendliness of a large site? Thanks!
Technical SEO | | Gordian0 -
Help with 301 redirect code
Hi, I can't work out how to make this one work and would apreciate if someone could help.
Technical SEO | | Paul_MC
i have a series of folders from a old site that are in the structure:
/c/123456/bags.html (the "123456" changes and is any series of 6 digit numbers), and the "bags.html" changes depending on the product.
I need that to be 301 redirected to the following format:
/default/bags/bags.html0 -
4XX(Client Error)
Hello there Please help! I am getting this kind of error in the whole site. http://www.mileycyrus-online.co.uk/leaked-hannah-montana-the-movie-pictures.html/comments Running on wordpress site. I chagned the template few times.. most of the error ends with a /comments. Infact all my post has the same issue: http://www.mileycyrus-online.co.uk/miley-cyrus-at-golden-globes-ceremony.html/comments http://www.mileycyrus-online.co.uk/miley-cyrus-at-president-obamas-inauguration-concert.html/comments 404 Error.
Technical SEO | | ExpertSolutions0 -
301 or a 404
Just had a discussion with a collegue about a page on our own website. We have some cases which are outdated. These pages receive some visitors but they arrive there when they search for the clients brand name, so for us they are irelevant. What's the best way to handle these kind of pages? Is a 301-redirect to the showcase overview the way to go or do we make it a 404 and include the showcase overview in this 404?
Technical SEO | | nvs.nim0 -
REL Canonical Error
In my crawl diagnostics it showing a Rel=Canonical error on almost every page. I'm using wordpress. Is there a default wordpress problem that would cause this?
Technical SEO | | mmaes0