4xx errors
-
Hi
I checked in my campaign to look for errors on my page and i have got a report showing me a lot of 404 broken or dead links error. So how can i view the source of the broken link in order to fix it.
Thank you!
-
I don't see any "drop down" on the SEOmoz listed broken links... I also would like to know where to find the source of the broken link! Why don't show the source right inside the error report?
-
Google Webmaster Tools -- create an account there if you don't already have one -- is also a useful way to find 404 errors and track down their sources. Once your account is setup, go to Diagnostics > Crawl Errors > HTTP (this is the deault tab for the "Crawl Errors" screen).
-
You want to see where the links to the broken page are coming from?
3 options:
-
Xenu - http://home.snafu.de/tilman/xenulink.html - run that on your site and it will tell you. It's not the prettiest solution though.
-
Webmaster tools - Diagnostics > Crawl Errors - click on the page that is a 404 and it will tell you where the links are coming from.
-
SEOmoz - Set up a campaign in the pro section and in the crawl it will give you 4xx errors. Click on that then on each broken link drop down there an 'Explore links' option. That will open up OpenSiteExplorer for that link and show you where you're getting links from
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Pageview/Goal Data Errors In GA
**Background: ** We utilize a static .html page for our quote form. It is embedded on our WordPress site via iframe in a single location. The quote form code itself (within the quote form .html) is generated from our CRM, but contains no tracking code itself. The .html containing this code is tracked with embedded Analytics code to track our Goals. This code is tested and works properly, recording goal completions when our thank-you.html page is loaded within the iframe. To be clear, quote.html is the page the iframe loads, .com/quote-page is the WordPress page with the iframe, and thanks.html is the goal completion page. Google Analyticator plugin handles code insertion throughout the site. The .html pages have code manually inserted and neither are indexed by Google or linked to/accessible by any route other than .com/quote-page **Problems: ** 1. When I check Pageviews in GA, the quote.html page has many more hits than .com/quote-page. The disparity is 552 to 416. How is this possible when quote-page has to be loaded in order for quote.html to be loaded? Shouldn't they be similar? 2. Our completion page, thanks.html, is showing 142 pageviews and 133 unique pageviews. Our goals confirm 133 goal conversions. How are people seeing the thanks.html page again without it registering a goal? A backspace? Someone help me decipher this please! If you need any more details, let me know!
Reporting & Analytics | | kirmeliux0 -
Crawl errors for pages that no longer exist
Hey folks, I've been working on a site recently where I took a bunch of old, outdated pages down. In the Google Search Console "Crawl Errors" section, I've started seeing a bunch of "Not Found" errors for those pages. That makes perfect sense. The thing that I'm confused about is that the "Linked From" list only shows a sitemap that I ALSO took down. Alternatively, some of them list other old, removed pages in the "Linked From" list. Is there a reason that Google is trying to inform me that pages/sitemaps that don't exist are somehow still linking to other pages that don't exist? And is this ultimately something I should be concerned about? Thanks!
Reporting & Analytics | | BrianAlpert780 -
Suspect Links from Yeusaigon.net Causing Server Errors
Good morning, Webmaster Tools is reporting an increase in server errors on our site due to some very suspect links from Yeusaigon.net. After taking a quick look, it appears they are some form of search engine attempting to link to our images by using incomplete URLs. For example: http://yeusaigon.net/search/images.php?q=htc%20one%20max%20phone%20cases&page=1044 Is linking to: http://www.mobilemadhouse.co.uk/caseflex-htc-one-max-real-leather-flip... As this URL is incomplete, it's throwing up a server error. There are currently 139 instances of there errors from the same domain, and is increasing by around 5-10 per day. The domain, however, is linking to some of our pages/images correctly, but I fear Google may look at these as spammy links - they certainly look that way! So, what can we do? I can't find any contact details on Yeusaigon website so I have disavowed the entire domain. Is this the right thing to do? How do I stop the ever-increasing number of sever errors due to incorrect URLs? Cheers, Lewis
Reporting & Analytics | | PeaSoupDigital0 -
Sitemap 404 error
I have generated a .xml sitemap of the site www.ihc.co.uk. The sitemap generated seems all fine, however when submitting to webmaster tools, it is returning a 404 error? anyone experienced this before. deleted and re-done the process. Tried different xml sitemap generators and even cleared cache along the way.
Reporting & Analytics | | dentaldesign0 -
Google Webmasters DNS error
Hi, In my webmaster tools I have a yellow triangle stating that there is a DNS error that is preventing Google crawling my sites. The site is indexed and I have checked fetch as Google and that seems ok but the triangle is still there every time I check it. The whois sites all have the correct information and point to Hostgator who I am using. I have contacted them and they said everything seems ok. Should I just carry on as normal with my link building as the site is indexed or investigate even further? Cheers, Stuart
Reporting & Analytics | | stuart420 -
How serious are the Duplicate page content and Tags error?
I have a travel booking website which reserves flights, cars, hotels, vacation packages and Cruises. I encounter a huge number of Duplicate Page Title and Content error. This is expected because of the nature of my website. Say if you look for flights between Washington DC and London Heathrow you will at least get 60 different options with same content and title tags. How can I go about reducing the harm if any of duplicate content and meta tags on my website? Knowing that invariably I will have multiple pages with same content and tags? Would appreciate your advice? S.H
Reporting & Analytics | | sherohass0 -
Subdomain and relative link paths cause crawl errors
I have a Wordpress blog on our subdomain and we use relative paths on our domain. It appears as though Google bot is crawling from the subdomain categories back to the domain relative paths. This of course results in hundreds of 404 pages. Any suggestions as to how to resolve this issue without changing the relative path structure of our domain? I can provide more information if need be. While I realize these issues are not that pressing, I'd obviously like to remove as many errors as possible. If anyone has encountered this problem, especially in Wordpress I'd really like to hear your solution or lack there of. Thank you in advance.
Reporting & Analytics | | BethA0 -
Why are Seemingly Randomly Generated URLs Appearing as Errors in Google Webmaster Tools?
I've been confused by some URLs that are showing up as errors in our GWT account. They seem to just be randomly generated alphanumeric strings that Google is reporting as 404 errors. The pages do 404 because nothing ever existed there or was linked to. Here are some examples that are just off of our root domain: /JEzjLs2wBR0D6wILPy0RCkM/WFRnUK9JrDyRoVCnR8= /MevaBpcKoXnbHJpoTI5P42QPmQpjEPBlYffwY8Mc5I= /YAKM15iU846X/ymikGEPsdq 26PUoIYSwfb8 FBh34= I haven't been able to track down these character strings in any internet index or anywhere in our source code so I have no idea why Google is reporting them. We've been pretty vigilant lately about duplicate content and thin content issues and my concern is that there are an unspecified number of urls like this that Google thinks exist but don't really. Has anyone else seen GWT reporting errors like this for their site? Does anyone have any clue why Google would report them as errors?
Reporting & Analytics | | kimwetter0