The crawl report shows a lot of 404 errors
-
They are inactive products, and I can't find any active links to these product pages. How can I tell where the crawler found the links?
-
That's too easy Keri.
-
If you download the CSV of the report, there is a column that will list the referring URL for the 404.
-
Mike's suggestion is a great one. SF gives some great data that's easy to play with in Excel. The free version only crawls 500 pages, so if you have a small site, you'll probably get what you need. (I got 30k+ pages, so I use the paid version that's only about $160/yr.)
-
There is nothing wrong with having a "404 error" if the page is an expired product. Obviously, you don't wont to be linking to a 404 page on your site, so I'd suggest using a tool like Screaming Frog, potentially even OSE, and monitoring your 404 pages in Google Webmaster Tools to see if it is still currently being linked to.
If the page has external links pointing to it, I'd recommend 301 redirecting it to whatever category/subcategory the product belongs to.
-
Do a scan with Screaming Frog. It can point out exact pages where you are having broken links.
They have a free trial and a paid version.
This program is great for diagnosing many different website related issues.
Hope this helps.
Mike
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why doesn't Moz crawl whole pages of our website to report All On-Page issues?
Hi friends & mozzers, How can't Moz crawl whole pages of our website: https://www.4atvtires.com/ to report All Serious On-Page issues. We have more than 15000 product pages. And how could it be possible that Moz isn't able to crawl whole, just got crawl report upto 258 pages of our website, and also I can experience the same in Google webmaster ?? Please help to fix this issue as early as possible. Regards,
Moz Pro | | BigSlate
Rann0 -
403 error for a member site
Perhaps a stupid question but SEOmoz registers 403 errors for pages behind a membersite (ie. they are restricted on purpose). Should I noindex these pages or just let SEOmoz register these "errors"?
Moz Pro | | Crunchii0 -
Crawl Diagnostics - Crawling way more pages than my site has?
Hello all, I'm fairly new here, more of a paid search guy dabbling in SEO on the side. I have a client that I have in SEOMoz and the Crawl Diagnostics report is showing 10,000+ pages crawled and I think the site has at most 800 pages (e-commerce site using freewebstore.org as the platform). Any reasons this would be happening?
Moz Pro | | LodestoneGen0 -
More complete campaign reports
SEOMoz campaigns include a lot more data than can be sent via email automatically with the custom reporting feature. So what do people do to send that data to clients? Do you not include it? Or export reports manually and send them on? Or something else?
Moz Pro | | antdesign0 -
Why does Rel Canonical show up as a notice?
In the crawl diagnostics screen "Rel Canonical" shows up as a notice for every page that has a rel="canonical" meta tag in it. Why is this the case? Shouldn't every page have a canonical tag on it to show the absolute URL to the content? Wouldn't a better notice be to display pages that do not have a canonical tag instead? I could be wrong but that would make more sense to me. (In fact.. let's be honest here.. I probably am wrong.. but I'd like someone to explain it if they could.) Thanks
Moz Pro | | rrolfe1 -
How to run down the actual source of a 404 error that is reported.
In my 404 errors, the second entry is as follows: URL: http://www.virginiahomesandforeclosures.com/listing/0428387-lot-k-commerce-park-franklin-va-23851/REWIDX_URL_CDNimg/no-image.gif Is there a simple way to find the root or page in which this error was generated? IF I visit this page " http://www.virginiahomesandforeclosures.com/listing/0428387-lot-k-commerce-park-franklin-va-23851" without the attached gobble de gook, I see a good page. So bottom line its possible it could be in one of my sitemaps, but I have 50 of those so its time consuming to search thru all 50 for each error like this since I have so many. I am pretty sure its not in my sitemaps, since google has not picked up any of these errors and they have crawled over 12,000 urls so far. When google gives me a 404 error I can click on the link and find what pages they found the link and go there and correct it at the root. Any suggestions would be greatly appreciated. I have more than 1,000 of these errors with the bad url with the junk attached to the end and have not been able to isolate the cause yet. Thanks in advance.
Moz Pro | | tommytx0 -
Archived campaign and automatic reports
Hi I set up the standard reports under Reports new and am still getting them emailed with no data. Just want to stop receiving as I have archived the campaign Thanks
Moz Pro | | Alexanders0 -
Why are inbound links not showing up?
I'm new to SEOmoz but have a question regarding inbound links that I don't see posted in the forum. In order to become more familiar with SEOmoz tools, I've been checking out sites that friends and family members have created as practice. Things have been going really smooth until I came across a 2+ year old page that should have included an inbound link from wsj.com but said link is not appearing in OSE for this page. Background: A friend of mine has a (basically) defunct blog that had a pretty well trafficked posting in 2009. However, when I use OSE to check out both the domain and page inbound links, I don't see the aforementioned inbound link from wsj.com. Why is that? Or, it's insanely late - am I missing something? Friend's blog posting: http://bcclist.com/2009/04/21/craigslist-killer-megan-philipcom-removed/ WSJ posting with a link to my friend's blog (4th paragraph...anchor text = "taken down"): http://blogs.wsj.com/digits/2009/04/21/who-is-megan-mcallister/ No rush. Again, I'm doing this as practice and being new to the site, I figure I'm overlooking something. Any feedback would be greatly appreciated. Thanks!
Moz Pro | | ICM0