Sitemap 404 error
-
I have generated a .xml sitemap of the site www.ihc.co.uk. The sitemap generated seems all fine, however when submitting to webmaster tools, it is returning a 404 error? anyone experienced this before. deleted and re-done the process. Tried different xml sitemap generators and even cleared cache along the way.
-
Hi,
Webmaster tools will normally tell you what the problem is if you click through to the details. I've also tried to access it and it seems fine. I've used a couple of different useragents on websniffer: http://web-sniffer.net/ and I've checked robots.txt for anything funny. All looks good.
My advice would be to try and fetch as Googlebot in WMT - if that works, I wouldn't worry about it. You could also check the server logs to see what the response code is when Googlebot requests the sitemap.
I hope this helps, seems like a bug to me.
Craig
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Strange - Search Console page indexing "../Detected" as 404
Anyone seen this lately? All of a sudden Google Search Console is insisting in Page indexing that there is a 404 for a page that has never existed on our client's site: https://........com.au/Detected We've noticed this across a number of sites, precisely in this way with a capitalised "/Detected" To me it looks like something spammy is being submitted to the SERPs (somehow) and Google is trying to index that and then getting a 404. Naturally MOZ isn't picking it up, cause the page simply never existed - it's just happening in Search Console 2afc7e35-71e4-4e25-80a3-690bf10776a7.png It comes and it goes in the 404 alerts in Console and is really annoying. I reckon it started happening late 2022.
Reporting & Analytics | | DanielDL0 -
404 Status Codes in Google Search Console
Hi all, I've noticed in Google Search Console under 'Crawl errors' - 1. Why does the status code '410' come up as an 'error' in the crawl report? 2. Why are some articles labelled as '404' error when they have been completely deleted and should be a '410' - there are roughly around 1000-2000 of these. Thanks!
Reporting & Analytics | | lucwiesman0 -
Pageview/Goal Data Errors In GA
**Background: ** We utilize a static .html page for our quote form. It is embedded on our WordPress site via iframe in a single location. The quote form code itself (within the quote form .html) is generated from our CRM, but contains no tracking code itself. The .html containing this code is tracked with embedded Analytics code to track our Goals. This code is tested and works properly, recording goal completions when our thank-you.html page is loaded within the iframe. To be clear, quote.html is the page the iframe loads, .com/quote-page is the WordPress page with the iframe, and thanks.html is the goal completion page. Google Analyticator plugin handles code insertion throughout the site. The .html pages have code manually inserted and neither are indexed by Google or linked to/accessible by any route other than .com/quote-page **Problems: ** 1. When I check Pageviews in GA, the quote.html page has many more hits than .com/quote-page. The disparity is 552 to 416. How is this possible when quote-page has to be loaded in order for quote.html to be loaded? Shouldn't they be similar? 2. Our completion page, thanks.html, is showing 142 pageviews and 133 unique pageviews. Our goals confirm 133 goal conversions. How are people seeing the thanks.html page again without it registering a goal? A backspace? Someone help me decipher this please! If you need any more details, let me know!
Reporting & Analytics | | kirmeliux0 -
404 errors on page urls that don't even exist
I am getting a lot of errors on pages with urls that aren't even legit. Like for example: /videos/support/index.asp No such path even exists like this on the site. I have a /videos and /support off root but no place on the site is there any reference or file at location /videos/support/index.asp so I get a lot of 404 duplicate page errors. This is just one example of several. How do I stop this?
Reporting & Analytics | | GKLWL0 -
SEOMoz & Google Webmaster Tools crawl error conflicting info
Site im working on has zero crawl errors according to SEOMoz (it did previously have lots since ironed out) but now looking at GWebmaster Tools saying 5000 errors. Date of those are not that recent but Webmaster Tools line graph of errors still showing aprox 5000 up to yesterday There is an option to bulk action/tick them all as fixed so thinking/hoping GWT just keeping a historical record that can now be deleted since no longer applicable. However i'm not confident this is the case since still showing on the line graph. Any ideas re this anomalous info (can i delete and forget in GWT) ? Also side question I take it its not possible to link a GA property with a GWT account if created with different logins/accounts ? Many Thanks Dan
Reporting & Analytics | | Dan-Lawrence0 -
Strange 404 Error URL
Can anyone help determine how a URL like "www.mycompany.com/lago_www.bad-nsfw-content.com" would appear on the "not found" crawl error list in Google Webmaster Tools? The "www.bad-nsfw-content" site has nothing to do with our company and I don't how it would get associated with our site.
Reporting & Analytics | | pbhatt0 -
Spider 404 errors linked to purchased domain
Hi, My client purchased a domain which based on the seller "promising lots of traffic". Subsequent investigation showed it was a scam and that the seller had been creative in Photoshop with some GA reports. Nevertheless, my client had redirected the acquired domain to their primary domain (via the domain registrar). From the period on which the acquired domain was redirected to the point when we removed the redirect, the web log files had a high volume of spider/bot 404 errors relating to an online pharmaacy - viagra, pills etc. The account does not seem to have been hacked. No additional files are present and the rest of the logs seem normal. As soon as the redirect was removed the spider 404 errors stopped. Aside from the advice about acquiring domains promising traffic which I've already discussed with my client, does anybody have any ideas about how a redirect could cause the 404 errors? Thanks
Reporting & Analytics | | bjalc20110 -
4xx errors
Hi I checked in my campaign to look for errors on my page and i have got a report showing me a lot of 404 broken or dead links error. So how can i view the source of the broken link in order to fix it. Thank you!
Reporting & Analytics | | DigitalDave0