My sitemap in Google is coming back with an error
-
I submitted my xml sitemap to Google Webmaster tools. It is giving an error, not found. 404 Error. But I can't figure out why my site map is signaling a 404. Why?
-
It's always a good idea to include the directive in the robots.txt anyway, Courtney, so I'd say yes.
If typing the address in your browser works, but putting in that exact same address in GWT results in 404, then I'm stumped. Only thing i can figure is your WMT is experiencing a glitch and you might want o report it in the Google Webmaster forums.
If you want to send the site URL by personal message here I can take a last look.
Paul
-
The address does work in typed into the browser. And I did use an installed plugin to create the sitemap.
So if the address works, do you think my next step is to add the robots.txt?
Courtney
-
Just because your site has a sitemap doesn't mean it's guaranteed to be located at www/yoursite.com/sitemap.xml, Courteney. What happens when you actually type www.yoursite.com/sitemap.xml in your browser's address bar? I'm betting you'll get a 404, which means your sitemap isn't located/named what you think it is.
Many WordPress plugins that automatically create xml sitemaps don't put them in the expected standard location with the standard file name! For example, Yoast's WordPress SEO plugin creates its xml sitemap at www.yourdomain.com/sitemap_index.xml. It looks close, but its not the same.
Double check the sitemap plugin you're using to confirm where it's writing the xml file and then use that address to submit it to both Google and Bing Webmaster Tools.
It's also best practice to add a directive to the robots.txt file at the root of your site pointing to the correct sitemap.xml location as well. Just add this line at the bottom:
Sitemap: http://www.yoursite.com/actualsitemapfilename.xml
That address'll be the same one that you were able to successfully submit to GWT.
Does all that make sense?
Paul
P.S. One extra thought, since you mention that this is your first WordPress build. WP doesn't automatically create an xml sitemap for you - you'll have to have installed a plugin or used a 3rd-party tool to do that for you. Sorry if that's obvious to you, but wanted to cover all the bases
-
I agree if the page is live in your browser (xml version) then Google should be able to crawl it.
-
Hi Courtney,
Are you suggesting that the sitemap itself is 404ing OR webmasters is in indicating your site has 404's of page that exist on your sitemap?
If it's the sitemap itself, can you navigate to it directly? Does it render in a browser?
If it's an error from a page on the sitemap, and the page currently renders there is a good chance it didn't at some stage. If that's the case you can ask google to recrawl it as an individual page, see;
http://support.google.com/webmasters/bin/answer.py?hl=en&answer=1352276
Hope this helps.
Dan
-
It is the .xml. Any other ideas? Totally stuck....(first wordpress site I have built)
-
You have to submit the .XML version and it has to be live in your directory on your server.
on our site ie:
The page for users: http://www.boastingbiz.com/sitemap
The page for the Spiders: http://www.boastingbiz.com/sitemap.xml <--- Submit the .xml from wherever you place it on your server.
hope this helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
404 errors
Hi I am getting these show up in WMT crawl error any help would be very much appreciated | ?ecaped_fragment=Meditation-find-peace-within/csso/55991bd90cf2efdf74ec3f60 | 404 | 12/5/15 |
Technical SEO | | ReSEOlve
| | 2 | mobile/?escaped_fragment= | 404 | 10/26/15 |
| | 3 | ?escaped_fragment=Tips-for-a-balanced-lifestyle/csso/1 | 404 | 12/1/15 |
| | 4 | ?escaped_fragment=My-favorite-yoga-spot/csso/5598e2130cf2585ebcde3b9a | 404 | 12/1/15 |
| | 5 | ?escaped_fragment=blog/c19s6 | 404 | 11/29/15 |
| | 6 | ?escaped_fragment=blog/c19s6/Tag/yoga | 404 | 11/30/15 |
| | 7 | ?escaped_fragment=Inhale-exhale-and-once-again/csso/2 | 404 | 11/27/15 |
| | 8 | ?escaped_fragment=classes/covl | 404 | 10/29/15 |
| | 9 | m/?escaped_fragment= | 404 | 10/26/15 |
| | 10 | ?escaped_fragment=blog/c19s6/Page/1 | 404 | 11/30/15 | | |0 -
Should I remove these pages from the Google index?
Hi there, Please have a look at the following URL http://www.elefant-tours.com/index.php?callback=imagerotator&gid=65&483. It's a "sitemap" generated by a Wordpress plug-in called NextGen gallery and it maps all the images that have been added to the site through this plugin, which is quite a lot in this case. I can see that these "sitemap" pages have been indexed by Google and I'm wondering whether I should remove these or not? In my opinion these are pages that a search engine would never would want to serve as a search result and pages that a visitor never would want to see. Attracting any traffic through Google images is irrelevant in this case. What is your advice? Block it or leave it indexed or something else?
Technical SEO | | Robbern0 -
Local Google vs. default Google search
Hello Moz community, I have a question: what is the difference between a local version of Google vs. the default Google in regards to search results? I have a Mexican site that I'm trying to rank in www.google.com.mx, but my rankings are actually better if I check my keywords on www.google.com The domain is a .mx site, so wouldn't it make more sense that this page would rank higher on google.com.mx instead of the default Google site, which in theory would mean a "broader" scope? Also, what determines whether a user gets automatically directed to a local Google version vs. staying on the default one? Thanks for your valuable input!
Technical SEO | | EduardoRuiz0 -
Google Webmaster tools: Sitemap.xml not processed everyday
Hi, We have multiple sites under our google webmaster tools account with each having a sitemap.xml submitted Each site's sitemap.xml status ( attached below ) shows it is processed everyday except for one _Sitemap: /sitemap.xml__This Sitemap was submitted Jan 10, 2012, and processed Oct 14, 2013._But except for one site ( coed.com ) for which the sitemap.xml was processed only on the day it is submitted and we have to manually resubmit every day to get it processed.Any idea on why it might?thank you
Technical SEO | | COEDMediaGroup0 -
Soft 404 errors
Hello Everyone, I recently removed some pages and made a custom 404 page by putting "ErrorDocument 404 http://www.site.com/404.htm" in the htaccess file but WMT now reports soft 404 errors, how do I do this properly? Thanks
Technical SEO | | jwdl0 -
Google having trouble accessing my site
Hi google is having problem accessing my site. each day it is bringing up access denied errors and when i have checked what this means i have the following Access denied errors In general, Google discovers content by following links from one page to another. To crawl a page, Googlebot must be able to access it. If you’re seeing unexpected Access Denied errors, it may be for the following reasons: Googlebot couldn’t access a URL on your site because your site requires users to log in to view all or some of your content. (Tip: You can get around this by removing this requirement for user-agent Googlebot.) Your robots.txt file is blocking Google from accessing your whole site or individual URLs or directories. Test that your robots.txt is working as expected. The Test robots.txt tool lets you see exactly how Googlebot will interpret the contents of your robots.txt file. The Google user-agent is Googlebot. (How to verify that a user-agent really is Googlebot.) The Fetch as Google tool helps you understand exactly how your site appears to Googlebot. This can be very useful when troubleshooting problems with your site's content or discoverability in search results. Your server requires users to authenticate using a proxy, or your hosting provider may be blocking Google from accessing your site. Now i have contacted my hosting company who said there is not a problem but said to read the following page http://www.tmdhosting.com/kb/technical-questions/other/robots-txt-file-to-improve-the-way-search-bots-crawl/ i have read it and as far as i can see i have my file set up right which is listed below. they said if i still have problems then i need to contact google. can anyone please give me advice on what to do. the errors are responce code 403 User-agent: *
Technical SEO | | ClaireH-184886
Disallow: /administrator/
Disallow: /cache/
Disallow: /components/
Disallow: /includes/
Disallow: /installation/
Disallow: /language/
Disallow: /libraries/
Disallow: /media/
Disallow: /modules/
Disallow: /plugins/
Disallow: /templates/
Disallow: /tmp/
Disallow: /xmlrpc/0 -
Google webmaster errors
**If you know what these google webmasters errors mean, and you can explain it to me in simple english and tell me how I can locate the problem, I would really appreciate it!. <colgroup><col width=""><col width=""><col width=""><col width=""><col width="*"><col width="124"><col width="54"></colgroup>
Technical SEO | | Joseph-Green-SEO
| | | | | Server error | | | | Soft 404 | | | | Access denied | | Not found | | | Not followed | | | |** I have many of these errors, is it harming SEO?Yoseph0