Submitting a new sitemap index file. Only one file is getting read. What is the error?
-
Hi community,
I am working to submit a new a new sitemap index files, where about 5 50,000 sku files will be uploaded. Webmasters is reporting that only 50k skus have been submitted.
Google Webmasters is accepting the index, however only the first file is getting read. I have 2 errors and need to know if this is the reason that the multiple files are not getting uploaded.
Errors:
| 1 | | Warnings | Invalid XML: too many tags | Too many tags describing this tag. Please fix it and resubmi |
| 2 | | Warnings | Incorrect namespace | Your Sitemap or Sitemap index file doesn't properly declare the namespace. | 1 |
Here is the url I am submitting: http://www.westmarine.com/sitemap/wm-sitemap-index.xml
| 1 | | | | |
-
thank you,
I grabbed this and put into a text file for my developer to see and it and have made sense of your corrections.
-
The forum is butchering my formatting, but I hope you get the idea.
-
Try changing the first line of your index file to:
<sitemapindex xmlns="http: www.sitemaps.org="" schemas="" sitemap="" 0.9"=""></sitemapindex xmlns="http:>
That should get rid of the second warning. For the first warning, you are not using enough <sitemap>tags, you need to encapsulate each of the <loc>tags in one, so your sitemap should look like:</loc></sitemap>
<sitemapindex xmlns="http: www.sitemaps.org="" schemas="" sitemap="" 0.9"=""></sitemapindex xmlns="http:><sitemap></sitemap>http://www.westmarine.com/sitemap/wm-products01.xml.gz<sitemap></sitemap>http://www.westmarine.com/sitemap/wm-products02.xml.gz<sitemap></sitemap>http://www.westmarine.com/sitemap/wm-brands01.xml.gz<sitemap></sitemap>http://www.westmarine.com/sitemap/wm-categories01.xml.gz<sitemap></sitemap>http://www.westmarine.com/sitemap/wm-content-pages01.xml.gz<sitemap></sitemap>http://www.westmarine.com/sitemap/wm-commerce-pages01.xml.gz
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Some of my website urls are not getting indexed while checking (site: domain) in google
Some of my website urls are not getting indexed while checking (site: domain) in google
Technical SEO | | nlogix0 -
4XX client error
I am a bit confused...my recent site crawl told me I had 1 4XX Client error, (high priority). This is the page...
Technical SEO | | sdwellers
http://www.seadwellers.com/wp-content/uploads/2014/06/367679d2+0+277-SD.mp4 This link below is listed as the "linking page"....I guess that the link comes from?
http://www.seadwellers.com/category/dive-travel/ I'm just not getting this...where did the page of the first link above come from...and what is the deal with the catagory/dive-travel/ page? And how do I fix? Any guidance would be greatly appreciated...0 -
No Index PDFs
Our products have about 4 PDFs a piece, which really inflates our indexed pages. I was wondering if I could add REL=No Index to the PDF's URL? All of the files are on a file server, so they are embedded with links on our product pages. I know I could add a No Follow attribute, but I was wondering if any one knew if the No Index would work the same or if that is even possible. Thanks!
Technical SEO | | MonicaOConnor0 -
Multiple Sitemaps
Hello everyone! I am in the process of updating the sitemap of an ecommerce website and I was thinking to upload three different sitemaps for different part (general/categories and subcategories/productgroups and products) of the site in order to keep them easy to update in the future. Am I allowed to do so? would that be a good idea? Open to suggestion 🙂
Technical SEO | | PremioOscar0 -
Noindex Pages indexed
I'm having problem that gogole is index my search results pages even though i have added the "noindex" metatag. Is the best thing to block the robot from crawling that file using robots.txt?
Technical SEO | | Tedred0 -
Do I need an XML sitemap?
I have an established website that ranks well in Google. However, I have just noticed that no xml sitemap has been registered in Google webmaster tools, so the likelihood is that it hasn't been registered with the other search engines. However, there is an html sitemap listed on the website. Seeing as the website is already ranking well, do I still need to generate and submit an XML sitemap? Could there be any detriment to current rankings in doing so?
Technical SEO | | pugh0 -
Why is a 301 redirected url still getting indexed?
We recently fixed a redirect issue in a website, and although it appears that the redirection is working fine, the url in question keeps on getting crawled, indexed and cached by google. The redirect was done a month ago, and google shows cached version of it, even for a couple of days ago. Manual checking shows that its being redirected, and also a couple of online tools i checked report a 301 redirect. Do you have any idea why this could be happening? The website I'm talking about is www.hotelmajestic.gr and its being redirected to www.hotel-majestic.gr
Technical SEO | | dim_d0 -
Google News not indexing .index.html pages
Hi all, we've been asked by a blog to help them better indexing and ranking on Google News (with the site being already included in Google News with poor results) The blog had a chronicle URL duplication problem with each post existing with 3 different URLs: #1) www.domain.com/post.html (currently in noindex for editorial choices as showing all the comments) #2) www.domain.com/post/index.html (currently indexed showing only top comments) #3) www.domain.com/post/ (very same as #2) We've chosen URL #2 (/index.html) as canonical URL, and included a rel=canonical tag on URL #3 (/) linking to URL #2.
Technical SEO | | H-FARM
Also we've submitted yesterday a Google News sitemap including consistently the list of URLs #2 from the last 48h . The sitemap has been properly "digested" by Google and shows that all URLs have been sent and indexed. However if we use the site:domain.com command on Google News we see something completely different: Google News has indexed actually only some news and more specifically only the URLs #3 type (ending with the trailing slash instead of /index.html). Why ? What's wrong ? a) Does Google News bot have problems indexing URLs ending with .index.html ? While figuring out what's wrong we've found out that http://news.google.it/news/search?aq=f&pz=1&cf=all&ned=us&hl=en&q=inurl%3Aindex.html gives no results...it seems that Google News index overall does not include any URLs ending with /index.html b) Does Google News bot recognise rel=canonical tag ? c) Is it just a matter of time and then Google News will pick up the right URLs (/index.html) and/or shall we communicate Google News team any changes ? d) Any suggestions ? OR Shall we do the other way around. meaning make URL #3 the canonical one ? While Google News is showing these problems, Google Web search has actually well received the changes, so we don't know what to do. Thanks for your help, Matteo0