Image Sitemap Indexing Issue
-
Hello Folks,
I've been running into some strange issues with our XML Sitemaps.
- The XML Sitemaps won't open on a browser and it throws the following error instead of opening the XML Sitemap. Sample XML Sitemap - www.veer.com/sitemap/images/Sitemap0.xml.gzError - "XML Parsing Error: no element foundLocation: http://www.veer.com/sitemap/images/Sitemap0.xmlLine Number 1, Column 1:"2) Image files are not getting indexed. For instance, the sitemap - www.veer.com/sitemap/images/Sitemap0.xml.gz has 6,000 URLs and 6,000 Images. However, only 3,481 URLs and 25 images are getting indexed. The sitemap formatting seems good, but I can't figure out why Google's de-indexing the images and only 50-60% of the URLs are getting indexed. Thank you for your help!
-
Hi Cyrus,
Thank you for your note and my apologies for delay in response.
The indexation number is from Google Webmaster Tools.
The two are identical and I've tested other XML sitemap files that are in GZ format that opened fine in the browser without unzipping them or prompting a DL. The sitemaps were uploaded to GWT as the .gz files only since we have many pages to upload.
I'll check with our Dev Team regarding the XML parsing error.
Please let me know what other areas we need to look into based my answers to your questions. Thank you for your help, I greatly appreciate it!
-
Some possible suggestions:
- Make sure every image has a width and height attribute defined in the HTML. Images are much more likely to be indexed this way.
- Same with the "alt" attribute
- Make sure your image subdirectory isn't blocked (robots.txt for example)
- Same with the pages
It may be Google actually is indexing those images, but not reporting them in GWT. Do an image search and narrow results to your site, to see if your images actually appear.
Aside from accessibility issues, make sure the images are on well-linked to pages. It's much more likely for an image to be indexed on a page with good link metrics and a lack of crawl problems.
-
@Cyrus
You have given very good explanation. But, I have similar issue for image sitemap. If we are talking about crawling & indexing ratio so, it's quite good. You can know more by attachment.
You can check syntax of image sitemap by following XML.
http://www.vistastores.com/patio_umbrellas_sitemap.xml
Can you give me input ::: How can I improve crawling and indexing for images?
-
Hi Corbis,
Man, you've got some tough questions! i may have to call in some outside support on this one if we can't figure it out.
First of all, are you getting the indexation #s from Google Webmaster Tools? What I mean by this - is Google saying there are 6000 URLs in your sitemap, but they are only indexing 3,481?
When I unzipped the compressed sitemap file, it opened fine in my browser, while the 2nd uncompressed file did not. Are they identical? And have you submitted both to Google?
There could be many reasons why you're getting the XML parsing error. One issue might be in the second line, referencing http://www.google.com/schemas/sitemap-image/1.1/ as a Schema location, because this is an html webpage and not an XML or DTD file. You might try removing the reference to this URL, and see if that helps.
Otherwise, if Google is reporting the correct number of URLs and Images, then you know they are aware of those URLs, and the problem may not be with the sitemap. Google doesn't necessarily index all URLs in a sitemap, but instead bases it's indexing on factors like your domain authority, link structure and crawl allowance. Addressing these issues will usually help get more pages indexed than a sitemap alone.
So if you can improve internal crawl errors, duplicate content issues, and make sure there is a good navigational architecture to your site, you should see a good rise in indexations.
-
Hi Folks,
Just following up on this query. Any insights? Thank you for your help!
-Corbis
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
No index and Crawl Budget
Hello, If we noindex pages, will it improve crawl budget ? For example pages like these - https://x-z.com/2012/10/
Technical SEO | | Johnroger
https://x-y.com/2012/06/
https://x-y.com/2013/03/
https://x-y.com/2019/10/
https://x-y.com/2019/08/ Should we delete/redirect such pages ? Thanks0 -
URL Indexed But Not Submitted to Sitemap
Hi guys, In Google's webmaster tool it says that the URL has been indexed but not submitted to the sitemap. Is it necessary that the URL be submitted to the sitemap if it has already been indexed? Appreciate your help with this. Mark
Technical SEO | | marktheshark100 -
Magento Rewrite Issue
Moz's Crawler has thrown up a bunch of crawl issue for my site.The site is a magento based site and I recently updated the themes so some routes may have have become redundant. Moz has identified 289 pages with Temporary Redirect. I thought magento managed the redirects if I set the "Auto-redirect to Base URL" to Yes(301 Moved permanently). But this is enabled on my store and I still get the errors. The only thing I could think of was to add a Robots.txt and handle the redirection of these links from here. But handling redirection for 289 links is no mean task. I was looking for any ideas that could fix this without me manually doing this .
Technical SEO | | abhishek19860 -
Is it important to include image files in your sitemap?
I run an ecommerce business that has over 4000 product pages which, as you can imagine, branches off into thousands of image files. Is it necessary to include those in my sitemap for faster indexing? Thanks for you help! -Reed
Technical SEO | | IceIcebaby0 -
All images are noindex will opening this at once be an issue?
Hi, All images are noindex will opening this at once be an issue? Not sure how a few months ago all my images were set as noindex which i realized last week. We have 20K images which were indexed fine but now when i check Site:sitename it shows 10 or 12 and the inspect element via Chrome i see the noindex is set for all images. We have been renaming the images and adding ALT tags for most of them and would it be an issue if we change the noindex in one shot or should we do them few at a time? Thanks
Technical SEO | | mtthompsons0 -
301 issue in IE9
My development team recently discovered an issue with 301 redirects caching in IE9. They did some research and found the situation was very complicated so their solution was to use 302s and no longer use 301s. As a temporary solution to a few URLs I was okay with this, but we have a site redesign launching in a few months and I am quite worried if we have to do all of our redirects as 302s. Has anyone else had this issue with IE9 and 301s. I could use any advice on how to overcome this issue. Thanks!
Technical SEO | | SEI0 -
Do I need an XML sitemap?
I have an established website that ranks well in Google. However, I have just noticed that no xml sitemap has been registered in Google webmaster tools, so the likelihood is that it hasn't been registered with the other search engines. However, there is an html sitemap listed on the website. Seeing as the website is already ranking well, do I still need to generate and submit an XML sitemap? Could there be any detriment to current rankings in doing so?
Technical SEO | | pugh0 -
Xml Sitemap
Hi mozzers, I am about to submit a sitemap for one of my clients via webmaster tools. The issue is that I have way too many urls that I don't want them to be indexed by Google such as testing pages, auto generated pages... Is there way to remove certain URL from the XML sitemap or is this impossible? If impossible, is the only way to control these urls is to "No index" all these pages that i don't want the search engine to see? Thanks Mozzers,
Technical SEO | | Ideas-Money-Art0