Why are my images not being indexed?
-
I have submitted an image sitemap with over 2,000 images yet only about 35 have been indexed.
Could you please help me understand why Google is not indexing my images?
-
Every image I saw on the site was indexed in the large versions. I imagine you got this solved already, but let me know if there are any questions.
-
Thanks for checking this out.
Doug:
This is the sitemap: http://www.creative-calendars.com/sitemap-image.xml
The CDN is very new in an effort to try and solve this problem. I thought maybe the speed was the issue.
There are no errors.
George:
I have about 750 links from Pinterest to the various images.
-
My first suspicion was that it might be something to do with the content delivery network. I notice that the URL of the images is: "creativecalendars.nocompany1412096296.netdna-cdn.com/"
Looking at the http header on the cdn hosted images I can see that this is being correctly canonicalised to the local image.
What's the URL of the image sitemap? Can you share it so we can take a look?
I take it there are no errors being reported in Webmaster Tools for this image sitemap (Crawl -> Sitemaps).?
The ALT tags on your images appear to be very short/generic or missing. The image file names aren't too descriptive either. You might want to take a look at how you can improve these.
-
Hi Nicole,
Personally I've had lots of issues getting images indexed on large websites - and I've come across other webmasters with the same problems. If you really want to get diagnostic then you need to start splitting out content into different sitemaps as SEO-Buzz suggests so you can see a clearer breakdown in Webmaster Tools.
Another approach you might want to try is doing some image link building - your image content is ripe for being active on Pinterest and other photo sharing platforms. Getting the content placed like this should help with indexing.
Regards,
George
-
Thanks for checking this out.
I tried to analyze the behavior in crawls as you suggested and I noticed that most of the images that have been indexed are from the homepage. What does that mean? I do have a link to other pages from the homepage but that didn't seem to help.
Also, on the few internal pages that were indexed, only the first image was indexed. Could that indicate that it is a speed problem?
-
hmmm...6 months. I don't have any good ideas, but I took a deeper look at your website and although it appears you are using alt text well, the site has very little content and tons of images. I think that is appropriate for your site from a user's perspective, but wondering if Google views the website as "too shallow" to deem it crawl-worthy. If it were my situation, I think I would try a few tests like creating a standalone sitemap with one of your calendar's content page and images in it and submitting it. Another test would be to write more content on one of your calendar's pages and create a standalone sitemap for it, also, to submit. See if the behavior in crawls is any different for either of these. If so, it may give you some insight.
Another test you might try is to refer to images using anchor text instead of just a thumbnail in the content with the link to the image. Not that you would want to use this a lot on your site, but the test would tell you if this helps with indexing and if it does, then you can go from there.
I still recommend that you place a link to your sitemap or sitemap index in your footer. And I hope others will chime in!
-
The site map was submitted at least 6 months ago. All of the pages/posts have been indexed but not the images.
-
When did you submit your sitemap? Perhaps enough time has not passed for a complete index? This can take weeks, even months. Also, consider adding your sitemap.xml or sitemap directory to the root directory of your website.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Indexed, but not shown in search result
Hi all We face this problem for www.residentiebosrand.be, which is well programmed, added to Google Search Console and indexed. Web pages are shown in Google for site:www.residentiebosrand.be. Website has been online for 7 weeks, but still no search results. Could you guys look at the update below? Thanks!
Technical SEO | | conversal0 -
How to fix google index filled with redundant parameters
Hi All This follows on from a previous question (http://moz.com/community/q/how-to-fix-google-index-after-fixing-site-infected-with-malware) that on further investigation has become a much broader problem. I think this is an issue that may plague many sites following upgrades from CMS systems. First a little history. A new customer wanted to improve their site ranking and SEO. We discovered the site was running an old version of Joomla and had been hacked. URL's such as http://domain.com/index.php?vc=427&Buy_Pinnacle_Studio_14_Ultimate redirected users to other sites and the site was ranking for buy adobe or buy microsoft. There was no notification in webmaster tools that the site had been hacked. So an upgrade to a later version of Joomla was required and we implemented SEF URLs at the same time. This fixed the hacking problem, we now had SEF url's, fixed a lot of duplicate content and added new titles and descriptions. Problem is that after a couple of months things aren't really improving. The site is still ranking for adobe and microsoft and a lot of other rubbish and the urls like http://domain.com/index.php?vc=427&Buy_Pinnacle_Studio_14_Ultimate are still sending visitors but to the home page as are a lot of the old redundant urls with parameters in them. I think it is default behavior for a lot of CMS systems to ignore parameters it doesn't recognise so http://domain.com/index.php?vc=427&Buy_Pinnacle_Studio_14_Ultimate displays the home page and gives a 200 response code. My theory is that Google isn't removing these pages from the index because it's getting a 200 response code from old url's and possibly penalizing the site for duplicate content (which don't showing up in moz because there aren't any links on the site to these url's) The index in webmaster tools is showing over 1000 url's indexed when there are only around 300 actual url's. It also shows thousands of url's for each parameter type most of which aren't used. So my question is how to fix this, I don't think 404's or similar are the answer because there are so many and trying to find each combination of parameter would be impossible. Webmaster tools advises not to make changes to parameters but even so I don't think resetting or editing them individually is going to remove them and only change how google indexes them (if anyone knows different please let me know) Appreciate any assistance and also any comments or discussion on this matter. Regards, Ian
Technical SEO | | iragless0 -
Carwling and indexing problems
hi, i have noticed since my site was upgraded that google is taking a long time to publish my articles. before the upgrade google would publish the article straight away, but now it takes an average of around 4 days. the article i am talking about at the moment is here http://www.in2town.co.uk/celebrities-in-the-news/stuart-hall-has-his-prison-sentence-for-sex-crimes-doubled-to-30-months now i have a blog here on blogger and the article was picked up within six mins http://showbizgossipandnews.blogspot.co.uk/2013/07/stuart-hall-has-his-prison-sentence-for.html so i am just wondering what the problem is and what i need to solve this my problem is, my site is mostly a news site so it is no good to me if google is publishing new stories every four days, any help would be great.
Technical SEO | | ClaireH-1848860 -
Does this content get indexed?
A lot of content on this site is displayed in pop up pages. Eg. Visit the Title page http://www.landgate.wa.gov.au/corporate.nsf/web/Certificate+of+Title To access the sample report or fee details, the info is shown in a pop up page with a strange url. Example: http://www.landgate.wa.gov.au/corporate.nsf/web/Certificate+of+Title+-+Fee+Details I can't see any of these pages being indexed in Google or other search engines when I do a site search: http://www.landgate.wa.gov.au/corporate.nsf/web/Certificate+of+Title+-+Fee+Details Is there a way to get this content indexed besides telling the client to restructure this content?
Technical SEO | | Bigheadigital0 -
Index inactive mobile site?
Hi, I have a question wrt Mobile version of a site. Previously, we had a mobile site which is no longer active and there are possibilities of resurrecting it in future, so we have a 302 redirect which points to the homepage (desktop version). Currently, the mobile site is indexed by the search engines. To avoid the duplicate content issue, is it recommended to use robots.txt and block the spiders from mobile content or apply 301 redirect until the mobile site is up and running, OR continue with the 302 redirect. Any suggestions will be helpful. Thanks,
Technical SEO | | RaksG0 -
How do I eliminate indexed products?
Please help! We got clobbered by Penguin and are at risk of having to close down after 10 years. We have been trying to figure out why and believe now it might be because of duplicate content. We added 2" inserts in March (over 500): http://www.trophycentral.com/inserts1.html Even though each is a different products, SEOMOZ is saying they are considered duplicate content. Given the timing, we think this might be the cause, even though it is totally legitimate. Question - since these are now indexed and since we can't easily add content quickly, what is the best way to handle this situation? A no-index tag? Is there a way to let Google know that their algorithm is detroying legitimate businesses??
Technical SEO | | trophycentraltrophiesandawards0 -
How to push down outdated images in Google image search
When you do a Google image search for one of my client's products, you see a lot of first-generation hardware (the product is now in its third generation). The client wants to know what they can do to push those images down so that current product images rise to the top. FYI: the client's own image files on their site aren't very well optimized with keywords. My thinking is to have the client optimize their own images and the ones they give to the media with relevant keywords in file names, alt text, etc. Eventually, this should help push down the outdated images is my thinking. Any other suggestions? Thanks so much.
Technical SEO | | jimmartin_zoho.com0 -
Does Google index XML files?
Does Google or other search engines include XML files in their index? More specifically, I am wondering how Google knows the difference between an xml filetype and an RSS feed.
Technical SEO | | nicole.healthline0