Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Image Indexing Issue by Google
-
Hello All,My URL is: www.thesalebox.comI have Submitted my image Sitemap in google webmaster tool on 10th Oct 2013,Still google could not indexing any of my web images,Please refer my sitemap - www.thesalebox.com/AppliancesHomeEntertainment.xml and www.thesalebox.com/Hardware.xmland my webmaster status and image indexing status are below,
Can you please help me, why my images are not indexing in google yet? is there any issue? please give me suggestions?Thanks!
-
Hi there, I'm just checking in to see what the current status of this issue is. Please let us know, thanks!
Christy
-
Hi there, you've received a lot of thoughtful responses. Did any of them answer your question? Please let us know, thanks!
Christy
-
Hi Sorina,
Yes, That i can do, i will and let you update, whether it's work or not
Thanks for your suggestions
-
As I said, you can add reference to your sitemaps in the robots.txt file:
At the end of the file http://www.thesalebox.com/robots.txt add the following lines:
sitemap: http://www.thesalebox.com/AppliancesHomeEntertainment.xml
sitemap: http://www.thesalebox.com/Hardware.xml -
Hi, I have seen a situation before where GWT says that no images are indexed but they have indexed them. I don't know why.
Checking Google directly, by searching site:thesalebox.com and then clicking the Image tab shows that Google do have images indexed on your site, maybe not all, but there are some so maybe more are being indexed:
Peter
-
Hi Peter,
Thanks for your valuable suggestions,
But i would like to index image with sub domain path,
I have already verified this domain into Google Webmaster Tool and check Robotos.txt to block, but all things working proper,
Now can you please assist me still images are not indexing and How much time google will taken in first time.
Thanks,
-
Hi Sorina,
Thanks for the focus on google webmaster policy about image indexing with sub domain.
=> I have already verified my Sub domain http://pics.thesalebox.com in to Google Webmaster Tool.
=> Also, I have already added sitemap in to this account.
Please check following links for more informations,
http://pics.thesalebox.com/ShopByDepartment.xml
http://pics.thesalebox.com/SportingGoods.xml=> I have also verified current robots.txt to block this path, but there is no problem.
http://pics.thesalebox.com/robots.txt
Is there other way still i missing to work on it. please suggest me.
Thanks,
-
Here is a quote from Google's Webmasters Help:
In some cases, the image URL may not be on the same domain as your main site. This is fine, as long as both domains are verified in Webmaster Tools. If, for example, you use a content delivery network (CDN) to host your images, make sure that the hosting site is verified in Webmaster Tools OR that you submit your Sitemap using robots.txt. In addition, make sure that your robots.txt file doesn’t disallow the crawling of any content you want indexed.
Source: https://support.google.com/webmasters/answer/178636
According to the above, now that you have also verified the subdomain where you are hosting your images you should be fine.
You don't have to submit the sitemap to the GWT account of the subdomain where you host your images, but you may add reference to your sitemaps in the robots.txt located in the root folder of your website, by adding something like this to the robots.txt file:
sitemap: http://www.thesalebox.com/AppliancesHomeEntertainment.xml
sitemap: http://www.thesalebox.com/Hardware.xml -
Hi Will2112,
Thanks for focus on robots.txt, I have double check that all things that block by robots or not, but it's seems look perfect,
is there another suggestions?
Thanks!
-
Hi Sorina,
Thanks for your reply,
Yes, I have submitted http://pics.thesalebox.com into google WMT and verified and submitted same sitemap.
Now can you please look in to more in this issue??
Thanks!
-
Yes, if your images are on a CDN server you must add to GWT that subdomain too in order to be able to see if the images are indexed by Google or not.
-
If my images are hosted on a CDN server, would I need to add that subdomain to Webmaster Tools as well?
I have a site with lots of images and I can confirm that image indexing takes much longer than the regular webpages to be indexed. I see that your robots.txt has a lot of Disallows on it. Is it possible that you are blocking indexing of those images from the robots.txt?
-
Hi,
I noticed your images are all hosted on a subdomain, http://pics.thesalebox.com. Did you added this subdomain to Google Webmaster Tools?
-
Hi, from experience it can take Google quite a time to index images on a site and if this is the first time you have submitted a sitemap that is probably going to be a factor as well.
Just one thing though with the images on your site. The ecommerce CMS system you are using is not helping interest by search engines in the images because the images don't have a descriptive title. This is one I found on the home page: http://pics.thesalebox.com/catalog/product/cache/1/small_image/175x175/f33bcb0b82304f8755dbcdf9b59ce0e0/1/0/100706555.jpg - the image is named: 100706555.jpg which although you have used alt tags on your images the non-descriptive image name doesn't help. Neither does the depth of your URLs - the image is located 10 folders down.
I hope that helps,
Peter
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google tries to index non existing language URLs. Why?
Hi, I am working for a SAAS client. He uses two different language versions by using two different subdomains.
Technical SEO | | TheHecksler
de.domain.com/company for german and en.domain.com for english. Many thousands URLs has been indexed correctly. But Google Search Console tries to index URLs which were never existing before and are still not existing. de.domain.com**/en/company
en.domain.com/de/**company ... and an thousand more using the /en/ or /de/ in between. We never use this variant and calling these URLs will throw up a 404 Page correctly (but with wrong respond code - we`re fixing that 😉 ). But Google tries to index these kind of URLs again and again. And, I couldnt find any source of these URLs. No Website is using this as an out going link, etc.
We do see in our logfiles, that a Screaming Frog Installation and moz.com w opensiteexplorer were trying to access this earlier. My Question: How does Google comes up with that? From where did they get these URLs, that (to our knowledge) never existed? Any ideas? Thanks 🙂0 -
Pages are Indexed but not Cached by Google. Why?
Hello, We have magento 2 extensions website mageants.com since 1 years google every 15 days cached my all pages but suddenly last 15 days my websites pages not cached by google showing me 404 error so go search console check error but din't find any error so I have cached manually fetch and render but still most of pages have same 404 error example page : - https://www.mageants.com/free-gift-for-magento-2.html error :- http://webcache.googleusercontent.com/search?q=cache%3Ahttps%3A%2F%2Fwww.mageants.com%2Ffree-gift-for-magento-2.html&rlz=1C1CHBD_enIN803IN804&oq=cache%3Ahttps%3A%2F%2Fwww.mageants.com%2Ffree-gift-for-magento-2.html&aqs=chrome..69i57j69i58.1569j0j4&sourceid=chrome&ie=UTF-8 so have any one solutions for this issues
Technical SEO | | vikrantrathore0 -
Google not Indexing images on CDN.
My URL is: https://bit.ly/2hWAApQ We have set up a CDN on our own domain: https://bit.ly/2KspW3C We have a main xml sitemap: https://bit.ly/2rd2jEb and https://bit.ly/2JMu7GB is one the sub sitemaps with images listed within. The image sitemap uses the CDN URLs. We verified the CDN subdomain in GWT. The robots.txt does not restrict any of the photos: https://bit.ly/2FAWJjk. Yet, GWT still reports none of our images on the CDN are indexed. I ve followed all the steps and still none of the images are being indexed. My problem seems similar to this ticket https://bit.ly/2FzUnBl but however different because we don't have a separate image sitemap but instead have listed image urls within the sitemaps itself. Can anyone help please? I will promptly respond to any queries. Thanks
Technical SEO | | TNZ
Deepinder0 -
Indexing Issue of Dynamic Pages
Hi All, I have a query for which i am struggling to find out the answer. I unable to retrieve URL using "site:" query on Google SERP. However, when i enter the direct URL or with "info:" query then a snippet appears. I am not able to understand why google is not showing URL with "site:" query. Whether the page is indexed or not? Or it's soon going to be deindexed. Secondly, I would like to mention that this is a dynamic URL. The index file which we are using to generate this URL is not available to Google Bot. For instance, There are two different URL's. http://www.abc.com/browse/ --- It's a parent page.
Technical SEO | | SameerBhatia
http://www.abc.com/browse/?q=123 --- This is the URL, generated at run time using browse index file. Google unable to crawl index file of browse page as it is unable to run independently until some value will get passed in the parameter and is not indexed by Google. Earlier the dynamic URL's were indexed and was showing up in Google for "site:" query but now it is not showing up. Can anyone help me what is happening here? Please advise. Thanks0 -
Will Google Recrawl an Indexed URL Which is No Longer Internally Linked?
We accidentally introduced Google to our incomplete site. The end result: thousands of pages indexed which return nothing but a "Sorry, no results" page. I know there are many ways to go about this, but the sheer number of pages makes it frustrating. Ideally, in the interim, I'd love to 404 the offending pages and allow Google to recrawl them, realize they're dead, and begin removing them from the index. Unfortunately, we've removed the initial internal links that lead to this premature indexation from our site. So my question is, will Google revisit these pages based on their own records (as in, this page is indexed, let's go check it out again!), or will they only revisit them by following along a current site structure? We are signed up with WMT if that helps.
Technical SEO | | kirmeliux0 -
How to stop my webmail pages not to be indexed on Google ??
when i did a search in google for Site:mywebsite.com , for a list of pages indexed. Surprisingly the following come up " Webmail - Login " Although this is associated with the domain , this is a completely different server , this the rackspace email server browser interface I am sure that there is nothing on the website that links or points to this.
Technical SEO | | UIPL
So why is Google indexing it ? & how do I get it out of there. I tried in webmaster tool but I could not , as it seems like a sub-domain. Any ideas ? Thanks Naresh Sadasivan0 -
Does Google index XML files?
Does Google or other search engines include XML files in their index? More specifically, I am wondering how Google knows the difference between an xml filetype and an RSS feed.
Technical SEO | | nicole.healthline0 -
Dynamically-generated .PDF files, instead of normal pages, indexed by and ranking in Google
Hi, I come across a tough problem. I am working on an online-store website which contains the functionlaity of viewing products details in .PDF format (by the way, the website is built on Joomla CMS), now when I search my site's name in Google, the SERP simply displays my .PDF files in the first couple positions (shown in normal .PDF files format: [PDF]...)and I cannot find the normal pages there on SERP #1 unless I search the full site domain in Google. I really don't want this! Would you please tell me how to figure the problem out and solve it. I can actually remove the corresponding component (Virtuemart) that are in charge of generating the .PDF files. Now I am trying to redirect all the .PDF pages ranking in Google to a 404 page and remove the functionality, I plan to regenerate a sitemap of my site and submit it to Google, will it be working for me? I really appreciate that if you could help solve this problem. Thanks very much. Sincerely SEOmoz Pro Member
Technical SEO | | fugu0