Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Image Indexing Issue by Google
-
Hello All,My URL is: www.thesalebox.comI have Submitted my image Sitemap in google webmaster tool on 10th Oct 2013,Still google could not indexing any of my web images,Please refer my sitemap - www.thesalebox.com/AppliancesHomeEntertainment.xml and www.thesalebox.com/Hardware.xmland my webmaster status and image indexing status are below,
Can you please help me, why my images are not indexing in google yet? is there any issue? please give me suggestions?Thanks!
-
Hi there, I'm just checking in to see what the current status of this issue is. Please let us know, thanks!
Christy
-
Hi there, you've received a lot of thoughtful responses. Did any of them answer your question? Please let us know, thanks!
Christy
-
Hi Sorina,
Yes, That i can do, i will and let you update, whether it's work or not
Thanks for your suggestions
-
As I said, you can add reference to your sitemaps in the robots.txt file:
At the end of the file http://www.thesalebox.com/robots.txt add the following lines:
sitemap: http://www.thesalebox.com/AppliancesHomeEntertainment.xml
sitemap: http://www.thesalebox.com/Hardware.xml -
Hi, I have seen a situation before where GWT says that no images are indexed but they have indexed them. I don't know why.
Checking Google directly, by searching site:thesalebox.com and then clicking the Image tab shows that Google do have images indexed on your site, maybe not all, but there are some so maybe more are being indexed:
Peter
-
Hi Peter,
Thanks for your valuable suggestions,
But i would like to index image with sub domain path,
I have already verified this domain into Google Webmaster Tool and check Robotos.txt to block, but all things working proper,
Now can you please assist me still images are not indexing and How much time google will taken in first time.
Thanks,
-
Hi Sorina,
Thanks for the focus on google webmaster policy about image indexing with sub domain.
=> I have already verified my Sub domain http://pics.thesalebox.com in to Google Webmaster Tool.
=> Also, I have already added sitemap in to this account.
Please check following links for more informations,
http://pics.thesalebox.com/ShopByDepartment.xml
http://pics.thesalebox.com/SportingGoods.xml=> I have also verified current robots.txt to block this path, but there is no problem.
http://pics.thesalebox.com/robots.txt
Is there other way still i missing to work on it. please suggest me.
Thanks,
-
Here is a quote from Google's Webmasters Help:
In some cases, the image URL may not be on the same domain as your main site. This is fine, as long as both domains are verified in Webmaster Tools. If, for example, you use a content delivery network (CDN) to host your images, make sure that the hosting site is verified in Webmaster Tools OR that you submit your Sitemap using robots.txt. In addition, make sure that your robots.txt file doesn’t disallow the crawling of any content you want indexed.
Source: https://support.google.com/webmasters/answer/178636
According to the above, now that you have also verified the subdomain where you are hosting your images you should be fine.
You don't have to submit the sitemap to the GWT account of the subdomain where you host your images, but you may add reference to your sitemaps in the robots.txt located in the root folder of your website, by adding something like this to the robots.txt file:
sitemap: http://www.thesalebox.com/AppliancesHomeEntertainment.xml
sitemap: http://www.thesalebox.com/Hardware.xml -
Hi Will2112,
Thanks for focus on robots.txt, I have double check that all things that block by robots or not, but it's seems look perfect,
is there another suggestions?
Thanks!
-
Hi Sorina,
Thanks for your reply,
Yes, I have submitted http://pics.thesalebox.com into google WMT and verified and submitted same sitemap.
Now can you please look in to more in this issue??
Thanks!
-
Yes, if your images are on a CDN server you must add to GWT that subdomain too in order to be able to see if the images are indexed by Google or not.
-
If my images are hosted on a CDN server, would I need to add that subdomain to Webmaster Tools as well?
I have a site with lots of images and I can confirm that image indexing takes much longer than the regular webpages to be indexed. I see that your robots.txt has a lot of Disallows on it. Is it possible that you are blocking indexing of those images from the robots.txt?
-
Hi,
I noticed your images are all hosted on a subdomain, http://pics.thesalebox.com. Did you added this subdomain to Google Webmaster Tools?
-
Hi, from experience it can take Google quite a time to index images on a site and if this is the first time you have submitted a sitemap that is probably going to be a factor as well.
Just one thing though with the images on your site. The ecommerce CMS system you are using is not helping interest by search engines in the images because the images don't have a descriptive title. This is one I found on the home page: http://pics.thesalebox.com/catalog/product/cache/1/small_image/175x175/f33bcb0b82304f8755dbcdf9b59ce0e0/1/0/100706555.jpg - the image is named: 100706555.jpg which although you have used alt tags on your images the non-descriptive image name doesn't help. Neither does the depth of your URLs - the image is located 10 folders down.
I hope that helps,
Peter
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
URLs dropping from index (Crawled, currently not indexed)
I've noticed that some of our URLs have recently dropped completely out of Google's index. When carrying out a URL inspection in GSC, it comes up with 'Crawled, currently not indexed'. Strangely, I've also noticed that under referring page it says 'None detected', which is definitely not the case. I wonder if it could be something to do with the following? https://www.seroundtable.com/google-ranking-index-drop-30192.html - It seems to be a bug affecting quite a few people. Here are a few examples of the URLs that have gone missing: https://www.ihasco.co.uk/courses/detail/sexual-harassment-awareness-training https://www.ihasco.co.uk/courses/detail/conflict-resolution-training https://www.ihasco.co.uk/courses/detail/prevent-duty-training Any help here would be massively appreciated!
Technical SEO | | iHasco0 -
Google tries to index non existing language URLs. Why?
Hi, I am working for a SAAS client. He uses two different language versions by using two different subdomains.
Technical SEO | | TheHecksler
de.domain.com/company for german and en.domain.com for english. Many thousands URLs has been indexed correctly. But Google Search Console tries to index URLs which were never existing before and are still not existing. de.domain.com**/en/company
en.domain.com/de/**company ... and an thousand more using the /en/ or /de/ in between. We never use this variant and calling these URLs will throw up a 404 Page correctly (but with wrong respond code - we`re fixing that 😉 ). But Google tries to index these kind of URLs again and again. And, I couldnt find any source of these URLs. No Website is using this as an out going link, etc.
We do see in our logfiles, that a Screaming Frog Installation and moz.com w opensiteexplorer were trying to access this earlier. My Question: How does Google comes up with that? From where did they get these URLs, that (to our knowledge) never existed? Any ideas? Thanks 🙂0 -
How can I get a photo album indexed by Google?
We have a lot of photos on our website. Unfortunately most of them don't seem to be indexed by Google. We run a party website. One of the things we do, is take pictures at events and put them on the site. An event page with a photo album, can have anywhere between 100 and 750 photo's. For each foto's there is a thumbnail on the page. The thumbnails are lazy loaded by showing a placeholder and loading the picture right before it comes onscreen. There is no pagination of infinite scrolling. Thumbnails don't have an alt text. Each thumbnail links to a picture page. This page only shows the base HTML structure (menu, etc), the image and a close button. The image has a src attribute with full size image, a srcset with several sizes for responsive design and an alt text. There is no real textual content on an image page. (Note that when a user clicks on the thumbnail, the large image is loaded using JavaScript and we mimic the page change. I think it doesn't matter, but am unsure.) I'd like that full size images should be indexed by Google and found with Google image search. Thumbnails should not be indexed (or ignored). Unfortunately most pictures aren't found or their thumbnail is shown. Moz is giving telling me that all the picture pages are duplicate content (19,521 issues), as they are all the same with the exception of the image. The page title isn't the same but similar for all images of an album. Example: On the "A day at the park" event page, we have 136 pictures. A site search on "a day at the park" foto, only reveals two photo's of the albums. 3QolbbI.png QTQVxqY.jpg mwEG90S.jpg
Technical SEO | | jasny0 -
Are images stored in Amazon S3 buckets indexable to your domain?
We're storing all our images in S3 bucket, common practice, but we want to get these images to drive traffic back to our site -- and credit for that traffic. We've configured the URLs to be s3.owler.com/<image_name>/<image_id>. I've not seen any of these images show in our web master tools. I am wondering if we're actually not going to get the credit for these images because technically they do sit on another domain. </image_id></image_name>
Technical SEO | | mindofmiller0 -
Does Google index internal anchors as separate pages?
Hi, Back in September, I added a function that sets an anchor on each subheading (h[2-6]) and creates a Table of content that links to each of those anchors. These anchors did show up in the SERPs as JumpTo Links. Fine. Back then I also changed the canonicals to a slightly different structur and meanwhile there was some massive increase in the number of indexed pages - WAY over the top - which has since been fixed by removing (410) a complete section of the site. However ... there are still ~34.000 pages indexed to what really are more like 4.000 plus (all properly canonicalised). Naturally I am wondering, what google thinks it is indexing. The number is just way of and quite inexplainable. So I was wondering: Does Google save JumpTo links as unique pages? Also, does anybody know any method of actually getting all the pages in the google index? (Not actually existing sites via Screaming Frog etc, but actual pages in the index - all methods I found sadly do not work.) Finally: Does somebody have any other explanation for the incongruency in indexed vs. actual pages? Thanks for your replies! Nico
Technical SEO | | netzkern_AG0 -
Google indexing despite robots.txt block
Hi This subdomain has about 4'000 URLs indexed in Google, although it's blocked via robots.txt: https://www.google.com/search?safe=off&q=site%3Awww1.swisscom.ch&oq=site%3Awww1.swisscom.ch This has been the case for almost a year now, and it does not look like Google tends to respect the blocking in http://www1.swisscom.ch/robots.txt Any clues why this is or what I could do to resolve it? Thanks!
Technical SEO | | zeepartner0 -
How to fix Google index after fixing site infected with malware.
Hi All Upgraded a Joomla site for a customer a couple of months ago that was infected with malware (it wasn't flagged as infected by google). Site is fine now but still noticing search queries for "cheap adobe" etc with links to http://domain.com/index.php?vc=201&Cheap_Adobe_Acrobat_xi in web master tools (about 50 in total). These url's redirect back to home page and seem to be remaining in the index (I think Joomla is doing this automatically) Firstly, what sort of effect would these be having on on their rankings? Would they be seen by google as duplicate content for the homepage (moz doesn't report them as such as there are no internal links). Secondly what's my best plan of attack to fix them. Should I setup 404's for them and then submit them to google? Will resubmitting the site to the index fix things? Would appreciate any advice or suggestions on the ramifications of this and how I should fix it. Regards, Ian
Technical SEO | | iragless0 -
Staging & Development areas should be not indexable (i.e. no followed/no index in meta robots etc)
Hi I take it if theres a staging or development area on a subdomain for a site, who's content is hence usually duplicate then this should not be indexable i.e. (no-indexed & nofollowed in metarobots) ? In order to prevent dupe content probs as well as non project related people seeing work in progress or finding accidentally in search engine listings ? Also if theres no such info in meta robots is there any other way it may have been made non-indexable, or at least dupe content prob removed by canonicalising the page to the equivalent page on the live site ? In the case in question i am finding it listed in serps when i search for the staging/dev area url, so i presume this needs urgent attention ? Cheers Dan
Technical SEO | | Dan-Lawrence0
