Images on sub domain fed from CDN
-
I have a client that uses a CDN to fill images, from a sub domain ( images.domain.com). We've made sure that the sub domain itself is not blocked. We've added a robots.txt file, we're creating an image sitemap file & we've verified ownership of the domain within GWT.
Yet, any crawler that I use only see's the first page of the sub domain (which is .html) but none of the subsequent URL's which are all .jpeg.
Is there something simple I'm missing here?
-
Alphonse it sounded like they were just waiting for the sitemap to launch. Other than that, I couldn't think of anything else to add because the sitemap should solve their issue. However, I have marked this as "Discussion" again.
-
I am a little confused. The question was marked answered, but which one is the answer?
-
We have the same issue however we have image XML sitemaps on each country subdomain's XML Index which point to the image files on images.domain.com.
Example:
https://uk.domain.com/image-sitemap1.xml
https://us.domain.com/image-sitemap1.xml
These 2 files are the same.
We also don't have a homepage on images.domain.com and it currently responds with a 404.
Do you think we need to create a landing page on the homepage and host the image XML sitemap at https://images.domain.com/images-sitemap1.xml rather than in each sub-domain?
Thanks.
-
Yes, we are doing everything correctly, aside from waiting for IT department to create a sitemap.
-
Are you using your own subdomain or one somewhere else (e.g. akamai.com)? You should use your own subdomain, if possible.
Was this a change from a previous version that didn't use a CDN? If those images were/are hosted on your primary domain be sure to match the filenames and paths as closely as possible to what they were before.
If you're doing that you shouldn't have a problem once the sitemap is submitted.
For more information please check out this post:
http://www.goinflow.com/four-seo-best-practices-for-using-a-content-delivery-network-cdn/How do you know that Google only attempts to crawl the primary domain URL (i.e. the .html page)? Are you checking log files?
Is the crawler you're using set to crawl external URLs? If not, that could be the issue. Technically a subdomain is a totally separate website so most tools don't crawl them by default.
-
We've correctly applied the CNAME directive from the CDN to reflect the subdomain. Yet, when Google or any other tool attempts to crawl it only shows ONE URL. Not the images that are residing on their own independent URL's.
-
In order to put those image URLs for the crawler to be able to access them you should either:
- Link to the URLs of the images (does that .html page in the subdomain contain these URLs?)
or
- Use the images URLs as resources in the pages already been crawled. Unfortunately this could be tricky when dealing with CDNs since those resources are dynamic.
In either case, the sitemap will solve your problem.
-
The sitemap is not completed yet. Server logs show Googlebot only indexing one page the .html page, not other pages.
-
Did you reference the sitemap in the robots.txt file or did you set up it in GWT?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Do you Index your Image Repository?
On our backend system, when an image is uploaded it is saved to a repository. For example: If you upload a picture of a shark it will go to - oursite.com/uploads as shark.png When you use a picture of this shark on a blog post it will show the source as oursite.com/uploads/shark.png This repository (/uploads) is currently being indexed. Is it a good idea to index our repository? Will Google not be able to see the images if it can't crawl the repository link (we're in the process of adding alt text to all of our images ). Thanks
Technical SEO | | SteveDBSEO0 -
How to get the most value out of an additional domain?
Hi, My company domain is wire.com. We recently also acquired wire.chat domain What's the best way to get value out of that additional domain? Wire is a messaging platform so "chat" is relevant but "wire chat" is not really a high volume term We don't have a whole lot of content, certainly no resources to build out additional valuable website Is simply redirecting traffic the only sensible to thing given the above restriction? Any technical issues to be mindful of? Thanks, Siim
Technical SEO | | Siim-at-Wire0 -
Mobile site domain authority
Hello, I think this may be a coding issue, but hoping someone can help me. I am still having issues with our mobile site ranking, even though we created redirects/canonical to identify similar content between desktop version and mobile. I did notice through MOZ analysis of backlinks that we have no domain authority. If the mobile site is automatically detected dependent on the user, shouldn't we also have the same domain authority? How does that work exactly? How can we build up the domain authority for our mobile site? Any help would be greatly appreciated! Thanks
Technical SEO | | lfrazer0 -
Followed Linking Root Domains and No Followed Linking Domains
If you have more NoFollowed Linking Root Domains than Followed Linking Root Domains is that a problem?
Technical SEO | | INN0 -
New Website and Domain Question
Hi all, I am launching a new website around the end of October and I have purchased a great domain to use for it. My question is should I put some kind of holding page up to try and start building up some domain authority in preperation for launch? Or maybe a blog at www.domain.com/blog and then keep all the blog content at the same location when the full site goes up? Or is it best to wait and just launch the site when the first version is complete? Thanks, Ben
Technical SEO | | BenInder0 -
Can I do a redirect to a new domain name only a couple of weeks after having redirected to another domain?
I have a client with two website with very similar content. Both had a lot of inbound links and performed fairly well in SERPS. We recently combined both sites and have redirected one of the domains to the other. The traffic dipped slightly initially, but is recovering nicely. Now the client registered a new domain name he would like to use for the site. Should I wait a few weeks for everything to settle down after the first redirect/consolidation of sites before doing a new redirect to a new domain name, or should I not worry about having any issues with doing it right away?
Technical SEO | | Drewco0 -
Two spelling of a domain
I have a customer with two spellings of their domain name. I set up an account for spelling A and forwarded all the email boxes to spelling B becuase people tend to remember spelling A more of the time. Spelling B is the real web site. I also want any www. traffic for spelling A to go to spelling B so I used this .htaccess file in the root of spelling A Options +Indexes +FollowSymLinks
Technical SEO | | freestone
RewriteEngine On
RewriteRule ^(.*)$ http://www.B.com/$1 [R=301,L] I use to just forward A to B from the registrar but made this change to allow for email spelled either way. My question is does this create a duplicate site issue for the bots? Is this in anyway an SEO negative and if so is there a better way to do this. Thanks jw0 -
Multiple domain names with similar content
Hi, we've got multiple domains that point to the same website and same content. The only difference is the currency and some text, you could say only about 5% difference in each domain's content: http://www.redwrappings.com.au/
Technical SEO | | Essentia
http://www.redwrappings.com/ Will Google penalise us for having 95% similar content for each domain (they sell the same products but in different currencies)? We shoudn't really put canonical link, should we? Because 5% of the content is different, which means they are not identical. What would be the best solution if this is a problem? Thanks0