SEO dealing with a CDN on a site.
-
This one is stumping me and I need some help. I have a client who's site is www.site.com and we have set them up a CDN through Max CDN at cdn.site.com which is basically a cname to the www.site.com site. The images in the GWT for www.site.com are de-indexing rapidly and the images on cdn.site.com are not indexing. In the Max CDN account I have the images from cdn.site.com sending a canonical header from www.site.com but that does not seem to help, they are all still de-indexing.
-
In my experience Google does a pretty good job of applying the rankings to the CDN version of the image if you follow those best practices.
Good luck!
-
They were being de-indexed until I did this, http://moz.com/community/q/would-this-be-considered-cloaking-and-would-it-be-a-bad-move
The images are not dynamic at all, they have static urls. The only thing that was being changes was the sub domain from www to cdn. When that happened the images started to de-index on the www sub domain, even though the cdn sub domain image had a canonical header pointing back to the www domain. You are welcome to check it out in action, the site is https://www.redwrappings.com.au/
-
Very odd, then, that they're being removed from the index. Do you think it's possible that the images have different URLs depending on which server they're cached on? That could definitely do it. I'd have a friend across the country pull them up and see if the image URL changes.
I'm assuming that the image has some dynamic characters on it, which is pretty common with CDNs under certain configurations. Unfortunately, I've never used MaxCDN. If the image is just cdn.site.com/image.png - I'm afraid I have absolutely no idea why they wouldn't be re-indexed. I have similar CDN images that pull in fine.
-
That article touches on a lot of the issues. Here are my thoughts and you can tell me if I am incorrect in my thinking. For a couple of years the images have been on the www part of the domain, now they are on a CDN sub directory. I was trying to keep them indexed under the www part of domain so that they would keep the authority of the domain. My thoughts were if they are de-indexed under the www and re-indexed under the cdn they would have to climb their way back in the image search rankings. Basically that is what I was trying to avoid.
-
They are static. It is a passthrough CDN that basically strips off the www of the image url and replaces it with cdn.
-
Hello Lesley,
Here is an article that may help, or provide some links to other resources at the bottom: Four Best Practices for Using a CDN .
Are you keeping the same filenames or do those change?
What is in the robots.txt file on the CDN?
Have you set up and verified the CDN in Google Webmaster Tools? If so, have you submitted an XML Sitemap?
-
Hi there,
Could you tell me whether the URLs on your images are static on the CDN sub-domain? Or do they change regularly?
-
Christy,
Sure thing, this is what is in place now, http://moz.com/community/q/would-this-be-considered-cloaking-and-would-it-be-a-bad-move
-
Thanks for the update, Lesley. I'm sorry to hear that you haven't found a solution you're happy with. Let me see if any of the other Associates can help you troubleshoot this. In the meantime, are you able to to share the details of your workaround?
Christy
-
None of the answers really applied. I did a work around that I am not too happy with at the current time.
-
Hi Lesley, what is the current status of this issue? Were you able to resolve it, or are you still having problems? We'd love an update, thanks!
Christy
-
I have not actually set up the robots.txt in maxcdn. But the cdn is not indexing, which is what I am wanting, doing the site: search for the cdn shows no results. For the main site though the images are falling out of the index, even though there is a site map for them and they are still accessible from their normal url.
-
I am not using Wordpress, the site is using PrestaShop, so it does not have those plugins.
This is how it is set up.
cdn.site.com is a cname of www.site.com so images are accessible from www.site.com/image.jpg and cdn.site.com/image.jpg but when they are served from the cdn.site.com/image.jpg they have the canonical header that points to www.site.com/image.jpg I cannot understand why that would de-index all of the images on www.site.com though
-
I had this problem.
Are you using W3 Total Cache... if so you should activate the Yoast SEO extension (presumably you use Yoast, and their sitemaps). You will find it at Performance -> Extensions in the W3 Total Cache admin area.
In addition, with the robots file in Maxcdn you should have something like:
User-agent: *
Disallow:/
Allow: /wp-content/
Allow: /wp-includes/You will find this under the SEO settings of your Pull zone in MaxCDN. Make sure you have robots & canonical header ticked as well.
The W3 Total cache extension will sort all the sitemap problems with pointing to the right URL and canonical. There is no need to do anything manually as some of the articles suggest.
-
I have no problem helping you look into this. I can play with it more tomorrow morning. I have a few more questions though. Did you setup the robots.txt in MaxCDN to point to the origin directory of the images? What happens when you search site:cdn.domain.com or site:www.domain.com in Google?
There are a few ideas that I have into why this is happening but I would like to test them prior to posting.
-
It is a pull zone.
-
Are you currently using a "Pull" or "Push" zone on MaxCDN?
-
That is actually what has been done, I have seen the article before. But there are two issues, one they are not indexing and it has been a couple of weeks. But the more major issue to me is that using the cdn url none of the link juice from the main domain is being passed to the cdn sub domain. I am trying to figure out why Google is not respecting the canonical header for the images. It would seem to me, that according to what Matt Cutts says that it would. But it is not.
-
I have had this issue in past as well when working with MaxCDN. I was able to apply a fix using some of the methods in this article to fix the issue.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Pagination, SEO, and you
So, i have done some research on this and I am running into 2 problems. We run a review site for a specific niche. wordpress is viewing our category pages as an "archive" which i don't know that it really is. Google seems to only be indexing the first 9 pages of this category. We would like google to be indexing these pages because thats the only place on our website where all our specific products are linked. Any thoughts are greatly appreciated.
Reporting & Analytics | | HashtagHustler1 -
High Bounce Rate on traffic generating area of our site
Hi, Our eCommerce site currently includes a blog section known as Igloo which we have filled with unique and helpful content that is useful to a fair few people, not just customers of ours. It currently attracts a large number of visitors (more than the actual eCommerce side of the site in actual fact) organically who aren't currently customers of ours. Very few of these turn in to paying clients so it's not really a money spinner but it has worked quite well from a linkbait perspective / traffic generation perspective and undoubtedly a few of these people do end up making a purchase on the actual shopping end of our site. We're look at ways to encourage these people finding help on this free resource to take a look at our homepage and hopefully make an order but in the meantime I am worried that there may be a few downsides to us creating this content: Google may see us more as a help site than a shopping site. Since selling products is where we make our money this could ultimately be a bad thing. Our bounce rate is REALLY high (I'm talking around 94%) on the help site versus around 20% on the eCommerce site. I guess people land on the article they want, read it and then disappear. Would this bounce rate skew our entire site stats and ultimately result in decreased performance in the SERPS. I would appreciate your opinions and, in the event you do feel it may be hurting us overall perhaps some suggestions on how to mitigate the effects? Many thanks!
Reporting & Analytics | | ChrisHolgate0 -
Webmaster Tools Suddenly Asking For Verification of Site Registered for 5 Years
Google Webmaster Tools has been successfully installed on my website, (www.nyc-officespace-leader.com) for more than five years. Suddenly, today I have received a request to Verify this Site". This makes no sense. The only possibility I can think of is that this is somehow tied to the following events in the last month: 1. Launch of new version of website on June 4th
Reporting & Analytics | | Kingalan1
2. Installation of Google of Tag Manager
3. Sudden Increase in number of pages indexed by Google. Unexplained indexing of an additional 175 pages. About 625 pages should be indexed, while 800 are now indexed. In the last month ranking and traffic have fallen sharply. Could it be tat these issues are all linked? But the strangest issue is the request to verify the site. Does anyone have any ideas? Thanks,
Alan0 -
Looking for an Automated SEO report Software Solution
Buon Giorno from 4 degrees C mostly cloudy Wetherby UK 🙂 I love Google Analytics but I'm bogged down with analytics report writting. I'm looking for a web analytics softeare package that: 1. White Label ie we can brand the reports up
Reporting & Analytics | | Nightwing
2. Bespoke ie i can pick and choose what I report on
3. Automated ie I can set a time & date when the client receives the report. Any recommendations appreciated 🙂 Grazie tanto, David0 -
Does prevent links from being included in Google Webmaster linking sites report?
My client has clean links in edit from nytimes.com. The links do not have nofollow tags. Google Webmaster stopped including links from nytimes.com in the external linking domains report and we don't know why since the URL is still live. The nytimes.com URL includes this tag in the source code: Are links on pages with NOARCHIVE still counted in Google Webmaster linking domains reports?
Reporting & Analytics | | ebenthurston0 -
Why did I loose all my product page rankings (e-commerce site)
This friday I noticed that I'd lost pretty much all my product pages in the SERP and also their rankings for the product names. These are products I both have introduced to the market (sweden) and also some that I've been the only one selling. I've analyzed a couple of different ranking-faults. Examples: **"super mario väggdekaler" should rank **http://www.roligaprylar.se/Super-Mario-Vaeggdekaler.html as #1 and has done for several years. Instead this search in my internal search engine ranks #10-#15 with no relevance. www.roligaprylar.se/?q=mario%20v%E4g "jedi morgonrock" should rank www.roligaprylar.se/Jedi-Morgonrock.html as #1 or #2 but instead this url ranks as #12 www.roligaprylar.se/product_detail.php?pid=Jedi-Morgonrock "Charlie sheen bobblehead" (in the swedish serp this should be the most simple term to rank on. previously #1) my internal search engine ranks for #8 with this url <cite>www.roligaprylar.se/?q=Charlie%20Sheen%20Bobblehead</cite>J So I've drawn these conclusions and actions Products that don't rank well longer but still ranks with their alternative non-rewritten url has gotten deep links from affilliates (i track affilliate ids and stuff via this link) and have replaced the original url which is rewritten. Action: Canonical urls for these non-rewritten products to the rewritten version. For example on this product page www.roligaprylar.se/product_detail.php?pid=Jedi-Morgonrock I've placed a canonical for this url www.roligaprylar.se/Jedi-morgonrock.html With the products not ranking at all or when searches in my search engine shows up I suspect some kind of dup content punishment where Google thinks the search result is more important than the product page. Action: All search-pages are now noindex,follow I also increased product name density in terms of keywords on the product page. But I'm still owned and losing tons of money during the holidays (buying adwords at obscene amounts instead hehe). So just wanted to hear with you guys. Are my conclusions and actions correct? What have I missed, what more could I do to reverse this? Thanks Dan
Reporting & Analytics | | nuttinalle0 -
How to Refesh site comapign?
How to Refesh site comapign? its displaying 3 days old data. now fixed some contents. unable to test it. kindly guide me for howto refresh the report?
Reporting & Analytics | | peanut20100 -
GoDaddy SEO Services?
So one of my client's wants to know why he shouldn't just hire GoDaddy to "do SEO" for him! He found this page: http://www.godaddy.com/search-engine/seo-services.aspx Without actually trying it, It looks like a bunch of stuff I already do (rank tracking, PR tracking, traffic tracking, ROI projections), stuff I shouldn't do (site submissions to search engines), and stuff I probably don't need ("keyword wizard"). That said, I am intrigued by: the dedicated phone number with track and monitor (is it better than Hosted Numbers?), the business listing (is it a link worth the $6/mo?), site analysis and optimization, and ROI dashboard. Does anyone have any experience with this? Does any piece of it have any value?
Reporting & Analytics | | TheEspresseo1