SEO dealing with a CDN on a site.
-
This one is stumping me and I need some help. I have a client who's site is www.site.com and we have set them up a CDN through Max CDN at cdn.site.com which is basically a cname to the www.site.com site. The images in the GWT for www.site.com are de-indexing rapidly and the images on cdn.site.com are not indexing. In the Max CDN account I have the images from cdn.site.com sending a canonical header from www.site.com but that does not seem to help, they are all still de-indexing.
-
In my experience Google does a pretty good job of applying the rankings to the CDN version of the image if you follow those best practices.
Good luck!
-
They were being de-indexed until I did this, http://moz.com/community/q/would-this-be-considered-cloaking-and-would-it-be-a-bad-move
The images are not dynamic at all, they have static urls. The only thing that was being changes was the sub domain from www to cdn. When that happened the images started to de-index on the www sub domain, even though the cdn sub domain image had a canonical header pointing back to the www domain. You are welcome to check it out in action, the site is https://www.redwrappings.com.au/
-
Very odd, then, that they're being removed from the index. Do you think it's possible that the images have different URLs depending on which server they're cached on? That could definitely do it. I'd have a friend across the country pull them up and see if the image URL changes.
I'm assuming that the image has some dynamic characters on it, which is pretty common with CDNs under certain configurations. Unfortunately, I've never used MaxCDN. If the image is just cdn.site.com/image.png - I'm afraid I have absolutely no idea why they wouldn't be re-indexed. I have similar CDN images that pull in fine.
-
That article touches on a lot of the issues. Here are my thoughts and you can tell me if I am incorrect in my thinking. For a couple of years the images have been on the www part of the domain, now they are on a CDN sub directory. I was trying to keep them indexed under the www part of domain so that they would keep the authority of the domain. My thoughts were if they are de-indexed under the www and re-indexed under the cdn they would have to climb their way back in the image search rankings. Basically that is what I was trying to avoid.
-
They are static. It is a passthrough CDN that basically strips off the www of the image url and replaces it with cdn.
-
Hello Lesley,
Here is an article that may help, or provide some links to other resources at the bottom: Four Best Practices for Using a CDN .
Are you keeping the same filenames or do those change?
What is in the robots.txt file on the CDN?
Have you set up and verified the CDN in Google Webmaster Tools? If so, have you submitted an XML Sitemap?
-
Hi there,
Could you tell me whether the URLs on your images are static on the CDN sub-domain? Or do they change regularly?
-
Christy,
Sure thing, this is what is in place now, http://moz.com/community/q/would-this-be-considered-cloaking-and-would-it-be-a-bad-move
-
Thanks for the update, Lesley. I'm sorry to hear that you haven't found a solution you're happy with. Let me see if any of the other Associates can help you troubleshoot this. In the meantime, are you able to to share the details of your workaround?
Christy
-
None of the answers really applied. I did a work around that I am not too happy with at the current time.
-
Hi Lesley, what is the current status of this issue? Were you able to resolve it, or are you still having problems? We'd love an update, thanks!
Christy
-
I have not actually set up the robots.txt in maxcdn. But the cdn is not indexing, which is what I am wanting, doing the site: search for the cdn shows no results. For the main site though the images are falling out of the index, even though there is a site map for them and they are still accessible from their normal url.
-
I am not using Wordpress, the site is using PrestaShop, so it does not have those plugins.
This is how it is set up.
cdn.site.com is a cname of www.site.com so images are accessible from www.site.com/image.jpg and cdn.site.com/image.jpg but when they are served from the cdn.site.com/image.jpg they have the canonical header that points to www.site.com/image.jpg I cannot understand why that would de-index all of the images on www.site.com though
-
I had this problem.
Are you using W3 Total Cache... if so you should activate the Yoast SEO extension (presumably you use Yoast, and their sitemaps). You will find it at Performance -> Extensions in the W3 Total Cache admin area.
In addition, with the robots file in Maxcdn you should have something like:
User-agent: *
Disallow:/
Allow: /wp-content/
Allow: /wp-includes/You will find this under the SEO settings of your Pull zone in MaxCDN. Make sure you have robots & canonical header ticked as well.
The W3 Total cache extension will sort all the sitemap problems with pointing to the right URL and canonical. There is no need to do anything manually as some of the articles suggest.
-
I have no problem helping you look into this. I can play with it more tomorrow morning. I have a few more questions though. Did you setup the robots.txt in MaxCDN to point to the origin directory of the images? What happens when you search site:cdn.domain.com or site:www.domain.com in Google?
There are a few ideas that I have into why this is happening but I would like to test them prior to posting.
-
It is a pull zone.
-
Are you currently using a "Pull" or "Push" zone on MaxCDN?
-
That is actually what has been done, I have seen the article before. But there are two issues, one they are not indexing and it has been a couple of weeks. But the more major issue to me is that using the cdn url none of the link juice from the main domain is being passed to the cdn sub domain. I am trying to figure out why Google is not respecting the canonical header for the images. It would seem to me, that according to what Matt Cutts says that it would. But it is not.
-
I have had this issue in past as well when working with MaxCDN. I was able to apply a fix using some of the methods in this article to fix the issue.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Same person on my site, every day, for over 6 months??
I watch my Google analytics real time while I work and for the last 6 months, I've had one person on my site consistently during every Monday to Friday. It's only the home page there go on, it never shows they look anywhere else. But it's starting to concern me a little since it's been going on for so long. Does anyone have an idea what they could be doing? I do my own SEO, there is no one else working on my site. Thanks in advance!
Reporting & Analytics | | Coppell0 -
Are these Search Console crawl errors a major concern to new client site?
We recently (4/1) went live with a new site for a client of ours. The client site was originally Point2 before they made the switch to a template site with Real Estate Webmasters. Now when I look into the Search Console I am getting the following Crawl Errors: 111 Server Errors (photos) 104 Soft 404s (blogs, archives, tags) 6,229 Not Found (listings) I have a few questions. The server errors I know not a lot about so I generally ignore. My main concerns are the 404s and not found. The 404s are mostly tags and blog archives which I wonder if I should leave alone or do 301s for each to /blog. For not found, these are all the previous listings from the IDX. My assumption is these will naturally fall away after some time, as the new ones have already indexed. But I wonder what I should be doing here and which will be affecting me. When we launched the new site there was a large spike in clicks ( 250% increase) which has now tapered off to an average of ~85 clicks versus ~160 at time of launch. Not sure if the Crawl Errors have any effect, I'm guessing not so much right now. I'd appreciate your insights Mozzers!
Reporting & Analytics | | localwork0 -
Is there a way to map your on-page SEO changes with the organic growth?
Hi Mozzers, I was just wondering if there's a way we can map our on-page SEO changes with the increase/decrease in organic traffic. For instance, I introduced brand pages' link the product page breadcrumbs and suddenly organic traffic for my brand pages increase from X to 2X in 1 couple of weeks. Now, this can be because of this breadcrumb change purely or because of some algorithm update or may be, bots started finding the content interesting and hence, started ranking them up (in case the brand pages were launched recently). So, you can't say which change should be mapped to what increase/decrease in organic traffic. Or, is there a way to map this?
Reporting & Analytics | | _nitman0 -
.com site referral traffic to ccTLDs
We have 7 international domains set up along with our main .com site. All of the ccTLds are showing their main referral traffic as coming from the .com site in GA, and most of those being from mobile. Each site is set up correctly with geo-targeting and hreflang tags. Has anyone experienced this before?
Reporting & Analytics | | ggpaul5620 -
Stop getting info from Google analytics on purchases in our site
Hi guys, We have eCommerce.
Reporting & Analytics | | WayneRooney
We connected the site to the Google analytic eCommerce.
Everything was work fine until 3 weeks ago. Suddenly we stooped getting purchases information in the analytic although i see purchases in the website. We didn't change anything in the website and i really don't know how to solve this problem.
If someone here can point me where i can get some info on how to fix it it can be great. Thanks a lot!0 -
How is it possible that this site has a higher page authority than my site?
Judging by open site explorer, I'm crushing my competitor in every imaginable way. And yet, somehow they have a higher page authority than me and, consequently, are ranking higher than me. How is this possible? My site is on the left: 40atcpP.png
Reporting & Analytics | | ScottMcPherson0 -
ROI in SEO
Hi there! I'm just doing the SEO for a client and I just don't know exactly how could I estimate the ROI associated to the SEO actions that are taking place. Should I only consider the conversions made via organic search? Is there another parameter, metric or item I must measure? . Is there something "formal" to follow'? I think I'm missing something. Thanks in advanced.
Reporting & Analytics | | juanmiguelcr0 -
Getting traffic for another site
Hi Everyone, Our website url/brand is very close to another website url/brand. We are non-competing entities. It appears as though this other company has begun a marketing program which has resulted in our traffic skyrocketing. However, it seems to have also resulted in our Pages/Visit and Visit Duration to decrease and our Bounce Rate to increase. Can anyone suggest how to deal with this type of scenario? Thanks,
Reporting & Analytics | | AC_Pro
Robert0