SEO dealing with a CDN on a site.
-
This one is stumping me and I need some help. I have a client who's site is www.site.com and we have set them up a CDN through Max CDN at cdn.site.com which is basically a cname to the www.site.com site. The images in the GWT for www.site.com are de-indexing rapidly and the images on cdn.site.com are not indexing. In the Max CDN account I have the images from cdn.site.com sending a canonical header from www.site.com but that does not seem to help, they are all still de-indexing.
-
In my experience Google does a pretty good job of applying the rankings to the CDN version of the image if you follow those best practices.
Good luck!
-
They were being de-indexed until I did this, http://moz.com/community/q/would-this-be-considered-cloaking-and-would-it-be-a-bad-move
The images are not dynamic at all, they have static urls. The only thing that was being changes was the sub domain from www to cdn. When that happened the images started to de-index on the www sub domain, even though the cdn sub domain image had a canonical header pointing back to the www domain. You are welcome to check it out in action, the site is https://www.redwrappings.com.au/
-
Very odd, then, that they're being removed from the index. Do you think it's possible that the images have different URLs depending on which server they're cached on? That could definitely do it. I'd have a friend across the country pull them up and see if the image URL changes.
I'm assuming that the image has some dynamic characters on it, which is pretty common with CDNs under certain configurations. Unfortunately, I've never used MaxCDN. If the image is just cdn.site.com/image.png - I'm afraid I have absolutely no idea why they wouldn't be re-indexed. I have similar CDN images that pull in fine.
-
That article touches on a lot of the issues. Here are my thoughts and you can tell me if I am incorrect in my thinking. For a couple of years the images have been on the www part of the domain, now they are on a CDN sub directory. I was trying to keep them indexed under the www part of domain so that they would keep the authority of the domain. My thoughts were if they are de-indexed under the www and re-indexed under the cdn they would have to climb their way back in the image search rankings. Basically that is what I was trying to avoid.
-
They are static. It is a passthrough CDN that basically strips off the www of the image url and replaces it with cdn.
-
Hello Lesley,
Here is an article that may help, or provide some links to other resources at the bottom: Four Best Practices for Using a CDN .
Are you keeping the same filenames or do those change?
What is in the robots.txt file on the CDN?
Have you set up and verified the CDN in Google Webmaster Tools? If so, have you submitted an XML Sitemap?
-
Hi there,
Could you tell me whether the URLs on your images are static on the CDN sub-domain? Or do they change regularly?
-
Christy,
Sure thing, this is what is in place now, http://moz.com/community/q/would-this-be-considered-cloaking-and-would-it-be-a-bad-move
-
Thanks for the update, Lesley. I'm sorry to hear that you haven't found a solution you're happy with. Let me see if any of the other Associates can help you troubleshoot this. In the meantime, are you able to to share the details of your workaround?
Christy
-
None of the answers really applied. I did a work around that I am not too happy with at the current time.
-
Hi Lesley, what is the current status of this issue? Were you able to resolve it, or are you still having problems? We'd love an update, thanks!
Christy
-
I have not actually set up the robots.txt in maxcdn. But the cdn is not indexing, which is what I am wanting, doing the site: search for the cdn shows no results. For the main site though the images are falling out of the index, even though there is a site map for them and they are still accessible from their normal url.
-
I am not using Wordpress, the site is using PrestaShop, so it does not have those plugins.
This is how it is set up.
cdn.site.com is a cname of www.site.com so images are accessible from www.site.com/image.jpg and cdn.site.com/image.jpg but when they are served from the cdn.site.com/image.jpg they have the canonical header that points to www.site.com/image.jpg I cannot understand why that would de-index all of the images on www.site.com though
-
I had this problem.
Are you using W3 Total Cache... if so you should activate the Yoast SEO extension (presumably you use Yoast, and their sitemaps). You will find it at Performance -> Extensions in the W3 Total Cache admin area.
In addition, with the robots file in Maxcdn you should have something like:
User-agent: *
Disallow:/
Allow: /wp-content/
Allow: /wp-includes/You will find this under the SEO settings of your Pull zone in MaxCDN. Make sure you have robots & canonical header ticked as well.
The W3 Total cache extension will sort all the sitemap problems with pointing to the right URL and canonical. There is no need to do anything manually as some of the articles suggest.
-
I have no problem helping you look into this. I can play with it more tomorrow morning. I have a few more questions though. Did you setup the robots.txt in MaxCDN to point to the origin directory of the images? What happens when you search site:cdn.domain.com or site:www.domain.com in Google?
There are a few ideas that I have into why this is happening but I would like to test them prior to posting.
-
It is a pull zone.
-
Are you currently using a "Pull" or "Push" zone on MaxCDN?
-
That is actually what has been done, I have seen the article before. But there are two issues, one they are not indexing and it has been a couple of weeks. But the more major issue to me is that using the cdn url none of the link juice from the main domain is being passed to the cdn sub domain. I am trying to figure out why Google is not respecting the canonical header for the images. It would seem to me, that according to what Matt Cutts says that it would. But it is not.
-
I have had this issue in past as well when working with MaxCDN. I was able to apply a fix using some of the methods in this article to fix the issue.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What is the most effective way of selecting a top keyword per page on a site?
We are creating fresh content for outdated sites and I need to identify the most significant keyword per page for the content developers, What is the best way to do this?
Reporting & Analytics | | Sable_Group0 -
Is there a way to map your on-page SEO changes with the organic growth?
Hi Mozzers, I was just wondering if there's a way we can map our on-page SEO changes with the increase/decrease in organic traffic. For instance, I introduced brand pages' link the product page breadcrumbs and suddenly organic traffic for my brand pages increase from X to 2X in 1 couple of weeks. Now, this can be because of this breadcrumb change purely or because of some algorithm update or may be, bots started finding the content interesting and hence, started ranking them up (in case the brand pages were launched recently). So, you can't say which change should be mapped to what increase/decrease in organic traffic. Or, is there a way to map this?
Reporting & Analytics | | _nitman0 -
Weird visitors to my site
Hi, I am in the process of disentangling myself from a dodgy SEO company. At some point they set up another GA account on my site without consulting me. I replaced the tracking code with my original account on my wordpress site, placing the tracking code on the dashboard. There is a box in the dashboard for you to do this. For some reason the account he created is still giving me analytics but from mostly one url :forum.topic55622342.darodar.com. It has marked it as a referral? When you click it it redirects to this site : http://activities.aliexpress.com/computers_channel.php?aff_platform=aaf&sk=vV3B2RJYB%3A&cpt=1421321021096&null There have been 218 visits from this "referral" in the last month and also 2 direct visits to a clients online gallery (i'm a photographer). I am guessing the code for this new account is still on the site somewhere? Funnily enough in the first month I was getting targeted by spam using my contact form and I was a bit perplexed as to why. We had to put captchas on the contact forms which I was loathe to do as its another step for a client to have to go through causing resistance. Has this link got something to do with it? I have recently disavowed a lot of toxic links he created, so maybe they had something to do with it? Best wishes. David.
Reporting & Analytics | | WallerD0 -
Site re-crawled?
I've fixed many of my errors, but they're still showing in my dashboard. When will the site be crawled again?
Reporting & Analytics | | sakeith0 -
Can you link several sites together in Google Webmaster Tools?
I have a client saying that there is a way to link 3 separate websites (A website for each department of a company) in Google Webmaster tools to tell Google it's basically one site but really its 3. Or to tell Google it's the same company and all the sites are one. I have never heard of this & I don't see the point in making 3 separate small sites & "linking" them as one in Webmaster tools. Is there in fact a way? Am I out to lunch on what they might be referring to? I am recommending they create one larger authority site with a page on each department & earn links for each department page & provide informative unique content for each page. Thoughts? Thanks for the help!
Reporting & Analytics | | DCochrane0 -
Google SEO - Where have I disappeared to?
Okay, so first off Google, I hate you. Before I signed up for SEOMoz, my website was hitting page 9 and page 10 for some ultra difficult keywords. After spending a month using Hubspot and SEOMoz, I finally made it on to page 2 of google, for said 'impossible to rank high' keywords, which I was super happy with. But last week, I login to find that I have disappeared off the top 100 pages of Google for about 100 of my top keywords!!!! What the hell did I do wrong? I tried to please Google, but my website is still indexed, but just not ranked at all for any of my top keywords. The last thing I did before I disappeared overnight was add "follow me" buttons to all my pages and "share this" buttons to all my blogs. Could this be the problem? My website and main keyword is Process Server Is there anyone who could help push me in the right direction? I have no idea what I did wrong. 😕 Martyn
Reporting & Analytics | | spymore0 -
Any thoughts on why Nextag and MonsterMarketPlace are linking to our site?
I'm looking in WMT at the crawl errors and I noticed that our website has gotten a lot of Not Found crawl errors that seem strange. A lot of these not found pages are Display URLs that I use in PPC advertising, but not actual redirects (i.e. explorica.com/EducationalTrips). When I looked at how these links were being found, the inbound links were coming from Nextag.com and monstermarketplace.com, two sites that our company has never had a relationship with. We're an educational travel company, so we'd have no reason to. When I followed the links, it looks like it's coming from their "Sponsored Links," but these aren't Google or Bing Ads. We don't even advertise on the content network. Example link: http://www.monstermarketplace.com/starters-and-alternators/alternator-motorola-style-12v-51a-10376 (the ads do rotate so my site might not appear when you check it out). Anyone ever had experience with this type of issue?
Reporting & Analytics | | Explorica0 -
Something strange going on with new client's site...
Please forgive my stupidity if there is something obvious here which I have missed (I keep assuming that must be the case), but any advice on this would be much appreciated. We've just acquired a new client. Despite having a site for plenty of time now they did not previously have analytics with their last company (I know, a crime!). They've been with us for about a month now and we've managed to get them some great rankings already. To be fair, the rankings weren't bad before us either. Anyway. They have multiple position one rankings for well searched terms both locally and nationally. One would assume therefore that a lot of their traffic would come from Google right? Not according to their analytics. In fact, very little of it does... instead, 70% of their average 3,000 visits per month comes from just one referring site. A framed version of their site which is through reachlocal, which itself doesn't rank for any of their terms. I don't get it... The URL of the site is: www.namgrass.co.uk (ignore there being a .com too, that's a portal as they cover other countries). The referring site causing me all this confusion is: http://namgrass.rtrk.co.uk/ (see source code at the bottom for the reachlocal thing). Now I know reach local certainly isn't sending them all that traffic, so why does GA say it is... and what is this reachlocal thing anyway?? I mean, I know what reachlocal is, but what gives here with regards to it? Any ideas, please??
Reporting & Analytics | | SteveOllington0