SEO dealing with a CDN on a site.
-
This one is stumping me and I need some help. I have a client who's site is www.site.com and we have set them up a CDN through Max CDN at cdn.site.com which is basically a cname to the www.site.com site. The images in the GWT for www.site.com are de-indexing rapidly and the images on cdn.site.com are not indexing. In the Max CDN account I have the images from cdn.site.com sending a canonical header from www.site.com but that does not seem to help, they are all still de-indexing.
-
In my experience Google does a pretty good job of applying the rankings to the CDN version of the image if you follow those best practices.
Good luck!
-
They were being de-indexed until I did this, http://moz.com/community/q/would-this-be-considered-cloaking-and-would-it-be-a-bad-move
The images are not dynamic at all, they have static urls. The only thing that was being changes was the sub domain from www to cdn. When that happened the images started to de-index on the www sub domain, even though the cdn sub domain image had a canonical header pointing back to the www domain. You are welcome to check it out in action, the site is https://www.redwrappings.com.au/
-
Very odd, then, that they're being removed from the index. Do you think it's possible that the images have different URLs depending on which server they're cached on? That could definitely do it. I'd have a friend across the country pull them up and see if the image URL changes.
I'm assuming that the image has some dynamic characters on it, which is pretty common with CDNs under certain configurations. Unfortunately, I've never used MaxCDN. If the image is just cdn.site.com/image.png - I'm afraid I have absolutely no idea why they wouldn't be re-indexed. I have similar CDN images that pull in fine.
-
That article touches on a lot of the issues. Here are my thoughts and you can tell me if I am incorrect in my thinking. For a couple of years the images have been on the www part of the domain, now they are on a CDN sub directory. I was trying to keep them indexed under the www part of domain so that they would keep the authority of the domain. My thoughts were if they are de-indexed under the www and re-indexed under the cdn they would have to climb their way back in the image search rankings. Basically that is what I was trying to avoid.
-
They are static. It is a passthrough CDN that basically strips off the www of the image url and replaces it with cdn.
-
Hello Lesley,
Here is an article that may help, or provide some links to other resources at the bottom: Four Best Practices for Using a CDN .
Are you keeping the same filenames or do those change?
What is in the robots.txt file on the CDN?
Have you set up and verified the CDN in Google Webmaster Tools? If so, have you submitted an XML Sitemap?
-
Hi there,
Could you tell me whether the URLs on your images are static on the CDN sub-domain? Or do they change regularly?
-
Christy,
Sure thing, this is what is in place now, http://moz.com/community/q/would-this-be-considered-cloaking-and-would-it-be-a-bad-move
-
Thanks for the update, Lesley. I'm sorry to hear that you haven't found a solution you're happy with. Let me see if any of the other Associates can help you troubleshoot this. In the meantime, are you able to to share the details of your workaround?
Christy
-
None of the answers really applied. I did a work around that I am not too happy with at the current time.
-
Hi Lesley, what is the current status of this issue? Were you able to resolve it, or are you still having problems? We'd love an update, thanks!
Christy
-
I have not actually set up the robots.txt in maxcdn. But the cdn is not indexing, which is what I am wanting, doing the site: search for the cdn shows no results. For the main site though the images are falling out of the index, even though there is a site map for them and they are still accessible from their normal url.
-
I am not using Wordpress, the site is using PrestaShop, so it does not have those plugins.
This is how it is set up.
cdn.site.com is a cname of www.site.com so images are accessible from www.site.com/image.jpg and cdn.site.com/image.jpg but when they are served from the cdn.site.com/image.jpg they have the canonical header that points to www.site.com/image.jpg I cannot understand why that would de-index all of the images on www.site.com though
-
I had this problem.
Are you using W3 Total Cache... if so you should activate the Yoast SEO extension (presumably you use Yoast, and their sitemaps). You will find it at Performance -> Extensions in the W3 Total Cache admin area.
In addition, with the robots file in Maxcdn you should have something like:
User-agent: *
Disallow:/
Allow: /wp-content/
Allow: /wp-includes/You will find this under the SEO settings of your Pull zone in MaxCDN. Make sure you have robots & canonical header ticked as well.
The W3 Total cache extension will sort all the sitemap problems with pointing to the right URL and canonical. There is no need to do anything manually as some of the articles suggest.
-
I have no problem helping you look into this. I can play with it more tomorrow morning. I have a few more questions though. Did you setup the robots.txt in MaxCDN to point to the origin directory of the images? What happens when you search site:cdn.domain.com or site:www.domain.com in Google?
There are a few ideas that I have into why this is happening but I would like to test them prior to posting.
-
It is a pull zone.
-
Are you currently using a "Pull" or "Push" zone on MaxCDN?
-
That is actually what has been done, I have seen the article before. But there are two issues, one they are not indexing and it has been a couple of weeks. But the more major issue to me is that using the cdn url none of the link juice from the main domain is being passed to the cdn sub domain. I am trying to figure out why Google is not respecting the canonical header for the images. It would seem to me, that according to what Matt Cutts says that it would. But it is not.
-
I have had this issue in past as well when working with MaxCDN. I was able to apply a fix using some of the methods in this article to fix the issue.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why are these sites outranking me?
I've been hit by every update and have spent thousands of $ and hundreds of hours trying to survive. Survival looks doubtful if I can't get turned around in 4 weeks or less. I have found adwords and google errors and fixed them. Alexa says us-nano.com is the best ranked site. I used my moz bar and they are doing everything wrong, keyword stuffing, no H1's tags, poor design. How are they ranking & I'm not? My duplicate meta tags are from this week when I added alexa and bing ID's to my header to verify my site ownership. http://imgur.com/a/I1bsw I1bsw
Reporting & Analytics | | cheaptubes0 -
Link Activity to site keeps decreasing
Recently I have started seeing daily declines in the total inbound links to my site. 4 of my top 5 sources are all experiencing pretty significant declines and in fact, Pinterest has declined from a high of 16,295 links in Nov '15 to a current number that is 8,479 on 1/6/16. Any ideas on what could be causing these declines? I did upload a new sitemap in Nov, could this be the cause?
Reporting & Analytics | | ctripp10100 -
How Am I Linked to These Sites?
I was going through Google Analytics, specifically looking at the Channels under Acquisition. I found a number of sites that Google is giving credit as sources of traffic.
Reporting & Analytics | | TowerMarketing
When I went to the sites, however, and searched the page source for links to my site, I could not find any. My site is www.towermarketing.net. The sites being listed by GA are:
blackhawkhardware.com
musicas.baixar-musicas-gratis.com If anyone can tell me how Tower is getting traffic from these sites, and what I can do to fix it - because at this point I am considering them spam - I would appreciate it. Thanks!0 -
My GWT tells me that verification has failed numerous occasions - will this stop my site being crawled?
I launched www.over50choices.co.uk 6 weeks ago and have had trouble with google indexing and crawling all pages. It tells me 143 submitted & 129 Indexed, but the site has 166 pages? It still shows the old home page image in GWT - which is v annoying! Whilst the site is verified by GA & HTML Tag, it tells me in the Verification section that "reverification failed" on numerous occasions - they seem correspond with when google trys to process the site map. Is this a coincidence ie verification fails when its trying to process the site map, which in turn is leaving me with an out of date site map and therefore not all my pages submitted or crawled? Or will this not effect the googles ability to crawl the site? Your help please. Ash
Reporting & Analytics | | AshShep10 -
Anyone notice a drop in results using site operator?
I set our site's preferred domain back on January 28. We had a www and non www domain being indexed. Since then, I've seen the number or results for our site site operator (site:) decline dramatically. Not sure if this is a good thing or bad thing. So, I'm trying to see if it's unique to our site. My gut is that the numbers are probably leveling out to where they should be and the duplicates are falling out, but I would think that as I see number of results for non www decline, the number of results for www would increase. Any thoughts? Anyone else seeing fluctuations in results using site: ? Lisa
Reporting & Analytics | | Aggie0 -
Is there any way to find out how many people blocked our site in Google?
With it being a factor in Panda, I would like to know what the number is and whether it's high / low compared to other sites etc. I kind of doubt a lot of people would have blocked us, but I'd like to at least be aware of whether that's a problem for us.
Reporting & Analytics | | rango0 -
Top Notch SEOs on the forum - I have a challenge for you!
I have been hitting my head again the wall trying to figure out why Google does not move my page up. All the SEOMOZ numbers show that the page has higher authority than the competition, more links, more domain authority, older domain, and higher Page rank. In addition the page is error free and I've gotten a W3C HTML 4.01 approval. About a month ago We were at #10 , then after the changes I've made to content and the page Google moved the page to #8, Now the Page is back to #9. I've done plenty of SEO to drive traffic to the page and created some very powerful and quality links. However, it seems like the page is being punished for something. Much weaker competitors are outranking our page and I am not sure why. On page optimization, title tags, Meta Tags, keyword density, etc - everything is there. In fact I have a check list that I go by and everything is by the book. Not only that, one competitor now shows up twice. First the domain and then specific page. I've looked into specific page and all they did is create 600 internal links from their own website. So now they are number #2 and #3 If anyone can take a look and give me an idea of what is going on and maybe suggest Keyword: Laser Marking Url: http://tinyurl.com/3nt3g2h Any help would be appreciated..
Reporting & Analytics | | DmitryP0 -
Help with local SEO strategy for service industries
Here is the scenario I often wonder about: My client's tree removal service is ranking in #1 in local search for
Reporting & Analytics | | MozMan2
"tree removal town state." His Google Places account is set for a 30 mile radius. He has lots of directory listings and positive reviews. Some inbound links as well. The same client is ranking #1 in organic listing for "tree removal county state" ...I chose to target the county for organic listings because the client was dominating local search for the town. My reasoning: I thought, Google local search would bring all of the local specific searches for "tree removal town state" and organic listings would bring the broader searches for "tree removal county state." That is exactly what's happening and stats show there are some visitors coming to the site searching with the county name. Not a ton of traffic but a lot of keyword variations using the county name. The bulk of the traffic comes from the his Google Places listing for the town the business is located in which is great. Dilemma: My client is not ranking in local search results for neighboring towns just a few miles away and certainly not ranking in organic listings for neighboring towns either because we are targeting the county. He has a long list of town names he services in the footer area and this does seem to help for organic search in neighboring towns with little competition. Broad Question: How can I optimize pages for the same services in neighboring towns without duplicating content. For example, the home page title tag and H1 reads:
Tree Removal, Tree Trimming, Stump Removal, County State It would be very easy to create identical pages with title tags and page headings for the different towns but that would undoubtedly create duplicate content and would look weird to someone browsing the site. Specific Questions: Should I put the town name where the business is located in the title tag even though the site already ranks #1 for that town in local search, without having the town in the title tag? Why not use this importunity for an area that we are not ranking for? Do I nix the county and state and try to insert another town or two in the title and H1? Ideally I would like to have this site rank well in local search for all of the neighboring towns. This may be too broad of a post, (it is my first one) but perhaps there are a few of you out there that can outline strategies that work for service industries like, lawn care, tree removal, landscaping, etc. Thanks for reading.0