SEO dealing with a CDN on a site.
-
This one is stumping me and I need some help. I have a client who's site is www.site.com and we have set them up a CDN through Max CDN at cdn.site.com which is basically a cname to the www.site.com site. The images in the GWT for www.site.com are de-indexing rapidly and the images on cdn.site.com are not indexing. In the Max CDN account I have the images from cdn.site.com sending a canonical header from www.site.com but that does not seem to help, they are all still de-indexing.
-
In my experience Google does a pretty good job of applying the rankings to the CDN version of the image if you follow those best practices.
Good luck!
-
They were being de-indexed until I did this, http://moz.com/community/q/would-this-be-considered-cloaking-and-would-it-be-a-bad-move
The images are not dynamic at all, they have static urls. The only thing that was being changes was the sub domain from www to cdn. When that happened the images started to de-index on the www sub domain, even though the cdn sub domain image had a canonical header pointing back to the www domain. You are welcome to check it out in action, the site is https://www.redwrappings.com.au/
-
Very odd, then, that they're being removed from the index. Do you think it's possible that the images have different URLs depending on which server they're cached on? That could definitely do it. I'd have a friend across the country pull them up and see if the image URL changes.
I'm assuming that the image has some dynamic characters on it, which is pretty common with CDNs under certain configurations. Unfortunately, I've never used MaxCDN. If the image is just cdn.site.com/image.png - I'm afraid I have absolutely no idea why they wouldn't be re-indexed. I have similar CDN images that pull in fine.
-
That article touches on a lot of the issues. Here are my thoughts and you can tell me if I am incorrect in my thinking. For a couple of years the images have been on the www part of the domain, now they are on a CDN sub directory. I was trying to keep them indexed under the www part of domain so that they would keep the authority of the domain. My thoughts were if they are de-indexed under the www and re-indexed under the cdn they would have to climb their way back in the image search rankings. Basically that is what I was trying to avoid.
-
They are static. It is a passthrough CDN that basically strips off the www of the image url and replaces it with cdn.
-
Hello Lesley,
Here is an article that may help, or provide some links to other resources at the bottom: Four Best Practices for Using a CDN .
Are you keeping the same filenames or do those change?
What is in the robots.txt file on the CDN?
Have you set up and verified the CDN in Google Webmaster Tools? If so, have you submitted an XML Sitemap?
-
Hi there,
Could you tell me whether the URLs on your images are static on the CDN sub-domain? Or do they change regularly?
-
Christy,
Sure thing, this is what is in place now, http://moz.com/community/q/would-this-be-considered-cloaking-and-would-it-be-a-bad-move
-
Thanks for the update, Lesley. I'm sorry to hear that you haven't found a solution you're happy with. Let me see if any of the other Associates can help you troubleshoot this. In the meantime, are you able to to share the details of your workaround?
Christy
-
None of the answers really applied. I did a work around that I am not too happy with at the current time.
-
Hi Lesley, what is the current status of this issue? Were you able to resolve it, or are you still having problems? We'd love an update, thanks!
Christy
-
I have not actually set up the robots.txt in maxcdn. But the cdn is not indexing, which is what I am wanting, doing the site: search for the cdn shows no results. For the main site though the images are falling out of the index, even though there is a site map for them and they are still accessible from their normal url.
-
I am not using Wordpress, the site is using PrestaShop, so it does not have those plugins.
This is how it is set up.
cdn.site.com is a cname of www.site.com so images are accessible from www.site.com/image.jpg and cdn.site.com/image.jpg but when they are served from the cdn.site.com/image.jpg they have the canonical header that points to www.site.com/image.jpg I cannot understand why that would de-index all of the images on www.site.com though
-
I had this problem.
Are you using W3 Total Cache... if so you should activate the Yoast SEO extension (presumably you use Yoast, and their sitemaps). You will find it at Performance -> Extensions in the W3 Total Cache admin area.
In addition, with the robots file in Maxcdn you should have something like:
User-agent: *
Disallow:/
Allow: /wp-content/
Allow: /wp-includes/You will find this under the SEO settings of your Pull zone in MaxCDN. Make sure you have robots & canonical header ticked as well.
The W3 Total cache extension will sort all the sitemap problems with pointing to the right URL and canonical. There is no need to do anything manually as some of the articles suggest.
-
I have no problem helping you look into this. I can play with it more tomorrow morning. I have a few more questions though. Did you setup the robots.txt in MaxCDN to point to the origin directory of the images? What happens when you search site:cdn.domain.com or site:www.domain.com in Google?
There are a few ideas that I have into why this is happening but I would like to test them prior to posting.
-
It is a pull zone.
-
Are you currently using a "Pull" or "Push" zone on MaxCDN?
-
That is actually what has been done, I have seen the article before. But there are two issues, one they are not indexing and it has been a couple of weeks. But the more major issue to me is that using the cdn url none of the link juice from the main domain is being passed to the cdn sub domain. I am trying to figure out why Google is not respecting the canonical header for the images. It would seem to me, that according to what Matt Cutts says that it would. But it is not.
-
I have had this issue in past as well when working with MaxCDN. I was able to apply a fix using some of the methods in this article to fix the issue.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
SEO Keyword Research
Is there a way to insert Google API into Excel? I want to have a search volume column that would automatically fill out with query Another question is that I have a list of keywords in one column in Excel with another column that have search volume. I then go and do some research on keyword groups with a grouping tool but if this tool doesn't find a group for a keyword it removes it. So then when I want to copy and past keywords and match keyword columns in excel that already exist with search volume I have to do it manually because some rows are missing. Is there a way this could be done automatically so that when I paste it in the column with keywords it automatically finds that the current column exists and adds a group column ?
Reporting & Analytics | | aliciaporrata10090 -
Google Analytics SEO Queries Not Showing
Hi All, This might be a silly question, but for all the properties I monitor in Google Analytics, I'm now showing no data for SEO Queries under Acquisition for the past 6 days. Normally I would expect a few day delay in queries, but nothing for 6 days is somewhat peculiar especially as it was functioning fine prior to November 12th. Does anyone have insight into what might be going on? Thanks! URaNMa3
Reporting & Analytics | | amichaels0 -
Site operator result anomaly
"Site:" search for site:http://www.mycity4kids.com/Bangalore/activity-based-approach is showing 76 results.I am using SERPS Redux to collect all the indexed pages, but when I re-checked indexed status of these pages using "site operator" google showed that these pages are not indexed. What is the possible explanation for this? Thanks
Reporting & Analytics | | prsntsnh0 -
Does anyone know what's happened to google analytics -> traffic sources -> SEO -> queries many of my accounts are showing a drop to zero in the laste few days
Howdy mozzers It's in the question title really. Zero impressions showing for the last few days on. Multiple accounts Any thought out there
Reporting & Analytics | | Big_Partnership0 -
What is the impact of a panda refresh on a Pandalized site?
When a panda refresh hits and you have a pandalized site, If the site were to de-pandalized, would you see traffic back to pre-panda levels right away? Or any type of movement right away?
Reporting & Analytics | | jessefriedman0 -
Is there a tool to automatically gather website SEO data?
I am looking for a tool that will crawl a website and create a spreadsheet listing out all key data such as title, meta description, etc. Anyone know of an available tool to do that?
Reporting & Analytics | | jfeitlinger0 -
I am new to SEO. Would appreciate anyone share with me how can i further optimize my website to attract more visitors based on this scenario?
Hi all, Thanks and really appreciate for your time helping me out here, and i am very new to SEO. Just a brief introduction, our team have been developing a gaming review website (specifically targeting MMO game, facebook game, and browser-based game) since September 2011, and the website show significant improvement for the 1st month, and we got daily UV spiking up to 2k in November. The issue here is that the visitors start to become very stagnant, and even dropping now. Even though we try hard to publish more game reviews and there is NO IMPROVEMENT AT ALL for past 2 months in terms of number of visitors, and we have no idea how should we further proceed to attract more visitors...We have about 7 k site pages (however not all are with content). For the past 2 months it seems like there is nothing going on that proportionate to the effort we put in. Pretty discouraging, so we decided to sign up SEOMOZ to get SEO help from the community here. Some general questions we have: 1. Is it because of the the quality of the content we write ? (not enough keyword density, review too short, etc). 2. Is it because of the website designed not user friendly and not optimized? 3. Is it because of the general gaming trend, or other competitors? 4. Are we penalized by google by any chance? Are we spamming or anything wrong that deter our improvement? 5. Our main revenue source is on advertising, and currently we are using adsense. Would the ads placement affect anything? 6. Based on your knowledge and expertise, any advice would you give us? Would appreciate if you can take a kind look at the website to understand the scenario better: www.gameonline2.com, with 3 subdomain of: wiki.gameonline2; gameplay.gameonline2; cheats.gameonline2. For your reference, example our competitors are: www.onrpg.com; www.bbgsite.com; www.mmohut.com I would be happy to share with you any further information including google analytics if needed. I have report generated from SEOMOZ but i would appreciate the pros to help me take a look to give me a more complete ideas of how can i further improve to gain at least 10k visitors per day. Thanks again for your kind help and time. Andrew
Reporting & Analytics | | andrewsoh070 -
Will Google start trimming 'stale' sites rank?
With the recent focus on Google to reduce rank of farms and low value sites, I am interested to get SEO view on if you think Google will start devaluing stale sites. I do find it a bit frustrating that in the top 5 for my main key phrase, there is one site that has NO content just an error and another blog that has not updated content in 2 years. How can blogs that do not blog be considered high enough value by Google to rank in the top 5? How can sites that just return 404 or 500 for ALL their pages be even considered a site let alone rank 2nd. I am interested so see others experiences and thoughts on 'user experience' clean ups by Google and why these types of sites get missed?
Reporting & Analytics | | oznappies0