How was cdn.seomoz.org configured?
-
The SEOmoz CDN appears to have a "pull zone" that is set to the root of the domain, such that any static file can be addressed from either subdomain:
http://www.seomoz.org/q/moz_nav_assets/images/logo.png
http://cdn.seomoz.org/q/moz_nav_assets/images/logo.png
The risk of this configuration is that web pages (not just images/CSS/JS) also get cached and served by the CDN. I won't put the URL here for fear of Google indexing it, but if you replace the 'www' in the URL below with 'cdn', you'll see a cached copy of the original:
http://www.seomoz.org/ugc/the-greatest-attribution-ever-graphed
The worst-case scenario is that the homepage gets indexed. But this doesn't happen here:
That URL issues a 301 redirect back to the canonical www subdomain. As it should.
Here's my question: how was that done?
Because maxcdn.com can't do it. If you set a "pull zone" to your entire domain, they'll cache your homepage and everything else. googlebot has a field day with that; it will reindex your entire site off the CDN.
Maybe the SEOmoz CDN provider (CloudFront) allows specific URLs to be blocked? Or do you detect the CloudFront IPs and serve them a 301 (which they'd proxy out to anyone requesting cdn.seomoz.org)?
One solution is to create a pull zone that points to a folder, like example.com/images... but this doesn't help a complex site that has cacheable content in multiple places (do you Wordpress users really store ALL your static content under /wp-content/ ?).
Or, as suggested above, dynamically detect requests from the CDN's proxy servers, and give them a 301 for any HTML-page request. This gets complex quickly, and is both prone to breakage and very difficult to regression-test.
Properly retrofitting a complex site to use a CDN, without creating a half-dozen new CDN subdomains, does not appear to be easy.
-
its a SEOmoz secret...
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Schema.org wrong display in SERP
Hi, and happy new year! I tagged our new platform with schema.org: website+application software. There's also "reviews". Those reviews use datepublished microdata. However it seems that this info is used as a date for the page... Search for "logiciel cesar" with Google.fr, and the page is https://www.caplogiciel.com/logiciel/cesar
Intermediate & Advanced SEO | | 2MSens
Here's a screenshot of the result: https://www.evernote.com/l/AN29vPn0PFNJdINZtA9QU6x_tmoq99c8D3A What did I do wrong? I checked other websites which are well displayed on Google and they use the same microdata... Thanks. Best, Benoit.0 -
Good CDN
Dose anyone know of a good CDN? Free if poss!! It is for use on a joomla v2.5 site Thanks Richard
Intermediate & Advanced SEO | | seoman101 -
How to work with schema.org together with Wordpress
Hi schema.org seems to be quite simple when working with a plain HTML website. You just look up the code you need on schame.org and implement it in the HTML file as required. But when using Wordpress things become more complicated. I have to use plugins for schemata and then I can only use the schemata that exist in those plugins which are very limited. How do you deal with this issue? Cheers Marc
Intermediate & Advanced SEO | | RWW1 -
Consensus on disavowing low-quality auto-generated links (e.g. webstatsdomain.org etc) ?
Is there a consensus in the SEO world around the best practice on how to treat the multiple auto-generated links for a domain? With a lot of the link profiles we have been analyzing nearly 70% volume of the backlinks relate to these auto generated links (e.g. similarweb.com, informer.com, webstatsdomain.org etc) I can see arguments for disavowing them (low-quality links) as well as keeping them (skew anchor text distribution towards URL mentions, natural link profile) but would be interested if people have run experiments or prefer strongly one way or the other.
Intermediate & Advanced SEO | | petersocapro1 -
Are there discrepancies between GWT and SEOMoz?
In our keyword rank tracking report, we've dominated a keyword in Google and have secured the slot for years. All evidence points in this direction. In Google Webmaster Tools, however, this particular keyword averages a rank of 6.5. Is anyone else experience these kinds of discrepancies? What is your take on it?
Intermediate & Advanced SEO | | NaHoku0 -
Questions about Vittana.org's blogging contest and having bloggers use specific anchor text.
Hi All, Kenji Crosland here. I just joined vittana.org (yesterday!) to do some of the blogger outreach and content creation/link building. Although most of the links we've gotten in the past are branded links, we've decided to actively pursue anchor text links with specific keywords. If you check, you'll see that vittana has a relatively high domain authority. At the beginning of next week we'll be conducting a blogging contest with A-list celebrity tech bloggers. I don't think we'll have time to contact influencers in other areas for this contest unfortunately. When these A-list bloggers write their posts, we want them to have a link to this page: http://www.vittana.org/students To me, this seems a great opportunity to win on certain keywords we've discovered that should be easy to win and yet have a high volume of monthly searches. These are 5 word plus keywords that have over 300,000 searches per month. The students page, however, isn't optimized for those keywords. In the long run we want to win for the more difficult keyword "literacy". The word "literacy" is what we think will be a part of our new tagline: "Literacy is not enough". Because of time constraints, we won't be able to create landing pages to win for those "low hanging fruit" keywords in time for the blog contest. My question is: to what extent should we optimize the http://www.vittana.org/students page for the five word plus low hanging fruit keywords that we've discovered. I imagine if the content isn't relevant our clickthrough rates will suffer even if we do win for it (Altering our meta description is a possibility here) . Should we just try for the difficult keyword from the get go and come up with other ways to win for the low hanging fruit keywords? I'd love to hear your thoughts on this.
Intermediate & Advanced SEO | | vittana_seo0 -
Usage of Schema.org Microdata?
I am trying to figure out the correct usage of Schema.org for a business. Example: http://schema.org/Restaurant There is information like opening times or payments accepted. Would you populate this data within meta tags on every page (i.e. in the header) or really target specific pages? This could also apply to general info such as address, contact details, etc.. Interested in hearing your thoughts 🙂 Cheers Noel
Intermediate & Advanced SEO | | noeltock0 -
Setting up .org and .net supplements
To help with DSEO we are consdering setting up a .org and .net of our brand name (the primary site runs off .com obviously). We are thinking that the net/org sites will: Be one page that is optimized as we need (standard stuff like copy, title, alt-tags, meta desc, etc.) Will be hosted on separate C-blocks Will have a google sitemap that is submitted to Google Webmaster Central Provides a link back to .com Any other suggestions? Are separate C-blocks necessary? Thanks, b
Intermediate & Advanced SEO | | bcmull0