How was cdn.seomoz.org configured?
-
The SEOmoz CDN appears to have a "pull zone" that is set to the root of the domain, such that any static file can be addressed from either subdomain:
http://www.seomoz.org/q/moz_nav_assets/images/logo.png
http://cdn.seomoz.org/q/moz_nav_assets/images/logo.png
The risk of this configuration is that web pages (not just images/CSS/JS) also get cached and served by the CDN. I won't put the URL here for fear of Google indexing it, but if you replace the 'www' in the URL below with 'cdn', you'll see a cached copy of the original:
http://www.seomoz.org/ugc/the-greatest-attribution-ever-graphed
The worst-case scenario is that the homepage gets indexed. But this doesn't happen here:
That URL issues a 301 redirect back to the canonical www subdomain. As it should.
Here's my question: how was that done?
Because maxcdn.com can't do it. If you set a "pull zone" to your entire domain, they'll cache your homepage and everything else. googlebot has a field day with that; it will reindex your entire site off the CDN.
Maybe the SEOmoz CDN provider (CloudFront) allows specific URLs to be blocked? Or do you detect the CloudFront IPs and serve them a 301 (which they'd proxy out to anyone requesting cdn.seomoz.org)?
One solution is to create a pull zone that points to a folder, like example.com/images... but this doesn't help a complex site that has cacheable content in multiple places (do you Wordpress users really store ALL your static content under /wp-content/ ?).
Or, as suggested above, dynamically detect requests from the CDN's proxy servers, and give them a 301 for any HTML-page request. This gets complex quickly, and is both prone to breakage and very difficult to regression-test.
Properly retrofitting a complex site to use a CDN, without creating a half-dozen new CDN subdomains, does not appear to be easy.
-
its a SEOmoz secret...
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Schema.org wrong display in SERP
Hi, and happy new year! I tagged our new platform with schema.org: website+application software. There's also "reviews". Those reviews use datepublished microdata. However it seems that this info is used as a date for the page... Search for "logiciel cesar" with Google.fr, and the page is https://www.caplogiciel.com/logiciel/cesar
Intermediate & Advanced SEO | | 2MSens
Here's a screenshot of the result: https://www.evernote.com/l/AN29vPn0PFNJdINZtA9QU6x_tmoq99c8D3A What did I do wrong? I checked other websites which are well displayed on Google and they use the same microdata... Thanks. Best, Benoit.0 -
Disavow files and net com org etc ....
When looking at my backlinks if I see something like this: www.domainPizza.net
Intermediate & Advanced SEO | | HLTalk
www.domainPizza.com
sub.domainPizza.com
www.domainpizza.org
domainPizza.net
https://domainpizza.com
https://www.domainpizza.net What is the actual list of disavows that I put into the file if I want to disavow this domain? I am seeing so many variations of the same domain. Thank you.0 -
Schema.org Data Appears on Website
Hello Moz World, I would like to incorporate Schema Data onto my website. My website has Meta tags in the of the document, which incorporates our keywords. And, in the footer, I have my businesses address, logo and other relevant information. Everything flows because it is stylized using CSS. When I input the schema.org data it appears on the website as text w/ a hyperlink. See Code: My Company
Intermediate & Advanced SEO | | MarketingChimp10
DESCRIPTION. STREET
CITY
STATE
ZIP
United States
Phone: NUMBER I've tried to put it inside the head, body & footer of the HTML code. I want my website to show up properly through Google Structured Data Testing Tool however, I don't want the text to show up on my website not stylized. It sticks out like a soar thumb no matter where I incorporate it. My questions are; Should I even bother with Schema.org? And, is there a way to incorporate it into my website so it does not show up and Google can still pull it up? Thanks ahead of time for all of the awesome responses! B/R Will H.0 -
CDN for SEO (or not)?
Does CDN impact on SEO or not? There seems conflicting ideas as to whether they impact positively or negatively, I realise that if the page loads quicker this is a good thing for SEO and usability of course. Does Google see CDN as just cheating and a get-around for not doing the work from the ground up and using good hosting etc? Do you have any direct experience? All constructive input much appreciated!
Intermediate & Advanced SEO | | seoman101 -
Should We Add the W3.org Language Tag To Every Page Or Just The Home Page?
Greetings, We have five international sites around the world, two of which are in difference languages. Currently we have the following line of html code on the home page of each of the sites: Clearly, we need to change the "en" portion for the sites that aren't in English, but, should we include that meta tag in each of the site's pages, or will the home page suffice. Thanks!
Intermediate & Advanced SEO | | CSawatzky0 -
Does anyone have a BOTW.org promo code for november yet?
Does anyone have a best of the web directory promo code for november yet?
Intermediate & Advanced SEO | | unitedfitness0 -
I tried the directorie list of seomoz, but almost all of them charged for the inclusion. This is a black hat situation?
I need backlinks for my site, and several places inform that directories are a good place. But they charge for the inclusion. Should I pay? This is a blackhat situation where I'm buying for links?
Intermediate & Advanced SEO | | Naghirniac0 -
SEOMOZ found basically all my articles and says they need a 301 redirect ?
Hope someone can HELP. So my site looks like it has the proper 301 redirect to www. for the main domain. But for some reason my articles that have a /trackback on them redirect to same address with out the trackback at the end. How do i fix this? seomoz is saying all my articles need a 301 redirect .all like 100. Thanks any help would be great
Intermediate & Advanced SEO | | jstgobig0