Why are there lots of 404s after setting up CDN?
-
I just setup Cloudfront CDN through W3 Total Cache. Everything looks good but there is one problem that I have encountered:
After activating the CDN none of the images are available at the older image URLs and they are throwing a 404 error.
Let me give you an example for this:
1. Before I setup the CDN, let's say an image was available at http://example.com/wp-content/uploads/2015/03/leap-of-faith.jpg
2. After I setup the CDN, the image is available at http://cdn.example.com/wp-content/uploads/2015/03/leap-of-faith.jpg and the good part is the URLs in the blog posts where this image was attached is updated to reflect the above mentioned URL. But the problem is that when visit the older URL of the image (which is what Google has crawled earlier, I get a 404 error).
Can you help me how to avoid this problem?
Ravi C
-
Thanks Dirk.. That sounds good.
-
Hi,
Do you get a lot of traffic coming through image search? Most of the images you use on the site seem to be stock images, so normally the % of image search traffic shouldn't be that big.
If you receive limited or no search traffic from image search, you don't really have to do anything special. There 404 errors in WMT will disappear after a while & the new images will get indexed. Normally the 404's will have no impact on search traffic.
If all CDN's contain all the images, you could always redirect the original image folder to one of the cdn's - but it not strictly necessary.
rgds
Dirk
-
I would say you may look at the set up process of CDN may not be as per the required criteria. IF you just check it thoroughly you may be able to get rid of it.
-
Thanks for that reply Dirk.
I think what you are referring to is quite applicable when the CDN is setup via using a S3 bucket. I followed the following guide to setup my CloudFront CDN:
https://www.doitwithwp.com/set-up-w3-total-cache-with-amazon-cloudfront-cdn
Here are the 2 problems that I'm facing currently:
1. The images appear at multiple CDN locations - http://cdn5.sarkarilife.com/wp-content/uploads/2015/01/bank-awareness-gk-ibps-bank-exams.jpg as well as http://cdn1.sarkarilife.com/wp-content/uploads/2015/01/bank-awareness-gk-ibps-bank-exams.jpg .
2. The same image is not available at the original location - http://sarkarilife.com/wp-content/uploads/2015/01/bank-awareness-gk-ibps-bank-exams.jpg
Looking forward to your response.
-
Hi,
You could put a 301 redirect of the old image folder to the new location. The easiest alternative is to keep the images in both places for a while, until Google has indexed the new location (which can take a few weeks/months). Normally, if all the internal links have been updated, there should be no links to old location, so these images will disappear from the index and replaced by the ones in the new location. Once they are indexed on their new location, you can delete them in the old location
rgds,
Dirk
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Setting up analytics for a website redesign
Hey all, so in the past when I make changes to a site, I make the changes, review the analytics in the wake of the changes, analyze and go from there. Little things here and there, no biggie. With my new company, we're doing a full website redesign from scratch (Currently on Wordpress, moving to custom). They are asking me about analytics and reporting and I was hoping to get some insight here. When the new site is ready, they are launching it at www2.ourdomain.com and sending 25% of traffic to ourdomain.com to that with the other 75% going to www.ourdomain.com (current site). So two questions- how would you go about setting up analytics for that? And how do you ensure the www2 version doesn't get indexed but stay in Google's good graces? If you de-index your "home page" that 25% are seeing I can't imagine that's helpful for SEO. Hopefully that makes sense! Trying to look at how to A/B test to ensure the new site is working and converting before pushing all traffic to it.
Web Design | | DanDeceuster0 -
Increase in Soft 404s due to Custom 404 page?
Hi all, We have noticed recently soft 404s are increasing day by day; which are landing on our custom 404 page created a month back. Other 404 pages are NOT landing on custom 404 page. Does this custom 404 page hurting us by causing an increase in soft 404s? Our CMS is WordPress. Thanks
Web Design | | vtmoz0 -
Web Hosting and CDN for Wordpress Site Load Speed - Suggestions Needed
We all know that website load speed is more important than ever. While I love the look and feel of parallax and Wordpress, I want to do everything I can to keep the load speed down. I see a lot of conflicting information regarding web hosting services, CDN services and other service (Cloudflare for example). I am looking to hear from those with their own experiences to let me know what they think is the ideal setup for a parallax Wordpress site is as far as which services to use, including: 1. Web Hosting
Web Design | | Gauge123
2. CDN
3. Any other service or product that would help to provide and extremely fast site load time. Thank you!0 -
Should Blog Category Archive URLs be Set to "No-Index" in Wordpress?
It appears that Google Webmaster Tools is listing about 120 blog archives URLs in Google Index>Index Status that should not be listed. Our site map contains 650 pages, but Google shows 860. Pages like: <colgroup><col width="464"></colgroup>
Web Design | | Kingalan1
| http://www.nyc-officespace-leader.com/blog/category/manhattan-office-space | With Titles Like: <colgroup><col width="454"></colgroup>
| Manhattan Office Space Archives - Metro Manhattan Office Space | Are listed when in the Rogerbot crawl report for the site. How can we remove such pages from Google Webmaster Tools, Index Status? Our site map shows about 650 pages, yet Google show these extra pages. We would prefer that they not be indexed. Note that these pages do not appear when we run a site:www.nyc-officespace-leader.com search. The site has suffered a drop in ranking since May and we feel it prudent to keep Google from indexing useless URLs. Before May 650 pages showed on the Webmaster Tools Index status, and suddenly in early June when we upgraded the site the index grew by about 175 pages. I suspect the 120 blog archives URLs may have something to do with it. How can we get them removed? Can we set them to "No-Index", or should the robot text be used to remove them? Or can some type of removal request be made to Google? My developers have been struggling with this issue since early June. The bloat on the site is about 175 URLs not on the site map. Is there any go to authority on this issue (it is apparently rather complicated) that can provide a definitive answer? Thanks!!
Alan0 -
New website put up and ALL my keywords fell a LOT!???
I helped a client redesign their new website and we just went live a couple weeks ago. This morning I checked his campaign and 53 keywords fell DRAMATICALLY. Like 35-50 places down in Google for dozens of keywords!? I haven't ever seen a drop that's so dramatic when putting up a new site. Have you ever seen this? Will they bounce back? This site isn't significantly different than the last one. We did forward two other domains to this new site but that wouldn't make a difference, would it? Any feedback would be greatly appreciated! Matthew
Web Design | | Mrupp440 -
Getting a lot more duplicate content warnings than I expected.
I run WordPress on many of my sites and a site crawl has found MANY duplicate content pages on the latest domain I started a campaign for. I expected to see quite a lot on the tag pages that only had one post but even tag pages with multiple posts and author and category pages with many posts are showing as duplicate content. Is this normal for a WordPress site to have so much duplicate content warnings from the taxonomy pages? I have the option to bulk noindex, follow the category and tag pages but should I do it? I get some traffic directly to the tag pages so removing the pages from search results would dent the traffic of the site a little (generally high bounce rate, low engagement traffic anyway) but could removing the apparent duplicate content actually improve the article pages themselves? Or does anyone have any WordPress specific advice for making the pages not duplicate content? I've toyed with the idea of just displaying excerpts but creating manual excerpts for the 4 years worth of posts, some of which I have no personal knowledge of the subject matter so other suggestions are welcome.
Web Design | | williampatton0 -
Thoughts on using a CDN?
I'm trying to decide if I should use a CDN to speed up my site. I've never used one before, but I do know that there are a lot of images on my site and I think that could help a great deal. Are they hard to set up? Are there any down sides? And are the pricey? Also I'm not sure even sure if it would make a difference for me, but I want to work on getting my site faster. Thanks.
Web Design | | NoahsDad1 -
Need help setting up google analytics goals / tracking
I don't use Google analytics to see much more than how many visits I'm getting and what sort of keywords people are using to find our site. I'd like to step up my GA skills a bit. I'm wondring if you guys could give me some advice. I've never really set up any GA goals, or used it to track specific things, but I'd like to. Here are a few things off the top of my head that I would like to track. I'm wondring if these are posable in GA, and if someone could give me some feedback on how to track it / set up goals. Thanks.1 1. I'd like to know how many people click play on a video when they are on a page that has a video. 2. I'd like to know how many people are clicking "like" " google plus, etc.) 3. I'd like to know the path people are taking on our site. For instance, if they click a link from Facebook, and go to a landing page, what page are they visiting next..... 4. How long people are staying on the page I would really like to break this down further by people that visit a link I posted on Facebook, or twitter, or from the link on my twitter profile page, etc... Also if there are any other valuable goals / reports that would be useful for a blogger to track I'd appreciate your feedback. Thanks.
Web Design | | NoahsDad0