What's my best strategy for Duplicate Content if only www pages are indexed?
-
The MOZ crawl report for my site shows duplicate content with both www and non-www pages on the site. (Only the www are indexed by Google, however.) Do I still need to use a 301 redirect - even if the non-www are not indexed? Is rel=canonical less preferable, as usual?
Facts:
- the site is built using asp.net
- the homepage has multiple versions which use 'meta refresh' tags to point to 'default.asp'.
- most links already point to www
Current Strategy:
- set the preferred domain to 'www' in Google's Webmaster Tools.
- set the Wordpress blog (which sits in a /blog subdirectory) with rel="canonical" to point to the www version.
- Ask programmer to add 301 redirects from the non-www pages to the www pages.
- Ask programmer to use 301 redirects as opposed to meta refresh tags & point all homepage versions to www.site.org.
Does this strategy make the most sense? (Especially considering the non-indexed but existent non-www pages.)
Thanks!!
-
Very informative - thank you! It seems when I think I have a relatively firm grip on SEO, I stumble upon something new - like the dangerous potential for an infinite loop in a 301 redirect in IIS. (That was Chinese to me two days ago;))
Your response solved my concerns - hopefully it will help somebody else when they face the same problem.
-
Well the reason Google has picked the www version as it's preferred version automatically is most likely because of all the links you mentioned that were already pointing to that iteration of your domain. Google can figure this out on their own. That said, it still sees both of the sites (non www and www) as duplicates of each other. Best practice is to 301 one to another.
I've waged this war with a programmer before so I know how it goes. The one I dealt with didn't think there was any reason and told me all websites work that way. So I asked him to go to http://google.com and tell me how it resolves. Repeat that step with every major brand you can think of until he/she gets the point and that might help you.
They should be able to 301 this one time, no matter whether they're using an Apache or IIS server. This should be a quick fix. If they're unsure how to do it, have them Google "IIS 301 redirects" if it's a Windows server or "htaccess 301" if it's a Linux/Apache server.
-
Thanks for easing my mind Jesse! One thing still confuses me - the fact that the non-www pages are not indexed. They are not disallowed in robots.txt, there are no rel=canonical tags (except for pages in the blog subdirectory), they are not meta-refreshed, and obviously not 301 redirected. Could it be the doing of a sitemap (though I cant find one)? Or did Google simply decide all the www pages were more relevant? Am I missing something here? I don't want to ask the programmer to add a ton of 301 redirects (which I did) only to get a 'DUH!' response;)
FYI - site is asp.net - not sure if that matters, except when they redirect homepage to avoid creating infinite loop. (right?)
Thanks again!
-
This is an incredibly easy topic to address because you've already laid out exactly what needs to happen.
In other words, yes! That strategy is exactly the way you should go.
Good job and good luck!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Content/Missing Meta Description | Pages DO NOT EXISIT!
Hello all, For the last few months, Moz has been showing us that our site has roughly 2,000 duplicate content errors. Pages that were actually duplicate content, I took care of accordingly using best practice (301 redirects, canonicalization,etc.). Still remaining after these fixes were errors showing for pages that we have never created. Our homepage is www.primepay.com. An example of pages that are being shown as duplicate content is http://primepay.com/blog/%5BLink%20to%20-%20http:/www.primepay.com/en/payrollservices/payroll/payroll/payroll/online-payroll with a referring page of http://primepay.com/blog/%5BLink%20to%20-%20http:/www.primepay.com/en/payrollservices/payroll/payroll/online-payroll. Some of these are even now showing up as 403 and 404 errors. The only real page on our site within that URL strand is primepay.com/payroll or primepay.com/payroll/online-payroll. Therefore, I am not sure where Moz is getting these pages from. Another issue we are having in relation to duplicate content is that moz is showing old campaign url’s tacked on to our blog page i.e. http://primepay.com/blog?title=&page=2&utm_source=blog&utm_medium=blogCTA&utm_campaign=IRSblogpost&qt-blog_tabs=1. As of this morning, our duplicate content went from 2,000 to 18,000. I exported all of our crawl diagnostics data and looked to see what the referring pages were, and even they are not pages that we have created. When you click on these links, they take you to a random point in time from the homepage of our blog; some dating back to 2010. I checked our crawl stats in both Google and Bing’s Webmaster tool, and there are no duplicate content or 400 level errors being reporting from their crawl. My team is truly at a loss with trying to resolve this issue and any help with this matter would be greatly appreciated.
Moz Pro | | PrimePay0 -
Why does a word show as 'Google en-US Mobile Friendly' = FALSE when it also shows that it ranks 3 on mobile?
I pulled a recent report for all words I'm tracking locally and it states that a specific term ranks #4 on desktop and #3 on mobile. But in separate columns, specifically "Google en-US Mobile Friendly" that same terms shows as 'FALSE'. Any idea what this means? Thanks!
Moz Pro | | KristinaWitmer0 -
Duplicate Content even when Canonical is used
Hi Everyone, Our website uses the Magento platform which is notorious for creating duplicate content. I tried to make sure that all the duplicate content it creates should be "canonicalized" to the correct page. While looking through the moz Page Diagnostics I see that I have 1003(!) pages of duplicate content. When I downloaded the csv I saw that over 95% of them had a canonical url. Does that mean there is really no issue but moz analytics is still reading it as duplicate content and titles? Is there an issue with them being canonicals as opposed to being redirected? Thanks!
Moz Pro | | EcomLkwd1 -
Moz crawl only shows 2 pages, but we have more than 1000 pages.
Hi Guys Is there anyway we can test Moz crawler ? it showing only 2 pages crawls. We are running website on HTTPS ? Is HTTPS is issues for Moz ?
Moz Pro | | dotlineseo0 -
Track client's facebook page with Social
I use SEOMoz for a client and want to track the client'sprogress on Facebook. But I can not get connected becasuse the facebook page is not my own. How to solve this? Patrick
Moz Pro | | paddydaddy0 -
Finding the source of duplicate content URL's
We have a website that displays a number of products. The product has variations (sizes) and unfortunately every size has its own URL (for now anyway). Needless to say, this causes duplicate content issues. (And of course, we are looking to change the URL's for our site as soon as possible) However, even though these duplicate URL's exist, you should not be able to land on them by navigating through the site. In theory, the site should always display the link to the smallest size. It seems that there is a flaw in our system somewhere, as these links are now found in our campaign here on SEOmoz. My question: is there any way to find the crawl path that lead to the URL's that shouldn't have been found, so we can locate the problem?
Moz Pro | | DocdataCommerce0 -
Duplicate Content Issue from using filters on a directory listing site
I have a directory listing site of harpists and have alot of issues coming up that say: Content that is identical (or nearly identical) to content on other pages of your site forces your pages to unnecessarily compete with each other for rankings. Because this is a directory listing site the content is quite generic.The main issue appears to be coming from the functionality of the page. It appears that the "spider" is picking up each different choice of filter as a new page? If you have a look at this link you will see what I mean. People searching the site can filter the results of the songs played by this harpist by changing the dropdowns etc... but for some reason the filter arguments are being picked up...? Do you have any good approaches to solving this issue? A similar issue comes from the video pages for each harpist. They are being flagged as identical content - as there are currently no videos on the page. | http://www.find-a-harpist.co.uk/user/39/videos | http://www.find-a-harpist.co.uk/user/37/videos | Do you have any suggestions? Many thanks for taking the time to read this and respond. | | | | | |
Moz Pro | | dseo241
| |0 -
SEOMoz Pro still hasn't crawled 10k pages for one campaign
I looked for a question in the forum already for this but couldn't find anything. Perhaps I am using the wrong keywords, so I apologize if this is a duplicate. I recently signed up with SEOMoz Pro and added two campaigns. For one campaign, 10,000 pages were crawled. For the other campaign, only about 300. It's been 2 weeks since I created the campaigns. Is there a way to force a crawl of the site associated with the second campaign?
Moz Pro | | SharieBags0