Duplicate Content Question
-
Just signed up for pro and did my first diagnostic check - I came back with something like 300 duplicate content errors which suprised me because every page is unique.
Turns out my pages are listed as www.sportstvjobs.com and just sportstvjobs.com does that really count as duplicate? and if so does anyone know what I should be doing differently?
I thought it was just a canonical issue, but best I can tell I have the canonical in there but this still came up as a duplicate error....maybe I did canonical wrong, or its some other issue?
Thanks
Brian Clapp
-
You want them on all pages as I'm sure Google is seeing copies of all your pages. This is really a best practices thing but I add them to all pages when the page is created to avoid just such issues. If you choose the www version you would use this tag for the Career center page:
In addition I've added a link to this great post on canonical
Hope this helps
-
its pretty awesome. Welcome
-
less than 24 hours into my seomoz pro membership and I'm already blown away - thanks for all of your assistance. Christine it appears I don't have a .htaccess file so I will follow what you wrote and create a new blank file with just that code in it...
you guys rock. I've got lots more learning to do....
Brian
-
Something that was challenging for me at first was finding .htaccess file. It should be in the root directory of where all of you website files are stored. It may be hidden or you may need to create one. If its hidden, find out how to un-hide files in your specific FTP program. If you need to create it, simply create a blank file with the name .htaccess and paste the recommendation that Ryan made.
-
Ryan thanks for your great advice. I'm the site founder but by no means a tech expert... I'm more the content guy, but I'm learning. I
understand what you have written...all except the part of where I put that code. Where do I find my .htaccess file?
-
Thank you for your great response Dave! I'm the founder of the site, but my background is more in content and I'm learning the SEO as I go, so forgive me if this is a silly follow up.
In reference to response #4 - my original designer said canonical was only on the home page, but your response sounds like it should be on every page? If so do I put the whole url for that page, or just the root? Again not an expert here so all this advice really helps me learn.
Thanks
Brian
-
- In addition to your 301 redirect issue Ryan mentioned you are internally linking to the non www version for your logo link: (this leaks juice on the redirect as well if you are going for the www version)
2 ) You really want to choose one version and go with it everywhere (be precise down to the trailing slash) - Google webmaster tools has a setting for this as well - sign up and choose your preferred setting / then save if you haven't already
-
Make the changes and check Open Site Explorer for both versions to see your links that appear the non-preferred way and see if you can contact your external linking sources to fix the issue to the preferred URL
-
I didn't see any canonical for the other pages (other than the home page) add those as well
-
Rerun OSE after your next crawl completes or use Xenu for immediate testing for the internal links - external may take a bit longer.
-
This matters mainly because of the potential for link dilution - most people agree you won't be explicitly penalized. See http://www.herseo.com/blog/2009/05/30/how-important-is-it-to-htaccess-redirect-to-www/
The best way to address this is to set up a 301 redirect in your .htaccess file. I recently did this for my own domain with the following code:
<ifmodule mod_rewrite.c="">RewriteEngine on
# Redirect adoptionhelp.org to www.adoptionhelp.org
RewriteCond %{HTTP_HOST} !^www.adoptionhelp.org [NC]
RewriteRule ^(.*)$ http://www.adoptionhelp.org/$1 [R=301,L]</ifmodule>Add that to your .htaccess file and drop it in your root directory.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Headers & Footers Count As Duplicate Content
I've read a lot of information about duplicate content across web pages and was interested in finding out about how that affected the header and footer of a website. A lot of my pages have a good amount of content, but there are some shorter articles on my website. Since my website has a header, footer, and sidebar that are static, could that hurt my ranking? My only concern is that sometimes there's more content in the header/footer/sidebar than the article itself since I have an extensive amount of navigation. Is there a way to define to Google what the header and footer is so that they don't consider it to be duplicate content?
Technical SEO | | CyberAlien0 -
Duplicate Content in Wordpress.com
Hi Mozers! I have a client with a blog on wordpress.com. http://newsfromtshirts.wordpress.com/ It just had a ranking drop because of a new Panda Update, and I know it's a Dupe Content problem. There are 3900 duplicate pages, basically because there is no use of noindex or canonical tag, so archives, categories pages are totally indexed by Google. If I could install my usual SEO plugin, that would be a piece of cake, but since Wordpress.com is a closed environment I can't. How can I put a noindex into all category, archive and author peges in wordpress.com? I think this could be done by writing a nice robot.txt, but I am not sure about the syntax I shoud use to achieve that. Thank you very much, DoMiSol Rossini
Technical SEO | | DoMiSoL0 -
Testing for duplicate content and title tags
Hi there, I have been getting both Duplicate Page content and Duplicate Title content warnings on my crawl diagnostics report for one of my campaigns. I did my research, and implemented the preferred domain setting in Webmaster Tools. This did not resolve the crawl diagnostics warnings, and upon further research I discovered the preferred domain would only be noted by Google and not other bots like Roger. My only issue was that when I ran an SEOmoz crawl test on the same domain, I saw none of the duplicate content or title warnings yet they still appear on my crawl diagnostics report. I have now implemented a fix in my .htaccess file to 301 redirect to the www. domain. I want to check if it's worked, but since the crawl test did not show the issue last time I don't think I can rely on that. Can you help please? Thanks, Claire
Technical SEO | | SEOvet0 -
Caps in URL creating duplicate content
Im getting a bunch of duplicate content errors where the crawl is saying www.url.com/abc has duplicate at www.url.com/ABC The content is in magento and the url settings are lowercase, and I cant figure out why it thinks there is duplicate consent. These are pages with a decent number of inbound links.
Technical SEO | | JohnBerger0 -
Press Releases & Duplicate Content
How do you do press releases without duplicating the content? I need to post it on my website along with having it on PR websites. But isn't that considered bad for SEO since it's duplicate content?
Technical SEO | | MercyCollege0 -
Multiple URLs in CMS - duplicate content issue?
So about a month ago, we finally ported our site over to a content management system called Umbraco. Overall, it's okay, and certainly better than what we had before (i.e. nothing - just static pages). However, I did discover a problem with the URL management within the system. We had a number of pages that existed as follows: sparkenergy.com/state/name However, they exist now within certain folders, like so: sparkenergy.com/about-us/service-map/name So we had an aliasing system set up whereby you could call the URL basically whatever you want, so that allowed us to retain the old URL structure. However, we have found that the alias does not override, but just adds another option to finding a page. Which means the same pages can open under at least two different URLs, such as http://www.sparkenergy.com/state/texas and http://www.sparkenergy.com/about-us/service-map/texas. I've tried pointing to the aliased URL in other parts of the site with the rel canonical tag, without success. How much of a problem is this with respect to duplicate content? Should we bite the bullet, remove the aliased URLs and do 301s to the new folder structure?
Technical SEO | | ufmedia0 -
Duplicate content domains ranking successfully
I have a project with 8 domains and each domain is showing the same content (including site structure) and still all sites do rank. When I search for a specific word-string in google it lists me all 8 domains. Do you have an explanation, why Google doesn't filter those URLs to just one URL instead of 8 with the same content?
Technical SEO | | kenbrother0 -
Duplicate content handling.
Hi all, I have a site that has a great deal of duplicate content because my clients list the same content on a few of my competitors sites. You can see an example of the page here: http://tinyurl.com/62wghs5 As you can see the search results are on the right. A majority of these results will also appear on my competitors sites. My homepage does not seem to want to pass link juice to these pages. Is it because of the high level of Dup Content or is it because of the large amount of links on the page? Would it be better to hide the content from the results in a nofollowed iframe to reduce duplicate contents visibilty while at the same time increasing unique content with articles, guides etc? or can the two exist together on a page and still allow link juice to be passed to the site. My PR is 3 but I can't seem to get any of my internal pages(except a couple of pages that appear in my navigation menu) to budge of the PR0 mark even if they are only one click from the homepage.
Technical SEO | | Mulith0