Duplicate Content
-
Hello All, my first web crawl has come back with a duplicate content warning for
and
slightly mystified!
thanks
paul
-
If you're still in contact with a web developer, that would be great. If you're not, a note to everyone else on this thread that the website in question is using IIS 6.0, so apache info isn't going to help in this case.
-
Just a 301 from 7index to /
-
Hi Cesar,
there is no drawback. technically www.simodal.com and www.simodal.com/ are different pages just like www.simodal.com/randompage and www.simodal.com/randompage/ would be considered different. Most people would consider /randompage a page and /randompage/ a directory. But from a SEO perspective .com and .com/ are equally good.
What you should do is to decide whether you want to use a trailing slash or not and stick to it. if you dedide not to use / on your sites root page use it consistently everywhere.
Generally speaking there are 3 often seen ways: use .html for pages and / for directorys vs. no suffix for pages (domain.tld/page ) and / for directorys vs. / for all pages and directories (wordpress uses / AFAIK). It doesnt realy matter much, take one and stick to it.
-
which is the drawback of the 301 redirect without the "/"?
-
Hi Paul
I can fully identify with your frustrations - been there!
A simple question may help you. Did you have a web developer, and are you still in relationship with him/her. If so, get them to do a 301 redirect from the www.simodal.com/index.htm to your chosen version. Most seem to do www.simodal.com/ - but with a trailing forward slash at the end. Someone else might like to comment on that.
Also as Aaron says also do it for the version without the www's ie: http://simodal.com/ and do a 301 to exactly the same URL as the above.
If you haven't got a developer there is some info around telling you exactly how to do it.
Hope this helps
-
Hey Paul,
here is the explanation:
www.simodal.com and www.simodal.com/index.htm are considered separate pages by google, although both are your sites "starting point". Some Content Management Systems (CMS) make thiis mistake, i.e. delivering the same page and not distinguishing between simodal.com/ and simodal.com/index.htm.
As said before, you should decide whether all your pages should be www.simodal oder just simodal.com. There is a great Whiteboard-Friday Video by Rand on this toppic. Then you should rewrite your URLs to either version.
Additionally you might want to add a rel canonical to your page, maybe just to your starting page. a
<link rel="canonical" href="http://www.simodal.com/" />
on your starting page would tell google to ignore the /index.htm and use /
But watch out, rel canonical is somewhat tricky...but there are good tutorials here.
To be honest: I know quiet a lot of pages, that make this mistake. Google should be able to correct this, so dont qorry about rankings. You should however do the redirect www. (or the opposite) as this will trigger googles DC filter. Also: if you plan to use SSL (https:// ) make sure that these pages are also not indexed, best by using rel canonical.
-
Hello Paul!
Because the URL is different, the crawlers look them as different pages, but as you know, they're not! It's just two ways to get there!
To solve this, you have to redirect the /index page to the non-/index, using the 301 redirection code.
Tutorial here: http://www.tamingthebeast.net/articles3/spiders-301-redirect.htm
Got it?
Hope it helps! =]
-
ThanksAaron, this is very new to me and you will have to forgive my DOH! moments.
Still don't get it. Can you point me in any direction so I can understand.
best
paul
-
It is indeed duplicate content! You might want to consider doing a redirect. I also noticed that you haven't done a redirect from the non www. domain either!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Page Content Issue
Hello, I recently solved www / no www duplicate issue for my website, but now I am in trouble with duplicate content again. This time something that I cannot understand happens: In Crawl Issues Report, I received Duplicate Page Content for http://yourappliancerepairla.com (DA 19) http://yourappliancerepairla.com/index.html (DA 1) Could you please help me figure out what is happenning here? By default, index.html is being loaded, but this is the only index.html I have in the folder. And it looks like the crawler sees two different pages with different DA... What should I do to handle this issue?
Technical SEO | | kirupa0 -
Duplicate content and canonicalization confusion
Hello, http://bit.ly/1b48Lmp and http://bit.ly/1BuJkUR pages have same content and their canonical refers to the page itself. Yet, they rank in search engines. Is it because they have been targeted to different geographical locations? If so, still the content is same. Please help me clear this confusion. Regards
Technical SEO | | IM_Learner0 -
Duplicate Content from Multiple Sources Cross-Domain
Hi Moz Community, We have a client who is legitimately repurposing, or scraping, content from site A to site B. I looked into it and Google recommends the cross-domain rel=canonical tag below: http://googlewebmastercentral.blogspot.com/2009/12/handling-legitimate-cross-domain.html The issue is it is not a one to one situation. In fact site B will have several pages of content from site A all on one URL. Below is an example of what they are trying to accomplish. EX - www.siteB.com/apples-and-oranges is made up of content from www.siteA.com/apples & www.siteB.com/oranges So with that said, are we still in fear of getting hit for duplicate content? Should we add multiple rel=canonical tags to reflect both pages? What should be our course of action.
Technical SEO | | SWKurt0 -
Duplicate Content for Multiple Instances of the Same Product?
Hi again! We're set to launch a new inventory-based site for a chain of car dealers with various locations across the midwest. Here's our issue: The different branches have overlap in the products that they sell, and each branch is adamant that their inventory comes up uniquely in site search. We don't want the site to get penalized for duplicate content; however, we don't want to implement a link rel=canonical because each product should carry the same weight in search. We've talked about having a basic URL for these product descriptions, and each instance of the inventory would be canonicalized to this main product, but it doesn't really make sense for the site structure to do this. Do you have any tips on how to ensure that these products (same description, new product from manufacturer) won't be penalized as duplicate content?
Technical SEO | | newwhy0 -
Taking descriptions from Manufacturer sites and Duplicate content
We are doing some inventory improvements eg new photographs from various angles, etc. We are also writing descriptions for each product.. As one of our suppliers has perfect desriptions on their site what is the theory on how duplicate content will affect our ranking for these products if we copy and paste? Also if we change the descriptions, just how different do they need to be? Thanks
Technical SEO | | seanmccauley1 -
Url rewrites / shortcuts - Are they considered duplicate content?
When creating a url rewrite or shortcut, does this create duplicate content issues? split your rankings / authority with google/search engines? Scenario 1 wwwlwhatthehellisahoneybooboo.com/dqotd/ -> www.whatthehellisahoneybooboo.com/08/12/2012/deep-questions-of-the-day.html Scenario 2 bitly.com/hbb -> www.whatthehellisahoneybooboo.com/08/12/2012/deep-questions-of-the-day.html (or to make it more compicated...directs to the above mentioned scenario 1 url rewrite) www.whatthehellisahoneybooboo.com/dqotd/ *note well- there's no server side access so mentions of optimizing .htacess are useless in this situation. To be clear, I'm only referring to rewrites, not redirects...just trying to understand the implications of rewrites. Thanks!
Technical SEO | | seosquared0 -
Category URL Duplicate Content
I've recently been hired as the web developer for a company with an existing web site. Their web architecture includes category names in product urls, and of course we have many products in multiple categories thus generating duplicate content. According to the SEOMoz Site Crawl, we have roughly 1600 pages of duplicate content, I expect primarily from this issue. This is out of roughly 3600 pages crawled. My questions are: 1. Fixing this for the long term will obviously mean restructuring the URLs for the site. Is this worthwhile and what will the ramifications be of performing such a move? 2. How can I determine the level and extent of the effects of this duplicated content? 3. Is it possible the best course of action is to do nothing? The site has many, many other issues, and I'm not sure how highly to prioritize this problem. In addition, the IT man is highly doubtful this is causing an SEO issue, and I'm going to need to be able to back up any action I request. I do feel I will need to strongly justify any possible risks this level of site change could cause. Thanks in advance, and please let me know if any more information is needed.
Technical SEO | | MagnetsUSA0 -
Worpress Tags Duplicate Content
I just fixed a tags duplicate content issue. I have noindexed the tags. Was wondering if anyone has ever fixed this issue and how long did it take you to recover from it? Just kind of want to know for a piece of mind.
Technical SEO | | deaddogdesign0