Duplicate content issue index.html vs non index.html
-
Hi
I have an issue. In my client's profile, I found that the "index.html" are mostly authoritative than non "index.html", and I found that www. version is more authoritative than non www. The problem is that I find the opposite situation where non "index.html" are more authoritative than "index.html" or non www more authoritative than www.
My logic would tell me to still redirect the non"index.html" to "index.html". Am I right?
and in the case I find the opposite happening, does it matter if I still redirect the non"index.html" to "index.html"?
The same question for www vs non www versions?
Thank you
-
Yes, I like using rewrites in an .htaccess file, which is covered in the links above.
-
I fix the 2 URLs.
In this case domain.com/index.html is the code for domain.com/.
Do you mean to use mode_rewrite and create a 301 redirect from domain.com/index.html to domain.com/ ?
Thank you for your time.
-
<colgroup><col span="30" width="64"></colgroup>
Hi Taysir, first of all ypou must take an overview with what is duplicate content? Solving the cannonical problems with www. Duplicate Content Issues in www & non www I hope that your query had been solved. -
It's very likely that the "index.html" version is more authoritative because you're using it in internal links. The problem is that that often creates a duplication issue - you refer to the root (non-index.html) version in inbound links, social, etc. (and people tend to link and bookmark the root version), but then link internally to "index.html", so Google will end up indexing both.
If the authority is coming from internal links, and you:
(1) Switch the internal links to the root ("/")
(2) 301-redirect "index.html" to the root ("/")
...you shouldn't lose any authority, as you'll have re-routed it by doing step (1). You'll also consolidate your signals and be better off all-around, IMO.
Kane's right, though - it's a bit tough to tell without knowing the specifics.
-
Redirecting the authoritative link to the less authoritative URL is not ideal.
However, in my opinion being consistent with URLs throughout the site takes precedent.
Implementing 301 redirects will indicate that there has been a permanent relocation of that pages content, and you will get most of the link value from the authoritative link. That said, if you feel comfortable emailing the person who created that authoritative link, it's worth a little effort to ask them to change it, but if it's a hassle to do so, don't push it.
-
How to redirect domain.com/index.html to domain.com/index.html?
Those two URLs are the same, so there is nothing to change. If you wanted to redirect domain.com/index.html to domain.com/ then you would do so with 301 redirects. Here's a guide on getting started:
http://www.seomoz.org/learn-seo/redirection
http://www.seomoz.org/blog/url-rewrites-and-301-redirects-how-does-it-all-work
-
I personally would rewrite & redirect everything using the 2nd option above.
Can you explain me how to do that, please?
How to redirect domain.com/index.html to domain.com?
Thanks
-
thank you for your detailed answer but one more thing does it matter if I redirect a more authoritative link to a weaker one for the benefit of staying consistent and vice versa?
let s say I redirect a non index.html to an index.html and vice versa for the sake of consistency?
-
You should stick with one format across the site:
-
domain.com/index.html and domain.com/subfolder/index.html
**OR **
I typically choose the second option because it is agnostic of CMS or file type, and it looks better in my opinion. I would not mix the two across the site because it causes a confusing user experience.
So, to answer your questions directly:
My logic would tell me to still redirect the non"index.html" to "index.html". Am I right?
No, not necessarily. By telling us that there are examples where .html is more authoritative and there are examples where it isn't as authoritative, it's impossible for us to say which is the better choice. I personally would rewrite & redirect everything using the 2nd option above.
**The same question for www vs non www versions? **
I believe that WWW vs non-WWW is less important. You could decide based upon which format has more links or which one has been historically used. Consistency (using the same across the entire site), proper 301 redirects, and proper rel canonical tags are your priorities here.
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Best way to fix duplicate content issues
Another question for the Moz Community. One of my clients has 4.5k duplicate content issues. For example: http://www.example.co.uk/blog and http://www.example.co.uk/index.php?route=blog/blog/listblog&year=2017. Most of the issues are coming from product pages. My initial thoughts are to set up 301 redirects in the first instance and if the issue persists, add canonical tags. Is this the best way of tackling this issue?
Technical SEO | | Laura-EMC0 -
Duplicate Content Showing up on Moz Crawl | www. vs. no-www.
Hello Moz Community! I am new to SEO, Moz and this is my first question. My questions; I have a client that is getting flagged for Duplicate Content. He is getting flagged for having two domains that have the same content i.e. www.mysite.com & mysite.com. I read into this and set up a 301 redirect through my hosting site. I evaluated which site had a stronger Page Authority and had the weaker site redirect to the stronger site. However, I am still getting hit for Duplicate pages caused by the www.mysite.com & mysite.com being duplicates. How should I go about resolving this? Is this an example of a Canonical tag needed in the head of the HTML? Any direction is appreciated. Thank You. B/R Will H.
Technical SEO | | MarketingChimp100 -
Modx revolution- getting around index.php vs. root duplicate content issue?
Basically, SEOMoz bots are flagging our index.php and root files as duplicate content of one another, therefore cutting the page authority of each. What we want to do is make the root the canonical preference over index.php. Ordinarily, we should be able to do this in the htaccess file. For some reason, as the site has been built into a cms using ModX Revolution, this does not seem to work. We've tried A TON of htaccess rewrite mods to resolve this issue to no avail. We have also tried revising our sitemap to include only the root address. Any ideas? We'll try most anything at this point. Thanks in advance.
Technical SEO | | G2W0 -
Issue Duplicate Page Title
I'm having some really strange issues with duplicate page titles and I can't seem to figure out what's going on. I just got a new crawl from SEOMOZ and it's showing some duplicate page titles. http://www.example.com/blog/ http://www.example.com/blog/page/2/ http://www.example.com/blog/page/3/ Repeat .............. I have no idea what's going on, how these were duplicated, or how to correct it. Does anyone have a chance to take a look and see if you can figure out what's happening and what I need to do to correct the errors? I'm using Wordpress and all in one SEO plugin. Thanks so much!
Technical SEO | | KLLC0 -
Duplicate Footer Content
A client I just took over is having some duplicate content issues. At the top of each page he has about 200 words of unique content. Below this is are three big tables of text that talks about his services, history, etc. This table is pulled into the middle of every page using php. So, he has the exact same three big table of text across every page. What should I do to eliminate the dup content. I thought about removing the script then just rewriting the table of text on every page... Is there a better solution? Any ideas would be greatly appreciated. Thanks!
Technical SEO | | BigStereo0 -
How can i resolve Duplicate Page Content?
Hello, I have created one campaign over SEOmoz tools for my website AutoDreams.it i have found 159 duplicate page content. My problem is that this web site is about car adsso it is easy to create pages with duplicate content and also Car ads are placed byregistered users. How can i resolve this problem? Regards Francesco
Technical SEO | | francesco870 -
Are RSS Feeds deemed duplicate content?
If a website content management system includes built-in feeds of different categories that the client can choose from, does that endanger them of having duplicate content if their categories are the same as another client's feed? These feeds appear on templated home page designs by default. Just trying to figure out how big of an issue these feeds are in terms of duplicate content across clients' sites. Should I be concerned? Obviously, there's other content on the home page besides the feed and have not really seen negative effects, but could it be impacting results?
Technical SEO | | KyleNeuberger0 -
Duplicate Content issue
I have been asked to review an old website to an identify opportunities for increasing search engine traffic. Whilst reviewing the site I came across a strange loop. On each page there is a link to printer friendly version: http://www.websitename.co.uk/index.php?pageid=7&printfriendly=yes That page also has a link to a printer friendly version http://www.websitename.co.uk/index.php?pageid=7&printfriendly=yes&printfriendly=yes and so on and so on....... Some of these pages are being included in Google's index. I appreciate that this can't be a good thing, however, I am not 100% sure as to the extent to which it is a bad thing and the priority that should be given to getting it sorted. Just wandering what views people have on the issues this may cause?
Technical SEO | | CPLDistribution0