Duplicate Content From My Own Site?!
-
When I ran the SEO Moz report it says that I have a ton of duplicate content.
The first one I looked at was my home page.
All of the above 3 have varying internal links, page authority, and link root domains. Only the first has any external links.
All of the others only seem to have 1 other duplicate page. It's a difference between the www and the non-www version.
I have a verified acct for www.kisswedding.com in google webmaster tools. The non-www version is in there too but has not been verified.
Under settings for the verified account (www.kisswedding.com), "Don't set a preferred domain" is checked off. Is that my mistake. And if so, which should I select? The www version or the non-www version?
Thanks!
-
Hi Ryan
I'm really sorry, but could you tell me what rule(s) to add to my htaccess file. I've been to my host (bluehost) but they don't know. Nice eh!?
I'd really appreciate it. And SO sorry!
-
I think I just had a major Duh! moment. Instead of linking back to index.html could I have been linking back to / all this time?
YES! Once you decide upon a correct URL structure for your website, you should always link directly to the appropriate URL.
On your site, you have chosen http://www.kisswedding.com/ to represent your home page. 100% of your links to the home page should use that URL. Any non-www links and /index.html links should be changed to this format.
Also, any signatures you use and content on other sites which you have control over should be updated to the proper format. Facebook, twitter, articles you published on other sites, etc. should all be updated.
-
omg.
I think I just had a major Duh! moment. Instead of linking back to index.html could I have been linking back to / all this time?
And thank you for confirming. Much appreciated. It all seems logical and it's all stuff I thought needed to happen - but was hoping didn't.
-
Your understanding is correct. The most important item is #3.
You asked how do you know the www version because is getting more links
Well, I was taking your word for it. In your original question you stated "Only the first has any external links."
Since you asked I just checked and actually all 3 URLs have external links, but the www version has by far the most.
With respect to the index.html file, it does not have to exist but that is the way your site is currently set up. Either way, it is ok. Simply 301 redirect the /index.html page to the / page.
What you need to understand is that while on your site the /index.html page is the same as the / page, it is not necessarily that way on every site. Additionally while on your site the www and non-www version of a URL is the same page, it is not necessarily that way on every site. For this reason Google will treat each of these pages separately. It will appear to Google that you are duplicating content. Also, your link juice will be divided. A properly configured 301 redirect will resolve these issues.
-
Thank you Maurizio!!! Just what I needed to know!
-
Thank you so much Ryan.
You answered a lot of the questions I was contemplating. Can you confirm I've understood your suggestions?
1. Verify the non-www version in GWT
2. Set preferred domain to the www version because that's getting more links (how do you know that btw?).
3. Set up 301 redirects in my htaccess file.
4. You've recommended that i set my home page as http://www.kisswedding.com VERSUS http://kisswedding.com/index.html or http://kisswedding.com
--- question about this last one. Isn't the index.html always going to exist because if I link back to my site within my own site I use the index.html extension? In fact, all my links are like that - I don't include my entire site url when i'm linking from one page to another on the site.
-
Hi Susan.
I agree with Maurizio and will offer a few more details.
It is important for each web page on your site to only be accessible from a single URL. It is up to you to decide which URL you prefer, then ensure all other URLs are removed or redirected to your chosen URL.
With respect to www or non-www, it does not make any difference but is rather your choice. In this case, since you already have links to the www version and not the non-www version, I would recommend choosing the www version and sticking with it.
The most important change you need to make is on your site. You need to 301 redirect all non-www URLs to their www equivalent. If you are unsure how to make this change and have managed hosting (most small sites do) then contact your host and they can easily make the change. Once the change has been made, test it! First go to your non-www home page and make sure you are redirected. Next, go to any of your inner pages, remove the "www" from the URL and make sure you are redirected to the same page but with the "www" added to the URL.
For Google WMT, you need to verify the site first. You should not have to upload any new file or change any code, but simply click through the options and the site will likely verify. Next, choose the preferred domain as the "www" version. If you wish to be very thorough, you can log into Bing Webmaster Tools and perform the same action. Once your 301 redirect is in place, the search engines would automatically make the change anyway, but it will likely take a month for all the pages of your site to be adjusted.
For the index.html, I would recommend 301 redirecting that URL to your http://www.kisswedding.com/ url.
Lastly, please review all links on your site and ensure they all use the "www" prefix. Include any signatures, social pages (facebook, twitter, etc).
-
Hi
i think that is not different from wwww and without wwww but you must decide in the google webmaster tools.
and i think that you can rediret (301 permanent) with htaccess to show only one site online.
example if you decide to have http://www.kisswedding.com/ when somebody digit a url without wwww ( http://kisswedding.com) you can redirect to wwww.
So you can avoid a duplicate content
Ciao
Maurizio
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
New Site Worries
To cut a long story short, our old web developers who built us a bespoke site decided that they could no longer offer us support so we decided to move our back end to the latest Magento 2 software and move over to https with a new company. The new setup has been live for 3 weeks, I have checked in webmaster tools and it says we have 4 pages indexed, if I type in site:https://www.mydomain.com/ we have 6560 pages indexed, our robots.txt file looks like this:Sitemap: https://www.mydomain.com/sitemap.xml Sitemap: https://www.mydomain.com/sitemaps/sitemap_default.xml I use Website Auditor and Screaming Frog, Website Auditor returns a 302 for my domain and Screaming Frog returns a 403 which means I cannot scan any of these. If I check my domain using an https checking tool some sites return an error but some return a 200.
Reporting & Analytics | | Palmbourne
I have spoken to my new developer and he says everything is fine, in Webmaster tools I can see some redirects from his domain to mine when the site was in testing mode. I am concerned that something is not right as I always check my pages on a regular basis. Can anyone shed any light on this, is it right or am I right to be concerned. Thank you in advance0 -
Excluding Cookieless Static Content Sub-domain from GA/GTM
For the purposes of this question our ecommerce site url is www.ecommerce.com Our TLD is ecommerce.com We have, following advice from Yslow, Pagespeed and others, moved our static content to a subdomain - static.ecommerce.com We have Google Analytics and Enhance Ecommerce installed, fired from GTM. The cookieDomain setting in GTM is 'auto' At present cookies are being attached to our static resources. What changes do I need to make to to prevent this happening? Many thanks Julian
Reporting & Analytics | | jdeb0 -
Ecommerce site product link. How to handle a link that doesn't exist.
Suppose we have this product A, and we just have a single item for this. When the item is sold out we do not want to show it on the website saying "out of stock". Instead we would like to remove the product from out store which will now result in a url that doesn't exist. And google webmaster tool and Moz analytic will show them as page not found after they crawl over the site. Should i be generating a new sitemap.xml and update ? How do i handle those pages that don't exist anymore ? Thanks
Reporting & Analytics | | MindlessWizard0 -
Site account in Google Analytics
Hello I have a question about my site account. On 2014, during a week, my ID tracking of Google Analytics was removed of the site, in this period the volume of users and sessions is lower than the other weeks. But I don't understand why are the sessions and users still reporting during this period without ID Tracking
Reporting & Analytics | | Arkix0 -
Which Algorithm Change Hurt the Site? A causation/correlation issue
The attached graph is from google analytics, a correlation of about 14 months of Organic Google visits with algo changes, data from moz naturally 🙂 Is there any way to tell from this which will have affected the site? for example #1 or #2 seems to be responsible for the first dip, but #4 seems to fix it and it broke around 6, or is the rise between 4 and 7 an anomaly and actually 1 or 2 caused a slip from when it was released all the way to when 7 was released. Sorry if the graph is a little cloak and dagger, that is partly because we don't have permissions to reveal much about the identity, and partly because we were trying to do a kind of double blind, separating the data from our biases 🙂 We can say though the different between the level at the start and end of the graph is at least 10,000 visits per day JarMzoK.png
Reporting & Analytics | | Fammy0 -
Any harm and why the differences - multiple versions of same site in WMT
In Google Webmaster Tools we have set up: ourdomain.co.nz
Reporting & Analytics | | zingseo
ourdomain.co.uk
ourdomain.com
ourdomain.com.au
www.ourdomain.co.nz
www.ourdomain.co.uk
www.ourdomain.com
www.ourdomain.com.au
https://www.ourdomain.co.nz
https://www.ourdomain.co.uk
https://www.ourdomain.com
https://www.ourdomain.com.au As you can imagine, this gets confusing and hard to manage. We are wondering whether having all these domains set up in WMT could be doing any damage? Here http://support.google.com/webmasters/bin/answer.py?hl=en&answer=44231 it says: "If you see a message that your site is not indexed, it may be because it is indexed under a different domain. For example, if you receive a message that http://example.com is not indexed, make sure that you've also added http://www.example.com to your account (or vice versa), and check the data for that site." The above quote suggests that there is no harm in having several versions of a site set up in WMT, however the article then goes on to say: "Once you tell us your preferred domain name, we use that information for all future crawls of your site and indexing refreshes. For instance, if you specify your preferred domain as http://www.example.com and we find a link to your site that is formatted as http://example.com, we follow that link as http://www.example.com instead." This suggests that having multiple versions of the site loaded in WMT may cause Google to continue crawling multiple versions instead of only crawling the desired versions (https://www.ourdomain.com + .co.nz, .co.uk, .com.au). However, even if Google does crawl any URLs on the non https versions of the site (ie ourdomain.com or www.ourdomain.com), these 301 to https://www.ourdomain.com anyway... so shouldn't that mean that google effectively can not crawl any non https://www versions (if it tries to they redirect)? If that was the case, you'd expect that the ourdomain.com and www.ourdomain.com versions would show no pages indexed in WMT, however the oposite is true. The ourdomain.com and www.ourdomain.com versions have plenty of pages indexed but the https versions have no data under Index Status section of WMT, but rather have this message instead: Data for https://www.ourdomain.com/ is not available. Please try a site with http:// protocol: http://www.ourdomain.com/. This is a problem as it means that we can't delete these profiles from our WMT account. Any thoughts on the above would be welcome. As an aside, it seems like WMT is picking up on the 301 redirects from all ourdomain.com or www.ourdomain.com domains at least with links - No ourdomain.com or www.ourdomain.com URLs are registering any links in WMT, suggesting that Google is seeing all links pointing to URLs on these domains as 301ing to https://www.ourdomain.com ... which is good, but again means we now can't delete https://www.ourdomain.com either, so we are stuck with 12 profiles in WMT... what a pain.... Thanks for taking the time to read the above, quite complicated, sorry!! Would love any thoughts...0 -
If you have G+ buttons on your site, does google still suggest you add them?
We've had G+ buttons on the site for many months now (Can't remember exactly when they were added.) Yet in Google Webmaster Tools, they still give me this message: "Get more recommendations in Google Search and grow your audience on Google+. Add the Google+ badge to your site." Is this happening to everyone, or is it just me? Do they think the buttons aren't there? Also, they say this: "Your site doesn't have enough +1's yet to show characteristics." According to the stats, 551 unique people have +1'd our pages. How many does it take, to get stats? Anyone willing to give stats?
Reporting & Analytics | | loopyal0