Duplicate Content after Moz Site Audit
-
Hello folks,
So I signed up for the trial version of the Moz tool and ran an initial site audit. One of the site audit results is confusing me.
It reports that there are two pages with duplicate content ( Each page has a duplicate page with duplicate content in it).
When I take a look at what those pages are, here is what I see:mysite.com/Contact-Us.html
mysite.com/contact-us.html
( The difference in the above is the Contact and Us, the first letters are capitalized on one of the URLS)mysite.com/index.html
mysite.comNow I am confused because for one thing, I don't have 2 Contact Us html files uploaded on my hosting server.
Why is Moz seeing 2 Contact Us pages? How to remove one?Regarding my home page, why is it flagging the same page as two different pages? How to remove of them?
-
Sure thing,
Using a canonical only would still let you access mysite.com/index.html and would display that url in the browser. This means 2 things, firstly a user can see this url (and it can look a little messy) if they happen to find their way onto this page and 2, they may link to your website using this url (many people copy and paste links from the browser window). Whilst this isn't a problem as the canonical would pass link juice anyway it makes things a little "messy".
A 301 would do exactly the same as the canonical in terms of passing link juice etc but it wouldn't let the user access mysite.com/index.html they would be redirected to mysite.com removing the possibility anyone would see or link to index.html
Both solutions fix your problem, one is just a little neater.
-
No worries on the delayed response. It is important to enjoy your weekend!
Regarding the 301 redirect, now I must ask, what do you mean by "neater" in the browser?
Just trying to get all the information and understand what I am doing before I go ahead and modify anything.
Appreciate the help.
-
Hi Jorge, sorry for the delay responding i was away for the weekend.
You most likely don't have any links pointing the mysite.com/index.html
Index.html is the default hompage for most websites. mysite.com technically points to a folder and searches for the index.html file within this folder. As such, both address for your homepage are nearly always found.
A canonical will fix this, if you have the non-www version as your preferred domain go with
Many people prefer to 301 redirect this page as its neater in the browser. But the canonical will do the job.
-
ATP,
Thank you for the information. So I did a bit of poking around on the site and found that on a few pages, the Contact-Us.html link was in fact capitalized on some pages and on others it was not. I proceeded to capitalize the first letters of each word on all the link references on all the pages, and re-ran the site audit, and the tool no longer flags the Contact-Us pages as being duplicates. Great stuff.
I then proceeded to look for links in any of my pages which have either www.mysite.com or www.mysite.com/index.html and did not find any differences. All of the links in the code are pointing to the home page using:
[This would tell the search engines that the real version and all the "link juice" should go to www.mysite.com.
Which brings up another question, should I use the www. version or the non-www. version? See I have the non www. version as my preferred domain set in my hosting provider, as well as in Google Webmaster Tools ( Google Search Console ).](/index.html)
-
Hi Jorge,
lets take it from the top
Moz tries to show you, and report on how google would see you site.
When you type in a url, the browser and server holding and displaying the website doesn't care if you use capitals or lowercase, for their purpose it is the same page. This is why you will have only created this page once on whatever web platform you are using. However, google sees them differently, each one as a different page.
You could access this page from any combination of capital letters even something stupid like
These hundred of variations are never picked up on simply because we dont use them.
Lets presume you wanted the the page to be reachable at "mysite.com/contact-us.html" and made it this way. The reason the second variation has been picked up on is most likely because you have used it (or someone else has) to link to that page. Somewhere somebody will have Link Text
Because of this link the second variation is found and because google treats it as a different page, moz is reporting it as a different page.
It is a similiar case with your
mysite.com
mysite.com/index.htmlIs is the same page accessible at 2 different urls.
To combat this, you need to use a solution such as
1. Canonical Tags (Recomended)
On your homepage get this code inserted between the tags
On your contact page get this code inserted between the tags
This will cause all versions of this page that are "accidentally made" to say "Hey, im just a copy of this page"
2. 301 Redirects
The second solution is to put a 301 redirect in place, this varies depending on what web platform you are on. This simply redirects the user and any crawl bot to the intented pagei.e. someone tries to go to mysite.com/index.html and your website stops it loading and sends them to mysite.com
This is normally done by editing your htaaccess file. If you want to go this road tell us what platform you website is on and we can give you instructions.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How do I check PA in moz
i see where I can check the DA for a site, but how can I check the PA of a page?
Getting Started | | Konvertica0 -
Moz unable to crawl my Zenfolio website
Hey guys, I am attempting to optimize a website for my wife's business but Moz is unable to crawl it. Zenfolio is the web hosting service (she is a photographer). The error message is: **Moz was unable to crawl your site on Apr 1, 2019. **Our crawler was not able to access the robots.txt file on your site. This often occurs because of a server error from the robots.txt. Although this may have been caused by a temporary outage, we recommend making sure your robots.txt file is accessible and that your network and server are working correctly. Typically errors like this should be investigated and fixed by the site webmaster. Read our troubleshooting guide. I did read the troubleshooting guide but nothing worked. My robots.txt file disallows a few bots, but not roger bot. Anyone have any idea what is going on? Or do I need to request server logs from Zenfolio? Thanks
Getting Started | | bpenn111 -
Duplicate titles issue
Hi, For the second week in a row MOZ is finding duplicate titles crawling my website.
Getting Started | | AlessiaCamera
But as you see in the attached screenshot it doesn't seem it's a clear duplicate title thing, as it's mainly due to the different pages having the same title.
What should I do? Is it really affecting my SERP positioning? ZANRT0 -
Can't track my site, keep getting "Ooops. Our crawlers are unable to access that URL"
Hello, So i keep getting this message and I went to hurl.it and I get 200 response. But it appears its not my actual homepage bc it says the body is empty and in the title it says "COMING SOON" which is not what my actual homepage says. Does anyone know what this means?? Thank you in advance! Rena
Getting Started | | Palila-Studio0 -
My question is, when you translate your website to another language, does moz crawl both or do i have to add another campaign to moz so that they can crawl it seperately?
Hello, i recently translated my website to spanish, keywords,meta tags, content etc. All of our URL for the translated pages start "ep", which symbolizes espanol. My question is, does moz crawl those pages along with my english pages? or do i have add another campaign for moz to crawl my spanish pages seperately?
Getting Started | | prestigeluxuryrentals.com0 -
A lot of duplicate content issues - does Moz understand canonical URL?
Hi, Since I subscribed to Moz my Magento store has given a lot of duplicate content issues. However, I did have a problem with Canonical URL at the time. It has been settled for a couple of weeks by now and although I had 302 redirects before, I configured Magento to 301 today. Since Moz has been crawling and showing duplicate content for exactly the same Magento pages but with endings like store=us, store=aus etc (since I have several store views enabled), I am wondering whether canonical URL does actually help Google to skip these versions of the duplicate pages and does Moz also understand it and will it reduce the amount of duplicate content errors once the 301 redirects and canonical URLs have been properly set for a week or so? Thanks!
Getting Started | | speedbird12290 -
Moz's official stance on Subdomain vs Subfolder - does it need updating?
Hi, I am drawing your attention to Moz's Domain basics here: http://moz.com/learn/seo/domain It reads: "Since search engines keep different metrics for domains than they do subdomains, it is recommended that webmasters place link-worthy content like blogs in subfolders rather than subdomains. (i.e. www.example.com/blog/ rather than blog.example.com) The notable exceptions to this are language-specific websites. (i.e., en.example.com for the English version of the website)." I am wondering if this is still Moz's current recommendation on the subfolders vs subdomains debate, given that the above (sort of) implies that SE's may not combine ranking factors to the domain as a whole if subdomains are used - which (sort of) contradicts Matt Cutts last video on the matter ( http://www.youtube.com/watch?v=_MswMYk05tk ) which implies that this is not the case and there is so little difference that their recommendation is to use whatever is easiest. It would also seem to me that if you were looking through the eyes of Google, it would be silly to treat them differently if there were no difference at all other than subdomain vs subfolder as one of the main reasons a user would use a sud-domain is a technical on for which it would not make sense for Google to treat differently in terms of its algorithm. I notice that in terms of Moz, while most of the site uses subfolders, you do have http://devblog.moz.com/ - and I was wondering if this is due to a technical reason or conscious decision, as it would seem to me that the content within this section is indeed linkworthy (as it has external links pointing to it from external sources), therefore it would seem to not be following the initial advice that is posted in Moz's basics on domains. Therefore I am assuming it is due to a technical reason - or that Moz's adive is out of date with current Moz thinking, and is indeed in line with Matt C in that it doesn't matter. Cheers
Getting Started | | James773 -
Why does moz show "not in top 50" for all my keywords???
Hello, I signed up to moz pro 4 days ago. And so far it seems to be tracking visits etc. But all my keywords say "not in top 50" . Why is this? Is this normal? Just to confirm most of the keywords i pasted in from my webmaster tools and i only chose the ones that were in top 50
Getting Started | | casper09030