Duplicate Content after Moz Site Audit
-
Hello folks,
So I signed up for the trial version of the Moz tool and ran an initial site audit. One of the site audit results is confusing me.
It reports that there are two pages with duplicate content ( Each page has a duplicate page with duplicate content in it).
When I take a look at what those pages are, here is what I see:mysite.com/Contact-Us.html
mysite.com/contact-us.html
( The difference in the above is the Contact and Us, the first letters are capitalized on one of the URLS)mysite.com/index.html
mysite.comNow I am confused because for one thing, I don't have 2 Contact Us html files uploaded on my hosting server.
Why is Moz seeing 2 Contact Us pages? How to remove one?Regarding my home page, why is it flagging the same page as two different pages? How to remove of them?
-
Sure thing,
Using a canonical only would still let you access mysite.com/index.html and would display that url in the browser. This means 2 things, firstly a user can see this url (and it can look a little messy) if they happen to find their way onto this page and 2, they may link to your website using this url (many people copy and paste links from the browser window). Whilst this isn't a problem as the canonical would pass link juice anyway it makes things a little "messy".
A 301 would do exactly the same as the canonical in terms of passing link juice etc but it wouldn't let the user access mysite.com/index.html they would be redirected to mysite.com removing the possibility anyone would see or link to index.html
Both solutions fix your problem, one is just a little neater.
-
No worries on the delayed response. It is important to enjoy your weekend!
Regarding the 301 redirect, now I must ask, what do you mean by "neater" in the browser?
Just trying to get all the information and understand what I am doing before I go ahead and modify anything.
Appreciate the help.
-
Hi Jorge, sorry for the delay responding i was away for the weekend.
You most likely don't have any links pointing the mysite.com/index.html
Index.html is the default hompage for most websites. mysite.com technically points to a folder and searches for the index.html file within this folder. As such, both address for your homepage are nearly always found.
A canonical will fix this, if you have the non-www version as your preferred domain go with
Many people prefer to 301 redirect this page as its neater in the browser. But the canonical will do the job.
-
ATP,
Thank you for the information. So I did a bit of poking around on the site and found that on a few pages, the Contact-Us.html link was in fact capitalized on some pages and on others it was not. I proceeded to capitalize the first letters of each word on all the link references on all the pages, and re-ran the site audit, and the tool no longer flags the Contact-Us pages as being duplicates. Great stuff.
I then proceeded to look for links in any of my pages which have either www.mysite.com or www.mysite.com/index.html and did not find any differences. All of the links in the code are pointing to the home page using:
[This would tell the search engines that the real version and all the "link juice" should go to www.mysite.com.
Which brings up another question, should I use the www. version or the non-www. version? See I have the non www. version as my preferred domain set in my hosting provider, as well as in Google Webmaster Tools ( Google Search Console ).](/index.html)
-
Hi Jorge,
lets take it from the top
Moz tries to show you, and report on how google would see you site.
When you type in a url, the browser and server holding and displaying the website doesn't care if you use capitals or lowercase, for their purpose it is the same page. This is why you will have only created this page once on whatever web platform you are using. However, google sees them differently, each one as a different page.
You could access this page from any combination of capital letters even something stupid like
These hundred of variations are never picked up on simply because we dont use them.
Lets presume you wanted the the page to be reachable at "mysite.com/contact-us.html" and made it this way. The reason the second variation has been picked up on is most likely because you have used it (or someone else has) to link to that page. Somewhere somebody will have Link Text
Because of this link the second variation is found and because google treats it as a different page, moz is reporting it as a different page.
It is a similiar case with your
mysite.com
mysite.com/index.htmlIs is the same page accessible at 2 different urls.
To combat this, you need to use a solution such as
1. Canonical Tags (Recomended)
On your homepage get this code inserted between the tags
On your contact page get this code inserted between the tags
This will cause all versions of this page that are "accidentally made" to say "Hey, im just a copy of this page"
2. 301 Redirects
The second solution is to put a 301 redirect in place, this varies depending on what web platform you are on. This simply redirects the user and any crawl bot to the intented pagei.e. someone tries to go to mysite.com/index.html and your website stops it loading and sends them to mysite.com
This is normally done by editing your htaaccess file. If you want to go this road tell us what platform you website is on and we can give you instructions.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Changing the Moz Crawl Date
Hello, I am wondering where I can change the date of Crawl by Moz. I would like to change this crawl period from one week to 2 or even 3 weeks for Moz to crawl my website. Hope to hear from anyone soon. Kind regards, Koen.
Getting Started | | Koenniiee1 -
Moz site crawl doesn't work
The Moz site crawl isn't working for my campaign, but works for the site's on demand crawl. The search should not be disallowed by robots.txt or the headers. I'd like to be able to track the website for the campaign so I can see SEO gains / losses and increases / decreases in indexing.
Getting Started | | DrainKing0 -
Does MOZ pick up every issue in one crawl?
Hi, Does MOZ pick up every error/warning in one crawl? Or does it take numerous crawls? Many thanks Lee
Getting Started | | lbagley0 -
How Do I Scan My New Site & Grade My Work With The Robots Turned Off? For Pre-Inspection before I launch my Site?
I have a new site that has all the bots turned off so google can't index my site until I'm finished it. I've been working on this site for a couple months now optimizing and I was wondering if there was anyway I can run a preliminary scan on the site for my titles, URLs, Headers, Alt Tags and pretty much anything else that will grade my work and tell me if i did anything wrong? Can MOZ do this with the Bots turned off? Thanks
Getting Started | | Inframan0 -
Trending Bugs in Moz Analytics
The FAQ’s We hope we can help! Below are the trending issues at the moment. If you don’t see your question addressed or need further help, send us a message at http://moz.com/help/contact. Our Help Team gumshoes will investigate your issue and respond shortly. Where did my Campaigns Go?
Getting Started | | Abe_Schmidt
Whenever your account is suspended, your campaigns are archived. The good news is that reviving archived accounts is really simple. Go to your Campaigns section: http://pro.moz.com/campaigns
At the top of the page there is a "Archived Campaigns" Tab, give it a click.
From here, you should easily be able to "activate" your campaigns. My Google Analytics profile won’t stay connected to my campaign and/or I am missing profiles in my GA Settings. Google goes through a process called Oauth when it comes to authorizing access of services that are linked to your Google account. Under any single Google account, there are about 20 tokens account wide. Those tokens are used to provide access for a variety of services from Gmail to apps on your phone. Once you hit the limit, which is 20 for most users, the system automatically revokes the oldest token to provide a new one. I'm not certain if this is causing the issue on your account, but it is a great place to start troubleshooting. You may be able to correct the issue by manually revoking your tokens to make room for new ones on your account and then reconnecting the account to Moz. Here are the instructions on how to do that: Follow this link https://accounts.google.com/b/0/IssuedAuthSubTokens. This page displays the current OAuth tokens you are using. Once you reach the page, simply press the revoke button (illustrated on this screenshot: http://screencast.com/t/vjh3KrjRRIe) for services that you are not using right now. Once you are done with, that simply go back to your campaign settings, disconnect your GA profile and reconnect. The below process may also fix this issue for you. Head to your campaign settings page on your overview page. Disconnect your Google Analytics connection. Go back to your settings page and click on "connect account." Please make sure you log into the correct GA account. 🙂 Hurray! This should let you grab the most current traffic data! Oops! try refreshing page! I can’t access any of my Campaign data.
This issue is normally machine specific and can be a bit complex. An individual forum has been created to address this: https://seomoz.zendesk.com/entries/28203486-Oops-Try-Refreshing-the-Page-Error-showing-on-all-pages- Unable to retrieve historical ranking CSVs. Some users did not receive their requested Historical Rankings CSVs. This has mostly been affected by a change or update to your competitors after a campaign has been setup for a while. Please send your request to retrieve this data with http://moz.com/help/contact.8 -
MOZ Starter Crawl not happeneing
Hi I added a new site 48hours + ago and the starter crawler has not even begun collecting data. Any help would be appreciated. cheers Isaac
Getting Started | | sodafizz0 -
Custom reports in Moz Analytics?
I am trying to find where to configure whitelabled custom reports in moz analytics. Where can I find them? If they are only accessible through the old version, how do I switch back?
Getting Started | | keybroker1 -
Where to find answers to really dumb questions about setting up Moz campaigns...
I had a Moz account earlier this year, let it drop for a few months, then decided it really was important to have one. So I'm back. But there is still something about it that I really find frustrating. I can't seem to find any answers to what seem to be basic questions. Maybe they are too simple, maybe they are just dumb questions, but I sure would appreciate it if somebody could lead me to another source of information. Maybe somebody has written a "Moz for Dummies" book? Here's an example: I'm setting up campaigns for my websites, and I get to the Brand and Mentions section. It tries to autofill the space using my campaign name, but that's not accurate. So I go look at the Help Hub for Brands and Mentions, and it gives an example using MOZ as the brand name. But not many sites have a three-letter domain name, do they?. One of my domains is EasyDigging.com (2 words) and the other is BestDryingRack.com (3 words). So I look and look, but can't find any examples of longer domain names. So I have no idea how to enter my Brand. Should it be the whole "EasyDigging.com" or should it just be "EasyDIgging" or should it be broke up into "Easy Digging"? This is just 1 example. I find these sort of unanswered questions almost every time I try something new. Is there a collection of examples anywhere? I really hope so. Surely there are others who learn best from examples, who hate having to guess based on slim or vague instructions. Keeping my fingers crossed that somebody can lead me to the goldmine of good examples. Thanks!
Getting Started | | GregB1230