Duplicate pages
-
Hi I have recently signed up to Moz Pro and the first crawl report on my wordpress site has brought up some duplicate content issues. I don't know what to do with this data!
The original page : http://www.dwliverpoolphotography.co.uk/blog/
and the duplicate content page : http://www.dwliverpoolphotography.co.uk/author/david/
If anyone can point me to a resource or explain what I need to do thanks!
David.
-
Awesome - thanks for all the extra info David!
Just to clarify, do you mean 301 the author archives to the homepage? Yoast does this when you check "disable author archives".
-
Just wanted to add that he may want to process the 301's, even after setting the plugin correctly, as Google may have already indexed a lot of the non-SEF and author pages. On the new canoncial pages, it's probably a good idea to also check "no-archive" so the search engine only shows the most recent result.
David, for you to reference:
- NOINDEX tag tells Google not to index a specific page
- NOFOLLOW tag tells Google not to follow the links on a specific page
- NOARCHIVE tag tells Google not to store a cached copy of your page
- NOSNIPPET tag tells Google not to show a snippet (description) under your Google listing, it will also not show a cached link in the search results
Best of luck with your edits!
-
thank's guy's I will let you know how I get on!
Best wishes.
David
-
301 redirects are processed through your htaccess file, and your server. Here is an example of what the code looks like. This particular example is used to force "www" on our site, so that a user cannot access multiple versions of the home page: (btw if you don't have this in place, you should)
force www
RewriteCond %{HTTP_HOST} !^www.webdesignandcompany.com$ [NC]
RewriteRule ^(.*)$ http://www.webdesignandcompany.com/$1 [R=301,L]The # symbol allows the description to be ignored, so that you can organize your htaccess rules easily, and have them labeled.
If you use cpanel for hosting, or have a host provider that uses it, here is how to process through the backend admin:
http://docs.cpanel.net/twiki/bin/view/AllDocumentation/CpanelDocs/ReDirects"I know you can use a plug in for wordpress, have you had any experience yourself implementing the rel'canonical' or 301 redirect?"
Super simple, so don't sweat it. By using the Yoast SEO Plugin, you can set the canonical page directly from the page's editor.
Here is the link explaining how to do just that:
https://yoast.com/wordpress/plugins/canonical/Hope this helps! If you need any further assistance let me know.
-
David
You have Yoast SEO installed, so follow these steps;
- Go to SEO->Titles/Meta->Other
- and for "author archives" check "noindex, follow"
- and if this is a single author blog, check "disable author archives"
For more details on setting up WordPress for SEO, you can check out my guide here: http://moz.com/blog/setup-wordpress-for-seo-success
-Dan
-
You are most welcome David. You can use Yoast for Wordpress for handling rel=canonial.
Here for more: https://yoast.com/wordpress/plugins/seo/#canonical
Here you go for implementing 301 permanent redirection using .htaccess file on Apache server(Linux hosting):
If you are on a Windows server, here are the steps for 301 redirection:
http://www.iis.net/configreference/system.webserver/httpredirect
301 redirection is definitely faster than rel=canonical and almost same when it comes to passing on the SEO goodies to the canonical page. With 301 in place, in this case with the URLs mentioned by you for example(suppose, the author page is redirected to the homepage via 301), no one will be able to see the author page as it would take you to the homepage but with rel=canonical in place, everyone will be able to see the author page and its just that the search engines like Google will not index the author page as the cononical or the preferred page would be the homepage. So, ideally, you should be going with the rel=canonical implementation here. Hope it helps.
Good Luck my friend.
Best regards,
Devanur Rafi
-
Hi Devanur, thank you for your response.
I have read that the 301 redirect passes more link juice to the original page. Although I am still trying to figure out how to actually physically do it!
I know you can use a plug in for wordpress, have you had any experience yourself implementing the rel'canonical' or 301 redirect?
Best wishes.
David.
-
Hi David,
I will be very quick here. 'rel=canonial' can come to your rescue.
Here you go for more: https://support.google.com/webmasters/answer/139066?hl=en
Here is another article from Moz regarding duplicate content and how to go about it:
http://moz.com/learn/seo/duplicate-content
Best regards,
Devanur Rafi
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicated content by the product pages
Hi,Do you thing those pages have duplicate content:https://www.nobelcom.com/Afghanistan-phone-cards/from-Romania-235-2.htmlhttps://www.nobelcom.com/Afghanistan-phone-cards-2.htmlhttps://www.nobelcom.com/Afghanistan-Cell-phone-cards-401.htmlhttps://www.nobelcom.com/Afghanistan-Cell-phone-cards/from-Romania-235-401.html.And also how much impact will it have on a panda update?I'm trying to figure out if all the product pages, (that are in the same way as the ones above) are the reson for a Panda Penalty
On-Page Optimization | | Silviu0 -
How do I fix duplicate page issue on Shopify with duplicate products because of collections.
I'm working with a new client with a site built on Shopify. Most of their products appear in four collections. This is creating a duplicate content challenge for us. Can anyone suggest specific code to add to resolve this problem. I'm also interested in other ideas solutions, such as "don't use collections" if that's the best approach. I appreciate your insights. Thank you!
On-Page Optimization | | quiltedkoala0 -
Identifying Duplicate Page Title
Moz weekly reports, among other things, the "Duplicate Page Title". How can I identify which two urls/pages have duplicate page titles? Is there any simple way to trace?
On-Page Optimization | | Sequelmed0 -
On Brand Queries Google does not shows my home page first instead of it shows internal pages.
Also on my brand query it doesn't shows sitelinks. What may be the reason?
On-Page Optimization | | vivekrathore0 -
How to overcome blog page 1, 2, 3, etc having no or duplicate meta info?
As the above what is the best way to overcome having the same meta info on your blog pages (not blog posts) So if you have 25 blog posts per page once you exceed this number you then move onto a second blog page, then when you get to 50 you then move onto a 3rd blog page etc etc So if you have thousands f blog pages what is the best method to deal with this rather than having to write 100s of different meta titkes & descriptions? Cheers
On-Page Optimization | | webguru20141 -
Duplicate Page Content
Hey Moz Community, Newbie here. On my second week of Moz and I love it but have a couple questions regarding crawl errors. I have two questions: 1. I have a few pages with duplicate content but it say 0 duplicate URL's. How do I know what is duplicated in this instance? 2. I'm not sure if anyone here is familiar with an IDX for a real estate website. But I have this setup on my site and it seems as though all the links it generates for different homes for sale show up as duplicate pages. For instance, http://www.handyrealtysa.com/idx/mls...tonio_tx_78258 is listed as having duplicate page content compared with 7 duplicate URLS: http://www.handyrealtysa.com/idx/mls...tonio_tx_78247
On-Page Optimization | | HandyRealtySA
http://www.handyrealtysa.com/idx/mls...tonio_tx_78253
http://www.handyrealtysa.com/idx/mls...tonio_tx_78245
http://www.handyrealtysa.com/idx/mls...tonio_tx_78261
http://www.handyrealtysa.com/idx/mls...tonio_tx_78258
http://www.handyrealtysa.com/idx/mls...tonio_tx_78260
http://www.handyrealtysa.com/idx/mls...tonio_tx_78260 I've attached a screenshot that shows 2 of the pages that state duplicate page content but have 0 duplicate URLs. Also you can see somewhat about the idx duplicate pages. rel="canonical" is functioning on these pages, or so it seems when I view the source code from the page. Any help is greatly appreciated. skitch.png0 -
Locating Duplicate Pages
Hi, Our website consists of approximately 15,000 pages however according to our Google Webmaster Tools account Google has around 26,000 pages for us in their index. I have run through half a dozen sitemap generators and they all only discover the 15,000 pages that we know about. I have also thoroughly gone through the site to attempt to find any sections where we might be inadvertently generating duplicate pages without success. It has been over six months since we did any structural changes (at which point we did 301's to the new locations) and so I'd like to think that the majority of these old pages have been removed from the Google Index. Additionally, the number of pages in the index doesn't appear to be going down by any discernable factor week on week. I'm certain it's nothing to worry about however for my own peace of mind I'd like to just confirm that the additional 11,000 pages are just old results that will eventually disappear from the index and that we're not generating any duplicate content. Unfortunately there doesn't appear to be a way to download a list of the 26,000 pages that Google has indexed so that I can compare it against our sitemap. Obviously I know about site:domain.com however this only returned the first 1,000 results which all checkout fine. I was wondering if anybody knew of any methods or tools that we could use to attempt to identify these 11,000 extra pages in the Google index so we can confirm that they're just old pages which haven’t fallen out of the index yet and that they’re not going to be causing us a problem? Thanks guys!
On-Page Optimization | | ChrisHolgate0 -
How can I make it so that the various iterations (pages) do not come up as duplicate content ?
Hello, I wondered if somebody could give me some advice. The problem of various iterations of the clanedar page coming up as duplicate content. There is a large calendar on my site for events and each time the page is viewed it is seen as duplicate content . How can I make it so that the various iterations (pages) do not come up as duplicate content ? Regards
On-Page Optimization | | Tony14Aug0