Duplicate pages
-
Hi I have recently signed up to Moz Pro and the first crawl report on my wordpress site has brought up some duplicate content issues. I don't know what to do with this data!
The original page : http://www.dwliverpoolphotography.co.uk/blog/
and the duplicate content page : http://www.dwliverpoolphotography.co.uk/author/david/
If anyone can point me to a resource or explain what I need to do thanks!
David.
-
Awesome - thanks for all the extra info David!
Just to clarify, do you mean 301 the author archives to the homepage? Yoast does this when you check "disable author archives".
-
Just wanted to add that he may want to process the 301's, even after setting the plugin correctly, as Google may have already indexed a lot of the non-SEF and author pages. On the new canoncial pages, it's probably a good idea to also check "no-archive" so the search engine only shows the most recent result.
David, for you to reference:
- NOINDEX tag tells Google not to index a specific page
- NOFOLLOW tag tells Google not to follow the links on a specific page
- NOARCHIVE tag tells Google not to store a cached copy of your page
- NOSNIPPET tag tells Google not to show a snippet (description) under your Google listing, it will also not show a cached link in the search results
Best of luck with your edits!
-
thank's guy's I will let you know how I get on!
Best wishes.
David
-
301 redirects are processed through your htaccess file, and your server. Here is an example of what the code looks like. This particular example is used to force "www" on our site, so that a user cannot access multiple versions of the home page: (btw if you don't have this in place, you should)
force www
RewriteCond %{HTTP_HOST} !^www.webdesignandcompany.com$ [NC]
RewriteRule ^(.*)$ http://www.webdesignandcompany.com/$1 [R=301,L]The # symbol allows the description to be ignored, so that you can organize your htaccess rules easily, and have them labeled.
If you use cpanel for hosting, or have a host provider that uses it, here is how to process through the backend admin:
http://docs.cpanel.net/twiki/bin/view/AllDocumentation/CpanelDocs/ReDirects"I know you can use a plug in for wordpress, have you had any experience yourself implementing the rel'canonical' or 301 redirect?"
Super simple, so don't sweat it. By using the Yoast SEO Plugin, you can set the canonical page directly from the page's editor.
Here is the link explaining how to do just that:
https://yoast.com/wordpress/plugins/canonical/Hope this helps! If you need any further assistance let me know.
-
David
You have Yoast SEO installed, so follow these steps;
- Go to SEO->Titles/Meta->Other
- and for "author archives" check "noindex, follow"
- and if this is a single author blog, check "disable author archives"
For more details on setting up WordPress for SEO, you can check out my guide here: http://moz.com/blog/setup-wordpress-for-seo-success
-Dan
-
You are most welcome David. You can use Yoast for Wordpress for handling rel=canonial.
Here for more: https://yoast.com/wordpress/plugins/seo/#canonical
Here you go for implementing 301 permanent redirection using .htaccess file on Apache server(Linux hosting):
If you are on a Windows server, here are the steps for 301 redirection:
http://www.iis.net/configreference/system.webserver/httpredirect
301 redirection is definitely faster than rel=canonical and almost same when it comes to passing on the SEO goodies to the canonical page. With 301 in place, in this case with the URLs mentioned by you for example(suppose, the author page is redirected to the homepage via 301), no one will be able to see the author page as it would take you to the homepage but with rel=canonical in place, everyone will be able to see the author page and its just that the search engines like Google will not index the author page as the cononical or the preferred page would be the homepage. So, ideally, you should be going with the rel=canonical implementation here. Hope it helps.
Good Luck my friend.
Best regards,
Devanur Rafi
-
Hi Devanur, thank you for your response.
I have read that the 301 redirect passes more link juice to the original page. Although I am still trying to figure out how to actually physically do it!
I know you can use a plug in for wordpress, have you had any experience yourself implementing the rel'canonical' or 301 redirect?
Best wishes.
David.
-
Hi David,
I will be very quick here. 'rel=canonial' can come to your rescue.
Here you go for more: https://support.google.com/webmasters/answer/139066?hl=en
Here is another article from Moz regarding duplicate content and how to go about it:
http://moz.com/learn/seo/duplicate-content
Best regards,
Devanur Rafi
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Page Optimization Error
Hi, I am trying to track a page optimization feature for one of my project, https://shinaweb.com but i keep getting this below error: "PAGE OPTIMIZATION ERROR
On-Page Optimization | | shinawebnavid
There was a problem loading this page. Please make sure the page is loading properly and that our user-agent, rogerbot, is not blocked from accessing this page." I checked robots.txt file, it all looks fine. Not sure what is the problem? Is it a problem with Moz or the website?0 -
Is there a limit to the number of duplicate pages pointing to a rel='canonical ' primary?
We have a situation on twiends where a number of our 'dead' user pages have generated links for us over the years. Our options are to 404 them, 301 them to the home page, or just serve back the home page with a canonical tag. We've been 404'ing them for years, but i understand that we lose all the link juice from doing this. Correct me if I'm wrong? Our next plan would be to 301 them to the home page. Probably the best solution but our concern is if a user page is only temporarily down (under review, etc) it could be permanently removed from the index, or at least cached for a very long time. A final plan is to just serve back the home page on the old URL, with a canonical tag pointing to the home page URL. This is quick, retains most of the link juice, and allows the URL to become active again in future. The problem is that there could be 100,000's of these. Q1) Is it a problem to have 100,000 URLs pointing to a primary with a rel=canonical tag? (Problem for Google?) Q2) How long does it take a canonical duplicate page to become unique in the index again if the tag is removed? Will google recrawl it and add it back into the index? Do we need to use WMT to speed this process up? Thanks
On-Page Optimization | | dsumter0 -
Category Page Content
Hey Mozzers, I've recently been doing a content audit on the category and sub-category pages on our site. The old pages had the following "profile" Above The Fold
On-Page Optimization | | ATP
Page Heading
Image Links to Categories / Products
Below the Fold
The rest of the Image Links to Categories / Products
600 words+ of content duplicated from articles, sub categories and products My criticisms of the page were
1. No content (text) above the fold
2. Page content was mostly duplicated content
3. No keyword structure, many pages competed for the same keywords and often unwanted pages outranked the desired page for the keyword. I cleaned this up to the following structure Above The Fold
H1 Page Heading 80-200 Word of Content (Including a link to supporting article)
H2 Page Heading (Expansion or variance of the H1 making sure relevant) 80-200 150 Words of Content
Image Links to Categories / Products
Below the Fold
The rest of the Image Links to Categories / Products The new pages are now all unique content, targeted towards 1-2 themed keywords. I have a few worries I was hoping you could address. 1. The new pages are only 180-300 words of text, simply because that is all that is needed to describe that category and provide some supporting information. the pages previously contained 600 words. Should I be looking to get more content on these pages?
2. If i do need more content, It wont fit "above the fold" without pushing the products and sub categories below the fold, which isn't ideal. Should I be putting it there anyway or should I insert additional text below the products and below the fold or would this just be a waste.
3. Keyword Structure. I have designed each page to target a selction of keywords, for example.
a) The main widget pages targets all general "widget" terms and provides supporting infromation
b) The sub-category blue widget page targets anything related and terms such as "Navy Widgets" because navy widgets are a type of blue widget etc"
Is this keyword structure over-optimised or exactly what I should be doing. I dont want to spread content to thin by being over selective in my categories Any other critisms or comment welcome0 -
Duplication in landing page
This is driving me mad, I have a site that for some reason google and moz pick up the landing page as a duplicate. They see "mysite/" and "mysite/index.html" as two different pages and giving me warnings for duplication. I have no 301 included at this time and I am using foundation as the base. This is occurring both on a localhost test bed and live....... anyone got an idea how to correct.
On-Page Optimization | | AndyBirtles0 -
"Issue: Duplicate Page Content " in Crawl Diagnostics - but these pages are noindex
Saw an issue back in 2011 about this and I'm experiencing the same issue. http://moz.com/community/q/issue-duplicate-page-content-in-crawl-diagnostics-but-these-pages-are-noindex We have pages that are meta-tagged as no-everything for bots but are being reported as duplicate. Any suggestions on how to exclude them from the Moz bot?
On-Page Optimization | | Deb_VHB0 -
301 redirected Duplicate Content, still showing up as duplicate after new crawl.
We launched a site where key landing pages were not showing up in google. After running the seomoz crawl it returned a lot of duplicate pages which may expalin this. The actual url of the page is /design and it was telling me the following were dupes: /design/family-garden-design
On-Page Optimization | | iterate
/design/small-garden-design
/design/large-rural-garden-design
/Design All of these URL's were in fact pointing to the /design landing page. I 301 redirected all of the pages so they all now resolve to /design After running another crawl the day after doing this it's still showing up as duplicate content on seomoz. Does seomoz evaluate the new changes right away?0 -
How can I make it so that the various iterations (pages) do not come up as duplicate content ?
Hello, I wondered if somebody could give me some advice. The problem of various iterations of the clanedar page coming up as duplicate content. There is a large calendar on my site for events and each time the page is viewed it is seen as duplicate content . How can I make it so that the various iterations (pages) do not come up as duplicate content ? Regards
On-Page Optimization | | Tony14Aug0 -
Duplicate content - what to do?
Hi, We have a whole lot of articles on our site. In total 5232 actually. The web crawler tells me that in the articles we have a lot of duplicate content. Which is sort of nonsense, since each article is unique. Ah, some might have some common paragraphs because they are recurring news about a weekly competition. But, an example: http://www.betxpert.com/artikler/bookmakere/brandvarme-ailton-snupper-topscorerprisen AND http://www.betxpert.com/artikler/bookmakere/opdaterede-odds-pa-sportschef-situationen-pa-vestegnen These are "duplicate content", however the two article texts are not the same. The menu, and the widgets are all the same, but highly relevant to the article. So what should I do? How can i rid myself of these errors? -Rasmus
On-Page Optimization | | rasmusbang0