404 for duplicate content?
-
Sorry, I think this is my third question today...
But I have a lot of duplicated content on my site. I use joomla so theres a lot of unintentional duplication. For example, www.mysite.com/index.php exists, etc.
Up till now, I thought I had to 301 redirect or rel=canonical these "duplicated pages."
However, can I just 404 it? Is there anything wrong with this rpactice in regards to SEO?
-
I agree with Andy here. Too many 404's can hurt your site. EVEN Google says that in GWT. I wouldn't do any 404s. I would 301 or robot.txt folders.
You may want to robots.txt some folders. Sometimes you can get a plugin and fix things quickly.
-
Hi Kyu,
Remember, canonical is only a suggestion to google of which page should be delivering the content - it is still up to them what they do. In practice though, this is what many opt for.
301's are a permanent redirect and too many can suggest a poor underlying site - you wouldn't want a 301 for every page if there were a lot of them.
You could also think about Robots to remove some of the duplicated pages so they never get spidered, or just no-index them.
404's for me wouldn't be the ideal scenario because somewhere in the site, it can lead to what is basically a dead page. Too many 404's can actually harm your ranking because when Google spiders and finds them, if you have a large enough site, they could be met with 200+ dead pages!
On some sites, you are able to just remove the pages altogether, but you can't do this with the likes of Joomla.
Think about no-indexing / robots because although the pages will still be there, you are telling Google not to bother. This is the route many SEO's are taking now.
Andy
-
You are very welcome. I think "simpler" could be a relative term
All three are appropriate in different situations. However, there are times when people have very limited access to source code or to the backends of their websites, so then one solution might work better than another.
As far as 404s go it's really all about what's best and most appropriate from a user standpoint. If you can guide visitors to content relevant to their search query via a 301-redirect, they are probably going to be more satisfied with that than a 404. This could potentially indirectly effect your SEO because if your bounce rate increases or your 404 pages results in a lot of pogo-sticking by potential visitors, your site could be effected negatively by Googe's algorithm.
When at all possible, I try to do a 301-redirect. But in the cases of really old content that may no longer accurately represent our content or products (and that also doesn't have veyr many inbound links) a 404 might be just fine.
Sorry, that's a bit of a long answer, but I hope it helps!
Dana
-
Thanks Dana! Youve been so helpful!
But one thing I am confused about, when i read articles about how to fix duplicate content, they always talk about the best two options being 301 or rel=canonical. Why is that?
Isnt 404 error simpler?
Hmm, or is 404 just simpler in my case beacuse all my duplicated pages are pages that users will never go to?
-
Yes, you could allow those pages to 404 and in some instances that may be preferable to you. No, there is no negative effect on SEO from 404's. The only negative impact is really on your users. To minimize this, you might consider creating a nice, friendly, customer 404 page instead of using Google's defult 404 error page. Hope that helps!
Dana
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
SEM Rush & Duplicate content
Hi SEMRush is flagging these pages as having duplicate content, but we have rel = next etc implemented: https://www.key.co.uk/en/key/brand/bott https://www.key.co.uk/en/key/brand/bott?page=2 Or is it being flagged as they're just really similar pages?
Intermediate & Advanced SEO | | BeckyKey0 -
Tools to scan entire site for duplicate content?
HI guys, Just wondering if anyone knows of any tools to scan a site for duplicate content (with other sites on the web). Looking to quickly identify product pages containing duplicate content/duplicate product descriptions for E-commerce based websites. I know copy scape can which can check up to 10,000 pages in a single operation with Batch Search. But just wondering if there is anything else on the market i should consider looking at? Cheers, Chris
Intermediate & Advanced SEO | | jayoliverwright0 -
How do you reduce duplicate content for tags and categories in Wordpress?
Is it possible to avoid a duplicate content error without limiting a post to only one category or tag?
Intermediate & Advanced SEO | | Mivito0 -
WMT Index Status - Possible Duplicate Content
Hi everyone. A little background: I have a website that is 3 years old. For a period of 8 months I was in the top 5 for my main targeted keyword. I seemed to have survived the man eating panda but not so sure about the blood thirsty penguin. Anyway; my homepage, along with other important pages, have been wiped of the face of Google's planet. First I got rid of some links that may not have been helping and disavowed them. When this didn't work I decided to do a complete redesign of my site with better content, cleaner design, removed ads (only had 1) and incorporated social integration. This has had no effect at all. I filed a reconsideration request and was told that I have NOT had any manual spam penalties made against me, by the way I never received any warning messages in WMT. SO, what could be the problem? Maybe it's duplicate content? In WMT the Index Status indicates that there are 260 pages indexed. However; I have only 47 pages in my sitemap and when I do a site: search on Google it only retrieves 44 pages. So what are all these other pages? Before I uploaded the redesign I removed all the current pages from the index and cache using the remove URL tool in WMT. I should mention that I have a blog on Blogger that is linked to a subdomain on my hosting account i.e. http://blog.mydomain.co.uk. Are the blog posts counted as pages on my site or on Blogger's servers? Ahhhh this is too complicated lol Any help will be much appreciated! Many thanks, Mark.
Intermediate & Advanced SEO | | Nortski0 -
How can I remove duplicate content & titles from my site?
Without knowing I created multiple URLs to the same page destinations on my website. My ranking is poor and I need to fix this problem quickly. My web host doesn't understand the problem!!! How can I use canonical tags? Can somebody help, please.
Intermediate & Advanced SEO | | ZoeAlexander0 -
How To Handle Duplicate Content Regarding A Corp With Multiple Sites and Locations?
I have a client that has 800 locations. 50 of them are mine. The corporation has a standard website for their locations. The only thing different is their location info on each page. The majority of the content is the same for each website for each location. What can be done to minimize the impact/penalty of having "duplicate or near duplicate" content on their sites? Assuming corporate won't allow the pages to be altered.
Intermediate & Advanced SEO | | JChronicle0 -
Multiple cities/regions websites - duplicate content?
We're about to launch a second site for a different, neighbouring city in which we are going to setup a marketing campaign to target sales in that city (which will also have a separate office there as well). We are going to have it under the same company name, but different domain name and we're going to do our best to re-write the text content as much as possible. We want to avoid Google seeing this as a duplicate site in any way, but what about: the business name the toll free number (which we would like to have same on both sites) the graphics/image files (which we would like to have the same on both sites) site structure, coding styles, other "forensic" items anything I might not be thinking of... How are we best to proceed with this? What about cross-linking the sites?
Intermediate & Advanced SEO | | webdesignbarrie0 -
Duplicate Content on Blog
I have a blog I'm setting up. I would like to have a mini-about block set up on every page that gives very brief information about me and my blog, as well as a few links to the rest of the site and some social sharing options. I worry that this will get flagged as duplicate content because a significant amount of my pages will contain the same information at the top of the page, front and center. Is there anything I can do to address this? Is it as much of a concern as I am making it? Should I work on finding some javascript/ajax method for loading that content into the page dynamically only for normal browser pageviews? Any thoughts or help would be great.
Intermediate & Advanced SEO | | grayloon0