Duplicate Forum Content
-
HI everyone,
great to be here, absolutely loving everything, please go easy on me I'm quite a noob when it comes to seo, but hopefully my question isn't too basic.
After running the initial checks on my websites, I found there are 7,646 duplicate pages? Some are easy fixes but the majority are not, being forum pages, the edit, quote and new post links are coming up as duplicates of the main post?
Does anyone know how to fix this?
Best Lee
-
How delightful that you refer to Invisionpower as I recently took the software for a test drive and plan to purchase it in January
A question though that has been troubling me. I use a paid for hosting service, which forwards to one of my sub-domains. The problem is I have nearly 7k pages of duplicate content (all from the forum), I really want to add a no follow and then remove the forum pages from the search index and thus remove all of the duplicate content. Though I'm worried that adding no follow to the sub domain may impact on the main site.
I have a lot of page one search positions, many long tails which combined result in a lot of visits, but I also have a couple of short phrases that get 50% of the visits alone, so I can't risk making any changes, actually I'm pretty terrified to make any changes, just in case all of my website pages are removed from the index.
-
invisionpower forum is good choice, as i knew. Found duplicated content pages here, it happens with most of forums, but there is full provision that this can resolved by canonical or stop crawling one you don't want in robots.txt.
-
Hey
just found some valuable info which has solved a lot of my problems, modified .htaccess which has removed all of the non www links, so thats a part solve.
I think what I'm going to do is purchase the new forum software, setup a robot.txt to not crawl the new forums and then once I have all of the posts transfered to the new forum, remove the old forums from the index.
That should solve it, the new forum is very well established and if need be I can pay one of their developers to incorporate some seo feature that will stop the duplicate content issue. Once thats solved I can remove the robot.txt
Many thanks, Lee
-
Hi, still one more thing, because you sound loss, and that's not the meaning of this Q&A. Contact the forumservice and ask for the canonical, this is a very simple and common thing to do.
-
Ok thanks Leonie,
still no nearer to knowing what to do, I know theres a problem and generally what I need to do, but I'm still at a complete loss as to how to do it
But thank you for trying to help,
Best, Lee
-
Seo stuff is'nt that difficult, though it can be comlicated
Anyway I wish you luck with the forum and hope you'll manage to get it the way you wanted.
Grtz, Leonie
-
Mmmm this is becoming more problematic by the minute, will need to implement robot.txt before removing the urls.
And there was me thinking this seo stuff would be easy.
-
Lol, tha'ts also an option
under url's index, though they have to be removed first, otherwise google crawl them again. but there you can remove complete directories at once.
-
or go somewhere else lol, have been considering buying invisionpower forum for a while, this might be the final push I needed.
Is there a way of mass deleting pages from the index in webmaster tools, i.e. all links that contain edit, quote and new ect?
-
Ah okay, that will be difficult than.
Maybe you can ask them to implement the canonical url. It will be helpful for you to avoid duplicates
-
is a paid forum service, but they give you the ability to add code snippets in the head.
-
Hi Lee,
What kind of CMS are you using?
In wmt under parameters you can configure parameters. If the ur's contain the same parameter it is a posibility.
-
Hi Leonie,
thanks for taking the time to answer
Not sure if that will work, there is only one head (hope that makes sense). Forums work pretty much the same as content management systems. I have the ability to change what's contained in the head, but adding rel="canonical" will have the same effect on all pages, even the duplicate one (I think).
Is there a way of removing all pages from the index (in webmaster tools) that contain the word edit, or quote or new? Kind of like using a wild card?
-
Hi Lee,
I'm not very familiar with forums. but i think you can solve it by putting a canonical in the head.
If you have a main page, which is the post, put a canonical url in it:
the duplicate pages need to have the same canonical.
I hope this work for you.
Grtz, Leonie
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
A good content calendar/organizer suggestion?
Does anyone have a good content calendar/organizer/software/etc to help plan delivering and pushing out content? I haven't ever used anything other than an actual calendar, and that doesn't seem to help all that much. Is there anything better out there? Any suggestions would be fantastic! Much appreciated, Ruben
Content Development | | KempRugeLawGroup0 -
Is there a content ratio that google looks for?
What i mean by this is: If i have a 1000 page ecommerce site that has say 10 pages of good quality content (1% good content) and a competitive site that has 100 pages of good content (10% good content) and another that has 500 (50%). if they are all almost the same in every other way the 50% content site would win hands down. however this is not always possible to have so much good content on some types of site. The question is: Is there a min percentage to aim for? Also is there a similar min rate of content production to aim for. Is their a kind of tipping point to get past that google will think they are doing things right i should keep an eye on them!
Content Development | | mark_baird0 -
Duplicate Legal Content
Oftentimes lawyer websites will publish laws (codes, statutes, regulations, case law, etc). They add no value to the text, it's just copy pasted. Therefore, the same text/content may be on potentially hundreds of websites. Does google interpret this as duplicate content, or does it recognize government content as special? I want to have the laws on my website as well, however I am debating whether to add no follow tags or not. Or I'm thinking about adding value to the content by breaking down the specific law. However, even then at least 50% of the content on the page will still be the law, and I'm not sure if that is enough to be considered duplicate content.
Content Development | | irnikij0 -
Duplicate page issue all from my website blog. How to i fix?
Crawl diagnosis indicates duplicate page content all from the blog on my website. What can i do to fix this?
Content Development | | skinbiz0 -
Site Content Review Please!
I m looking for someone who can review my site and let me about quality of content on my site. Can anyone suggest / know who I can talk to about this ? Nick
Content Development | | orion680 -
Duplicate Content Discovery
I was hit with Penguin on April 24th like a ton of bricks. Luckily my cash cow keyword was kept safe and still is today with even an increase in traffic over the year. With some other main keywords I used to rank far I fell off the board on that day. Since then I have been slowly trying to clean things up as much as I know Today I was sitting down with my coffee and Penguin mindset and I decided to use copyscape again to review duplicate content issues and something I noticed which I either didn't before or didn't think was an issue was my footer. In my footer I used a blurb from some other site in my niche a long time ago. Which I discovered they used from one of the main sites in my niche. Anyways I noticed that my footer is what kept coming up as being duplicate content and was always at an overage of 28% according to copyscape. My question is should I be worried about the footer? Is 28% a lot?
Content Development | | cbielich0 -
Creating the best content in your industry
Im currently working with a new client and their goal is to create the absolute best content in their industry. I've seen alot of articles on WHY to create the best content but not a lot on HOW to create the best content. Can anyone recommend a article they recall which talked more about the HOW. I'm looking for a process on how to create awesome content, how to go about it. Any suggestions?
Content Development | | monster990 -
My WebSite has two sections with overlapping, or redundant articles on the same topics. Google is only listing one or the other article in Search Results. What should I do to have both pages (similiar but unique content ) to be listed?
My Web Site has two sections with overlapping, or redundant articles on the same topics. Google is only listing one or the other article in Search Results. What should I do to have both pages (similar but unique content ) to be listed? Example: http://www.womenshealthcaretopics.com/pregnancy_week_12.htm http://www.womenshealthcaretopics.com/pregnancy_12_weeks.html
Content Development | | docjamesmd0