Duplicate Forum Content
-
HI everyone,
great to be here, absolutely loving everything, please go easy on me I'm quite a noob when it comes to seo, but hopefully my question isn't too basic.
After running the initial checks on my websites, I found there are 7,646 duplicate pages? Some are easy fixes but the majority are not, being forum pages, the edit, quote and new post links are coming up as duplicates of the main post?
Does anyone know how to fix this?
Best Lee
-
How delightful that you refer to Invisionpower as I recently took the software for a test drive and plan to purchase it in January
A question though that has been troubling me. I use a paid for hosting service, which forwards to one of my sub-domains. The problem is I have nearly 7k pages of duplicate content (all from the forum), I really want to add a no follow and then remove the forum pages from the search index and thus remove all of the duplicate content. Though I'm worried that adding no follow to the sub domain may impact on the main site.
I have a lot of page one search positions, many long tails which combined result in a lot of visits, but I also have a couple of short phrases that get 50% of the visits alone, so I can't risk making any changes, actually I'm pretty terrified to make any changes, just in case all of my website pages are removed from the index.
-
invisionpower forum is good choice, as i knew. Found duplicated content pages here, it happens with most of forums, but there is full provision that this can resolved by canonical or stop crawling one you don't want in robots.txt.
-
Hey
just found some valuable info which has solved a lot of my problems, modified .htaccess which has removed all of the non www links, so thats a part solve.
I think what I'm going to do is purchase the new forum software, setup a robot.txt to not crawl the new forums and then once I have all of the posts transfered to the new forum, remove the old forums from the index.
That should solve it, the new forum is very well established and if need be I can pay one of their developers to incorporate some seo feature that will stop the duplicate content issue. Once thats solved I can remove the robot.txt
Many thanks, Lee
-
Hi, still one more thing, because you sound loss, and that's not the meaning of this Q&A. Contact the forumservice and ask for the canonical, this is a very simple and common thing to do.
-
Ok thanks Leonie,
still no nearer to knowing what to do, I know theres a problem and generally what I need to do, but I'm still at a complete loss as to how to do it
But thank you for trying to help,
Best, Lee
-
Seo stuff is'nt that difficult, though it can be comlicated
Anyway I wish you luck with the forum and hope you'll manage to get it the way you wanted.
Grtz, Leonie
-
Mmmm this is becoming more problematic by the minute, will need to implement robot.txt before removing the urls.
And there was me thinking this seo stuff would be easy.
-
Lol, tha'ts also an option
under url's index, though they have to be removed first, otherwise google crawl them again. but there you can remove complete directories at once.
-
or go somewhere else lol, have been considering buying invisionpower forum for a while, this might be the final push I needed.
Is there a way of mass deleting pages from the index in webmaster tools, i.e. all links that contain edit, quote and new ect?
-
Ah okay, that will be difficult than.
Maybe you can ask them to implement the canonical url. It will be helpful for you to avoid duplicates
-
is a paid forum service, but they give you the ability to add code snippets in the head.
-
Hi Lee,
What kind of CMS are you using?
In wmt under parameters you can configure parameters. If the ur's contain the same parameter it is a posibility.
-
Hi Leonie,
thanks for taking the time to answer
Not sure if that will work, there is only one head (hope that makes sense). Forums work pretty much the same as content management systems. I have the ability to change what's contained in the head, but adding rel="canonical" will have the same effect on all pages, even the duplicate one (I think).
Is there a way of removing all pages from the index (in webmaster tools) that contain the word edit, or quote or new? Kind of like using a wild card?
-
Hi Lee,
I'm not very familiar with forums. but i think you can solve it by putting a canonical in the head.
If you have a main page, which is the post, put a canonical url in it:
the duplicate pages need to have the same canonical.
I hope this work for you.
Grtz, Leonie
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is there harm in publishing too much content, at once?
We are working on catching up with some competition, who has done a much better job over the last few years with content creation. We normally publish 1 article a day but are looking at scaling that up to 3 to 4 articles a day so we can get our content out there to compete with the other websites, in keyword space we currently aren't in. Is there a negative impact on publishing too much content at once? Is there a negative impact on publishing too much content at once? Besides the probable inability to market all of it, I don't see any issues since we will be getting our articles indexed and out there for Google to start building value on them. However, some team members are apprehensive about doing too much at once. What are your thoughts Moz Community?
Content Development | | GoAbroadKP0 -
Use old content from Archive
Hello all, I just were looking on Archive.org and though what if i use the content from a really old website thats already been un-indexed. Could i just use this content on my new website and will it not be duplicate content? Thank you for your comment! Kind regards,
Content Development | | MennoO0 -
Content Syndication Service
Am curious to get the forum's opinion on content syndication services like SYNND . Has anyone tried them or any other content syndication networks. Does the forum have any recommendations for good quality syndication networks that can be used to distribute content to quality sources.
Content Development | | SEO5Team0 -
How much content is needed
I have two clients whose websites have landing pages that feature a number of product links. In order to meet SEO/Google best practices, do I need to have additional content on these specific pages or will the links suffice? (Getpaper is an ecommerce; inpak is not) Any thoughts would be appreciated. http://www.getpaper.com/find-paper/inkjet-plotter-paper/color-bond-21-lb http://www.inpaksystems.com/bag-closing/bag-sewing
Content Development | | TopFloor0 -
Fresh content ideas for a static site?
I have an ecommerce site. My home page is set-up just as I want it. I'm not looking to redo it or change my site to a blog. Just looking for some new, different, SEO friendly ideas or concepts to keep it "fresh".
Content Development | | VictorVC0 -
How can i solve duplicate problem with different url needed?
My client is a big international firm with 10 websites with different url (.co.uk, .com, .com.au, .pl... etc). All websites are exactly the same except the price. I suggested them to only use .com and use region as a sub domain like au.xxx.com instead of xxx.com.au. However they cannot do that for some reason. I am trying to solve the duplicate issue. I dont think i can use 301 redirect or canonial link because all regions are making even traffics. Any suggestions?
Content Development | | ringochan0 -
Forum Site: Content Value Post Panda
I run a forum website built on Wordpress. We're about two years old. The theme of the site is a directory of attorneys, with each directory listing having its own blog account on our site. Through this platform, we receive 75-80 blog posts now every month of varying quality from our users. QUESTION: A good number of the blogs published on our site are also published on the attorney's law firm site as well (they're syndicating on our site). Will this hurt our site in light of Panda? A lot of the syndicated content is very well written and insightful. By contrast, Will non-syndicated but average to below average posts hurt our site? The authors almost always link back to their firm site. Would love some feedback on whether we should be happy about the syndicated content or whether we should potentially ban it?
Content Development | | JSOC0 -
Displaying archive content articles in a writers bio page
My site has writers, and each has their own profile page (accessible when you click their name inside an article). We set up the code in a way that the bios, in addition to the actual writer photo/bio, would dynamically generate links to each article he/she produces. Figured that someone reading something by Bob Smith, might want to read other stuff by him. Which was fine, initially. Fast forward, and some of these writers have 3,4, even 15 pages of archives, as the archive system paginates every 10 articles (so www.example.com/bob-smith/archive-page3, etc) My thinking is that this is a bad thing. The articles are likely already found elsewhere in the site (under the content landing page it was written for, for example) and I visualize spiders getting sucked into these archive black holes, never to return. I also assume that it is just more internal mass linking (yech) and probably doesnt help the overall TOS/bounce/exit, etc. Thoughts?
Content Development | | EricPacifico0