Duplicate Forum Content
-
HI everyone,
great to be here, absolutely loving everything, please go easy on me I'm quite a noob when it comes to seo, but hopefully my question isn't too basic.
After running the initial checks on my websites, I found there are 7,646 duplicate pages? Some are easy fixes but the majority are not, being forum pages, the edit, quote and new post links are coming up as duplicates of the main post?
Does anyone know how to fix this?
Best Lee
-
How delightful that you refer to Invisionpower as I recently took the software for a test drive and plan to purchase it in January
A question though that has been troubling me. I use a paid for hosting service, which forwards to one of my sub-domains. The problem is I have nearly 7k pages of duplicate content (all from the forum), I really want to add a no follow and then remove the forum pages from the search index and thus remove all of the duplicate content. Though I'm worried that adding no follow to the sub domain may impact on the main site.
I have a lot of page one search positions, many long tails which combined result in a lot of visits, but I also have a couple of short phrases that get 50% of the visits alone, so I can't risk making any changes, actually I'm pretty terrified to make any changes, just in case all of my website pages are removed from the index.
-
invisionpower forum is good choice, as i knew. Found duplicated content pages here, it happens with most of forums, but there is full provision that this can resolved by canonical or stop crawling one you don't want in robots.txt.
-
Hey
just found some valuable info which has solved a lot of my problems, modified .htaccess which has removed all of the non www links, so thats a part solve.
I think what I'm going to do is purchase the new forum software, setup a robot.txt to not crawl the new forums and then once I have all of the posts transfered to the new forum, remove the old forums from the index.
That should solve it, the new forum is very well established and if need be I can pay one of their developers to incorporate some seo feature that will stop the duplicate content issue. Once thats solved I can remove the robot.txt
Many thanks, Lee
-
Hi, still one more thing, because you sound loss, and that's not the meaning of this Q&A. Contact the forumservice and ask for the canonical, this is a very simple and common thing to do.
-
Ok thanks Leonie,
still no nearer to knowing what to do, I know theres a problem and generally what I need to do, but I'm still at a complete loss as to how to do it
But thank you for trying to help,
Best, Lee
-
Seo stuff is'nt that difficult, though it can be comlicated
Anyway I wish you luck with the forum and hope you'll manage to get it the way you wanted.
Grtz, Leonie
-
Mmmm this is becoming more problematic by the minute, will need to implement robot.txt before removing the urls.
And there was me thinking this seo stuff would be easy.
-
Lol, tha'ts also an option
under url's index, though they have to be removed first, otherwise google crawl them again. but there you can remove complete directories at once.
-
or go somewhere else lol, have been considering buying invisionpower forum for a while, this might be the final push I needed.
Is there a way of mass deleting pages from the index in webmaster tools, i.e. all links that contain edit, quote and new ect?
-
Ah okay, that will be difficult than.
Maybe you can ask them to implement the canonical url. It will be helpful for you to avoid duplicates
-
is a paid forum service, but they give you the ability to add code snippets in the head.
-
Hi Lee,
What kind of CMS are you using?
In wmt under parameters you can configure parameters. If the ur's contain the same parameter it is a posibility.
-
Hi Leonie,
thanks for taking the time to answer
Not sure if that will work, there is only one head (hope that makes sense). Forums work pretty much the same as content management systems. I have the ability to change what's contained in the head, but adding rel="canonical" will have the same effect on all pages, even the duplicate one (I think).
Is there a way of removing all pages from the index (in webmaster tools) that contain the word edit, or quote or new? Kind of like using a wild card?
-
Hi Lee,
I'm not very familiar with forums. but i think you can solve it by putting a canonical in the head.
If you have a main page, which is the post, put a canonical url in it:
the duplicate pages need to have the same canonical.
I hope this work for you.
Grtz, Leonie
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
The etiquette of reproducing someone else's content
Hello - Here is a scenario, representative of something that I just saw play out. Site A is a new blog about travel (as an example topic) Site B is an older, established blog about travel Site C is a new blog launched and owned by Site B that focuses on a particular travel niche (luxury travel, for example) Here is what happens next Site A writes an original piece of content Site C then republishes Site A's content, paraphrasing all of the text, but giving Site A credit with a link Site B (the established site) publishes a blurb about the article, directing readers with a link to "read more" on Site C. It credits Site A as the original author, but does not link to it. If you were able to follow that, here is what I would like to know. Did Site C do anything wrong by republishing a paraphrased version of Site A's content, even though it gave credit with a link? Did Site B do anything wrong by linking to Site C (which is for all intents and purposes the same website), but not linking to Site A (the original source)? My sense is that the established blog (Site B) is trying to get it's new publication (Site C) to outrank the original author (Site A) using its own content. In general though, I am curious to get some thoughts on this situation because it raises a few ethical questions that I am not sure about, namely: Is there anything wrong with publishing "spun" content, if it is done well and links back to the source? Is there anything wrong with linking to a republished version of an article on a sister website, rather than linking to the original article. Thanks
Content Development | | timsegraves1 -
What Content to Write - Hot Topic or More Niche Related?
Hi, Just for example, say you've got a shoe store but shoes are a non-searched-for topic on the informational side. Say fashion models or teenybopper shoppers are both hot topics. Would you recommend writing an article - one of the site's five >2000 word articles on a hot area of the hot topics? Or would you just stick to shoes topics? If you do write on the hotter topics, how does the shoe store owner write on these - they're out of his area of expertise? Does he need a content writer?
Content Development | | BobGW0 -
Any freelance writers with viral content / linkbait experience?
Looking for a great freelance writer to assist in creating linkbait and viral content pieces. Please contact me if you are, or know of, such a person. 🙂
Content Development | | AdamThompson0 -
Stolen Content and a Panda Penalty
Hey Folks Question for those folks that have spent some time helping people with the recent penalties and the like. I have a client who has a clear Panda Penalty, huge drop in traffic on the initial Panda date and a further drop on the second date. Much smaller incremental drops on subsequent recent updates as well. From digging in it seems fairly cut and dry - copyscape shows another 250 or so sites with content from this site and there are nearly 2000 external URLs with duplicate content across these sites. We are talking complete, shameless copies of all of the text, sometimes the images as well. The client claims the content is all 100% unique and is his content and that the other blogs must have stolen his content resulting in the penalty - which, if it is true, and I have no reason to suspect otherwise, kind of sucks. Now, many moons ago, way before Penguin or Panda (maybe around 2006) I had a client that had suddenly lost all traffic and their historical rankings. No funny business, it was a small company, had been online since around 2000 and they were pretty much the first of their kind and always did very well from organic search. As it turned out, the content from the site had not really changed since it was set up and as lots of companies had sprung up offering a similar service they had seen their content copied wholesale, across many sites, all over the world. We attempted to contact many of these sites and got some results but many were just old, abandoned copy cat sites on advert supported hosting that had ceased to trade so we maybe got rid of about 20%. Well, in the end we just decided to rewrite the content, we did this and sure enough, the site bounced back to it's previous standing and has been pretty much there ever since. Now that was kind of easy, the site had maybe 20 pages, and it needed a sprucing up but in this case the site has around 500 pages so doing a rewrite is not going to be so easy. Problem is, I don't see removal requests being particularly successful either. So, I see the options and steps as being. Contact all the sites and request the removal of the content use the Google content removal facility:
Content Development | | Marcus_Miller
https://www.google.com/webmasters/tools/removals File a DMCA takedown for anything remaining Report Scraped Pages to Google:
https://docs.google.com/spreadsheet/viewform?formkey=dGM4TXhIOFd3c1hZR2NHUDN1NmllU0E6MQ&ndplr=1 Submit a spam report for all sites involved ? Submit a reconsideration request to let Google know what we have been doing (unlikely In a nutshell, do everything we can to get this content removed and then documenting this to Google in the hope we catch hold of someone who hears our plight. Interestingly enough, this is a sensitive one, so no URL but I would welcome any thoughts or experiences any of you may have had with similar problems. There is a little extra info here from Matt Cutts + Barry Schwartz that kind of tallies with my approach above but would really like to hear any feedback. http://www.seroundtable.com/google-stolen-content-13243.html Cheers all Marcus0 -
How does one write different pages of their website that are very similar in nature with using too much duplicate content?
We are a service provider and we have different links on our website to each of our services. The problem is the content that we would have for each is very similar. How can I ensure that it is not deemed duplicate content and ranked poorly because of it. Thanks
Content Development | | JayTurner0 -
Content Estimates
Are there any SEO Content writers that can create several pages for me on a post- graduate level for a target audience of physicians? Writing should be informative regarding our company and also optimized for keywords. I only have one quote so far from a company outside of SEOMoz and I was hoping to get someone on this site that had some experience in this area. Thank you, Utah Tiger
Content Development | | Boodreaux0 -
How quickly should one add content?
I'm building a content site (the model is AdSense revenue) around a certain niche, and I'm currently paying for about 6 articles to be contributed per week. I have the capacity to be paying for a lot more articles, however, so I'm wondering what, if any, factors exist to recommend building the site up slowly as opposed to throwing on e.g. 100 articles over the next week? Those I can think of are: 1. Going slowly leaves room for better keyword optimization etc. 2. Google seems to favor aged domains/content, so 100 good articles now certainly isn't as advantageous as 100 articles 2 years from now. All that being said, I still feel like the benefit in terms of traffic of adding more content now - since I can - might outweigh these considerations. Does anyone have any thoughts?
Content Development | | ZakGottlieb710 -
Does anyone know if a large forum can impact your seo rating on specific terms?
I have a large forum and I'm trying to figure out how to leverage it for SEO. Hopefully it'll help. Any advice? Does google ignore forums?
Content Development | | RamseySolutions0