Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Knowledge base for seo, announcing new articles on blog (dupe content)
Hi all, Im thinking of creating a knowledge base with all many asked questions in my company. This could be a great Link-bait source but also nice ranking opportunities i think. But sometimes some new articles are so actual that i also want to blog them.
Content Development | | mdkay
Can i for example double post them (or post a big excerpt) on the blog and canonicalise it to the KB article?
Will links to the blog have equal value to KB links? And will this work?0 -
Content Curation & Duplicate Content
Hi, I have a client that wants to do content curation but it has been my understanding that adding external content that is already live on another website to your website, you get penalized for duplicate content. I have read that you can create an excerpt and then Google won't penalizes you for duplicate content. Can anyone shed more light on this topic. Thanks
Content Development | | M_80 -
How much duplicate content counts as duplicate content to Google?
Hi everyone! I've had a look through some duplicate content posts and I can't see the answer to this query, so I thought I'd ask in case someone could help. I've been looking at a website that competes with the site that I work on. They have profile pages containing content that has been copied and pasted straight from the suppliers' websites. Their pages have all their own code framing the content, which is diluting the concentration of duplicate copy. How much duplicate content can a page have before it gets penalised or ignored by Google? Any suggestions very gratefully received 🐵
Content Development | | ceecee0 -
How Google judge about duplicate content?
With recent Search engines updates one thing is clear we cannot ignore content. Content marketing definitely going to be most important part of our SEO strategy. I have few doubts about content marketing (circulation of content over web) where I want suggestions of community members. There would be different thoughts so I would like to have as many as responses to know what majority thinks: When we are writing guest posts, does article needs to be unique with each and every blog we are writing or we can safely circulate one good piece of content to 10-15 blogs who are interested in our creative. We have written a good blog post for our own domain. Apart from social sharing should it be posted to other related blogs too or it should be unique to our domain only. Social sharing, mentions, like of blog matters in rankings?Seems yes they do but need to know what majority thinks. Finally what is the safe number to circulate your content over web.
Content Development | | EG0CENTRIX0 -
What are the best content writer sites?
Hi, I'm doing some work on a new blog and wondered if anyone could recommend some low cost content writers? I have only justed started researching this service, so any advice the SEOmoz community could give would be grately appreciated. Thanks in advance.
Content Development | | RBH0 -
Duplicate content on the homepage
Hello SEOMOZ Is giving me an error on duplicated content on my site. When viewing the details it is showing the following as duplicated content domain.co.uk/ domain.co.uk domain.co.uk/index.html Obviously these are the same pages. Why is it seeing them as seperate. Does anyone know how I can resolve this issue? Many thanks
Content Development | | lcdesign0 -
Onsite Content - Word Count & KW Density
Does the word count of a webpage make a difference to search engines? Are longer word counts on pages indexed higher or given higher priority? For example,say you have 300 words of copy packed with 20 keywords, and say you also have 700 words of copy that have the same 20 keywords worked in, does Google have a preference over which one it ranks higher?
Content Development | | greentent0 -
Duplicate content for manually setup blog and wordpress blog
We have a website where the ecommerce will not allow us to host blog. So we created our own manual blog page setup. Will this flag duplicate content on Google? http://www.homesupershops.com/blog and http://www.homesupershops.com/blog-july have same content. How come on a word press the same content on http://www.vizionseo.com/blog/ and http://www.vizionseo.com/blog/2011/05/how-can-your-business-rank-high-on-google-maps/ does not flag duplicate content?
Content Development | | VizionSEO990