Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
I want to use some content that I sent out in a newsletter and post as a blog, but will this count as duplicate content?
I want to use some content that I sent out in a newsletter a while ago - adding it as a blog to my website. The newsletter exists on a http://myemail.constantcontact.com URL and is being indexed by Google. Will this count as duplicate content?
Content Development | | Wagada0 -
Removing old content
Ahoy! Variously I have heard the opinion that content which does not generate regular search traffic (let's ballpark it at >10 views in any given month) should be noindexed or even removed. Allegedly this would improve the overall quality of the site, rankings and traffic. I remain doubtful. What would you do if the interest in a given matter goes down over time for any (most) given topics of your content and is replaced by "newer" specific interest? Concrete example: I have a website about (book) reviews. Naturally, there will always be new books; old books are not in the media as much and "forgotten". Nevertheless, the reviews (all unique, based on really having read the books, no trace of the standard back cover copy) are obviously still there. Personally I feel that they do not really lose any value - they are still reviews of that one book, even though it is not the most recent one. So, what would you do: Deindex "older" book reviews after a certain time? Even remove them completely? Just let them run? I am looking forward to your opinions - and even your experience if you have done something like this! Nico
Content Development | | netzkern_AG0 -
Duplicate Content From Huffington Post Blog
A client who writes blog posts for Huffington Post also wants an identical version of the blog posted to his personal site. Do you think there could be a problem of being punished for duplicate content? Would a better SEO practice be to have the client do an on-site blog just linking to the Huffington Post blog and providing information about it?
Content Development | | EmarketedTeam0 -
Does the duplicate content on the crawl errors report test content on external websites?
Hello, Can you tell me if this is just duplicate content within my site or if it also recognises duplicate content on external sites as well? Thanks
Content Development | | stuarta600 -
Onsite Content - Word Count & KW Density
Does the word count of a webpage make a difference to search engines? Are longer word counts on pages indexed higher or given higher priority? For example,say you have 300 words of copy packed with 20 keywords, and say you also have 700 words of copy that have the same 20 keywords worked in, does Google have a preference over which one it ranks higher?
Content Development | | greentent0 -
Duplicate content for manually setup blog and wordpress blog
We have a website where the ecommerce will not allow us to host blog. So we created our own manual blog page setup. Will this flag duplicate content on Google? http://www.homesupershops.com/blog and http://www.homesupershops.com/blog-july have same content. How come on a word press the same content on http://www.vizionseo.com/blog/ and http://www.vizionseo.com/blog/2011/05/how-can-your-business-rank-high-on-google-maps/ does not flag duplicate content?
Content Development | | VizionSEO990 -
Getting Duplicated Content Removed
So I recently took over an in-house SEO role and began some house cleaning. I found a few places that had copied or duplicated our homepage content. Naturally I reached out to them to ask them to remove or change the content. Today I come to the office and one of the sites I had requested to remove the content, had fired back at me that I was being rude, threatening and I should go f**k myself. As if this wasn't enough this guys was an upper level manager (managing director) and knows the upper management where I work. Now I have people in my company pissed at me, when I thought I was doing the right thing. Am I in the wrong or what? I had simply asked to remove or change the content and that failure to do so could result in legal action. I understand that it could have been misconstrued as a threat (which it wasn't intended to be) but its seems like a pretty immature response from a higher level person. Any thoughts or advice on what to do next?
Content Development | | Gerad0 -
Can un-unique content damage my rankings?
Hi there, I run a blog @ http://ablemagazine.co.uk We produce our own editorial content for our print magazine. Which means I have a great bank of uniquely written content. I can usually afford to post 1-2 completely 100% unique articles a day. I've also been copy/pasting 2-3 articles from the BBC or The Guardian a day to keep up activity. Should I continue doing what I'm doing? Should I post exclusively unique articles? Thanks
Content Development | | craven220