Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
References for Healthcare Blog Content?
Hey everyone, We have a couple B2C medical/healthcare clients we produce content for and I was wondering what the industry stance is when it comes to giving references at the end of a blog, assuming there were no statistics or direct quotes used in the content. A lot of our content is written via research on a specific condition/treatment and doesn't really dive deep into specific medical nuances. Things like risks, recovery timelines, questions to ask, etc. are written about mostly. Still, should we be providing general references at the end of blogs to sites like WebMD, Medscape, etc. Thanks for any input!
Content Development | | danielreyes0 -
A good content calendar/organizer suggestion?
Does anyone have a good content calendar/organizer/software/etc to help plan delivering and pushing out content? I haven't ever used anything other than an actual calendar, and that doesn't seem to help all that much. Is there anything better out there? Any suggestions would be fantastic! Much appreciated, Ruben
Content Development | | KempRugeLawGroup0 -
What are the effects of posting on a blog site using copied content from your site if you source the site on the blog site?
I have noticed that some of my pages from my site are posted to different blog sites and directory submissions they are word for word what is on my site but at the bottom of the page it does list my site as the source and there are links in the text back to my site. Is this going to negatively effect my rankings or is it ok since my site is sourced in the article?
Content Development | | steve2150 -
Is there a way to automate finding low quality content on your site?
Hi all, I have a site that was once #1 for many keywords. Fast forward a number of years and I am sinking lower and lower and it really started to sink low from September 2012 and appears to be due to an algo update. How should I go about finding my low quality pages? I am told that I might have pages bringing my entire site down. I deleted heaps of low quality pages but not seeing any improvements (might be a little impatient). Any tips for finding bad content?
Content Development | | BatmanGoonie0 -
What is your strategy in looking for content to write relative to your niche?
looking to keep adding to our blog in a big way. Things I use are questions we get a lot, we add them to blog and answer them - works quite nicely. Look at other blogs although in our industry its not really there, etc. What are some of your strategies for looking for content to write about?
Content Development | | PaulDylan1 -
Duplicate YouTube Script Content - Penalty?
I've been tasked with writing scripts for upward of 100 YouTube videos describing my company's products. In more than a few cases, the products are so similar as to be almost identical; unfortunately, they aren't and will require their own videos. If I create a "template" script, I would save hours and hours of tedium. For example: Video 1: (VOICEOVER) Buy the ABC widget today! Video 2: (VOICEOVER) Buy the XYZ widget today! So, my question is: Would I be looking at a duplicate content issue? Jeff McRichie's terrific Whiteboard Friday about YouTube Ranking Factors mentioned that YouTube has an auto-transcription feature that might expose my self-plagiarism, and I don't want to get dinged. BTW, this isn't a matter of my being too lazy to write individualized content; it's more that 1) the products are almost identical, and 2) I have just about a week to write, produce, and act(!) in all of them.
Content Development | | RScime250 -
Finding Good Content Writers
I have a small but growing SEO company. I don't have in house content writers...where is a good place to find good content writers? Please help! Thanks.
Content Development | | ClickIt0 -
Duplicate Content on WordPress Blogs?
We are getting ready to add a WordPress blog to our established website. Our plans are to place it in a subfolder on our website to maximize rank. My question is...Do we need to utilize a Meta Robots WordPress plugin by Yoast or similar so that noindex,follow robots meta tags will prevent search engine indexing of search result pages, subpages and category archives? We want to avoid the dreaded Duplicate Content Error and penalty. Any other great SEO WordPress plugins? Thank you for your time. Brian
Content Development | | gw3seo0