Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
References for Healthcare Blog Content?
Hey everyone, We have a couple B2C medical/healthcare clients we produce content for and I was wondering what the industry stance is when it comes to giving references at the end of a blog, assuming there were no statistics or direct quotes used in the content. A lot of our content is written via research on a specific condition/treatment and doesn't really dive deep into specific medical nuances. Things like risks, recovery timelines, questions to ask, etc. are written about mostly. Still, should we be providing general references at the end of blogs to sites like WebMD, Medscape, etc. Thanks for any input!
Content Development | | danielreyes0 -
Duplicate Content for Non-SEO Purposes
Duplicate Content for Non-SEO Purposes There are a few layers to this question, but at the most basic level the question is... -Will having the same article (in the form of archived e-newsletter issues) on multiple different websites' newsletter archives HURT those sites? I'm fairly sure it won't HELP any of them in terms of SEO, but will having these back issues of their e-newsletters archived on their websites get them penalized? For the purpose of this question, these are not clients we are doing SEO for, just hosting and their e-newsletters. So it's fine if the archives provide no SEO benefit, we just don't want to leave them up if they will become LIABILITIES for the websites. -If having the same article in archived issues of e-newsletters on multiple different websites WOULD be harmful, would moving these archives to a sub-domain change anything or would it be best to simply take the archives down altogether? -Alternately, would spinning these articles make any difference in whether or not these sites get penalized? -Lastly, would spinning make the articles usable for archived e-newsletters for clients that ARE signed on for SEO services? I have a hunch about this, but I'd love to hear your expert opinions. Thanks!
Content Development | | BrianAlpert780 -
Duplicate Ttile and Duplicate Content
I'm a beginner of SEO. I have a few questions need to ask people to help. The MozPro's Crawl Diagnostics show I have a lot of duplicate titles and duplicate content. However, most duplicate titles are related to Pagination. What should I do? Also, for my duplicate content. B/c we are selling similar products,everything all most the same, only product's item number different. How can I avoid it?
Content Development | | alexsu09100 -
I want to remove some pages from my site with PR, what should I do with traffic?
I have a section of a site that I want to remove. It has a main page linked from the nav menu, and a half dozen subpages under that. The pages get some traffic and have ranks up to PR3, which is what my site's home page is. I'm no longer want to do these pages as they require tremendous upkeep and I'm not interested in keeping them going. So, I know if I just remove these pages and that's all, I'm going to pay for it somewhere with Google. What else should I do? I do't really have similar pages to direct them too.
Content Development | | bizzer0 -
Nearely identical content
Hi Everybody, I'm just checking the warnings from Seomoz an realized that on our site there are a lot of duplicate page content problems. In fact some of them are not really duplicated content because there are subtle differencies ie. colour or pack of products: http://www.szepsegbolt.hu/termekek/david_beckham_intimately_yours_for_man_eau_de_toilette_30_ml.html http://www.szepsegbolt.hu/termekek/david_beckham_intimately_yours_for_man_eau_de_toilette_50_ml.html What do you suggest, ignore this warning or change something on the site? Thank you in advance Balint
Content Development | | SanomaMediaseo0 -
Panda and Thin Content
Hi Guys, I have a quick question. We have a website and in the wake of Panda, we are worried about our video news section. We produce about 10 videos news a month on a templated page and beneath it is a small extract of the words spoken in the video. The text below each video is about 180 words each. Currently the video news section makes up and 1/5 of the content on the site. I.e Out of 500 pages, we have about 100 video news articles. Should I be worried about being wacked by Panda for this? Can I tell Google this is a news section?
Content Development | | VividLime0 -
What is the best practice for using the same content on two pages?
I have two websites in a very similar niche(s)...I have good unique content article that I would like to use on both sites because it adds value to the visitor experience.. Example: Science of Colors would be very useful for my seattle house painting paint colors page. I want to have content so they do not need to leave the site to navigate to second site. Would the identical content trigger a penalty or would it be crawled, ignored, and not indexed. Does having a rel=authorship on one site trump the site..Or is it a pile of BAD.
Content Development | | johnshearer0 -
Is it possible to over create/post content?
My company has signed up with a vendor to help write content. They are going to be supplying 50 articles per month so we'd basically be adding 2 or more articles per day. Is that too much or is that okay as long as it's quality content?
Content Development | | baudvilleweb0