Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to deal with lot of old content that doesn't drive traffic - delete?
Hi community, i hope someone can help me with this, We are migrating our e-commerce site next februari. I'm preparing the content migration. For a large part exact copies of our product listing and product detail pages will be migrated.
Content Development | | Marketing-Omoda
However, we also have a lot of old blog content, which is, because of seasonality and trendiness, outdated and doesn't drive traffic anymore. It actually is just worthless content. (Not only as a traffic driver, this also counts for extremely low to none internal driven traffic (both internal search and internal navigation). We have about 4.000+ blogs of which about 100 drive the most traffic (mostly incited by e-mail and social campaigns and internal navigation promoted on important category landing pages during some period. Is it a bad signal to search engines to delete these old content pages? I.a.: going from a content-rich to a content-poor site?
Off course I will migrate the top 100 traffic earning content and provide proper redirects to them0 -
Developing supporting content for main ideas
I attended Mozcon this year and the session by Joe Hall "Rethinking Information Architecture for SEO and Content Marketing" has me considering some changes to our site structure and architecture. I'm currently putting together a landing page for our webinars but could things like webinars and case studies be considered supporting content to our main ideas? For example instead of my architecture being: home > webinars > webinar about an idea It could be: home > main idea 1 > webinar about an idea So my webinar landing page would link out to all of the different webinar pages on the site instead of being contained in this bucket. Just wanted to get some thoughts on this.
Content Development | | Brando160 -
Content Spinner Tool??? Worth? Recommendations?
Hi all !! We have different websites in the english language (UK, IE, US, etc...), but we focus our content strategy (landing pages) in just one of them. We were thinking about using a Content spin tool to use this content in the other websites and give them an extra push in terms of SEO (content is the king :)). Did you have any idea if they work fine? Did you have any experience with them? Can you recommend any? Are they tools really worth or they are not working fine and after the spin a full review (and probably some re-write) is necessary? Of course I´m talking about the paid ones.... Not even thinking about the free tools 😉 Thanks for your help in advance !!!
Content Development | | AutoEurope0 -
How to Submit a Sponsored Content Submission?
Hi, I have some content related to our services and I want to know how to submit or post this content to some online publishers like yahoo, business insiders, etc.. I would appreciate your suggestion on this. Thank you.
Content Development | | Lry880 -
Content Writing Service Recommendations
I am looking to hire a content writer for our sites. Anyone familiar with a service where the manage the content on your site? Basically, come up with topics & content ideas, then writing the content. Please give me an idea of the pricing if possible. Greatly appreciate any help.
Content Development | | inhouseseo0 -
Is Publishing Content from a Book to your Site Considered Duplicate Content?
It is a book we don't own, either. Would you need to somehow find the original and rel=canonical it? Or is this just all around bad to do? Thanks.
Content Development | | ThridHour0 -
Content: Best Blogs Article
Hello, For an Ecommerce site, I think a good way to get known is to write a "Best X Blogs" article, where X is a topic in your industry, and then letting the people you link to know about the article. I got the idea from a Mozinar. My question is, how close does the X from above have to be in your niche? For example, if your product is running shoes can you write a "best athlete blogs" article? I'm worried about them reading the article, then leaving. In some smaller niches the topics closest to the product don't have much in the way of blogs out there. So how close to your niche does the Best X Blogs topic have to be?
Content Development | | BobGW0 -
On page content and PDF - Dup?
Hi We are writing a useful article which we want to put on our site, but we also want to add it as a pdf which people can download - will this be classed as dup copy?
Content Development | | jj34340