Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
The blog section of my website just got deleted, Would it get my website penalized if I posted the same content again?
The blog section of my website just got deleted, Would it get my website penalized if I posted the same content again?
Content Development | | DustChasersToronto0 -
Could this be an issue with duplicate content?
Hi everyone, I am working with a business consultant in the HVAC industry and doing SEO for 8 of his clients (all HVAC businesses from around the US and Canada). Each website is essentially a mirror of the business consultant's website with really the same information-- it applies perfectly well to each individual website, but it IS nearly, if not, identical. I'm getting ready to implement a blog on the original HVAC page and have been considering using the same content (customized to reflect each business-- but still the same information) for blogs for my other 8 clients. My questions are: 1. Is the mirroring of the website a duplicate content problem? Example if you're interested: http://www.mcair.com (original) and http://www.jpsheating.ca/ (client). 2. Is using the same blog across 8 different website (customized for each client but the same basic information) a duplicate content issue? For example-- a blog about getting your air ducts cleaned... the information is going to be the same (and relevant) with each business and each business could benefit from sharing that information with their customers. Thanks so much for your help and explanation
Content Development | | KaitlinNS0 -
Can I post my MailChimp articles on my blog without getting hit for duplicate content?
I would like to post my newsletters on my blog, but am afraid of duplicate content since you can click a link on the MailChimp email blast to view the Newsletter online. Is this considered dup content?
Content Development | | RoxBrock0 -
Are Duplicate Bio's Duplicate Content?
I'm wondering if I need to go through all the various bio's our firm has on all the various legal directory and client review sites to make sure they have unique bio's? I really, really don't want to do that, but if they are going to flagged as duplicate content, I will. I'm hoping sites like "findlaw," "avvo," etc, have some built in rel:cannonical or something that says these bio's aren't to be seen as unique content and therefore conflict with what's on our site. Anybody know? Just to clarify, any of the sties that have asked for unique content/bio/about us, I have complied with that. However, a lot don't specifically state it has to be unique, so I've just copy and pasted from our site in those cases. Thanks, Ruben
Content Development | | KempRugeLawGroup0 -
What makes high quality content?
Content is becoming more and more important in rankings and I was wondering what exactly Google defines a good content. Any ideas?
Content Development | | EJDekkers0 -
Can you add RSS feed content?
Buongiorno from the digital epicentre of forward digital thinking that is Wetherby UK 😉 Ok i have afeeling this is a big NO No but i just need to banish all doubt so here goes.. Am i right in saying you cannot subscrive to an RSS feed with the objective of pubkishing the linked contnet on another site.
Content Development | | Nightwing
Put another way if i built a news web page and subscribed to a BBC news RSS feed can i make that content appear in a site i administer? Grazie tanto,
David0 -
Ideas for content
Where do you get your ideas to create content for a blog? I have been using the new keyword planner and am not coming up with any ideas. We are an e-commerce site that deals with commercial equipment. It is hard for me to show anything interesting with products because the company does not own it.
Content Development | | EcommerceSite0 -
Quality content distribution service?
I want to distribute my articles without having to go multiple sites to get the job done. What is the best quality and most reputable company to help me distribute my content?
Content Development | | photoseo10