Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Community Discussion: Can 10x content be short-form content?
In his (intentionally very short) post on Tuesday, Rand makes the case that long-form content isn't necessarily great content: "Rather than applying a tactic like long-form content universally or setting length as the bar (or even a metric) for greatness, we instead match our content to our audience's needs and our business/personal goals. 700 more words will not help you reach your goals any more than 7 more words. Create content that helps people. Do it efficiently. Never write an ultimate guide where a single image could more powerfully convey the same value. Trust me; your audience and your bottom line will thank you." I think this is something we all struggle with as online marketers, in one way or another. As someone who casually consumes online content on a regular basis, this also resonates with me on a personal level. I'm curious, what are your hesitations with focusing on shorter-form content that packs a wallop, and what excites you about it? Can you think of any examples of content you've come across that you consider 10x short-form content?
Content Development | | Christy-Correll7 -
Any freelance writers with viral content / linkbait experience?
Looking for a great freelance writer to assist in creating linkbait and viral content pieces. Please contact me if you are, or know of, such a person. 🙂
Content Development | | AdamThompson0 -
Duplicate Content From Huffington Post Blog
A client who writes blog posts for Huffington Post also wants an identical version of the blog posted to his personal site. Do you think there could be a problem of being punished for duplicate content? Would a better SEO practice be to have the client do an on-site blog just linking to the Huffington Post blog and providing information about it?
Content Development | | EmarketedTeam0 -
How much content is needed
I have two clients whose websites have landing pages that feature a number of product links. In order to meet SEO/Google best practices, do I need to have additional content on these specific pages or will the links suffice? (Getpaper is an ecommerce; inpak is not) Any thoughts would be appreciated. http://www.getpaper.com/find-paper/inkjet-plotter-paper/color-bond-21-lb http://www.inpaksystems.com/bag-closing/bag-sewing
Content Development | | TopFloor0 -
RSS feeds with dup content and titles
Hi, For my Buddypress site I use a tool to create sites with RSS feeds. Each site is for a different feed, but the number of dup tiles and content is running in the thousands. I've been trying to reduce the dups, but have begun to think there is more trouble from such content than benefit. Should I dump the content or ignore the errors flagged by SEOMOZ? Any ideas if thes RSS feed dups are hurting my BuddyPress site? Any suggestions in general about how to eliminate such dupe for a Buddypress Site, eg. the activity log. Larry
Content Development | | tishimself0 -
I have created 2 blogs for a client as they have 2 domains (1 for their core business, and 1 for a product). I want to use the same content on both blogs. What is the best way to set this up so there are no ranking or duplicate content issues?
We are pushing SEO for only one of the domains, therefore I would like one to be dominant. We will be sending the blog post via email to their database, therefore each blog needs to have the same content. Thank you!
Content Development | | MarketingResults0 -
Displaying archive content articles in a writers bio page
My site has writers, and each has their own profile page (accessible when you click their name inside an article). We set up the code in a way that the bios, in addition to the actual writer photo/bio, would dynamically generate links to each article he/she produces. Figured that someone reading something by Bob Smith, might want to read other stuff by him. Which was fine, initially. Fast forward, and some of these writers have 3,4, even 15 pages of archives, as the archive system paginates every 10 articles (so www.example.com/bob-smith/archive-page3, etc) My thinking is that this is a bad thing. The articles are likely already found elsewhere in the site (under the content landing page it was written for, for example) and I visualize spiders getting sucked into these archive black holes, never to return. I also assume that it is just more internal mass linking (yech) and probably doesnt help the overall TOS/bounce/exit, etc. Thoughts?
Content Development | | EricPacifico0 -
Using testimonials to build quality content
I don't see a lot of talk on here about using testimonials to build good content. From what I have seen, the search engines love client testimonials. I haven't found a good service provider for this yet. I'm thinking something combined with Twitter or FB that also allows you to show the content on your site would be great. Can anyone recommend a service like this? I'm checking out a service called, "Quick Vouch" but I haven't tested it yet.
Content Development | | BradBorst0