Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Add a Search box (content hub) for my website?
Hello We would like to introduce a search area in our website, to help our users to find all information regarding a specific topic (landing pages, infographics, blogs, videos, etc...). Before we decide to build everything internally, we were wondering if there is any widget or plugging to make this in a smooth way and that works fine. I have also seen that Google offers a custom search option to make this happens. I would really appreciate advice about what to do regarding this topic: Is there any company that offers a really good solution for this? Is worth to use Google custom search option? Or the best option is build it internally? PS: I have seen that there are many plugins for wordpress, but our site is not a wordpress blog. Just to clarify. Many thanks for your help 🙂
Content Development | | AutoEurope0 -
Reviving a (very) old blog - is it worth shifting the content onto a new blog?
I look after a few ecommerce sites, one of them doesn't currently have a blog, we are setting up a wordpress blog now for the site. Going way back in time the site did have a blog which was on a separate Typepad domain. What I'm wondering is whether it is worth redirecting this whole blog to the new blog section of the site and copying some of the content over to the new blog as historical posts? I don't think it will be possible to redirect each individual post to a new one so it will just be a straight redirect of the old blog domain to the new one with the same (most of anyway) content. Do you think it is worth doing this for the value of this content which is relevant but dated (many of the links are now expired)? Doing this will take some time to do so it's not 'free' content we'd be getting We have a lot of new content planned out so we won't be short of content, just would be nice to have some historical content on there too Thanks
Content Development | | PeterLeatherland0 -
Thoughts on Evan Carmichael for Content Marketing & SEO?
We used to have a lot of success repurposing content through EvanCarmichael.com, but it seems like our articles are being de-indexed there very frequently now. I'd love to hear some others' opinions on Evan Carmichael and how worthwhile it is to keep publishing there. Thanks!
Content Development | | ScottImageWorks0 -
Blog Content
I keep reading that a steady stream of new blogs from my site is a great way for getting inbound links to my site. My question is... Does the content of my blogs have to be relevant to my site? My site is www.marblerenovation.com. If the blog should stay relevant, I am finding it pretty hard to create engaging content around cleaning marble floors. Also, does anyone know of a good place to find bloggers to help create this content? Thanks in advance everyone Dave
Content Development | | david.smith.segarra0 -
Duplicate Content
I have a service based client that is interested in optimizing his website for all the services that he provides in all the locations that he provides them in. For example: Service 1, location 1 Service 1, location 2 Service 2, location 1 Service 2, location 2 He wants to essentially create an individual page for each of the above, but i'm concerned that he will be penalized for duplicate content. Each of the pages would have the keyword in the url, page title and within the main body of content. We would certainly alter the content somewhat, but not sure how much a difference this would make. Any thoughts or advice would be greatly appreciated.
Content Development | | embracedarrenhughes1 -
Site Content Review Please!
I m looking for someone who can review my site and let me about quality of content on my site. Can anyone suggest / know who I can talk to about this ? Nick
Content Development | | orion680 -
Duplicate Page Content WordPress blog with categories?
Just got a crawl report back from SEOmoz and it gives me lots of errors for "duplicate page content". Upon investigating, I notice this is because my WP blog is setup into categories so the home page is almost identical to one of the category pages. None of my actually posts are the same but the category pages have some overlap since the same post could show up in two or more categories. Is this a problem or can I just ignore this error? Any thing I should be doing differently? Thanks!
Content Development | | frankthetank20 -
Press Releases and Duplicate Content on Event Related Site
I have a site that lists events. I ask those submitting events to submit original content if possible, but frequently they submit press releases which are already published elsewhere. I rewrite some of the press releases, but do not have time to rewrite every press release that comes my way. I want my users to get a comprehensive list of events, but I don't want get a penalty for duplicate content. What is the best solution?
Content Development | | andywozhere0