Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Clarification on duplicate content
Hi, if I have a page that unintentionally ranks for a term that I want to create a page for - say "atlanta apartments" - should I still create a page specifically intended to rank for "atlanta apartments"? Will canonical tags be crucial in this case? Hoping to avoid creating duplicate content and instead create the correct content for a specific term.
Content Development | | smiller760 -
What is the Best Content Spinner to Use?
I'm looking for a good article spinner. I used to use Spin Doc but it's not as intuitive anymore.
Content Development | | 01023451 -
How to Get Rid of Duplicate Content Captured on Article Lists
We have a ton of articles and blog posts on our site. Currently, we display summary lists of articles that contain the first paragraph of the article in the summary list. However, in my reports, this is coming back as duplicate content with the full article itself. How do I fix this? Ex: article main page- http://www.robots.com/articles/10 First article on that page- http://www.robots.com/articles/viewing/grippers-for-robots (which shows up as duplicate content with the main artilce page). With our blogs, we have the most recent 5 blogs (in the same summary format) listed on our main blog page. We then have categories that people can sort by. But again, this is causing us duplicate content because those pages show the first paragraph of the blogs related to that category. Ex: blog main page- http://www.robots.com/blog. First blog listed on that page- http://www.robots.com/blog/viewing/robots-and-automation-bringing-jobs-back-to-the-united-states (which then shows as duplicate content with the main blog page). And then you can also select categories to see related topics: http://www.robots.com/blog/category/buying-a-robot which is showing as duplicate content also. Help! How can I prevent this? Thanks! JWanner
Content Development | | jwanner0 -
Thumbs up or thumbs down to content rotators
Hi there - Our team is in the process of a website redesign. We're currently using a content rotator and are wondering if any folks have data to support whether this is actually a good practice despite it's popularity? Overall, I'm not impressed by the click throughs as a percentage of site traffic and most of our visitors are not repeat visitors so this may not really be necessary. Thoughts and experiences appreciated!
Content Development | | pasware0 -
Promoting blog content
I've created a pinterest board, which kind of serves as a blog also as it tells people how they can save money on their heating bills this winter. Does anyone have any suggestions on how to promote it as a backlinking opportunity? Would it be good to also feature the content on our website? Guest blogging? Any suggestions would be welcomed. Thank you.
Content Development | | AAttias0 -
Does this count as Copied Content ?
Hi, we are publishing news on our website blog. In the news we use excerpts from other websites but do mention the source like "according to XYZ news source" etc. Does it count as copied content as sometime copyscape shows alsmot 30% duplicated content due to inclusion of references from different sources in our news stories ? Regards, shahzad
Content Development | | shaz_lhr0 -
Best way to avoid duplicate content issues here.
I am planning to write an article that refutes some claims made in another article. The original article is a 20 page pdf. What I plan to do is to take quotes from this PDF and then under each quote write my arguments for or against the quote. If I take direct quotes from the article, is Google likely to see this as duplicate content?
Content Development | | MarieHaynes0 -
Duplicate content
Hello Seomoz team, i'm french and so my english is not very good ;-). I work for a brand site and we publish content about our products. The problem is : as a brand site, many sites that sell our products, copy our content. And we have duplicate content. And since these sites have worked SEO, they put in place rel canonical tag. as a brand, how to avoid being accused by Google duplicate content? tanks for you answer. I hope it's clear. Take care Denis
Content Development | | android_lyon0