Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Content - Similar but not exactly the same content - Duplicate or Spammy?
Hey, so I have been wondering for some time now as some pages will get indexed and others won't appear at all. That makes me think that I am either creating to similar content or it is becoming too spammy. Take these two pages I created for example. The body content is very similar but h tags, meta tags and title are different. So my questions is; would pages not be displaying due possibly being too similar and spammy or duplicate? I have linked two pages that are very similar below and would love to hear any thoughts about it. https://www.dlmremovals.com.au/queensland/interstate/removalists/gold-coast-to-ballarat-removalist-backloads-and-moving-service.html https://www.dlmremovals.com.au/queensland/interstate/removalists/gold-coast-to-bendigo-removalist-backloads-and-moving-service.html Any feedback would be greatly appreciated. Thanks in advance.
Content Development | | Niclasfa0 -
Can I post my MailChimp articles on my blog without getting hit for duplicate content?
I would like to post my newsletters on my blog, but am afraid of duplicate content since you can click a link on the MailChimp email blast to view the Newsletter online. Is this considered dup content?
Content Development | | RoxBrock0 -
What if your content is getting social shares but no links?
Suppose you have a weekly blog article and sometimes your articles earn social shares (e.g. 23 +1's on Google Plus on one article but normally 3-5 social shares). One out of 10 earns an organic link from a random blog. Would you continue publishing these blog posts?
Content Development | | ProjectLabs0 -
Need creative content idea suggestion for travel business
Hi everyone It's glad for me to be a part of moz community. I'm really enjoyed. I would love to ask if anyone is creative content expert here can share some suggestions on how to produce high quality content for travel business. As the travel industry we are focusing on is selling tours in southern asia markets such as vietnam, laos & cambodia Currently i already come up with some ideas here Trip Interview Articles - with commissioned writers & paid bloggers Trip Experience/Report Articles - outsourcing to elance writers who have visited the destinations Any unique idea better than these which can set us apart from competitors and having high ROI on SEO? Thank you
Content Development | | dklongpro0 -
Filling Up Content For A New News Publishing Site
Hello, SEO Gurus. I have a client whom I've been working with for a few months now, and part of our service offering is to publish and promote fresh, daily content on his site's blog. This strategy has been a huge success thus far, he is very happy with the content, etc. Now, he is getting ready to launch a second site, which will be a news publishing site for his industry niche, and we will once again be providing the content on a daily basis: we're going to be producing 10 to 15 articles a day. It's a big operation for us. The client, however, is concerned that he doesn't want the site to appear "thin" on content in the early going, and asked if it would be possible to populate the new site with the articles we wrote on the other site's blog. My gut reaction to this is that it would be an exceedingly bad idea to do this. While we are the ones who authored the original content (and we've used author tags and publishing markup), the best bet is to simply start fresh. Besides that, seeing as we'll be pumping out tons of content on a daily basis, it won't take long to fill up the content coffers. That being said, I just wanted to run this past you all and see if anyone had any alternative ideas on how to use the old content without it being duplicate content. I was thinking that maybe designating all of the old articles with noindex, nofollow could be an option? Many thanks in advance for your time and attention. Sincerely, Mike
Content Development | | RCNOnlineMarketing0 -
Do comments count as page content, as it relates to the length of content on a page?
I understand Google likes long content, and I make all my pages at least 500 words of unique and good content. But there is something I am curious about. Do they also count comments as content? The reason I'm asking is that I'm considering creating a Q&A site, where I'd control the questions, making sure they would be good ones and not duplicates, and then have people add answers. In reality, I'd be populating most the questions as first, and most definitely supplying a very good and long answer to questions. The answers would likely be in the form of comments, with highest ranked answers at top. So, I'm wondering what Google would think of a 100 word question, with a several hundred word answer in a comment, often followed by some other comments after that. Would it be a 100 word page or a 500+ word page?
Content Development | | bizzer0 -
My WebSite has two sections with overlapping, or redundant articles on the same topics. Google is only listing one or the other article in Search Results. What should I do to have both pages (similiar but unique content ) to be listed?
My Web Site has two sections with overlapping, or redundant articles on the same topics. Google is only listing one or the other article in Search Results. What should I do to have both pages (similar but unique content ) to be listed? Example: http://www.womenshealthcaretopics.com/pregnancy_week_12.htm http://www.womenshealthcaretopics.com/pregnancy_12_weeks.html
Content Development | | docjamesmd0 -
How can I use my unique content to my advantage?
Hi, I run http://ablemagazine.co.uk - we also put out a print magazine every 2 months (the biggest disability magazine in Britain) This means we have loads of unique content (around 30 feature stories and 30 news stories every 2 months) Just wondering how I can use this to my advantage? I've been social bookmarking the feature stories (reddit, etc) and a link to all my unique stuff on facebook/twitter. Just wondering if there's anything else I should be doing? Thanks
Content Development | | craven220