Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What to do with outdated and irrelevant content on a website?
Hi everyone, On our corporate website we have a blog where we publish articles which are directly related to our company (house heating systems and gas cylinders) and some articles which are completely irrelevant to our core business, but which might be of interest to our potential clients. Recently I've been told that it is not a good idea to include these not directly related posts to our core business, because Google might be somewhat confused at to what our core business is all about. I was advised to research this topic and think of completely removing blog posts that are irrelevant to our core business from our blog. By removing I mean completely removing pages and setting a 410 status to tell Google that it is not a 404 error but that these pages were intentionally removed. I would like to hear some independent advice from Moz community as to what I should do? Thank you very much in advance.
Content Development | | Intergaz0 -
Same Press Release Content ?
I recently distribute my first ever PR to PRlog site. Now i am planning to move the PR to other PR sites, should i use the same content which i already submitted or again the same thing spun article will do ?
Content Development | | chandubaba0 -
How Many Words on Page for Content When Optimizing a K.W.
I want to hired a writer to create content. When optimizing a keyword on a page, how many words (minimum) should I have on that content. Some writer use ''Word Count'' when fixing a price for text, before asking a writer, I need to specified ''How many word'' to included in the content. Thank you,
Content Development | | BigBlaze2050 -
Content Marketing - Car Space
Hey looking for cool content marketing examples in the car industry. Like major car companies leveraging their resources, in developing awesome, and viral content. Anyone aware of any cool campaigns? Cheers, Mark
Content Development | | MBASydney0 -
Duplicate Content Discovery
I was hit with Penguin on April 24th like a ton of bricks. Luckily my cash cow keyword was kept safe and still is today with even an increase in traffic over the year. With some other main keywords I used to rank far I fell off the board on that day. Since then I have been slowly trying to clean things up as much as I know Today I was sitting down with my coffee and Penguin mindset and I decided to use copyscape again to review duplicate content issues and something I noticed which I either didn't before or didn't think was an issue was my footer. In my footer I used a blurb from some other site in my niche a long time ago. Which I discovered they used from one of the main sites in my niche. Anyways I noticed that my footer is what kept coming up as being duplicate content and was always at an overage of 28% according to copyscape. My question is should I be worried about the footer? Is 28% a lot?
Content Development | | cbielich0 -
On page content and PDF - Dup?
Hi We are writing a useful article which we want to put on our site, but we also want to add it as a pdf which people can download - will this be classed as dup copy?
Content Development | | jj34340 -
How do you get your content ideas?
What kind of research tactics are the best for finding great ideas for content or articles? What tools are the most helpful?
Content Development | | MichaelWeisbaum0 -
Blogger & Blogspot Content - Move Across To Own Domain?
Hey, A few new clients have blogs hosted on blogger & blogspot, the first advice of mine is to set up a blog hosted on their company domain. It's usually easy to convince them of the benefits. What should happen to all the content on the existing blog? One blog in question has over 100 entries, good content with a lot of links back to the business domain. The blog itself has less than 10 links pointing in but a domain mozrank 3.5. In this example, my gut is telling me to leave it as is, and start fresh on the own domain. What about if there's less then 10 posts? At what point should the content be moved over to the new blog? Thanks for your thoughts.
Content Development | | LukeyJamo0