Is Google able to determine duplicate content every day/ month?
-
A while ago I talked to somebody who used to work for MSN a couple of years ago within their engineering department. We talked about a recent dip we had with one of our sites.We argued this could be caused by the large amount of duplicate content we have on this particular website (+80% of our site).
Then he said, quoted: "Google seems only to be able to determine every couple of months instead of every day if the content is actually duplicate content". I clearly don't doubt that duplicate content is a ranking factor. But I would like to know you guys opinions about Google being only able to determine this every couple of X months instead of everyday.
Have you seen or heard something similar?
-
Sorting out Google's timelines is tricky these days, because they aren't the same for every process and every site. In the early days, the "Google dance" happened about once a month, and that was the whole mess (index, algo updates, etc.). Over time, index updates have gotten a lot faster, and ranking and indexation are more real-time (especially since the "Caffeine" update), but that varies wildly across sites and pages.
I think you also have to separate a couple of impacts of duplicate content. When it comes to filtering - Google excluding a piece of duplicate content from rankings (but not necessarily penalizing the site), I don't see any evidence that this takes a couple of months. It can Google days or weeks to re-cache any given page, and to detect a duplicate they would have to re-cache both copies, so that may take a month in some cases, realistically. I strongly suspect, though, that the filter itself happens in real-time. There's no good way to store a filter for every scenario, and some filters are query-specific. Computationally, some filters almost have to happen on the fly.
On the other hand, you have updates like Panda, where duplicate content can cause something close to a penalty. Panda data was originally updated outside of the main algorithm, to the best of our knowledge, and probably about once/month. Over the more than a year since Panda 1.0 rolled out, though, it seems that this timeline accelerated. I don't think it's real-time, but it may be closer to 2 weeks (that's speculation, I admit).
So, the short answer is "It's complicated" I don't have any evidence to suggest that filtering duplicates takes Google months (and, actually, have anecdotal evidence that it can happen much faster). It is possible that it could take weeks or months to see the impact of duplicates on some sites and in some situations, though.
-
Hi Donnie,
Thanks for your reply, but I was already aware of the fact that Google had/ has a sandbox. I had to mention this within my question. I'm looking more for an answer around the fact if Google is able to determine on what basis if pages are duplicate.
Because I saw dozens of cases where our content was indexed and we linked/ linked not back to the 'original' source.
Also want to make clear that in all of these cases the duplicate content was in agreement with the original sources just to be sure.
-
In the past google had a sandbox period before any page (content) would rank. However, now everything is instant. (just learned this today @seomoz)
If you release something, Google will index it as fast as possible. If that info gets duplicated Google will only count the first one indexed. Everyone else loses brownie points unless they trackback/link back to the main article (first indexed).
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How To Stop Google's "Fetch & Render" From Showing Up In Google Analytics
Hi all, Within Google's "Fetch & Render" (found in Google Search Console) is the ability to index certain pages from my website on-demand. Unfortunately, every time I ask Google to index a page, it registers as a bounce in Google Analytics. Also, if it means anything, my website (www.knowtro.com) is a single-page application, functioning similarly to Google. If you guys know of any solution to this problem, please help! I originally thought that Google would know to block its own Fetch & Render crawler from Google Analytics but that doesn't seem to be the case. Thanks, Austin
Reporting & Analytics | | A_Krauss0 -
Google analytics
hello there, While Adding My website in Google Analytics,It is not Showing India Country in list. What i can do to add My account in it?? Have requested Google, But Got No response.any body can help please . Thanx in advance,
Reporting & Analytics | | iepl5
Falguni0 -
Google Analytics goals by source report?
Hello everybody. Is there way in Google analytics to create report on what goals have been completed per each source? Example: Lets say I have 3 goals: Subscription, Purchase, Quote. How can I get report, saying something like this: google / organic - Subscription - 5 conversions
Reporting & Analytics | | DmitriiK
Purchase - 3 conversions
Quote - 10 conversions and so on. P.S. Basically, I want the reverse of standard Google Analytics goal completions report, where you can click on goal and see which sources/mediums completions came from. I'd like to do the opposite - "click" on source/medium and see which goals have been completed. Thanks0 -
Any issues with Google impressions dropping in Webmaster Tools?
I'm seeing a drop in impressions across all my websites that are hosted at a certain location. Just wanted to make sure that it is not some reporting issue that others are seeing.
Reporting & Analytics | | tdawson090 -
If I install google Analytics, that can affect my SEO.?
I purchased a site that has a lot of Backlinks and when I started to make changes like installing google analytics! i suspect the performance in search engine was getting worst because of that, so I removed.
Reporting & Analytics | | pparma
Now I'm trying MOZ and needs GA, should I install it again??
Maybe it analyces the backlinks and tells google, I don't know, is that possible?0 -
Google Analytics shows wrong traffic source
I was reviewing out traffic sources on GA for this past week's traffic history, and noticed a couple of visits & conversions that GA cited as coming from campaigns from emails that were sent out in 2012 and early 2013. it seems odd that we would have traffic & conversions from these older campaigns - on a regular basis. it's not even from one campaign, but from multiple campaigns. is it possible that it is a cookie issue? - i wonder if these visits were really from the email campaigns - or if they were from organic searches or other sources. Thoughts? Thanks!
Reporting & Analytics | | S.S.N0 -
Google Analytics - multiple counters
Hey there Mozzers! One of our customers wants to seperate one Google Analytics account into multiple accounts. The website is divided in three parts: Main: www.website.nl Sub1: www.website.nl/sub1 Sub2: http://www.website.nl/sub2 And they would like 4 different reports under one account. R1: Total count R2: Website.nl (without Sub 1 & Sub2) R3: Sub1 R4: Sub2 I know multiple counters will get in conflict with each other, so I have to implement some filters. E.g: We can configure a filter for R3 on "astmakids" in URL. My question is: is there a safe way to implement multiple Analytics filters on one website? And how will R3 see visitors that come from the root domain astmafonds.nl? Are they referrals? Thanks a lot in advance!! Partouter
Reporting & Analytics | | Partouter0 -
Use SEOmoz API in Google Docs Spreadsheet
Hi, Has anyone made functions to get link metrics from SEOmoz API in Google docs spreadsheet? Page Authority Domain Authority Linking Root Domains Total Links _Thanks to Tom Critchlow I got the social metrics, now I just need the ones from SEOmoz api. _ Any help would be sweet.
Reporting & Analytics | | EdgySEO0