Is it possible that Google may have erroneous indexing dates?
-
I am consulting someone for a problem related to copied content. Both sites in question are WordPress (self hosted) sites. The "good" site publishes a post. The "bad" site copies the post (without even removing all internal links to the "good" site) a few days after.
On both websites it is obvious the publishing date of the posts, and it is clear that the "bad" site publishes the posts days later. The content thief doesn't even bother to fake the publishing date.
The owner of the "good" site wants to have all the proofs needed before acting against the content thief. So I suggested him to also check in Google the dates the various pages were indexed using Search Tools -> Custom Range in order to have the indexing date displayed next to the search results.
For all of the copied pages the indexing dates also prove the "bad" site published the content days after the "good" site, but there are 2 exceptions for the very 2 first posts copied.
First post:
On the "good" website it was published on 30 January 2013
On the "bad" website it was published on 26 February 2013
In Google search both show up indexed on 30 January 2013!Second post:
On the "good" website it was published on 20 March 2013
On the "bad" website it was published on 10 May 2013
In Google search both show up indexed on 20 March 2013!Is it possible to be an error in the date shown in Google search results?
I also asked for help on Google Webmaster forums but there the discussion shifted to "who copied the content" and "file a DMCA complain". So I want to be sure my question is better understood here.
It is not about who published the content first or how to take down the copied content, I am just asking if anybody else noticed this strange thing with Google indexing dates.How is it possible for Google search results to display an indexing date previous to the date the article copy was published and exactly the same date that the original article was published and indexed?
-
Thanks Doug. Really an eye-opener.
-
Thanks Doug for your response. It really cleared up the questions I had about that date Google shows next to the search results.
I was not able to find official details about it, all I was able to find was different referencing as the indexing date of a page.
But I knoew here in the MOZ community there are people who really know things, that's why I asked.
So that date is just Google's estimation of the publishing date, not the date Google indexed the content!
Thanks again for taking the time to answer me!
-
Hiya Sorina,
When you use the custom date range, Google isn't listing results based on the date they were indexed. Google is using an estimated publication date.
Google tries to estimate the the publication date based on meta-data and other features of the page such as dates in the content, title and URL. The date Google first indexed the page is just one of the things that Google can use to estimate the publication date.
I also suspect that dates in any sitemap.xml files will also be taken into consideration.
But, given that even Google can't guarantee that it'll crawl and index articles on the day they've been published the crawl data may not be an accurate estimate.
Also, if the scraped content is being re-published with intact internal links (are these the full URL - do you they resolve to your original website?) then it's pretty obvious where the content came from.
Hope this help answer your question.
-
Hi Sorina,
I can tell you that the index dates shown by Google are accurate but is not the case with the Cache date sometimes as the date shown in the Cache and the copy shown in the cache don't match many times but the index dates are accurate. Send me a private message with the actual URLs under discussion and I will be able to comment with more clarity.
Best,
Devanur Rafi
-
Thank you for your response Devanur Rafi, but the "good" site doesn't have problems getting indexed.
Actually all posts on the "good" site are indexed the very same day they are published.My question was more about the indexing date shown in Google search results
How come, for a post from the "bad" site, Google is displaying an indexing date previous to the actual date the post was published on that site?!
And how come this date is exactly the same as the date Google says it indexed the post from the "good" site?
-
Hi Sorina,
This is a common thing and it all depends on a site's crawlability (how easy is it to crawl for the bot) and crawl frequency for that site. Google would have picked up that post first on the bad site and then from the good site. However, just because one or two posts were picked up late does not mean that the good site is not crawler friendly. It also depends on how far the resource is from the root. Let us take an example:
A page on a good site: abc.com/folder1/folder2/folder3/page.html
Now a bad site copies that page: xyz.com/page.html
In this case, Google might first pickup the copied page from the bad site as it is just a click away from the root which is not the case with the good site where the page is nested deep inside multiple folders.
You can also give the way back machine (archive.org) a try to find which website published the post first. Sometimes this might work out pretty well. You can also try to look at the cache dates of the posts on both the sites in Google to get some info in this regard.
Hope those help. I wish you good luck.
Best,
Devanur Rafi.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Rel canonical on other page instead of duplicate page. How Google responds?
Hi all, We have 3 pages for same topics. We decided to use rel canonical and remove old pages from search to avoid duplicate content. Out of these 3 pages....1 and 2 type of pages have more similar content where 3 type don't have. Generally we must use rel canonical between 1 and 2. But I am wondering what happens if I canonical between 1 and 3 while 2 has more similar content? Will Google respects it or penalise as we left the most similar page and used other page for canonical. Thanks
Algorithm Updates | | vtmoz0 -
Training - How Google Crawls & Indexes Websites
Hi Does anyone know of any online training resources/webinars/training UK based that will cover the following for SEO: Why monitoring how search engines crawl and index content is important and how this can improve your SEO performance Using Google advanced operators to evaluate website indexation How to use log file data to gain insight into how search engines crawl and index content Techniques to control how search engines crawl and index content How search engines deal with JavaScript, common frameworks and SEO considerations I'm trying to develop my technical knowledge - I have always been more focused on content/KWD research/optimisation. Thank you
Algorithm Updates | | BeckyKey0 -
Google & Tabbed Content
Hi I wondered if anyone had a case study or more info on how Google treats content under tabs? We have an ecommerce site & I know it is common to put product content under tabs, but will Google ignore this? Becky
Algorithm Updates | | BeckyKey1 -
What are tips for ranking on Google Maps?
I have another thread going where everyone is saying to keep both the Places profile as well as the Google Plus Local profile I have for my company. I have another person telling me that it has a negative effect to have both accounts at the same time so I'm assuming thats why the listing never comes up on places unless you zoom all the way into the map to the address of the storefront. With that being said, can anyone provide some good tips for ranking first page on google maps? Goole Plus Local - https://plus.google.com/114370561649922317296/about?gl=us&hl=en Google Places - https://plus.google.com/103220086647895058915/about?gl=us&hl=en
Algorithm Updates | | jonnyholt1 -
Profile Picture Display Next to Map on Google Search
On a recent Google search, I noticed that the top result also displayed the Google map of the location next to it but aside from that, it also displayed that business' Google+ Profile picture. I'm wondering how that is done. We have a Google+ account as well with an image associated with our business. At this point, the map doesn't even display for our business, but how does one get the map and the profile picture to appear on search results? Here's a link to the search result in question:
Algorithm Updates | | atuomala
http://www.google.com/search?q=tech+squad&rlz=1C1CHFX_enUS511US511&aq=f&oq=tech+squad&sourceid=chrome&ie=UTF-8#hl=en&rlz=1C1CHFX_enUS511US511&sclient=psy-ab&q=tech+squad+wi&oq=tech+squad+wi&gs_l=serp.3..0l2j0i30l2.6041.6590.0.6592.3.3.0.0.0.0.79.158.2.2.0.les%3B..0.0...1c.1.4.psy-ab.DoA-PSlyt-A&pbx=1&bav=on.2,or.r_gc.r_pw.r_qf.&bvm=bv.42768644,d.cGE&fp=b5d841bb1ab219dc&biw=1440&bih=785 tech_squad_serp.jpg0 -
Decrease in Organic Traffic Due to Google Places
Hello there, we are national junk removal company and have franchises in most major cities in the US. We wanted to check to see if anyone else has seen a drop in organic traffic with the changes that Google has done with the amalgamation of Google Places with the organic rankings. All our places pages are ranking quite well and we are ranking higher organically but it appears that people go to the Google Places page and then either leaving or picking up the phone and calling our 1800 number to book a job instead of going to our website to make the booking. The interesting thing is that although Google started these changes back in October 2010 we have seen the drop in organic traffic mostly starting in April, even though we have seen a steady increase in organic ranking across the board. Has any other franchise based company seen this happen as well? Your feedback is greatly appreciated!
Algorithm Updates | | imspecialistgotjunk0 -
Google personalize search results ...
Hi cant find the right term or word for it but google seems to personalize my search results according to my previous searches so that the rankings i get for a certain term isnt correct. Can i turn that off somehow ?
Algorithm Updates | | danlae0 -
Was I Kicked Off Google Page One by Panda/Farmer?
Took over this site in March. Got a Panicked call from client Mid-March that all of a sudden keywords that put the site on Page One weren't working. There are still 9 that work, but apparently there were more. A large percentage of the backlinks are from Article Directories and Link Farms. Is this my problem? Also, a large percentage of the 149 pages suffer from keyword stuffing and were obviously written for Search Engines and not people. How much of a difference does that make?
Algorithm Updates | | reeljerc0