Xpath to scrape date from google serp
-
Anyone managed to scrape the date from a google serp? I can get title, link etc. but the date just eludes me...please help! Here's an example of the kind of code google is returning:
-
Latest UK News Headlines - Mirror.co.uk
<cite>www.mirror.co.uk/news/</cite>
Cached
-
Similar
You +1'd this publicly. Undo
13 Jan 2011 – Get the latest News and Headlines from the Daily Mirror newspaper. Read breaking bulletins, front page reports, daily articles and celebrity ...
-
-
thanks, but the 'scrape similar' extension in chrome doesn't seem to behave in the same way as importxml on google docs or xpathonurl in niels bosma's seotools for excel
-
I'm no xpath expert, but using the scraper extension for Google Chrome, I'm able to scrape the date just fine. Here's what the xpath output is showing:
//li/div/div/span/span
Seems like all of the dates are the only span with class="f", you should be able to drill down on that I think.
Are you scraping in Google Docs?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Html language deprecated by Google?
Hi Mates, Currently we are using on our site two tags for language (we are targeting english ) .... and these are defined on the head section, my question is it is required by Google in order to rank well or it is deprecated. Thank you Claudio
Intermediate & Advanced SEO | | ClayRey0 -
Why is google truncating my title tag?
We are trying to figure out why the search result for the term "au pair" is not matching our designated title tag or anything on our page. If you search "au pair", please see the result for the domain interexchange.org. We do not see this problem with other search terms.
Intermediate & Advanced SEO | | jrjames830 -
Apps content Google indexation ?
I read some months back that Google was indexing the apps content to display it into its SERP. Does anyone got any update on this recently ? I'll be very interesting to know more on it 🙂
Intermediate & Advanced SEO | | JoomGeek0 -
Weird Google SERPs after New Domain Transfer 301
Hi, I have some very weird results in the SERPS - We did not do a complete 301 of the entire domain, but rather individual pages. We did the transfer back on 10th of June, and I was checking to see if there were any results on old domain of pages that were transferred to new domain via 301. There were, but... Now I have the following occurring in the search results: Title of Page (Links to old domain!! )
Intermediate & Advanced SEO | | bjs2010
www.oldomain.com › ... › Figures & Sculptures › Tall Sculptures (these last 2 breadcrumbs link to NEW domain??!!)
Bla bla bla (meta description from new domain meta description I know it's Monday, but this one has got me quite concerned! - Any insight appreciated! Am I going nuts?0 -
Control Over SERPs
I launched a content branch of over 1200 guides for my company's website one year ago. This content branch now receives 180,000 visits per month. Unfortunately, my company has needed to cut back on several of the vertical services that it offers. As a result, many of the top article search results accounting for the majority of the traffic to this branch are now obsolete. Are there any feasible ways to shift the authority of these obsolete top rated pages (located in the top 20 results) to other articles that cover services that we still do offer? For reference, please see the attached image containing an overview of the top 20x results accounting for 40% of all traffic to this content branch. Green are still valid content and but all other colors are not. Does anyone have suggestions for moving more green articles up and into the top 20x results while pushing the other ones down without loosing out on overall site traffic? Appreciate any and all feedback! Thanks!!! 1qMyYEJ.jpg
Intermediate & Advanced SEO | | TQContent0 -
Google Indexing Feedburner Links???
I just noticed that for lots of the articles on my website, there are two results in Google's index. For instance: http://www.thewebhostinghero.com/articles/tools-for-creating-wordpress-plugins.html and http://www.thewebhostinghero.com/articles/tools-for-creating-wordpress-plugins.html?utm_source=feedburner&utm_medium=feed&utm_campaign=Feed%3A+thewebhostinghero+(TheWebHostingHero.com) Now my Feedburner feed is set to "noindex" and it's always been that way. The canonical tag on the webpage is set to: rel='canonical' href='http://www.thewebhostinghero.com/articles/tools-for-creating-wordpress-plugins.html' /> The robots tag is set to: name="robots" content="index,follow,noodp" /> I found out that there are scrapper sites that are linking to my content using the Feedburner link. So should the robots tag be set to "noindex" when the requested URL is different from the canonical URL? If so, is there an easy way to do this in Wordpress?
Intermediate & Advanced SEO | | sbrault740 -
Google + under Google business domain email account
Hello there, I have a quick and straight question and I am hoping to find answer here. What do we do with a G+ profile that was set up through a business domain's email account that is used by more than one person? We want to use the company name, but we can't as it is considered personal email account although it is under business domain verified by Google. Is there a way that we ask Google to change it and allow us to use the name of the company or should we just deactivate it? Thanks in advance!
Intermediate & Advanced SEO | | montauto0