Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Scraping Pinterest

Intermediate & Advanced SEO

2124

Cyle last edited by

I am new to creating agile tools with google docs and I was wondering if I could get a little help. I am trying to make a Google Doc that will scrape Pinterest. The problem I run into is the importxml reports "oops, looks like we ran into a problem...." and goes on to tell me more about how it ran into a error. I was wondering if someone might know why the importxml formula can't scrape Pinterest.

Here is a link to my Google Doc if that helps:

https://docs.google.com/spreadsheet/ccc?key=0Al9sXyLp1ZLsdFcxTVd6THlka09kMXBvNWJfeE1Ucmc
1 Reply Last reply
Reply Quote 0
Sean_Dawes last edited by

I just use this one tool to export a site's data to csv and then play with it in excel

http://digitalhighrise.com/how-to-capture-your-pinterest-audience
1 Reply Last reply
Reply Quote 0

Got a burning SEO question?

Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.

Start my free trial

Browse Questions

View

From

Sorted by

With category

Explore more categories

Related Questions

Large number of links form Pinterest

Could unusually large number of links from Pinterest cause issues? Would Google categorise them as spammy links or site wide links? I have a small site with Urls around 800-1000. But webmaster shows 5321 links from Pinterest.com and 1467 from Pinterest.se. Please see attachment. ffNLF
Intermediate & Advanced SEO | | riyaaaz

0
Scraped content ranking above the original source content in Google.

I need insights on how “scraped” content (exact copy-pasted version) rank above the original content in Google. 4 original, in-depth articles published by my client (an online publisher) are republished by another company (which happens to be briefly mentioned in all four of those articles). We reckon the articles were re-published at least a day or two after the original articles were published (exact gap is not known). We find that all four of the “copied” articles rank at the top of Google search results whereas the original content i.e. my client website does not show up in the even in the top 50 or 60 results. We have looked at numerous factors such as Domain authority, Page authority, in-bound links to both the original source as well as the URLs of the copied pages, social metrics etc. All of the metrics, as shown by tools like Moz, are better for the source website than for the re-publisher. We have also compared results in different geographies to see if any geographical bias was affecting results, reason being our client’s website is hosted in the UK and the ‘re-publisher’ is from another country--- but we found the same results. We are also not aware of any manual actions taken against our client website (at least based on messages on Search Console). Any other factors that can explain this serious anomaly--- which seems to be a disincentive for somebody creating highly relevant original content. We recognize that our client has the option to submit a ‘Scraper Content’ form to Google--- but we are less keen to go down that route and more keen to understand why this problem could arise in the first place. Please suggest.
Intermediate & Advanced SEO | | ontarget-media

0
ROI on Policing Scraped Content

Over the years, tons of original content from my website (written by me) has been scraped by 200-300 external sites. I've been using Copyscape to identify the offenders. It is EXTREMELY time consuming to identify the site owners, prepare an email with supporting evidence (screen shots), and following up 2, 3, 15 times until they remove the scraped content. Filing DMCA takedowns are a final option for sites hosted in the US, but quite a few of the offenders are in China, India, Nigeria, and other places not subject to DMCA. Sometimes, when a site owner takes down scraped content, it reappears a few months or years later. It's exasperating. My site already performs well in the SERPs - I'm not aware of a third party site's scraped content outperforming my site for any search phrase. Given my circumstances, how much effort do you think I should continue to put into policing scraped content?
Intermediate & Advanced SEO | | ahirai

1
Penguin 3.0 hit caused by Pinterest?

I've read the FAQs and searched the help center. My URL is: http://www.weplann.comHello there,Since 10/24 we've seen a critical impact in organic results in our site. At first, only taking a look at the date of when it all started made us think about a Penguin 3.0 hit and then we went to take a look at our linking data to find we had more than 14,000 links from Pinterest (significantly more than any other source). Then, another thing that made us think about that dramatic organic drop is the anchor text: an exact match (www.weplann.com) is at #1 in our anchor text ranking but we're not 100% sure if this could affect our organic results because of the Penguin update.So our questions are:- A lot of Pinterest links could affect dramatically our organic results (understanding that Google may find that huge difference between linking sources as a bad practice)?- The anchor text exact match could really affect our ranking? If none of this points could cause that drop, then what would be causing it? We're sure it has to do with the Penguin update.Thank you very much in advance for your help!!
Intermediate & Advanced SEO | | WePlann

0
Ranking sites in vertical markets with 90% scraped content

Hi, Hoping to get advice about ranking sites (a vertical market search engine/portal like a car site for example) that gets its content from scraping car sites. For various reasons (mostly scale eg cant get car dealers to push their listings to us) content was scraped. The startup has received great press, TV interviews, incubator programs etc, and has also secured very significant investment. I feel if this site was launched pre-panda it would be ranking much better. We have invested significantly in our tech, our search tools and site innovation place us easily as market leader in this space. Anyone with experience in ranking sites with legitimate reasons for using scraped content?
Intermediate & Advanced SEO | | edthomasnp

0
Xpath to scrape date from google serp

Anyone managed to scrape the date from a google serp? I can get title, link etc. but the date just eludes me...please help! Here's an example of the kind of code google is returning: Latest UK News Headlines - Mirror.co.uk <cite>www.mirror.co.uk/news/</cite>
Cached
-
Similar

You +1'd this publicly. Undo
13 Jan 2011 – Get the latest News and Headlines from the Daily Mirror newspaper. Read breaking bulletins, front page reports, daily articles and celebrity ...
Intermediate & Advanced SEO | | JaspalX

1
What is the best way to scrape serps for targeted keyword research?

Wanting to use search operators such as "KEYWORD inurl:blog" to identify potential link targets, then download target url, domain and keyword into an excel file. Then use SEOTools to evaluate the urls from the list. I see the link aquisition assistant in the Moz lab, but the listed operators are limited. Appreciate any suggestions on doing this at scale, thanks!
Intermediate & Advanced SEO | | Qualbe-Marketing-Group

0
Competitior 'scraped' entire site - pretty much - what to do?

I just discovered a competitor in the insurance lead generation space has completely copied my client's site's architecture, page names, titles, even the form, tweaking a word or two here or there to prevent 100% 'scraping'. We put a lot of time into the site, only to have everything 'stolen'. What can we do about this? My client is very upset. I looked into filing a 'scraper' report through Google but the slight modifications to content technically don't make it a 'scraped' site. Please advise to what course of action we can take, if any. Thanks,
Greg
Intermediate & Advanced SEO | | seagreen

0