Any tools for scraping blogroll URLs from sites?
-
This question is entirely in the whitehat realm...
Let's say you've encountered a great blog - with a strong blogroll of 40 sites.
The 40-site blogroll is interesting to you for any number of reasons, from link building targets to simply subscribing in your feedreader. Right now, it's tedious to extract the URLs from the site. There are some "save all links" tools, but they are also messy.
Are there any good tools that will
a) allow you to grab the blogroll (only) of any site into a list of URLs (yeah, ok, it might not be perfect since some sites call it "sites I like" etc.)
b) same, but export as OPML so you can subscribe.
Thanks!
Scott
-
Not at all. I guess my feeling here is that there is a sort of untapped social graph defined by blogrolls. If it were simple to harvest them upon visiting a blog (e.g. this blogger recommends...) one could do a stumble-on-steroids approach to a niche.
-
I thought you might be able to use the outbound link scraper to grab the outbound link onto the page. Pop in your URLS of the pages you want to scrape and it will spit out our a list of those domaind and urls. You can take those urls and put them into the contact finder and it will return the contact details for those sites. Combine the two spreadsheets for an epiuc list of blogs to contact for your outreach.
This is obviously for link building rather than subscribing - sorry if I have misunderstood what you were trying to do
-
Hi Keri,
That is a very cool tool, but is overkill for this. It takes far too many steps to accomplish only part of the desired goal of grabbing all blogroll URLs (within the blogroll DIV tag) and exporting the list to a valid OMPL file or URL list.
thanks!
-
nothing I saw there would do this. It looks like it could manage to list all external links, and I suppose you could manually pick the blogroll out of it.
-
Hi there,
Well, Keris response reminded me of this question and the fact that I found a tool for scraping these kind of lists:
Here it is (with some other cool tools) , have fun:
-
Hi Scott,
I'm going through older questions. Did you ever find a tool to do what you wanted to do here?
-
One thing to look at is Outwit Hub for Firefox. It might be able to help with that. It can scrape data from a page and do a lot with it. http://www.outwit.com/products/hub/. Don't know that it meets all of your needs, but I also haven't seen a response with anything better at the moment.
-
Hey Scott,
What a great question and <sigh>I don't have the answer. I am going to back to find out what people come up with here. Surely there is someone that lurks these parts that can throw something together?</sigh>
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved Site Crawl Stalled and Can't Restart
In my GreenSeed campaign, the site crawl continues to say "in progress." I can't figure out how to stop it or how to restart the site crawl. Can you please help?
Moz Pro | | Winger1 -
Is the on page optimization tool not working?
i received a grade f for one of my keywords/page. i corrected some of the points but when i tried to submit the form again, it doesn't check off those corrected items. is there something wrong with the tool right now? also, how does the tool work if i'm targeting 2 different keywords for one page? e.g. digital marketing philippines and digital marketing agency philippines I'm pretty sure one of the keywords will have problems with at least 3 critical and high importance on page factors (broad keyword usage in page title, exact keyword usage in page title, etc.) is there an effect if there's a critical factor left unchecked because using both keywords in the title might look redundant?
Moz Pro | | optimind0 -
Is Keyword Difficulty an absolute measure, or relative to my site?
We were able to rank very well for a specific keyword. After signing seomoz I've figured out that this keyword has a difficulty of 1%. All the other similar keywords I've researched have difficulties greater than 20%. Is the Difficulty related to my site? Or is it absolute?
Moz Pro | | BrunoReis0 -
Link buying: finding sites based on criteria
Hi. I'm looking to get links to my site from blogs/sites/pages that fit this criteria: niche: IT, electronics, product reviews Minimum PR: 2 Maximum 20 external links Is there an automated tool that can help me discover these site?
Moz Pro | | seo_marker0 -
Recommend SEOmoz PRO Tools on LinkedIn
Hey Moz Community, Because honest user reviews are the best way to inform people about SEOmoz PRO tools and benefits, we'd love for those of you who are on LinkedIn to leave a recommendation for our product: http://mz.cm/urHa1e If you do choose to leave a review, please be honest in what you say. Even if it's not 100% hearts & flowers, we'd rather you keep it real. Thanks!
Moz Pro | | EricaMcGillivray0 -
Use of the tilde in URLs
I just signed up for SEOMoz and sent my site through the first crawl. I use the tilde in my rewritten URLs. This threw my entire site into the Notice section 301 (permanent redirect) since each page redirects to the exact URL with the ~, not the %7e. I find conflicting information on the web - you can use the tilde in more recent coding guidelines where you couldn't in the old. It would be a huge thing to change every page in my site to use an underscore instead of a tilde int he URL. If Google is like SEOMoz and is 301 redirecting every page on the site, then I'll do it, but is it just an SEOMoz thing? I ran my site through Firebug and and all my pages show the 200 response header, not the 301 redirect. Thanks for any help you can provide.
Moz Pro | | fdb0 -
SEOMOZ tools are not helping!
So, I have a website www.mobikwik.com which provides mobile phone topup in India. I signed up for SEOMOZ Pro membership a little less than a month ago. For all our competitors , which rank above us in google as of today , I checked out two things a) competitive link analysis and b) on-page optimization. On b) , our home page had the best optimization for targeted keywords and the best score amongst all competitors who ranked above us. On a) , linkscape tools showed our moztrust and mozrank to be the best. Except for 2 competitors, our domain authority was also better than the rest. Inspite of all this, our website is ranked no more than 6 on google for all targeted keywords. We have the best quality links amongst our competitors. I am at a loss to understand this and how to improve our ranking. I have to make a decision whether to renew my SEOMOZ membership in 2 days time. Please help me decide!
Moz Pro | | mobikwik0 -
Keyword Difficulty Tool. Why my rank is low?
Keyword Difficulty Tool. Why my rank is lower than competitors even though I have higher numbers? Can somebody help me understand this... take a look at screenshot. http://www.traxnyc.com/images/keyword_difficulty.jpg What should I do to get to number 1 position? Keyword: Hip Hop Jewelry My Website: traxnyc.com
Moz Pro | | DiamondJewelryEmpire0