Any tools for scraping blogroll URLs from sites?
-
This question is entirely in the whitehat realm...
Let's say you've encountered a great blog - with a strong blogroll of 40 sites.
The 40-site blogroll is interesting to you for any number of reasons, from link building targets to simply subscribing in your feedreader. Right now, it's tedious to extract the URLs from the site. There are some "save all links" tools, but they are also messy.
Are there any good tools that will
a) allow you to grab the blogroll (only) of any site into a list of URLs (yeah, ok, it might not be perfect since some sites call it "sites I like" etc.)
b) same, but export as OPML so you can subscribe.
Thanks!
Scott
-
Not at all. I guess my feeling here is that there is a sort of untapped social graph defined by blogrolls. If it were simple to harvest them upon visiting a blog (e.g. this blogger recommends...) one could do a stumble-on-steroids approach to a niche.
-
I thought you might be able to use the outbound link scraper to grab the outbound link onto the page. Pop in your URLS of the pages you want to scrape and it will spit out our a list of those domaind and urls. You can take those urls and put them into the contact finder and it will return the contact details for those sites. Combine the two spreadsheets for an epiuc list of blogs to contact for your outreach.
This is obviously for link building rather than subscribing - sorry if I have misunderstood what you were trying to do
-
Hi Keri,
That is a very cool tool, but is overkill for this. It takes far too many steps to accomplish only part of the desired goal of grabbing all blogroll URLs (within the blogroll DIV tag) and exporting the list to a valid OMPL file or URL list.
thanks!
-
nothing I saw there would do this. It looks like it could manage to list all external links, and I suppose you could manually pick the blogroll out of it.
-
Hi there,
Well, Keris response reminded me of this question and the fact that I found a tool for scraping these kind of lists:
Here it is (with some other cool tools) , have fun:
-
Hi Scott,
I'm going through older questions. Did you ever find a tool to do what you wanted to do here?
-
One thing to look at is Outwit Hub for Firefox. It might be able to help with that. It can scrape data from a page and do a lot with it. http://www.outwit.com/products/hub/. Don't know that it meets all of your needs, but I also haven't seen a response with anything better at the moment.
-
Hey Scott,
What a great question and <sigh>I don't have the answer. I am going to back to find out what people come up with here. Surely there is someone that lurks these parts that can throw something together?</sigh>
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
No more than one canonical url Tag.
I just got the "no more than one canonical url TAG" for this page http://www.vacuumadvisers.com/1/electrolux-ultra-active-deep-clean-bagless-canister-vacuum-cleaner-review. I have no idea how to Fix that. Tried google it but none for Tag in particular. PS. I have changed the Theme recently therefore so did the URL Anyone?
Moz Pro | | bishop230 -
Good tool to track external links from the website
I am in search of a tool that provides me links generating from my site to another site. Is there a software or tool that can scan the whole site and provide me what are the links of other sites in my site.
Moz Pro | | csfarnsworth0 -
canonical URL tag
Hello, I was checking my ON page SEO, and one of the things i see Number of Canonical tags 2 Remove all but a single canonical URL tag I didn't fully understand, what is canonical URL tag? my website is http://novitasalonandspa.com Thanks for help
Moz Pro | | vlad_mezoz0 -
SEOmoz tools: Keyword Difficuty + On-page Analysis
To analyse the current KWD situation. I would want to see the top 10 results link metrics (as in Keyword Difficulty & SERP Analysis) and each page's On page score (as in On-page Analysis) for the keyword. Those two figures would give me a pretty good picture of the current situation. Kind regards,
Moz Pro | | OscarSE0 -
Open Site Explorer Update
What is taking OSE so long to update? The update schedule said the next update was going to be on Dec 28th.
Moz Pro | | Robbie8299
If you open OSE it says "Last Index Update: November 28th, 2011" Today in January 1st. Any thoughts as to why the delay?0 -
Can Open Site Explorer Do This?
Is there any way to set up Open Site Explorer to show these things for competitor external backlinks: Google Page Rank of the page the backlink is on Google Page Rank of the domain the backlink is on Whether the backlink is a follow or no follow Is this possible in OSE? If not, are there any other SEOMOZ Tools that will do this? Thanks.
Moz Pro | | N5c0 -
Adding LinkedIn to the new Social Media tool
I am loving the new Social Media data that SEOMoz recently added. I am sure more will come soon, but I wondering if they have plans of adding LinkedIn Company pages as apart of a campaign to track. Does anyone have the inside clue about this? Do you think it would be a good idea as well?
Moz Pro | | nextraq0 -
Site metric tool
Is there a tool on here (or anywhere else) where you can upload a list of sites (say 500 sites) and be given things like mozrank on it?
Moz Pro | | thefresh0