Any tools for scraping blogroll URLs from sites?
-
This question is entirely in the whitehat realm...
Let's say you've encountered a great blog - with a strong blogroll of 40 sites.
The 40-site blogroll is interesting to you for any number of reasons, from link building targets to simply subscribing in your feedreader. Right now, it's tedious to extract the URLs from the site. There are some "save all links" tools, but they are also messy.
Are there any good tools that will
a) allow you to grab the blogroll (only) of any site into a list of URLs (yeah, ok, it might not be perfect since some sites call it "sites I like" etc.)
b) same, but export as OPML so you can subscribe.
Thanks!
Scott
-
Not at all. I guess my feeling here is that there is a sort of untapped social graph defined by blogrolls. If it were simple to harvest them upon visiting a blog (e.g. this blogger recommends...) one could do a stumble-on-steroids approach to a niche.
-
I thought you might be able to use the outbound link scraper to grab the outbound link onto the page. Pop in your URLS of the pages you want to scrape and it will spit out our a list of those domaind and urls. You can take those urls and put them into the contact finder and it will return the contact details for those sites. Combine the two spreadsheets for an epiuc list of blogs to contact for your outreach.
This is obviously for link building rather than subscribing - sorry if I have misunderstood what you were trying to do
-
Hi Keri,
That is a very cool tool, but is overkill for this. It takes far too many steps to accomplish only part of the desired goal of grabbing all blogroll URLs (within the blogroll DIV tag) and exporting the list to a valid OMPL file or URL list.
thanks!
-
nothing I saw there would do this. It looks like it could manage to list all external links, and I suppose you could manually pick the blogroll out of it.
-
Hi there,
Well, Keris response reminded me of this question and the fact that I found a tool for scraping these kind of lists:
Here it is (with some other cool tools) , have fun:
-
Hi Scott,
I'm going through older questions. Did you ever find a tool to do what you wanted to do here?
-
One thing to look at is Outwit Hub for Firefox. It might be able to help with that. It can scrape data from a page and do a lot with it. http://www.outwit.com/products/hub/. Don't know that it meets all of your needs, but I also haven't seen a response with anything better at the moment.
-
Hey Scott,
What a great question and <sigh>I don't have the answer. I am going to back to find out what people come up with here. Surely there is someone that lurks these parts that can throw something together?</sigh>
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved Is the Keyword Explorer tool down?
Hi, I was supposed to work with researching keywords today. After researching a few queries in the Keyword Explorer, it stopped retrieving keywords and shows the message, “Getting keyword suggestions failed. Please retry your search or refresh this page”. The issue has persisted for a few hours. Thanks in advance.
Moz Pro | | wp-annalv0 -
Infographic distribution sites, ideas & tools
What is the best way to get a infographic distributed for pure marketing? Is there a good way to get both digital and news related channels? big-mistake.php
Moz Pro | | jdcline1 -
Any Other PAID Keyword Difficulty Tool?
Why is The Keyword Difficulty Tool so unreliable? Anyone know of another PAID Keyword Difficulty Tool? Right now I feel like I am getting ripped off for $99 a month for a tool that I have no confidence in...PLEASE HELP ANYONE
Moz Pro | | Local-Interactive0 -
Open Site Explorer CSV export limit?
Hi! Something has been puzzling me. I've filter down a few things within open site explorer to produce some links of interest to me - around 500 records are showing When I try to export it via CSV however, only 25 links appear? Anyone know why and how I can get the rest?? David
Moz Pro | | rejigdigital0 -
Open site explorer result discrepancy
Hi all, I'm a little confused - when using open sites explorer to view of link stats on one of our competitors site I get conflicting results. For example - Page Specific Metircs shows the url www.xyz.com has 265 external links. But Root Domain Metrics show they have 22,687 links Yet the total links at the top of the report says Total links 999 Can someone explain why www.xyz.com has 265 external links and the rot domain (which is the same) has 22687 links? I've also run the report on xyz.com rather than www.xyz.com which yielded no results. Thanks in advance
Moz Pro | | EclipseLegal0 -
Very confused on site.com/ or not using a /
I'm wanting to put the rel="canonical" tag on my homepage but I'm not sure which to use? How would you know what to use and always links to, http://www.site.com or http://www.site.com**/** Personally I never knew there was a difference until I used the seomoz tool and I wasn't using the tag.
Moz Pro | | GYMSN0 -
Where is the best place to add links on my site?
If I'd like to put links to other sites in my site, is it better to have a page named "Our Helpful Links" etc. instead of just adding them to the bottom of an existing page like I've seen on some sites? I'm asking because I'm wanting to make Google as happy as possible and still add them. Just in case it helps to look at the site yourself to give advise its; http://www.allstatetransmission.net If you see anything else there that I should work on feel free to be hard on it, I value any criticism. Thanks, Jeff
Moz Pro | | allstatetransmission0 -
Moz tool bar showing less links
Just checked our links for a couple of our sites and noticed that the number of inbound links has dropped from around 55,000 to 13,000 on one and from 6000 to 700 on the other. GWMT still showing the previous amounts. Anyone else experienced this over the last few days?
Moz Pro | | heatherrobinson0