Any tools for scraping blogroll URLs from sites?
-
This question is entirely in the whitehat realm...
Let's say you've encountered a great blog - with a strong blogroll of 40 sites.
The 40-site blogroll is interesting to you for any number of reasons, from link building targets to simply subscribing in your feedreader. Right now, it's tedious to extract the URLs from the site. There are some "save all links" tools, but they are also messy.
Are there any good tools that will
a) allow you to grab the blogroll (only) of any site into a list of URLs (yeah, ok, it might not be perfect since some sites call it "sites I like" etc.)
b) same, but export as OPML so you can subscribe.
Thanks!
Scott
-
Not at all. I guess my feeling here is that there is a sort of untapped social graph defined by blogrolls. If it were simple to harvest them upon visiting a blog (e.g. this blogger recommends...) one could do a stumble-on-steroids approach to a niche.
-
I thought you might be able to use the outbound link scraper to grab the outbound link onto the page. Pop in your URLS of the pages you want to scrape and it will spit out our a list of those domaind and urls. You can take those urls and put them into the contact finder and it will return the contact details for those sites. Combine the two spreadsheets for an epiuc list of blogs to contact for your outreach.
This is obviously for link building rather than subscribing - sorry if I have misunderstood what you were trying to do
-
Hi Keri,
That is a very cool tool, but is overkill for this. It takes far too many steps to accomplish only part of the desired goal of grabbing all blogroll URLs (within the blogroll DIV tag) and exporting the list to a valid OMPL file or URL list.
thanks!
-
nothing I saw there would do this. It looks like it could manage to list all external links, and I suppose you could manually pick the blogroll out of it.
-
Hi there,
Well, Keris response reminded me of this question and the fact that I found a tool for scraping these kind of lists:
Here it is (with some other cool tools) , have fun:
-
Hi Scott,
I'm going through older questions. Did you ever find a tool to do what you wanted to do here?
-
One thing to look at is Outwit Hub for Firefox. It might be able to help with that. It can scrape data from a page and do a lot with it. http://www.outwit.com/products/hub/. Don't know that it meets all of your needs, but I also haven't seen a response with anything better at the moment.
-
Hey Scott,
What a great question and <sigh>I don't have the answer. I am going to back to find out what people come up with here. Surely there is someone that lurks these parts that can throw something together?</sigh>
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Multiple Sites/Internal Pages Campaign
Can MOZ do reports/ranks/campaigns separately for each of our sites, then do separate keyword campaigns for specific internal content pages of each of the websites? For a law firm with both a defense and family law sites. We have multiple pages within each site, we will need to do separate and individual campaigns for assault, burglary, traffic tickets, ect. in our defense site. And we'll need other specific campaigns for other specific content in out family law site. Will Moz be able to accommodate our needs with their $99/mo plan or is that service only available for higher packages? (or is it even possible in ANY of their packages???) Thanks.
Moz Pro | | Wallin_Klarich0 -
How can a site have a backlink from Barclays website?
Hi, I have entered a competitiors website www.my-wardrobe.com into Open Site to see who they get links from and to my surprise they have a load from Barclays Business Banking. When I visit the page I can not see the links. But if I search the pages source code for my-wardrobe, there I have it, a link to my-wardrobe.com. How have they done this? Surely Barclays haven't sold them it? And more so, why are they receiving link juice when you cant even see the link on the Barclays page in question - http://www.barclays.co.uk/BusinessBanking/P1242557952664 Thanks | |
Moz Pro | | YNWA
| | <a <span="">href</a><a <span="">="</a>http://www.my-wardrobe.com" class="popup" title="Link opens in a new window" rel='' onmousedown="dcsMultiTrack('DCS.dcsuri','BusinessBankingfromBarclays/Footer/wwwmywardrobecom', 'WT.ti', '','WT.dl','1');"> |
| | www.my-wardrobe.com |
| |
|
| | |0 -
Is there a tool that tracks and records your links to your site
What I mean by this is we have a linkbuilder working for us and I'm looking for to record there progress with link building I've seen somthing in Majestic but is there one in SEOMOZ All teh best Steve
Moz Pro | | ibexinternet0 -
Moving from a dynamic site to nopcommerce
Hi, First question since becoming a member so be gentle with me ;o) We are moving from a site using a dynamically generated ecommercetemplate to a nopcommerce site. I have two questions about this I know that 301 redirects are the best way to pass "link juice" but with the site being dynamically generated a lot of these links will simply disappear when we move to nop, meaning that they wont actually be there in order to add a 301. Any advice on this would be appreciated. How can i get a list of the pages on my current site which have the best rank in order to create the redirects. Unfortunately due to some technical issues we had with google analytics we were unable to install it so dont really have any analytics to give us some extra info. I was hoping that there would be somewhere within seomoz where i could be directed. Many thanks Chris
Moz Pro | | cjhamill0 -
Why Is SEOMOZ No Longer crawling All Of My Site
Hi all, I joined Seomoz over a month ago and Roger has been crawling all of the pages on the site approx 20 pages. Through out the last few weeks I have been working on the errors and notices identified by Roger. However, this week Roger has only re-crawled 1 page and is not picking up all the other pages. Has any one come across this problem. can you recommend any thing to resolve it? Many thanks in advance....
Moz Pro | | Dan280 -
Ultimate Ranking Tool integrating Analytics / Adwords / Google WM Tools
I currently use SEOMOZ Campaigns and Advanced Web Ranking for monitoring our KW rankings and those of competition. AWR is a brilliant tool with so many different reports, methods of viewing etc. SEOMOZ campaigns are good but don't come close to the monitoring power of AWR (EG I monitor over 50 competitors on over 1000 KW's on a Daily basis with AWR and recieve a variety of set emailed reports on the data). However, one thing that SEOMOZ campaigns have that is useful is the traffic data - but this is still a bit basic and I think could be improved. The problem with AWR is that it doesn't integrate with your Analytics / Adwords / Google WM Tools - so it is only showing you half the picture. Knowing how your site ranks for each keyword is helpful, but it would be nice to understand the value of each keyword. For example, being able to see your rank position and how much traffic that keyword has sent you over time would be helpful. It would also be nice to see the number of searches that are performed for that keyword each month . For example, lets say I saw that I was ranking at number 11 for “hover mower” and getting 500 hits per month. Two months from now, if I was ranking at position 7, it would be nice to be able to immediately see how that changed the amount of traffic I was receiving for the term. Is a position of 11 (first item on page two) better than position 10 (last item on page one)? If you can link it to your analytics, you could then link it to your goals, and goal values to get a complete picture of where your keywords rank the value of the rank, and the improvment on that value when rank changes. If browsed around for such software but can't find anything like this - does anyone know of any software that can do this - or something close to this? Many thanks
Moz Pro | | James770 -
How do I delete a url from a keyword campaign
I have a couple of urls that are associated with the keywords in my campaign. They are no longer valid so how do I remove them?
Moz Pro | | PerriCline0 -
Domain and Submain : which choice ? (open explorer tool)
Hi, 1/ Please could you tell me why Moztrust and Mozrank give not similar figures for subdomain and root domain ? 2/ Which is the best way for Google webmaster tool for configuring : Sub or Root domain ? 3/ Finally, regarding anchor text, Sub or root domain ? Tks for links or knowledge base about it....
Moz Pro | | mozllo2