Anyways to pull anchor text?
-
Hi guys,
So basically i have a list of URLs/Domains and there backlinks (example: http://s29.postimg.org/ujxm0c4lj/screenshot_677.jpg) but i'm missing anchor text. Can anyone recommend any tools which can scan a backlink, locate the URL/Domain on the page and then pull the anchor text?
Cheers, Chris
<colgroup><col width="548"><col width="884"></colgroup>
| | | -
Hi Matt!
No i have not yet found a tool which can do this.
The _ScrapeBox Anchor Text plugin _CleverPhD mentioned can only do this for one domain at a time. I need it for multiple domains.
Any other suggestions?
-
Hi Jay! Did you get this worked out?
-
Thanks Jay. If I look on the backlinks side, they all seem to have the same subdomain in some form or another. You would just need to setup the regex in Screaming Frog to look for just that keyword in the subdomain so it should match all the variants of it.
That said, ignore everything I just posted. I was thinking earlier, "Surely there is scraper software out there that does this already." I did not take the time to look. Your mention of Scrapebox reminded me of that.
Scrapebox has a separate addon that does this
http://www.scrapebox.com/anchor-text-checker
The ScrapeBox Anchor Text Checker allows you to enter your domain and then load a list of URL’s that contain your backlink. It will scan all the URL’s containing your link and extract the anchor text used by the websites that link to you.
-
Basically want the anchor text, so I can easily identify the location of the link on the page without needing to view source and search for the URL.
This export is directly from: http://s29.postimg.org/ujxm0c4lj/screenshot_677.jpg
Scrapebox backlink checker which doesn't give you anchor text.
-
Ok. Can you be more specific on what you are trying to accomplish with this data? I think that may help my understanding of what you are trying to do.
-
Thanks CleverPhD, sorry should had mentioned i'm looking to do this for multiple domain names not just one. So the method you describe works great for a single domain.
-
Screaming Frog can do this with custom extraction and list mode. If I am reading your question correctly, you have a list of URLs and what pages on your site that they link to.
You would upload the list of URLs into Screaming Frog so it knows what pages to scan and run it in list mode
http://www.screamingfrog.co.uk/seo-spider/user-guide/configuration/#15
You would then use the custom extraction tool to grep for the ahref code that has a link to your domain
http://www.screamingfrog.co.uk/web-scraper/
You would need to plug in a regular expression to look for your domain (or versions of it) and then include the rest of the HTML tag that include the anchor text all the way through the ending .
You should then be able to import that data into a spreadsheet and use text to columns to split the anchor text into it's own column.
It is a little tricky as the regular expression may have to be tweaked depending on how other sites link to your site. Run the Frog on a test group of 10 or so to make sure it works. If you have a bunch of errors, take the error examples and tweak the regular expression based on those.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Will aggressive use of branded keywords in anchor text attract Penguin’s wrath?
I'm working on a site for a serviced apartment site http://www.alcove.co.in/ which offers apartments in 9 cities in India. Site was ranking in 1st page of Google for “serviced apartment + city” for 7 cities until sometime in Jan 2013. However organic traffic has been gradually falling since sometime in September 2012 (40% fall this month over same period last year). There’s been no sudden fall in traffic which we may link with any Penguin update. There have been no warning messages in Google WMT. Even today the site ranks in 1st page for 3 cities; however ‘Serviced apartments bangalore’ which was the biggest revenue earner, is not ranked in first 5 pages. My questions are whether will aggressive use of branded keywords in anchor text will attract Penguin’s wrath, does Google makes allowance for case when company's name includes keywords. In our case, company name is Alcove Service apartments, could there be some other reason for fall in ranking/traffic? The distribution of anchors (external links, multiple links from same domain are counted) is : percent
Intermediate & Advanced SEO | | anand53
Keywords 34%
brand+keywords 43%
Natural 4%
only brand 11%
URL 7% For the above, Brand = ‘Alcove Service apartments’ or ‘Alcove Serviced apartments’ brand+keywords = various combinations of ‘alcove’ + [‘guest houses’ or ‘hotels’ or ‘accommodation’] + city1 + city2… Intriguingly, Open Site Explorer analysis of domain metrics (Domain Authority, Followed Linking Root Domains, etc) ranks Alcove higher than all but one site appearing in 1st page of Google for ‘Serviced apartments bangalore’. Most of alcove’s links are from article directories (no spun articles were used), directories and link exchanges with relevant sites. Any suggestions and guidance on what we could do to remedy the situation would be greatly appreciated! Thanks0 -
What's the news on sitwide nofollow links and anchor text penalties
Is it possible to be penalized for sitewide nofollow links because of anchor text penalties, even if you use branded anchor text?
Intermediate & Advanced SEO | | BobGW0 -
Large scale change of incoming anchor text/alt tag image links
I provide SEO services in-house and for clients for a ecommerce and web design company. For every client site we create or host we provide a image link on the bottom linking back to our company website. I started researching competitors that offer same services for our industry that have top Google rankings for all the terms we are targeting and i just realized they rank that well because their image/anchor text link they place on client sites is alt tagged for that specific keyword. We have not been optimizing or utilizing this easy keyword backlink method. I am now wanting to go to all of our clients sites and change our backlinks to target the keyword we are optimizing for but my concern is will that number of incoming anchor text/image alt tag links cause us to get penalized from google for either over optimization or them seeing 100's of backlinks keyword specific just change overnight. What is the best way to go about this change in a safe way to avoid or risk penalty from Google? 99% of all of our client backlinks are in the footer so they show up on every single page and they are all images. Would it have a different affect if i add a alt tag to those images so that we get the oncoming link juice of that specific keyword? One of my concerns is over optimization, since some of our clients have 1000's of pages on their website. so that is 1000's of incoming exact match keyword links. I feel like the danger is low for being penalized but i would rather be safe then sorry and get additional feedback. Thanks, Stephen
Intermediate & Advanced SEO | | VITALBGS0 -
Hiding Text in an SEO friendly way - is it possible?
Hello, I have a client who has very little (practically no) text content on his ecommerce website, on the home page and category / sub cat pages. We have drafted some text for him - but the designer has fought back against this as he feels it will break the design. Our proposed solution is to have some text visible - and the rest will be text that is hidden but can be revealed by clicking Read More. We are planning to follow these recommendations : http://www.shimonsandler.com/collapsible-div-seo-friendly/ We are not hiding text for the sake of it - but more to improve the UX. We of course want the text to be accessible - i.e. readable by screen readers. Does anyone have any experience or opinions in respect to taking this course of action, and is there anything we should make sure we either do or not do to stay on the side of the BIG G? Kind Regs, Rich
Intermediate & Advanced SEO | | RichBestSEO1 -
Anchor text
my website hit by penguin update on april; i check my anchor text and found that i had 1790 anchor links for " air conditioning nyc" i erase them almost a month ago but it still shows on the website report. how long it take seomoz see the changes and what about google. if someone have any idea how can i bring my site back to the top pages. thank you
Intermediate & Advanced SEO | | eoberlender0 -
Internal Anchor Text Penalty Clarification
I believe we may be seeing the initial stages of a penalty for over-using internal anchor text on our ecommerce site. Per Rand and other training, we added related product links and popular category links to our product and category pages. At the time, we did not have an html sitemap in the footer. We're a small to medium sized site with 1,700+ products. We have since added an html sitemap of our categories to our footer. Now we have category links in the sitemap and category pages and product pages with targeted anchor text. I'm beginning to see downward movement on some of those targeted categories. If I have an html sitemap in the footer (category index) should I get rid of the popular category links throughout the rest of the site? Also, with more frequency, I'm seeing a "product index" and "category index" in footers. Is this a best practice? Thanks.
Intermediate & Advanced SEO | | AWCthreads0 -
Site views messy in a text browser, but can see all text, is that a problem?
In Google's webmaster guidelines, they mention to view your site in a text browser to ensure all text is visible. All of our text is visible, but is very messy and is all jumbled on the page. I've noticed most sites text browser layout is clean. H How important is it to SEO that the site views cleanly in a text browser? Does anyone know of any feedback from Google engineers about this point?
Intermediate & Advanced SEO | | nicole.healthline0 -
Convert keyword rich PDFs to web pages (text & images)
SteriPEN is a portable water purifier that kills viruses, protozoa, e-coli, etc. Because of the technical and safety requirements nature of the product, our website has much documentation of testing, organisms affected, and more. These are in pdf form and can often be found through google search (and through links on specific pages). Because of the keyword-richness of these documents pertaining to microbes SteriPEN kills, etc. does it make sense to convert these pdf's into html text and images? Then I was thinking perhaps writing a blog post AND generating key links on important landing pages to these documents (as html). Removing pdfs may be harmful? Not a clue as to the cost/benefit.
Intermediate & Advanced SEO | | Timmmmy0