Why Moz OSE, Ahrefs, Majestic and so on, don't change their user agent while crawling?
-
Some blackhat websites, PBNs and other "cheaters" are using various methods to effectively block third party backlink checker bots (OSE, Ahrefs, Majestic...) : robot.txt, IP and such.
A simple solution for those bots would be to mimic Google by using its user agent string for example.
Or if not legally permitted (which I doubt) use some kind of randomness in user agent strings, urls, and IPs in order to prevent blocking.This should not be a big deal IMHO, am I missing something obvious ?
-
The ethics of the Internet dictate that you
- crawl politely,
- obey robots.txt and
- properly identify yourself
This isn't a new issue. Link networks and sites have blocked crawlers and manipulated Google for years. Fortuneatly, it's only a small fraction of the web. Also, it unlikely links from those networks have much value, so crawl priority would be super low anyway.
Actually, it could be viewed as beneficial when blackhat sites block OSE and aHrefs, because those sites often get penalized by Google, but 3rd party crawlers have no way to know this, so blocking effectively keeps them out of the indexes.
-
Well, I think bot blocking is an obvious problem even now, and will be more important tomorrow with all private networks as you can imagine.
MOZ (and others) should find and implement the best possible solution, I see no problem with TAGFEE as soon as you are transparent with regards to the fact that your bots are undetectable.
I understand that what I'm proposing is maybe not best nor wanted solution, but the problem must be addressed or OSE will soon have no value at all
What do you propose ?
-
I agree with George here -- we'd hear a huge outcry if we pretended to be Googlebot or a different bot. We'd also likely get blocked, as sometimes people only let in a certain few known bots/IPs to crawl their site. If we changed user agents and IPs regularly, it would not be cool or TAGFEE.
-
What about using different user agents and IPs regurarly in order to avoid detection ?
Is there any acceptable other solution ?
-
The reputation and integrity of the major players would be at stake here. If they changed their user agent identification (to spoof Googlebot or Bing or whatever) that could be detected, and they would be castigated. The crawler IP address and its user agent ID would be out of sync...
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
I'm stuck by internal linking.
What structure should a football website follow? Silo or Topic Cluster?
Link Building | | gogoanimetp
I need advice on my website. My website: https://tructiepbongda.site
I hope there are answers!
Thanks0 -
We're looking at providing SEO for a website that has the majority of its incoming links from websites created solely to provide links. Few have bad spam rankings. How worried should I be about those links?
The majority of incoming links to a prospect's website are from website pages apparently created solely to provide links to the website. Few have high spam scores. The sites linking to the main site have versions of blogs with linked text. They seem to be providing positive SEO value now, but I'm concerned they might get noticed and hurt the main site in the future.
Link Building | | PKI_Niles1 -
Grr SEO linking.. I am not understanding why I wouldn't have lots more links.. Please help. Thanks
I have done the whole moz open exploer and I am not understanding why my site wouldn’t have more links registering to my website.. I have lots of sites(directorys and 3rd party) with my website domain in them. The only one that is linking to my site is BBB.com and my advertsing with saint paul press. www.somerersetautodealer.com But if I have links with all kinds of automotive directories why wouldn’t they register? I am sure this a simply answer or that I am not understanding something. Thanks for your help! Scott
Link Building | | Scott12340 -
The No of Sites linking to www.apollopowersystems.com is 50 as per Alexa, but Moz shows that the no of sites linking is 23\. The no has increased in Alexa but in Moz the no is 23 since 3 months. Why is this Difference?
Hi There, As I am new to SEO, so the problem that I am listing below may seem very naive. Please help me out. Please explain me the difference between the no of backlinks of the website, www.apollopowersystems.com as per the websites below. The nos are completely different as per the websites below: As per www.moz.com the no is 23 As per www.ahrefs.com the no currently is 586 As per www.alexa.com the no is 50 Please tell me why these nos are different. Looking forward to hear from you at the earliest. Thanks in advance.
Link Building | | KDKini0 -
New website, small business, niche market --- what's my best link building strategy?
Hi everyone, We are a small company manufacturing a niche product (indoor playground equipment), our new English website (www.funlandiaplaygrounds.com) has just been launched 2 months ago, before that we didn't even have a website in English. As the international sales manager of such a small company, I have to do all the international marketing jobs including SEO, but before this I'm almost a noob on SEO. I've just started the linking building work for our website, after a research on the links of our highest ranked competitors, I have found out that almost ALL of the external links of them come from directories and purchased links, many links are very dubious, please see the open explorer results below: http://www.opensiteexplorer.org/links?page=1&site=www.spiplay.co.uk&sort=page_authority&filter=&source=&target=page&group=1 http://www.opensiteexplorer.org/links?site=www.softplay.com%2F http://www.opensiteexplorer.org/links.html?page=1&site=www.china-cheer.com&sort=page_authority&filter=&source=&target=page&group=1 http://www.opensiteexplorer.org/links?site=http%3A%2F%2Fwww.aileplay.com%2F http://www.opensiteexplorer.org/links?site=internationalplayco.com%2F The search keywords is: indoor playground equipment. According to the latest SEO theory and numerous posts I've read here, links from these directories carry very low value, and solely relying on these links may even cause penalty to the website, but the reality is, all these competitors rank on the top as a result of these "spammy" links. For example this website www.aileplay.com that has the highest PA of 64 and rank on the first page on the search result of indoor playground equipment, has tons of spammy links. That is the situation we are facing now, then my questions is: As a small business in such a niche market, what is our best strategy to rank well in a reasonable time, say 3 months to 6 months? I do not think our competitors are very strong and hard to beat, I believe we will beat them in content creation for sure, but what should we do in link building? should we start to get directory links now, as it obviously works for them? Or should we first create more attractive content, then use these content to get natural links BEFORE we submit for directory, as recommended by most link experts here? If so should we just sit back doing nothing before the link worthy content is created and natural links starts to come in? I highly appreciate any comments! DSG_clan
Link Building | | DSG_clan0 -
Should I Just Copy A Competitor's Backlinks?
Forgive the newbie question, but now that I have found SeoMoz and OpenSiteExplorer, should I just piggy back on my competitors backlinks? What would be the downside? By way of explanation, I've never had the need to explore SEO before. Our site, Widgets.com has always ranked highly for all Widgets keywords because we have the keyword in our domain and our site has been around since 1998. But out of the blue this summer, a site, let's call them WidgetsCircus.com suddenly began outranking us on widgets keywords, and pretty much every keyword we can imagine in our little widget universe. Now that I have run OpenSiteExplorer, I can see how they've done it. They've pretty much spent the last year commenting on blog posts all over the place, editing wiki pages, etc., and built thousands of links for all these widget keywords. So, I'm wondering: why shouldn't I just go down the list of links and do exactly what they've done? Where they commented on a blog, why don't I just comment right along side them. Obviously, this has worked for them! Wouldn't it work for us too? Or is that too simple?
Link Building | | brianmcc0 -
Can changing our links page make our rankings drop through the floor?
Hi all, When I first started at my current job, most of the link building (aside from submitting to free directories left, right and centre) was replying to every spammy reciprocal email and adding them to our links page! Now we're all a bit wiser about such matters (thanks SEOMoz!), I want to get our links page turned into something actually useful. My plan was to get it neatly categorised, with recommendations, cut out unrelated links that would be useless to visitors and call it something like "Our Friends" I thought this might be useful for future link building efforts, but in its current state, it's a complete mess. However, I am worried about cutting out a lot of links if they have been reciprocal ones. Are the sort of sites that would be sending typical spammy link exchange emails for window installation in NY likely to notice our UK education site taking away their link six months later and take away ours? If they do, will losing links from these sites harm us? Perhaps more importantly, is my plan a terrible idea in the first place? Look forward to hearing from y'all!
Link Building | | TEFLScot0 -
Does it pay to change link text internally?
Most of my internal pages have their best links from within our site (of course we are trying to change that). Is it worth the effort to sculpt the link text to show varied text instead of most all showing the same link text? Or is that only important from external links?
Link Building | | joemas990