How to download full list of internal links properly with OSE?
-
Hi Moz Community,
I am having issues getting a full list of my internal links when I go to the Open Site Explorer. When I export the internal links into excel, I am getting half of my URLs and 80% of what I am getting is absolutely crazy URLs that are super super old.
Plus, these old URLs are not even showing in my campaign crawls. So the crawls themselves are different.
Any help on this would be greatly appreciated, since I am trying to get a full picture on the internal link structure of my site.
Thanks!
-
Agree, I love Screaming Frog!
-
Hi Ryan,
It would be best to use Screaming Frog if you want a true indication of your internal link structure.
https://www.screamingfrog.co.uk/seo-spider/
Cheers,
David
-
Hey there! Tawny from Moz's Help Team here. I think I can help explain why you're not seeing all the links you might expect, and why you're seeing some older links that you're not seeing in your main Campaign Site Crawl.
Open Site Explorer and the Link Analysis page of Moz Pro Campaigns are both tied to our Mozscape index, which tends to update roughly once a month.
Just a few points on how we compile our index:
-
We grab the most recent index.
-
We take the top 10 billion URLs with the highest MozRank (with a fixed limit on some of the larger domains).
-
We start crawling from the top down until we've crawled ~130 billion URLs
The idea here is that we're focusing on the highest-quality links we can find, coming from the most prominent pages of authoritative sites. So, while you may not see every link for a site within our index, we're aiming to report the most valuable ones available!
Most new sites and links will be indexed by our spiders and available in Mozscape and Open Site Explorer within 60 days, but some take even longer for many reasons, including the crawl-ability of sites, the number of inbound links to them, and the depth of pages in subdirectories. This tends to bias our index in favor of newer links. Linking data is only stored in the index for about 180 days, so unless the crawler has a compelling reason to return to your site and the sites linking to yours and rediscover those links, they can fall back out of our index again. That doesn't mean they're not out there affecting your SEO, just that our tools don't see them anymore.
You can see our most recently updated schedule here as well as some more technical metrics on our Mozscape API Updates page. You can also see when the last and next updates happened on the Open Site Explorer (OSE) homepage at any time.
Since Moz focuses on quality of links over quantity, we are always focused on the most relevant links to display to our users. It's possible that Moz's index will leave out some of the lower-quality (non-link juice providing) links out of our index because of this. So, that might explain why you may see some discrepancies with what other tools may be showing.
You can read more about how we build our index in our guide here.
I know this is a ton of information, so if you have any questions or if I didn't make anything clear enough, please don't hesitate to ask! You can always drop us a line at help@moz.com and we'll do our best to clear up any questions you might have.
Cheers! -
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved Large discrepancy in number of links using compare links tool
Hi there, I’m trying to understand the large discrepancy in number of links among websites. For instance, I am comparing landing pages among two different colleges. One college has over 7 million links, while the other has only 72,000 links. Meanwhile, these colleges are fairly comparable in terms of population and prestige. Could anyone help me understand why this metric is wide ranging, and how a college could have so many links? Specifically looking at total links, internal links, external links, and linking domains.
Link Explorer | | HanoverianHE0 -
Same domain, different DA? (OSE)
Bit of an odd one here, anyone seen this before: https://i.gyazo.com/aa083d07c58623e101fc84b8b569a0cc.png
Link Explorer | | ThomasHarvey0 -
Not existing domains linking to my website (spam)
Hello, When i run my platform www.taobao.nl through https://moz.com/researchtools/ose/ i see a lot of nasty adult domains that link to my platform. These give a very negative (spam)score to my platform. I already disavowed quite a few via Webmastertools, but they keep coming. When i check the names in the Whois, they don't even seem to exist! (anymore) What could cause this and how can i end it?? Thanks for your help! Sander
Link Explorer | | benhond1 -
Why don't links from Reddit show up in Site Explorer?
Some are follow and some are no follow. They show up in webmaster tools and not in site explorer. In addition why do some other links show up in webmaster tools but not site explorer?
Link Explorer | | RafeTLouis0 -
OSE - Link Opportunities
I'm checking out this new Link Opportunities feature for one of our sites and what I'm seeing right now is pretty disappointing. For Reclaim Links, everything listed is an internal link on our site. It's got tons of URLs from our old Iciniti structure (replatformed to Magento mid-March). It's crawling tons of stuff that's blocked in robots. There are no links from external domains in the first 5 pages. For Unlinked Mentions, it's showing tons of mentions - 33,566 to be exact, a ridiculous number - mostly from news sites like Forbes, WSJ, Guardian, CNN, etc. These sites are not mentioning us. It's set up to look only for our brand name or domain name, so I don't know how it's thinking there are all these nonexistent mentions. What's going on with Link Opportunities?
Link Explorer | | Kingof50 -
Drastic Monthly Fluctuations in Page Link Metrics
We have experienced very drastic changes in our root domain numbers and as a result, have seen an odd impact on our DA. The data does not seem at all reliable at all. We went one month with over 50 recorded root domains and the next month that dropped over 75%. It doesn't make sense to be paying for a monthly pro account when the data is so clearly unreliable. What is going on? Looking for a good answer before closing our account!
Link Explorer | | TVape0 -
Discrepancies between Mozbar data and OSE?
Hey everyone, I've been doing some research into keyword competition, and i've come across a few instances where the data that Mozbar provides differs pretty greatly from what open site explorer is telling me. An example: For this site: http://www.bcwsupplies.com When I have it returned in the serps, Mozbar is telling me there are 21,306 links & 218 linking root domains. But, when I check the backlink profile via OSE, it's only showing 3,020 links & 88 root domains. Can anyone shed some insight into this? Thanks!
Link Explorer | | RCDesign740 -
Is there an efficient way to use Open Site Explorer to find unnatural or harmful links
We have a new client with 8 sites that we would like to 301 to a new site. However, before doing so we want to make sure that the backlinks are not unnatural or harmful in any way. Using open site explorer (other than just looking at exact anchor text vs brand anchor text ratio) is there a way to determine low quality inbound sites linking in? or any type of links google will find manipulative?
Link Explorer | | Bryan_Loconto0