How to download full list of internal links properly with OSE?
-
Hi Moz Community,
I am having issues getting a full list of my internal links when I go to the Open Site Explorer. When I export the internal links into excel, I am getting half of my URLs and 80% of what I am getting is absolutely crazy URLs that are super super old.
Plus, these old URLs are not even showing in my campaign crawls. So the crawls themselves are different.
Any help on this would be greatly appreciated, since I am trying to get a full picture on the internal link structure of my site.
Thanks!
-
Agree, I love Screaming Frog!
-
Hi Ryan,
It would be best to use Screaming Frog if you want a true indication of your internal link structure.
https://www.screamingfrog.co.uk/seo-spider/
Cheers,
David
-
Hey there! Tawny from Moz's Help Team here. I think I can help explain why you're not seeing all the links you might expect, and why you're seeing some older links that you're not seeing in your main Campaign Site Crawl.
Open Site Explorer and the Link Analysis page of Moz Pro Campaigns are both tied to our Mozscape index, which tends to update roughly once a month.
Just a few points on how we compile our index:
-
We grab the most recent index.
-
We take the top 10 billion URLs with the highest MozRank (with a fixed limit on some of the larger domains).
-
We start crawling from the top down until we've crawled ~130 billion URLs
The idea here is that we're focusing on the highest-quality links we can find, coming from the most prominent pages of authoritative sites. So, while you may not see every link for a site within our index, we're aiming to report the most valuable ones available!
Most new sites and links will be indexed by our spiders and available in Mozscape and Open Site Explorer within 60 days, but some take even longer for many reasons, including the crawl-ability of sites, the number of inbound links to them, and the depth of pages in subdirectories. This tends to bias our index in favor of newer links. Linking data is only stored in the index for about 180 days, so unless the crawler has a compelling reason to return to your site and the sites linking to yours and rediscover those links, they can fall back out of our index again. That doesn't mean they're not out there affecting your SEO, just that our tools don't see them anymore.
You can see our most recently updated schedule here as well as some more technical metrics on our Mozscape API Updates page. You can also see when the last and next updates happened on the Open Site Explorer (OSE) homepage at any time.
Since Moz focuses on quality of links over quantity, we are always focused on the most relevant links to display to our users. It's possible that Moz's index will leave out some of the lower-quality (non-link juice providing) links out of our index because of this. So, that might explain why you may see some discrepancies with what other tools may be showing.
You can read more about how we build our index in our guide here.
I know this is a ton of information, so if you have any questions or if I didn't make anything clear enough, please don't hesitate to ask! You can always drop us a line at help@moz.com and we'll do our best to clear up any questions you might have.
Cheers! -
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved Thousands of links with same IP address found in backlinks
Hey, While researching the backlinks of a client's website I found around 3000 links with the structure: http://54.201.231.138/page. The IP doesn't seem to be of their website and the MOZ Spam score for the domain is 1%. Wondering what these links are and if they're a big problem? Thank you in advance. Warmly,
Link Explorer | | JoePen1010
Joe Screenshot 2022-09-30 at 10.58.12.png0 -
How to find lost internal links
I've been seeing a large decline in internal followed links over the past couple of months and am trying to figure out the cause. My developer says there have been no changes to the site that would cause this. Is there a way to find what internal links have been lost?
Link Explorer | | rgibson1002 -
OSE Links CSV: Download Failed - Server Problem
I'm just trying to download our Links CSV from OSE and I keep getting "Failed - Server Problem" and I got a bad gateway page when I clicked the download link in the email. Is there a problem atm?
Link Explorer | | SanjidaKazi0 -
OSE - Link Opportunities
I'm checking out this new Link Opportunities feature for one of our sites and what I'm seeing right now is pretty disappointing. For Reclaim Links, everything listed is an internal link on our site. It's got tons of URLs from our old Iciniti structure (replatformed to Magento mid-March). It's crawling tons of stuff that's blocked in robots. There are no links from external domains in the first 5 pages. For Unlinked Mentions, it's showing tons of mentions - 33,566 to be exact, a ridiculous number - mostly from news sites like Forbes, WSJ, Guardian, CNN, etc. These sites are not mentioning us. It's set up to look only for our brand name or domain name, so I don't know how it's thinking there are all these nonexistent mentions. What's going on with Link Opportunities?
Link Explorer | | Kingof50 -
Drastic Monthly Fluctuations in Page Link Metrics
We have experienced very drastic changes in our root domain numbers and as a result, have seen an odd impact on our DA. The data does not seem at all reliable at all. We went one month with over 50 recorded root domains and the next month that dropped over 75%. It doesn't make sense to be paying for a monthly pro account when the data is so clearly unreliable. What is going on? Looking for a good answer before closing our account!
Link Explorer | | TVape0 -
Discrepancies between Mozbar data and OSE?
Hey everyone, I've been doing some research into keyword competition, and i've come across a few instances where the data that Mozbar provides differs pretty greatly from what open site explorer is telling me. An example: For this site: http://www.bcwsupplies.com When I have it returned in the serps, Mozbar is telling me there are 21,306 links & 218 linking root domains. But, when I check the backlink profile via OSE, it's only showing 3,020 links & 88 root domains. Can anyone shed some insight into this? Thanks!
Link Explorer | | RCDesign740 -
Expired domains and OSE?
If a domain expires will it remain in OSE indefinitely and show external backlinks? Or at some point does the domain get removed from OSE?
Link Explorer | | lsilver0 -
Is there an efficient way to use Open Site Explorer to find unnatural or harmful links
We have a new client with 8 sites that we would like to 301 to a new site. However, before doing so we want to make sure that the backlinks are not unnatural or harmful in any way. Using open site explorer (other than just looking at exact anchor text vs brand anchor text ratio) is there a way to determine low quality inbound sites linking in? or any type of links google will find manipulative?
Link Explorer | | Bryan_Loconto0