10/14 Mozscape Index Update Details
-
Howdy gang,
As you might have seen, we've finally been able to update the Mozscape index after many challenging technical problems in the last 40 days. However, this index has some unique qualities (most of them not ideal) that I should describe.
First, this index still contains data crawled up to 100 days ago. We try to make sure that what we've crawled recently is stuff that we believe has been updated/changed, but there may be sites and pages that have changed significantly in that period that we didn't update (due to issues I've described here previously with our crawlers & schedulers).
Second, many PA/DA and other metric scores will look very similar to the last index because we lost and had problems with some metrics in processing (and believe that much of what we calculated may have been erroneous). We're using metrics from the prior index (which had good correlations with Google, etc) until we can feel confident that the new ones we're calculating are correct. That should be finished by the next index, which, also, should be out much faster than this one (more on that below). Long story short on this one - if your link counts went up and you're seeing much better/new links pointing to you, but DA/PA remain unchanged, don't panic - that's due to problems on our end with calculations and will be remedied in the next index.
Third - the good news is that we've found and fixed a vast array of issues (many of them hiding behind false problems we thought we had), and we now believe we'll be able to ship the next index with greater quality, greater speed, and better coverage. One thing we're now doing is taking every URL we've ever seen in Google's SERPs (via all our rank tracking, SERPscape, the corpus for the upcoming KW Explorer product, etc) and prioritizing them in Mozscape's crawl, so we expect to be matching what Google sees a bit more closely in future indices.
My apologies for the delay in getting this post up - I was on a plane to London for Searchlove - should have got it up before I left.
-
Thank you Jennita and Rand for your quick responses.
Great, lets keep our fingers crossed all goes well and I'm confident the Moz team can deliver it.
We all take a ride on the ebb and flow roller coaster from time to time, its what makes us learn more and overcome challenges.
Have a great day
Cheers,
Joseph -
Hi Joseph - yes, I can answer that. We took ~14 days to process this latest index, which is very good news. However, we are having some trouble with the uploading process again - our technical operations team is working with the big data team to try and uncover the source of these problems. If we can get it fixed and working (in the past, the upload step took ~12 hours, now it's taking us 3-4 days), we should have much more regular index releases.
Right now, we are feeling confident about Nov. 17th, and once we complete the upload we'll have a good picture about data quality and whether we might be able to release early (which we think is quite possible IF quality looks good and these upload issues get sorted).
-
Hi Joseph! I'm sure Rand will chime in as well, but I know our engineers are currently working on a write-up that explains the future of the index, plus some of the issues of the past. They're trying to get all the details in there, and hopefully we can get that published by early next week. What I know so far, is that they've fixed some issues and this index is looking much better. I'll let the engineers explain what that means though.
-
Hi Rand,
I hope you're well and life is good.
I was wondering if you can shed some light on the upcoming OSE update scheduled for the 17th of November.
In an earlier post you said "The good news is that we pad every estimate by nearly 2X. In a normal, problem-free index cycle, we can get it done in 12-14 days."
This would indicate you have potentially / already run an update and reviewing the data to ensure it it correct and relevant to the masses, before making a general release.
Can you advise if the trial update has been run, a success or not, if not do you think you'll have the issue solved for the 17th of November 2015?
I'm very eager to report back to my clients with credible insight using the data you provide.
Cheers,
Joseph Gourvenec -
That doesn't surprise me - Majestic has a larger index than Moz (theirs is actually the largest among active 3rd party indices, then Ahrefs, then us).
https://moz.com/blog/big-data-big-problems-link-indexes-compared this is a pretty good resource comparing the strengths and weaknesses of the various indices, and https://builtvisible.com/comparing-link-data-tools/ is also a good, third-party review of the three. There are strengths and weaknesses to each, but if raw link coverage is your goal, I recommend Majestic.
-
Rand, I've sent info and s/s to Kevin (at Moz) in an attempt to find some commonality between GSC, MOZ and HubSpot. 3rd party tools are showing more links that OSE, and are more in line with GSC. For example, Majestic shows 5x the linking domains that OSE does (on the root, not the www). Kevin points at this thread and cites the present OSE data. I'm trying to figure out why redirected/GWT 'moved' domains don't add up links, or even if they are supposed to? I suspect he's powerless. Who/what can I trust?
-
There are a variety of reasons that include:
- This index is somewhat smaller in total links crawled and URLs included
- We may have biased the crawlers towards sites/pages that are less likely to feature links from your site (this is particularly possible if the linking sites were on Chinese, Palau, or several other TLD extensions that we had previously over-indexed)
- The links we previously crawled may have been on relatively low MozRank pages that this index didn't crawl b/c we found fewer links to them (and thus lower MozRank - we tend to crawl in roughly descending MozRank order across the web).
As noted, the next index should see better coverage, fresher data, and better metrics, too. Please let us know if the problem persists and maybe we can compare your WM Tools links vs. our index to see what could be happening. Thanks and apologies.
-
Hey There,
Sorry this wasn't called out, but the number of links in this index is quite a bit lower than our previous releases, due to some of the technical issues Rand calls out here and in the September post (https://moz.com/community/q/september-s-mozscape-update-broke-we-re-building-a-new-index). The next release will remedy the issues you are currently seeing, as we have identified and fixed the bugs that caused the issues. Sorry again for any confusion here.
-
Hi Rand,
We have noticed among many of our clients that there has been a drastic drop in inbound links since the update to the Mozscape index. Our DA/PA have stayed the same, but links have dropped approximatey 60-70% across the boards. We have looked into the issue by looking in Google Webmaster Tools and have found that we have quite a few more links than are showing up in Moz. Could you explain why this might be happening?
Thanks!
-
We are both in the same boat there. I, too, desperately need this next index to get us on track and provide excellent value. If not, I think we're going to lose a lot of customers, and I'm not sure people will trust us for a long time on link data.
You have my deep and sincere apologies for the frustration and professional challenge Moz has caused. We have an obligation to do better, and I damn sure hope the team is up to delivering on that obligation.
-
I have read the comments and the frustrations associated with the recent issues and would like to suggest that with any software such as this there will be glitches from time to time. We currently use roughly 30 to 40 SaaS providers of many different types and I cannot think of one that hasn't had an issue at one time or another. Having been with Moz for over 5 years I will say that the issues are few and the response is always transparent with frequent updates. (I cannot say that for most other providers).
I would suggest to anyone who is doing client work that as soon as you can afford to have more than one service provider you do so. There are two basic reasons: if there is an issue you always have back up and second, you get the benefit of being able to compare data. Personally, I find this invaluable for client work. I do not feel disloyal to Moz, I just know that every piece of software has its own limitations.
Good luck to all with the current travails.
-
Hi Rand
Firstly I have considered MOZ to be the best SEO information gathering tool of its kind for some years now; and whenever I have taken on a new client or role have recommended to those companies not already using it, to set up an account with MOZ.
I started a new digital marketing role in July which comes with six month KPIs. I've been given a budget to improve SERPs & external inbound links; and right now I'm feeling pretty frustrated and embarrassed as, similar to Joseph I feel the lack of quality data from MOZ over the past few months has left me with egg on my face and having to prove my worth.
I need the next index, which is suggested to be 14 November, to feel me with confidence and my reports with valid data or I will be forced to look elsewhere. I hope it doesn't come to that.
-
Hi Joseph - you'll get no arguments from me on any of these fronts. I think if you've been using Moz exclusively or primarily for the link data component, you should request a refund by emailing help@moz.com (they'll be happy to provide one). Totally concur that our service the past 60 days on the link data front has not been acceptable.
-
Hi Team Moz,
I have a lot of respect for Team Moz and all the efforts behind the senses you must do to deliver a what has been a great service.
The only issue I have now, is that I've paid for two months subscription and still nothing new to report to my clients and it is:
1, Making me look foolish "like i have the wrong provider"
2. You look not so credible "because I tell clients where we get our data from and explained last month, the last time an index update didn't happen in full was years ago"
3. Me out of pocket
a. From subscription
b. Staffing for data analysts to ensure we maximise the new data gained from Moz each "4 to 6 weeks" update.At the end of the day we pay a fee for the data in OSE expecting a reasonable level information to be delivered for the fee we pay.
Rand sometimes honesty is good but "Then things took a turn for the worse and we've been struggling ever since." and your other comments isn't filling me with confidence that the next OSE release is going to be an improvement on the last.
As much as I respect you, the Moz team, the service, historic data, past efforts and overall community. I pay my subscription for credible, relevant and up to data data from OSE to support my digital activities and strategies which I've not received for the last two months.
I'm sure things will improve because up is the only way from here I feel and it needs to be too, you can't expect customers to continue to pay for the core service Moz is known for and offer; that the SEO community rely on to perform and deliver on their client expectations.
I purposely don't use other suppliers for this type of data from starting out in SEO to this day running my own company, because I feel a certain amount of loyalty to Moz just as Donna does and the rest of the community most likely do. But there does become a point when options need to be revised.
I hope this message hasn't come across in a disrespectful manner to you or the team at Moz, I just want the best data and to deliver on the expectations I have set my clients; based on that I tell them we only use the best data provider and market leader in the world which is your company.
I look forward to seeing an improvement on the next update.
Cheers,
Joseph -
Hi Donna - you are most certainly not alone in your frustration. I would call my own feelings bordering on desperation. I'm frustrated, angry, nervous, guilty, and overwhelmed with a sense of powerlessness. It seems that every time we think we've identified a problem at the root of our Mozscape issues, things just get worse and new problems we never imagined arise.
On the padding issue, I have good news and depressing news. The good news is that we pad every estimate by nearly 2X. In a normal, problem-free index cycle, we can get it done in 12-14 days.... And yet, we never estimate less than 30-31 days for an index release. In the early part of this year, you might recall that we had a number of indices released back to back in that 2-3 week window. Then things took a turn for the worse and we've been struggling ever since.
I want to be honest - my belief is that we are going to get better, but the evidence of the last 6 months is against me. I want to believe my team and I know they are trying hard and doing everything they can to get this fixed. However, I think it's wise to have skepticism given the trajectory of the recent past.
Hope that's helpful and thank you for the comment.
-
Hi Donna,
I think it's fair to feel that way, it's definitely frustrating on all ends. While we do try hard to be as open and upfront as we can with information, we can most definitely work harder on getting it out sooner. I also very much appreciate the kind words about the community, and that you took the time to leave your thoughts. I'm sure many folks feel the same way. I'll let Rand (or others) jump in and respond as well, but I wanted to say thanks!
Jen
-
Before I comment, I want to say I am a loyal member of and contributor to the Moz community. In my opinion, it's unprecedented in it's openness, honesty and willingness to help others.
But I also want to express frustration cause I think I'm probably not alone in feeling it. While I understand and appreciate there are issues and Moz is doing everything it possibly can to address them, updates keep slipping and notifications are only given after the fact. We only know if there's going to be an update delay when the deadline passes and there's no change to API data.
Remember Scotty on the original Star Trek series? Chief engineer Montgomery "Scotty" Scott had a reputation for being a miracle worker because he routinely padded his estimates. I see the next updated scheduled for November 14 and am worried.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How frequently is the Search Volume update for each keyword? API for Search Volume?
Subject pretty much says it all... How frequently is the Search Volume update for a given keyword? Is there an API call that would include keyword-specific Search Volume for one or more keywords? Thank you.
API | | ToddLevy0 -
Sitemaps and Indexed Pages
Hi guys, I created an XML sitemap and submitted it for my client last month. Now the developer of the site has also been messing around with a few things. I've noticed on my Moz site crawl that indexed pages have dropped significantly. Before I put my foot in it, I need to figure out if submitting the sitemap has caused this.. can a sitemap reduce the pages indexed? Thanks David.
API | | Slumberjac0 -
Why does OSE show old data (Previous update results)?
Moz api started to show July 13 update results for my website. I checked it 2 days ago and saw all new established links and updated DA PA for July 27 update. But last 2 days both Moz Api and OSE main page show July 13 update results. Is there a maintenance or mismatching error between old and new databases?
API | | cozmic0 -
The April Index Update is Here!
Don’t adjust your monitors, or think this is an elaborate April Fool’s joke, we are actually releasing our April Index Update EARLY! We had planned to release our April Index Update on the 6th, but processing went incredibly smoothly and left us the ability to get it up today. Let’s dig into the details of the April Index Release: 138,919,156,028 (139 billion) URLs. 746,834,537 (747 million) subdomains. 190,170,132 (190 million) root domains. 1,116,945,451,603 (1.1 Trillion) links. Followed vs nofollowed links 3.02% of all links found were nofollowed 61.79% of nofollowed links are internal 38.21% are external Rel canonical: 28.14% of all pages employ the rel=canonical tag The average page has 90 links on it 73 internal links on average. 17 external links on average. Don’t let me hold you up, go dive into the data! PS - For any questions about DA/PA fluctuations (or non-fluctuations) check out this Q&A thread from Rand:https://moz.com/community/q/da-pa-fluctuations-how-to-interpret-apply-understand-these-ml-based-scores
API | | IanWatson9 -
What happens if I go over my Mozscape api free limits?
Hello,
API | | FPK
I just started using the free version of Mozscape and I fully understand there are limits and charges under this category. However to avoid any costly surprises, I like to know:
What happens when I get near my usage limit?
What happens when I just hit the limit?
What happens when I past the limit? Along with my questions, Is there any alert systems to let me know when I get to the range of said limit, like an email?0 -
Two days since the supposed update
And still no update. Less a question, more a comment. Feeling a bit of deja vu here.
API | | pfrance3 -
August 3rd Mozscape Index Update (our largest index, but nearly a monthly late)
Update 5:27pm 8/4 - the data in Open Site Explorer is up-to-date, as is the API and Mozbar. Moz Analytics campaigns are currently loading in the new data, and all campaigns should be fully up-to-date by 4-10pm tomorrow (8/5). However, your campaign may have the new data much earlier as it depends on where that campaign falls in the update ordering. Hey gang, I wanted to provide some transparency into the latest index update, as well as give some information about our plans going forward with future indices. The Good News: This index, now that it's delivered, is pretty impressive. Mozscape's August index is 407 Billion URLs in size, nearly 100 Billion (~25%) bigger than our last record index size. We indexed 2.18 trillion links for the first time ever (prior record was 1.54 trillion). Correlations for Page Authority have gone up from 0.319 to 0.333 in the latest index, suggesting that we're getting a slightly more accurate representation of Google's use of links in rankings from this data (DA correlations remain constant at 0.185) Our hit ratio for URLs in Google's SERPs has gone up considerably, from 69.97% in our previous index to 78.66% in the August update. This indicates we are crawling and indexing more of what Google shows in the search results (a good benchmark for us). Note that a large portion of what's missing will be things published in the last 30-60 days while we were processing the index (after crawling had stopped). The Bad News: August's index was late by ~25 days. We know that reliable, consistent, on-time Mozscape updates are critically important to everyone who uses Moz's products. We've been working hard for years to get these to a better place, but have struggled mightily. Our latest string of failures was completely new to the team - a bunch of problems and issues we've never seen before (some due to the index size, but many due to odd things like a massive group of what appear to be spam domains using the Palau TLD extension clogging up crawl/processing, large chunks of pages we crawled with 10s of thousands of links which slow down the MozRank calculations, etc). While there's no excuse for delays, and we don't want to pass these off as such, we do want to be transparent about why we were so late. Our future plans include scaling back the index sizes a bit, dealing with the issues around spam domains, large link-list pages, some of the odd patterns we see in .pl and .cn domains, and taking one extra person from the Big Data team off of work on the new index system (which will be much larger and real-time rather than updated every 30 days) to help with Mozscape indices. We believe these efforts, and the new monitoring systems we've got will help us get better at producing high quality, consistent indices. Question everyone always asks: Why did my PA/DA change?! There are tons of reasons why these can change, and they don't necessarily mean anything bad about your site, your SEO efforts, or whether your links are helping you rank. PA and DA are predictive, correlated metrics that say nothing about how you're actually performing. They merely map better than most metrics to Google's global rankings across large SERP sets (but not necessarily your SERPs, which is what you should care about). That said, here's some of the reasons PA/DA do shift: The domains/pages with the highest PA/DA scores gain even faster than most of the domains below them, making it harder each index to get higher scores (since PA/DA are on a logarithmic scale, this is smoothed out somewhat - it would be much worse on a conventional scale, e.g. Facebook.com 100, everyone else 0.0003). Google's ranking algorithm introduces new elements, changes, modifies what they care about, etc. Moz crawls a set of the web that does or doesn't include the pages that are more likely to point to a given domain than another. Although our crawl tends to be representative, if you've got lots of links from deep pages on less popular domains in a part of the web far from the mainstream, we may not consistently crawl those well (or, we could overcrawl your sector because it recently received powerful links from the center of the web). My advice, as always, is to use PA/DA as relative scores. If your scores are falling, but your competitors' are falling more, that's not a bad thing. If your scores are rising, but your competitors' are rising faster, they're probably gaining ground on you. And, if you're talking about score changes in the 1-4 points range, that's not necessarily anything but noise. PA/DA scores often shift 1-4 points up or down in a new index so don't sweat it! Let me know if you've got more questions and I'll do my best to answer. You can also refer to the API update page here: https://moz.com/products/api/updates
API | | randfish8 -
Is it possible to send a string/id to an api call and return it?
I am integrating your mozscape API into a project of mine and I am in need of seeing if I can send and return a variable with an associated call. For instance when I am sending some batch URL's I am requesting the [upa] and [pda] back. I am also requesting the [uu] (canonical URL) but it always seems to come back blank. So the issue I am having is if I send a batch of urls to the API how do I know how to associate a [upa] or [pda] with a link? My thoughts are if I could send a ID with a link and return that ID back then I know how to associate the [upa] or [pda] with that particular link. Anyone know?
API | | cbielich1