Open Site Explorer.. Bit of a let down?
-
Hi, not too sure if this is a discussion or rant!?
I’ve been following SEOMoz for a couple of years now. Testing their tools, reading their blogs, sharing their content, and watching the whiteboard Friday videos (massive thumbs up to that one!). They are at the top of the game, no argument there.
Although I should have done so much earlier, I have eventually signed up to the pro version and am about to migrate all my clients over. But there is one major caveat which I’m not sure should exist.. Open Site Explorer.
Don’t get me wrong, OSE provides invaluable metrics, some that no other ‘crawler’ provides, but it is far from perfect, there are things that we SEO’s need, and are hoping to get!
Some feature SEOMoz could add;
- Ability to view a chart of ‘link types’ (i.e. blog post, social media, Press Release etc..) – Linkdex do this!
- Utilise a ‘fresh‘ backlink index as Majestic SEO do. ( we do use majestic alongside SEOMoz)
- Crawl more frequently – enough said!
- Index all ‘not so good’ backlinks – this will help identify what backlinks AREN’T helping.
I realise this is a lot easier said than done, and I’m sure SEOMoz are working on solutions.. just can’t wait till they launch!
What do you think? IS OSE more than it’s cracked up to be? Could it be improved? Let me know
Lee
-
Wow Rand, thanks for such a detailed and frank answer, I now have a better understanding of the issues you face.. it's much more complex than I imagined. Great to see that you''re doing all you can, look forward to the improvements.
Have a great week! Lee
-
Hi Lee - thanks for bringing these up! I'll try to answer with regards to each of the items you've mentioned:
#1 - This is very tough at our scale (literally 1 trillion+ links, and in the latest index about to launch, 150 billion+ pages across nearly 200 million domains). However, you're correct that we're working towards a classification system that will lean on some seeding + machine learning + user input. I'd suspect 12-18 months before we can launch it, though.
#2 - This is not currently in the plans, at least not at full web scale. We will have Blogscape back up and running soon, and that crawls ~10mm+ fresh sources daily, so if you have a link/mention on a blog/forum/news site with any notoriety, we should be catching those and updating hopefully every 6-8 hours. Mozscape (aka Linkscape, our full web index), will continue to be at least 2-4 weeks between updates and will require 3-4 weeks of processing. We're trying some new things on the technology side, but it's a huge challenge to get to Google's scale and keep our metrics, sorting, views, etc. Majestic can do this with their index because they simply export links directly into the consummable portion of the app, but it limits the sorting/views/filters and ability to generate high quality metrics like PA/DA/mozRank/etc.
#3 - Yup. Definitely agree and working on it. We're trying some new hardware stuff, new parallelization of processing and everything under the sun to make it work.
#4 - Our next index will probably have a lot more of this (and we'll be watching carefully to see how this performs for our customers). We're also working to build a spam score that maps against what we've seen Google penalize/ban over time (currently doing a bunch of research there), so you can get a good sense of what things are likely to be hit over time.
Mozscape/OSE is a huge priority for us and we have 7 extremely talented folks working night and day (and a lot of weekends) to improve this. In the next 3-6 months, it will get massively better than it is today - the next new index is only a few days away now, and after that, we shouldn't ever have such a long period without an update (this round was an impossibly hard-to-fathom confluence of problems that we've taken many steps to prevent from ever happening again).
Thanks for your patience and the good questions.
-
Ignore the last reply John, have just tested Link Detective on a site that I know the link backlink profile of.. it's massively inaccurate! The tool simply isn't able to identify correct link types as they suggest. Dissapointing!
-
I've read they would be indexing 2 x the data Anthony, not 3 x. Suppose there must be some confusion out there, fingers crossed it's 3 x the data!
-
Cheers John, will take a look when I get the chance.. have you used it before? Is it accurate?
Enjoy the weekend! Lee
-
Thanks for all the feedback guys, would be good to hear SEOMoz's official stance on this!
-
You might find this tool helpful to analyse your downloaded report from OSE http://www.linkdetective.com/
-
I have to agree with Kevin on this one...
Items 2,3,and 4 are days away.
Does it suck that OSE's Linkscape Index hasn't been updated since the end of February? YEP!
Will it be worth the wait next week when it is updated and contains three times the data as the previous crawls? YEP!
I consider SEOMoz a community, and though Linkscape's stagnant data is causing some issues, I realize that once those issues are resolved that I'll be getting more bang for my buck than I did before, and I'm totally willing to struggle for a month in order to have better tools at my disposal for many months to come.
Rand and SEOMoz are drastically improving the volume of data that we will have at our disposal. That takes time and resources. They aren't raising prices or making excuses, they're working out bugs and improving the service. I'll take it.
Anthony
-
Lee,
First, I need to say I OSE - it's invaluable in my forensic audit work. Having said that, I would LOVE to see the "link types" chart. The biggest challenge I expect they'd face in that is properly detecting where a link is on a page, or what class to put it in due to how unspectacular code consistency really is across the vast web. For example, I've seen others attempt this and fail miserably ( what should be identified as a sidebar (blogroll type) link is often seen as something else, and what should be seen as a main content area link is often seen as a sidebar link (too many WP themes for example, label their main content area divs as "sidebar2" for example).
And that just gets worse / more challenging when the need comes in identifying if a site is really a blog, or some other site that happens to use WordPress as it's platform.
Having said all that, I would be in heaven if the Moz team could get that data into OSE
-
From following all the talk, I'm pretty sure they are working on points 2, 3 & 4.
While all that is good, I personally feel they should concentrate on updating more often first, then move towards giving those extra features. The two month stretch between updates has been a real pain in the ass. But I'm sure they know that.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why did Moz crawl our development site?
In our Moz Pro account we have one campaign set up to track our main domain. This week Moz threw up around 400 new crawl errors, 99% of which were meta noindex issues. What happened was that somehow Moz found the development/staging site and decided to crawl that. I have no idea how it was able to do this - the robots.txt is set to disallow all and there is password protection on the site. It looks like Moz ignored the robots.txt, but I still don't have any idea how it was able to do a crawl - it should have received a 401 Forbidden and not gone any further. How do I a) clean this up without going through and manually ignoring each issue, and b) stop this from happening again? Thanks!
Moz Pro | | MultiTimeMachine0 -
How to sift "site search" data from Google Analytics for trends
I apologize in advance if this has been asked a million times but I'm just not able to find anything on it for some reason. Probably the words "site" and "search" come up a lot in this area... Anyhow, my question: How do I find trends in "site search" data from Google Analytics? I set up "site search" a long time ago. I have thousands and thousands of searches people have made on my site logged and squirreled away. The plan was to review them on a weekly basis, find the trends and start writing content to address interests people seem to be having but not finding on our site. Sounded great at the time. The problem I have, of course, is that among my 10,000 searches (many shown in Google Analytics as "no-results:cats and dogs", etc), there are slight differences that make it difficult to total up search trends. Let's say the list is like this: Term | Search Count Cats | 500
Moz Pro | | rtkl
Dogs | 500
Cat | 250
Dog | 250
Cat food | 5
Dog food | 5
Birds | 1
Bird | 1
Cats are great | 1
Cats are really great | 1
Dogs are great | 1
I like birds | 1
Seriously, I like Cats | 1
Turtles | 1 ... 10,000 more entries, every single one only 1 search per term. OK, so it looks like people like Cats and Dogs a lot, but also Birds and Turtles. But maybe there are snake searches. Maybe there are "cat pajamas" searches and variations on all of the above. Who knows what else is really trending in there??? The review of this data is MIND-NUMBING. Especially when you get into plurality and misspellings, this rabbit hole has no bottom. Is there a tool people in the SEO jam use to take a big ole CSV dump and have it magically sorted by at least potential trends? I mean, there's gotta be, right? And I'm silly for not already knowing what it is.0 -
Is it possible to block Moz from crawling sites?
Hi, is it possible to stop Moz from crawling a site at the server level? Not that I am looking to do this or anything, but here's why I'm asking. I have been crawling a site that is managed (currently by 2 parties), and I noticed that this week pages crawled went from 80 (last week) to 1 page!! I know, what? See my image attached... and the issues all went to zero "0"....! So is it possible that someone can't prevent Moz from crawling the site at the server level? I checked the robots.txt file on the site, but nothing there. I'm curious. dYNUwjd.jpg
Moz Pro | | co.mc0 -
Why doesn't OSE show results from sites like Wikipedia, YouTube, Twitter, etc.?
I know OSE used to provide link data from these domains. But I have been doing link profile lookups on sites that I know have links from these domains - and they don't show up in my results. Just to make sure, they don't even show up when I sort the sites by domain authority.
Moz Pro | | ProspectMX0 -
Strange nothing site ranking
Hi There. If you check who ranks for "credit cards" there is a website https://www.woolworthsmoney.com.au/ that is in position #5 This is a highly competitive keyword, but OpenSiteExporer.org cannot give me any backlinks for it. it says "No Data Available for this URL" The same thing happens in Market Samurai - no data 1. What are these guys doing that the others are not? 2. How come OSE can't pull any data for it?
Moz Pro | | SearchProduct0 -
Alexa Ranking Sites
I found these two sites giving my competitor link juice: http://www.webnamelist.com/alexa/Alexa_186.html http://www.list-of-domains.org/alexa/Alexa_185.html I have seen these sites before and I just dont get why they are authoritative. The funny thing is I did a search for my competitors link on the page and its not showing up, is this a problem in site explorer? Why is site explorer mentioning these sites as my competitions best links when these links do not exist on their site?
Moz Pro | | SEODinosaur0 -
SEOmoz PRO campaign fot HTTPS site
Hi all, I'm trying to configure a PRO campagin for a https website. Butt it won't work. The software says it found a one redirect (for http to https I guess), and that's it. So now I don't have any data.... Can anybody help me? Thnx! Martijn
Moz Pro | | Men4Media0 -
Certain Domains no longer recognised by open site explorer
Afternoon everyone (well, it is for me), We've been tracking the linking root domains to our domain for around 6 months now, alongside tracking these domains we have also been engaging in linking building activities. Our initial activities worked quite well with linking domains rising from around 620 to 720 in 3 months. However, recently we have seen those numbers begin to fall away, in many cases it is because certain domains have stopped linking to us, have become no=follow sites or have been archived. But, in some cases we can see the link is still there, and is being registered by other tools such as yahoo or webmaster tools. My question is really, does anyone have a way of working out why a link, that was in the past being registered by open site explorer, is no longer registering and presumably no longer passing over juice to help with domain authority. What kind of signals should i be looking for to tackle a 'decaying' link? Looking forward to hear your thoughts!
Moz Pro | | NigelJ0