Open Site Explorer.. Bit of a let down?
-
Hi, not too sure if this is a discussion or rant!?
I’ve been following SEOMoz for a couple of years now. Testing their tools, reading their blogs, sharing their content, and watching the whiteboard Friday videos (massive thumbs up to that one!). They are at the top of the game, no argument there.
Although I should have done so much earlier, I have eventually signed up to the pro version and am about to migrate all my clients over. But there is one major caveat which I’m not sure should exist.. Open Site Explorer.
Don’t get me wrong, OSE provides invaluable metrics, some that no other ‘crawler’ provides, but it is far from perfect, there are things that we SEO’s need, and are hoping to get!
Some feature SEOMoz could add;
- Ability to view a chart of ‘link types’ (i.e. blog post, social media, Press Release etc..) – Linkdex do this!
- Utilise a ‘fresh‘ backlink index as Majestic SEO do. ( we do use majestic alongside SEOMoz)
- Crawl more frequently – enough said!
- Index all ‘not so good’ backlinks – this will help identify what backlinks AREN’T helping.
I realise this is a lot easier said than done, and I’m sure SEOMoz are working on solutions.. just can’t wait till they launch!
What do you think? IS OSE more than it’s cracked up to be? Could it be improved? Let me know
Lee
-
Wow Rand, thanks for such a detailed and frank answer, I now have a better understanding of the issues you face.. it's much more complex than I imagined. Great to see that you''re doing all you can, look forward to the improvements.
Have a great week! Lee
-
Hi Lee - thanks for bringing these up! I'll try to answer with regards to each of the items you've mentioned:
#1 - This is very tough at our scale (literally 1 trillion+ links, and in the latest index about to launch, 150 billion+ pages across nearly 200 million domains). However, you're correct that we're working towards a classification system that will lean on some seeding + machine learning + user input. I'd suspect 12-18 months before we can launch it, though.
#2 - This is not currently in the plans, at least not at full web scale. We will have Blogscape back up and running soon, and that crawls ~10mm+ fresh sources daily, so if you have a link/mention on a blog/forum/news site with any notoriety, we should be catching those and updating hopefully every 6-8 hours. Mozscape (aka Linkscape, our full web index), will continue to be at least 2-4 weeks between updates and will require 3-4 weeks of processing. We're trying some new things on the technology side, but it's a huge challenge to get to Google's scale and keep our metrics, sorting, views, etc. Majestic can do this with their index because they simply export links directly into the consummable portion of the app, but it limits the sorting/views/filters and ability to generate high quality metrics like PA/DA/mozRank/etc.
#3 - Yup. Definitely agree and working on it. We're trying some new hardware stuff, new parallelization of processing and everything under the sun to make it work.
#4 - Our next index will probably have a lot more of this (and we'll be watching carefully to see how this performs for our customers). We're also working to build a spam score that maps against what we've seen Google penalize/ban over time (currently doing a bunch of research there), so you can get a good sense of what things are likely to be hit over time.
Mozscape/OSE is a huge priority for us and we have 7 extremely talented folks working night and day (and a lot of weekends) to improve this. In the next 3-6 months, it will get massively better than it is today - the next new index is only a few days away now, and after that, we shouldn't ever have such a long period without an update (this round was an impossibly hard-to-fathom confluence of problems that we've taken many steps to prevent from ever happening again).
Thanks for your patience and the good questions.
-
Ignore the last reply John, have just tested Link Detective on a site that I know the link backlink profile of.. it's massively inaccurate! The tool simply isn't able to identify correct link types as they suggest. Dissapointing!
-
I've read they would be indexing 2 x the data Anthony, not 3 x. Suppose there must be some confusion out there, fingers crossed it's 3 x the data!
-
Cheers John, will take a look when I get the chance.. have you used it before? Is it accurate?
Enjoy the weekend! Lee
-
Thanks for all the feedback guys, would be good to hear SEOMoz's official stance on this!
-
You might find this tool helpful to analyse your downloaded report from OSE http://www.linkdetective.com/
-
I have to agree with Kevin on this one...
Items 2,3,and 4 are days away.
Does it suck that OSE's Linkscape Index hasn't been updated since the end of February? YEP!
Will it be worth the wait next week when it is updated and contains three times the data as the previous crawls? YEP!
I consider SEOMoz a community, and though Linkscape's stagnant data is causing some issues, I realize that once those issues are resolved that I'll be getting more bang for my buck than I did before, and I'm totally willing to struggle for a month in order to have better tools at my disposal for many months to come.
Rand and SEOMoz are drastically improving the volume of data that we will have at our disposal. That takes time and resources. They aren't raising prices or making excuses, they're working out bugs and improving the service. I'll take it.
Anthony
-
Lee,
First, I need to say I OSE - it's invaluable in my forensic audit work. Having said that, I would LOVE to see the "link types" chart. The biggest challenge I expect they'd face in that is properly detecting where a link is on a page, or what class to put it in due to how unspectacular code consistency really is across the vast web. For example, I've seen others attempt this and fail miserably ( what should be identified as a sidebar (blogroll type) link is often seen as something else, and what should be seen as a main content area link is often seen as a sidebar link (too many WP themes for example, label their main content area divs as "sidebar2" for example).
And that just gets worse / more challenging when the need comes in identifying if a site is really a blog, or some other site that happens to use WordPress as it's platform.
Having said all that, I would be in heaven if the Moz team could get that data into OSE
-
From following all the talk, I'm pretty sure they are working on points 2, 3 & 4.
While all that is good, I personally feel they should concentrate on updating more often first, then move towards giving those extra features. The two month stretch between updates has been a real pain in the ass. But I'm sure they know that.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Syntax for canonical tag for a default page in a sub directory (not subdomain) of a web site?
I'm getting two "no canonical tag" errors for the default page of a sub-directory default page (www and root) - again NOT a subdomain. Since the page is not the root of its own site, I tagged it as -- I have tried without the default.asp, but the error remains. Been doing this for 24 years and don't remember running across this before.
Moz Pro | | dcmike0 -
Site Crawl Error
In moz crawling error this message is appears: MOST COMMON ISSUES 1Search Engine Blocked by robots.txt Error Code 612: Error response for robots.txt i asked help staff but they crawled again and nothing changed. there's only robots.XML (not TXT) in root of my webpage it contains: User-agent: *
Moz Pro | | nopsts
Allow: /
Allow: /sitemap.htm anyone please help me? thank you0 -
Why is my site not ranking?
I have done various link building strategies, and used all white hat seo methods. SEOMoz even shows that my site is better than all my competition, but yet it doesn't rank ANYWHERE on Google. Can someone point me in the right direction as to why this is happening? I have attached the opensitexplorer screenshot showing my competition. 2013-02-13_1456_zpsb86ab0e1.png
Moz Pro | | locallyrank0 -
Tool Request - What keywords does a site rank for?
Hi folks, Something I've never had to do before so I'm not sure which tool to use, but is there a way to determine the keywords that a website currently ranks for? Hope someone can assist 🙂
Moz Pro | | ChristopherM1 -
Why site is getting ranked in top three positions on Bing and Yahoo but not in Google?
My site Business-Training-Schools.com is getting descent traffic from Bing and Yahoo but very little from Google. Can you please help me in finding out the root cause of it?
Moz Pro | | HQP0 -
Open Site Explorer vs. Linkscape... without mozRank passed how do you measure link value?
Hi all, Since Linkscape was retired we've been struggling without the mozRank passed data for our competitor research. However, I've read that it wasn't a particularly reliable metric to go on so my question is: what makes a link more valuable, in your opinion? I'm trying to work out the relation between Page Authority / Domain Authority / Number of Links etc., ideally to come up with some form of ranking system where we can say: Overall link value = ( (A x Page Authority) x (B x Domain Authority) x (C x Number of Links) ) / Number of Linking Domains or something like that. Is this a good way to go about it? What do you guys look at when using the OSE data? Thanks in advance for your help!
Moz Pro | | OmarKattan0 -
Site not showing data in Open Site Explorer
I have run open site explorer for a couple of sites but I am getting the "no data available for this URL" message. I have read the reasons listed on the error message but none of these apply to the particular sites I am looking for data from. Any thoughts as to why this might be? Thanks, Kwan
Moz Pro | | SocialKwan0