Open Site Explorer.. Bit of a let down?
-
Hi, not too sure if this is a discussion or rant!?
I’ve been following SEOMoz for a couple of years now. Testing their tools, reading their blogs, sharing their content, and watching the whiteboard Friday videos (massive thumbs up to that one!). They are at the top of the game, no argument there.
Although I should have done so much earlier, I have eventually signed up to the pro version and am about to migrate all my clients over. But there is one major caveat which I’m not sure should exist.. Open Site Explorer.
Don’t get me wrong, OSE provides invaluable metrics, some that no other ‘crawler’ provides, but it is far from perfect, there are things that we SEO’s need, and are hoping to get!
Some feature SEOMoz could add;
- Ability to view a chart of ‘link types’ (i.e. blog post, social media, Press Release etc..) – Linkdex do this!
- Utilise a ‘fresh‘ backlink index as Majestic SEO do. ( we do use majestic alongside SEOMoz)
- Crawl more frequently – enough said!
- Index all ‘not so good’ backlinks – this will help identify what backlinks AREN’T helping.
I realise this is a lot easier said than done, and I’m sure SEOMoz are working on solutions.. just can’t wait till they launch!
What do you think? IS OSE more than it’s cracked up to be? Could it be improved? Let me know
Lee
-
Wow Rand, thanks for such a detailed and frank answer, I now have a better understanding of the issues you face.. it's much more complex than I imagined. Great to see that you''re doing all you can, look forward to the improvements.
Have a great week! Lee
-
Hi Lee - thanks for bringing these up! I'll try to answer with regards to each of the items you've mentioned:
#1 - This is very tough at our scale (literally 1 trillion+ links, and in the latest index about to launch, 150 billion+ pages across nearly 200 million domains). However, you're correct that we're working towards a classification system that will lean on some seeding + machine learning + user input. I'd suspect 12-18 months before we can launch it, though.
#2 - This is not currently in the plans, at least not at full web scale. We will have Blogscape back up and running soon, and that crawls ~10mm+ fresh sources daily, so if you have a link/mention on a blog/forum/news site with any notoriety, we should be catching those and updating hopefully every 6-8 hours. Mozscape (aka Linkscape, our full web index), will continue to be at least 2-4 weeks between updates and will require 3-4 weeks of processing. We're trying some new things on the technology side, but it's a huge challenge to get to Google's scale and keep our metrics, sorting, views, etc. Majestic can do this with their index because they simply export links directly into the consummable portion of the app, but it limits the sorting/views/filters and ability to generate high quality metrics like PA/DA/mozRank/etc.
#3 - Yup. Definitely agree and working on it. We're trying some new hardware stuff, new parallelization of processing and everything under the sun to make it work.
#4 - Our next index will probably have a lot more of this (and we'll be watching carefully to see how this performs for our customers). We're also working to build a spam score that maps against what we've seen Google penalize/ban over time (currently doing a bunch of research there), so you can get a good sense of what things are likely to be hit over time.
Mozscape/OSE is a huge priority for us and we have 7 extremely talented folks working night and day (and a lot of weekends) to improve this. In the next 3-6 months, it will get massively better than it is today - the next new index is only a few days away now, and after that, we shouldn't ever have such a long period without an update (this round was an impossibly hard-to-fathom confluence of problems that we've taken many steps to prevent from ever happening again).
Thanks for your patience and the good questions.
-
Ignore the last reply John, have just tested Link Detective on a site that I know the link backlink profile of.. it's massively inaccurate! The tool simply isn't able to identify correct link types as they suggest. Dissapointing!
-
I've read they would be indexing 2 x the data Anthony, not 3 x. Suppose there must be some confusion out there, fingers crossed it's 3 x the data!
-
Cheers John, will take a look when I get the chance.. have you used it before? Is it accurate?
Enjoy the weekend! Lee
-
Thanks for all the feedback guys, would be good to hear SEOMoz's official stance on this!
-
You might find this tool helpful to analyse your downloaded report from OSE http://www.linkdetective.com/
-
I have to agree with Kevin on this one...
Items 2,3,and 4 are days away.
Does it suck that OSE's Linkscape Index hasn't been updated since the end of February? YEP!
Will it be worth the wait next week when it is updated and contains three times the data as the previous crawls? YEP!
I consider SEOMoz a community, and though Linkscape's stagnant data is causing some issues, I realize that once those issues are resolved that I'll be getting more bang for my buck than I did before, and I'm totally willing to struggle for a month in order to have better tools at my disposal for many months to come.
Rand and SEOMoz are drastically improving the volume of data that we will have at our disposal. That takes time and resources. They aren't raising prices or making excuses, they're working out bugs and improving the service. I'll take it.
Anthony
-
Lee,
First, I need to say I OSE - it's invaluable in my forensic audit work. Having said that, I would LOVE to see the "link types" chart. The biggest challenge I expect they'd face in that is properly detecting where a link is on a page, or what class to put it in due to how unspectacular code consistency really is across the vast web. For example, I've seen others attempt this and fail miserably ( what should be identified as a sidebar (blogroll type) link is often seen as something else, and what should be seen as a main content area link is often seen as a sidebar link (too many WP themes for example, label their main content area divs as "sidebar2" for example).
And that just gets worse / more challenging when the need comes in identifying if a site is really a blog, or some other site that happens to use WordPress as it's platform.
Having said all that, I would be in heaven if the Moz team could get that data into OSE
-
From following all the talk, I'm pretty sure they are working on points 2, 3 & 4.
While all that is good, I personally feel they should concentrate on updating more often first, then move towards giving those extra features. The two month stretch between updates has been a real pain in the ass. But I'm sure they know that.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Automatically Check List of Sites For Links To Specific Domain
Hi all, Can anyone recommend a tool that will allow me to put in a list of about 200 domains that are then checked for a link back to a specific domain? I know I can do various link searches and use Google site: command on a site by site basis, but it would be much quicker if there was a tool that could take the list of domains I am expecting a link on and then find if that link exists and if so on what page etc. Hope this makes sense otherwise I have to spend a day doing it by hand - not fun! Thanks,
Moz Pro | | MrFrisbee
charles.0 -
I have a client with a bit over 100 inbound links but Open Site Explorer shows total links on Subdomain as over 66,000, how can this be?
I have a client www.woodard247.com that shows only a bit over 100 inbound links. using the Open Site Explorer, they are a well trafficed local site and are not doing any grey hat or black hat techniques. However the rankings for their main keywords have suffered recently and we cannot identify any duplicate content or keyword stuffing issues. We have never purchased links or used link building software. Now for the main question: When we run Open Site Explorer it shows only a bit over 100 inbound links on the page and the domain, but the subdomain shows over 66,000 total links! How can this be possible? Could this be a problem? How can I find out what these links are since Open Site Explorer and Seo Spyglass both show only a hundred or so?
Moz Pro | | tjkirgin0 -
How can competition outrank you if your site has better Domain/Page Authority, More links, and More Social sharing?
Say you have a site that has better Domain/page authority, more links, more social media sharing, and a lot more indexed pages (thanks to blogging) than the competition. Of course all of these metrics are based off of data from SEOMoz open site explorer tool which I am not sure if it produces accurate data. 1. Other than exact match domains or the age of a domain what would be other reasons why competition would outrank you? 2. Can anyone suggest other ways to help increase a sites domain/page authority besides creating more indexed pages, link building, etc..?
Moz Pro | | webestate0 -
What tools can I use to crawl a site which uses #! hasbhang?
I have a site which was created in a way that it uses hasbang #!. I am using 3 different SEO tools and they can't seem to crawl the website. Or what suggestion can you give me in dealing with hasbang. Any ideas please. Thanks a lot for your help. Allan
Moz Pro | | AllanDuncan0 -
Settings to crawl entire site
Not sure what happened but I started a third campaign yesterday and only 1 pages was crawled, The other two campaigns has 472 and 10K respectively. What is the proper setting to choose in the beginning of campaign setup to have the entire site crawled. Not sure what I did different and I must be reading the instructions incorrectly. Thanks, Don
Moz Pro | | NicheGuy210 -
Duplicate Content Issue from using filters on a directory listing site
I have a directory listing site of harpists and have alot of issues coming up that say: Content that is identical (or nearly identical) to content on other pages of your site forces your pages to unnecessarily compete with each other for rankings. Because this is a directory listing site the content is quite generic.The main issue appears to be coming from the functionality of the page. It appears that the "spider" is picking up each different choice of filter as a new page? If you have a look at this link you will see what I mean. People searching the site can filter the results of the songs played by this harpist by changing the dropdowns etc... but for some reason the filter arguments are being picked up...? Do you have any good approaches to solving this issue? A similar issue comes from the video pages for each harpist. They are being flagged as identical content - as there are currently no videos on the page. | http://www.find-a-harpist.co.uk/user/39/videos | http://www.find-a-harpist.co.uk/user/37/videos | Do you have any suggestions? Many thanks for taking the time to read this and respond. | | | | | |
Moz Pro | | dseo241
| |0 -
Keyword research & how it's relevant to my site.
How do I know if I can compete on a particular keyword. Say the keyword analysis tool shows the keyword difficulty is 59%. How do I know what 59% means to my site, other than checking domain and page authority relative to my site (i.e. if sites in the top 10 are higher or lower authority than my site). Is there a way of showing what keyword difficulty percentage is a cut off point for my site? Thanks, Dan
Moz Pro | | dcostigan0 -
Site is showing forwarded /301 to another website
My site http://riyas.in is showing a 301 redirect or a forward to http://flicker.com/muhammedriyas . I had done a 301 redirect long before from my site to this domain, but i removed that after 2-3 days. Please help me to solve this problem. I attached a screen shot seomoz1.jpg
Moz Pro | | riyas_0