Why Does OSE (Open Site Explorer) have such little backlink data on russian sites in the google.ru index?
-
OK this seems v strange, but google.ru are indexing far more BLs in their SERPS for a widget than OSE reports. Very little data is found in OSE for russian based sites.
Is this the marketing intention?
(I could send raw data if needed!)
What is filtering this vast google.ru data list out? Is OSE only catered for US/UK?
-
I don't know the exact composition of seed sites that SEOmoz uses, but it's believed to be similar, at least in theory to a process that search engines use.
This would include seed sites such as universities and government agencies and other highly respected sites, for example the Red Cross would be considered a high trust site that would make a good seed.
The set is updated frequently as sites can be gamed. But I wouldn't go as far to say that all newspapers and media sites go in the "bad" category. Some do, some don't.
It's funny that you mention a "cynical" algorithm. It's believe Google does use something similar to a "SpamRank" algorythm.
-
Thanks for your response (last May - a while back I know)
I re-read your answer and it got me thinking - does OSE only find trusted seed set sites in the eyes of Google? Or does it consider beginning its 'trust' quest journey by what other engines consider, or humans even??
If google turned on a 'cynical' algorithm - surely it would consider most of the newspaper and media links as 'gamed' - turn these off - reverse the polarities and go to find the far remote honest and unbias independent citations from deeper parts of the web. How does one judge what a seed site should be in each country? Is this a manual choice?
-
Hi Turkey,
Good observation. It's certainly not an intentional bias. OSE is designed as a world-wide link index. That said, there are a couple of forces at play here.
First, because of the vast amount of links on the web,the OSE index is designed to find the most significant links. These are the links most likely to influence rankings. With this in mind, OSE usually only contains 40-60% of the links recorded by Google Webmaster Tools. This is true in all regions, including Europe and the United States.
The good news is that the majority of the time, the missing links are very often the same ones that pass little or no value.
It is possible certain "pockets" of the web get less bandwidth from Linkscape crawlers than other areas, due to natural selection. Linkscape starts with a "seed" of trusted sites, which include the top sites from the last index, and then crawls "out" from there. This method means sites well linked to from the "seed" sites and other top-ranked sites have a higher likelyhood of being crawled each index. If Linkscape doesn't have good metrics for a particular site, it is most likely distanced from the seeds.
Every so often the seed sites are adjusted, and there is a natural selection, in order to keep the index fresh and relative. It's one of the best indexes of its kind in the world, but of course there is always room for improvement.
Hope this explination helps. Thanks again for the feedback.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How does nudity on a site affect search results?
One of my clients sells lingerie and with it being lingerie there are a fair few photos of bottoms and some exposed breasts. Gosh. Anyway I know how this affects Adwords campaigns - Google classifies the site as 'adult' and your ads don't show anywhere. I also know how it affects image searches. However how does it affect text seaches?? Are rankings demoted because of nudity? I've worked with a clothing site with some nudity on it before and this didn't affect it, but would love to hear from anyone with specific experience of thisThanks
Industry News | | neenor0 -
Has anybody used Yext or Universal Business Listings as an automated approach to getting clients into all of the many directories? If so does it work? Or does Google penalize in using these automated services?
I'm trying to figure out if using either Yext or Universal Business Listings is worth it. They have reseller programs for SEO agencies. I just am curious what other SEO folks think of these services as I'm considering using one of them to automate and save time for clients. If you go to Yext.com or universalbusinesslistings.org you can see these. Curious what others say about these. Thanks
Industry News | | SOM240 -
Get Google To Crawl More Pages Faster on my Site
We opened our database of about 10 million businesses to be crawled by Google. Since Wednesday, Google has crawled and indexed about 2,000 pages. Google is crawling us at about 1,000 pages a day now. We need to substantially increase this amount. Is it possible to get Google to crawl our sites at a quicker rate?
Industry News | | Intergen0 -
Google update on Jan 17 2013 ?
Hi guys, Today ( Jan 17 2013 ) I am observing a lot of changes within google serp for a variety of keyword. im feeling like if there was a google update somehow. There seems to be few thread around the web that claim such an update ( or a panda refresh ) , were you affected ? Did somebady else noticed a huge SERP fluctuation within their primary keyword ? Thanks in advance for your answer 😄 Best regards, Yan
Industry News | | ydesjardins2001 -
SEO Risks for redirecting sites
Hey Everyone, I've tried searching for this question, but am not exactly sure what keywords to search for so I'm probably missing the resources if they already exist... My client has had duplicated sites for years, and after multiple penalizations of those sites I was finally able to convince him to consolidate them into a "mega-site". Currently, he has a main domain, a geo-subdomain for each office location under the main domain, and a geo-domain for each office location. We plan on redirecting each geo-domain to the corresponding geo-subdomain. So, the final result will be one main domain, and a sub-domain for each office location. I'm looking for any information regarding tracking SEO data after the redirects are in place, how to guard against potential drops in SERPs, what's the smartest strategy to implement, etc... My client is very sensitive to his sites' SEO data, so if anyone has any SEO-related advice regarding redirecting sites it would be greatly appreciated! Thank you!
Industry News | | Level2Designs0 -
How to report rankings after the Google Venice update?
As a profesional agency we focus on traffic and conversions, but rankings are still a good KPI to please customers. Unfortunately rankings are not reliable anymore sinds the Google Venice update. My question is; "How can you still report about rankings, but without the risk that your customer sees total different results?" Software we use At the moment we use Rank Tracker from Link Assistant.
Industry News | | VanSoelen0 -
Google+ profiles and Rel Author. Extensive question
A bit of a mammoth question for discussion here: With the launch of Google+ and profiles, coupled with the ability to link/verify authorship using rel=me to google+ profile - A few questions with respect to the long term use and impact. As an individual - I can have a Google+ Profile, and add links to author pages where I am featured. If rel=me is used back to my G+ profile - google can recognise me as the writer - no problem with that. However - if I write for a variety of different sites, and produce a variety of different content - site owners could arguably become reluctant to link back or accredit me with the rel=me tag on the account I might be writing for a competitor for example, or other content in a totally different vertical that is irrelevant. Additionally - if i write for a company as an employee, and the rel=me tag is linked to my G+ profile - my profile (I would assume) is gaining strength from the fact that my work is cited through the link (even if no link juice is passed - my profile link is going to appear in the search results on a query that matches something I have written, and hence possibly drain some "company traffic" to my profile). If I were to then leave the employment of that company - and begin writing for a direct competitor - is my profile still benefiting from the old company content I have written? Given that google is not allowing pseudonyms or ghost writer profiles - where do we stand with respect to outsourced content? For example: The company has news written for them by a news supplier - (each writer has a name obviously) - but they don't have or don't want to create a G+ profile for me to link to. Is it a case of wait for google to come up with the company profiles? or, use a ghost name and run the gauntlet on G+? Lastly, and I suppose the bottom line - as a website owner/company director/SEO; Is adding rel=me links to all your writers profiles (given that some might only write 1 or 2 articles, and staff will inevitably come and go) an overall positive for SEO? or, a SERP nightmare if a writer moves on to another company? In essence are site owners just improving the writers profile rather than gaining very much?
Industry News | | IPINGlobal541 -
Rebranding Sites
Our company just went through a rebranding and this includes the web sites. We have a new website (same domain) hosted on a new sever. We implemented 301's for outdated domain names and urls. The content has been overhauled to be simpler and we are building this from ground up with SEO in mind.I have canonical tags, being mindful of follow and no follow, with out trying to page sculpt) and ensuring the urls are descriptive and free of auto generated cms trash. To prepare for the launch, I captured all the analytics and adword campaigns before the switch. and began making lists of where we need to change the name around the web. We were performing pretty well, on SERPS before hand and now we want to try to keep the momentum speeding up with a cleaner newer site. Does anyone have any more suggestions on what more I should be doing to start off on the right foot.
Industry News | | KJ-Rodgers0