How effective is OSE in crawling press release links?
-
How effective is OSE in crawling press release links?
We have released a few press releases recently (over the last couple of months) and OSE doesn't seem to have found them.
-
Hey There,
That could be a possibility. It is hard to say definitively given the nature of web crawlers. They just crawl links as they see them in random succession, so a lot of factors come into play.
Best,
Nick
SEOmoz -
Our releases have appeared on big sites like the Financial Post.
Is it possible they just get buried under other news so OSE can't find them? I know Google indexes these pages, we get the alerts.
-
Hey there,
Just so you know, here's how we compile our index: - We grab the most recent index. - We take the top 10 billion URLs with the highest MozRank (with a fixed limit on some of the larger domains). - We start crawling from the top down until we've crawled 59,000,000,000 pages (which is about 25% the amount in Google's index).
Therefore, if the site is not linked to by one of these seed URLs (or one of the URLs linked to by them in the next update) then it won't show up in our index. Sorry!
We update our Linkscape Index every 4 weeks. Crawling the entire Internet to look for links takes 2-3 weeks, but our crawlers are always in motion. When we need to start processing, we grab all the data they have collected and start processing which can take up to 3 weeks to determine which of those links are the most important. You can see our most recently updated schedule here: http://seomoz.zendesk.com/entries/345964-linkscape-update-schedule
Linkscape focuses on a breadth-first approach. Therefore we almost always have content from the homepage of websites, externally linked-to pages, and pages higher up in a site's information hierarchy. However, deep pages that are buried beneath many layers of navigation are sometimes missed and it may be several index updates before we catch all of these.
If our crawlers or data sources are blocked from reaching those URLs, they may not be included in our index (though links that point to those pages will still be available). Finally, the URLs seen by Linkscape must be linked-to by other documents on the web or our index will not include them.
For now, the best thing you can do to help your domain become indexed is to work on link building for links from sites with high mozrank.
Best,
Nick
SEOmoz -
This all depends on where the press releases have been posted.
If you've got the urls of the sites they're on it may be worth looking at these in OSE to see if SEOmoz has them indexed. However, don't forget that the SEOmoz index is not the same as google's. Just because it's not showing on OSE doesn't mean that G hasn't seen it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Link Activity to site keeps decreasing
Recently I have started seeing daily declines in the total inbound links to my site. 4 of my top 5 sources are all experiencing pretty significant declines and in fact, Pinterest has declined from a high of 16,295 links in Nov '15 to a current number that is 8,479 on 1/6/16. Any ideas on what could be causing these declines? I did upload a new sitemap in Nov, could this be the cause?
Reporting & Analytics | | ctripp10100 -
Ecommerce site product link. How to handle a link that doesn't exist.
Suppose we have this product A, and we just have a single item for this. When the item is sold out we do not want to show it on the website saying "out of stock". Instead we would like to remove the product from out store which will now result in a url that doesn't exist. And google webmaster tool and Moz analytic will show them as page not found after they crawl over the site. Should i be generating a new sitemap.xml and update ? How do i handle those pages that don't exist anymore ? Thanks
Reporting & Analytics | | MindlessWizard0 -
Pinterest links have been growing in Webmaster Tools
I've been using Pinterest for a couple of years now. It has been great in promoting our brand and generating referral traffic. My question is this. I've noticed in Webmaster Tools, in the "Links to your site" section, that my links from Pinterest has really shot up.It's gone from 3,500 links, 6 months ago, to over 23,000 recently. Those of you using Pinterest, have you seen a similar increase? Just wondering if Google is recognizing more of the links, even though they are "no-follow" links.
Reporting & Analytics | | tdawson090 -
2 days in the past week Google has crawled 10x the average pages crawled per day. What does this mean?
For the past 3 months my site www.dlawlesshardware.com has had an average of about 400 pages crawled per day by google. We have just over 6,000 indexed pages. However, twice in the last week, Google crawled an enormous percentage of my site. After averaging 400 pages crawled for the last 3 months, the last 4 days of crawl stats say the following. 2/1 - 4,373 pages crawled 2/2 - 367 pages crawled 2/3 - 4,777 pages crawled 2/4 - 437 pages crawled What is the deal with these enormous spike in pages crawled per day? Of course, there are also corresponding spikes in kilobytes downloaded per day. Essentially, Google averages crawling about 6% of my site a day. But twice in the last week, Google decided to crawl just under 80% of my site. Has this happened to anyone else? Any ideas? I have literally no idea what this means and I haven't found anyone else with the same problem. Only people complaining about massive DROPS in pages crawled per day. Here is a screenshot from Webmaster Tools: http://imgur.com/kpnQ8EP The drop in time spent downloading a page corresponded exactly to an improvement in our CSS. So that probably doesn't need to be considered, although I'm up for any theories from anyone about anything.
Reporting & Analytics | | dellcos0 -
What tools are people using to analyse clicked links
Hi, What tools do you use/recommend to analyse what/where links are being clicked on a page. I have seen a few mentions about CrazyEgg but are there any free (but good) tools around worth using?
Reporting & Analytics | | NeilD0 -
Subdomain and relative link paths cause crawl errors
I have a Wordpress blog on our subdomain and we use relative paths on our domain. It appears as though Google bot is crawling from the subdomain categories back to the domain relative paths. This of course results in hundreds of 404 pages. Any suggestions as to how to resolve this issue without changing the relative path structure of our domain? I can provide more information if need be. While I realize these issues are not that pressing, I'd obviously like to remove as many errors as possible. If anyone has encountered this problem, especially in Wordpress I'd really like to hear your solution or lack there of. Thank you in advance.
Reporting & Analytics | | BethA0 -
Unique root linking domains - clarification
Hi guys, In SEOMoz Search Ranking Factors, one of the the top ranking factors is number of unique root domains linking to the page: http://www.seomoz.org/article/search-ranking-factors#metrics-5 My question is: do these unique root domains need to be unique root domains liking to my domain also? E.g. www.mydomain.com/landingpage1/ already got a link from www.externaldomain.com If www.externaldomain.com has another link pointing to www.mydomain.com/landingpage2/ will this link be counted? If yes will the value be diluted as www.externaldomain.com has already linked to www.mydomain.com/ Many thanks. David
Reporting & Analytics | | sssrpm0 -
Tracking Effects of Internal Linking
Can anyone suggest the best way to track the effectiveness of internal linking on a website? Thanks!
Reporting & Analytics | | RishadShaikh590