Googlebot encountered extremely large numbers of links on your site??? How Do I resolve this?
-
I am working on a site with over 30 million pages. Every time I get about One Million indexed I get a Message in the Google Webmasters Tools saying "Googlebot encountered extremely large numbers of links on your site"
The indexing then starts dropping like a Rock. I need to get the site indexed. Please Help!
-
Kenneth
I work with extremely large sites quite often. There's no single answer to this because it depends on what's going on as to why the Googlebot is breaking down in its crawl. For example - how many links exist on any single page? Is it 100, 300, 1000 or more? The more links on every page the more likely the bot will choke, though it's a lot better than it used to be.
Does the site validate for markup? Or could there be choke-points due to validation errors?
Is the content organized in an intelligent funnel structure, or is everything one level off the root domain?
Is there only one way for the bot to navigate deep into the site, or are there multiple methods to get down deep?
Is some of the content only linked from within in a way that many of those links are not discovered until the bot has to first go through six or eight other layers, some of which could be timing out just when the bot gets there?
How many quality inbound links point to pages deep within the site?
These are all questions that need to be asked and answered and that's just scratching the surface of the problem potential.
The most important thing is to try and think like the bot - if I go here, will I become overwhelmed? if I go here, will I hit a road-block?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Should Google Trends Match Organic Traffic to My Site?
When looking at Google Trends and my Organic Traffic (using GA) as percentages of their total yearly values I have a correlation of .47. This correlation doesn't seem right when you consider that Google Trends (which is showing relative search traffic data) should match up pretty strongly to your Organic Traffic. Any thoughts on what might be going on? Why isn't Google Trends correlating with Organic Traffic? Shouldn't they be pulling from the same data set? Thanks, Jacob
Reporting & Analytics | | jacob.young.cricut0 -
How do sites without access to a site's analytical data, determine a site's organic traffic?
I've recently used a organic traffic checker that showed you your traffic compared to each google algo update. I was interested in how they derived the organic traffic totals for each month, without having access to our site's google analytics? I've since compared the data to historical google analytics data and it's not wrong, isn't 100% match either but isn't far from fact. So if they're predicting or making a guess, it's rather spot on, site crawlers and SERPs snapshots only provide so much info, I'm just wondering where they get the rest from and how?
Reporting & Analytics | | Deacyde0 -
Www.googleadservices.com/pagead/conversion_async.js what is this url doing on my site?
Hello Guys, I am using google tagmanager and i have configured adwords in tag manager now what i find is that this link - www.googleadservices.com/pagead/conversion_async.js showing on my homepage not in view source but when i do inspect element at that time it appears. So do you think after using google tag manager still i need to use the given link? Thanks, Raghu
Reporting & Analytics | | raghuvinder0 -
If Links not in GWT does that mean they havent been Indexed yet?
Hi we have had some success recently with increased rank positions, so I am trying to find our what's caused it? Am I correct in thinking that if google hasnt listed any new links in my GWT account that it hasnt indexed them yet and therefore not impacting my rankings? Thanks Ash
Reporting & Analytics | | AshShep10 -
Subdomain and relative link paths cause crawl errors
I have a Wordpress blog on our subdomain and we use relative paths on our domain. It appears as though Google bot is crawling from the subdomain categories back to the domain relative paths. This of course results in hundreds of 404 pages. Any suggestions as to how to resolve this issue without changing the relative path structure of our domain? I can provide more information if need be. While I realize these issues are not that pressing, I'd obviously like to remove as many errors as possible. If anyone has encountered this problem, especially in Wordpress I'd really like to hear your solution or lack there of. Thank you in advance.
Reporting & Analytics | | BethA0 -
Analytics tagging parameters effect on site SEO
One of the effective tools used in analytics tagging is the use of analytics parameters that starts with '?' or '#'. Example on site tagging: Main link: www.domainname.com./category/sub-category/ www.domainname.com./category/sub-category/?lid=topnav www.domainname.com./category/sub-category/?lid=sidenav All three links link to the same landing page, with an extra parameter. Using email or campaign tagging: www.domainname.com./category/sub-category/ www.domainname.com./category/sub-category/?utm_source=launch&utm_medium=email&utm_term=html&utm_content=getscoop&utm_campaign=hwdyrwm2012 With that we create many tagged links based on the campaign internal strategy. How do these effect indexing, and link juice? How do thy effect SEO in general?
Reporting & Analytics | | RAPPLA0 -
.com version and .org version of site
So i just discovered that a site I now managae has a .com version - as well as the .org version that is the one everyone knows about! I'm guessing this is not a good thing... So the whole site eg www.abc.org/example has a mirror page www.abc.com/example.... What should I do about this? Is it really bad to have 2 versions out there? Thanks!
Reporting & Analytics | | inhouseninja0 -
High percentage of nofollow links
Hi, I've just created my first campaign and noticed that on the competitive analysis our website is having a lot of nofollow links: more than 50% I did some research on the web to learn more about nofollow links, but I don't understand why this percentage is so big especially compared to the other websites in the analysis (less than 10%)? Anyone any ideas? Thanks!
Reporting & Analytics | | poupette0