How can Google index a page that it can't crawl completely?

danatanseo

I recently posted a question regarding a product page that appeared to have no content. [http://www.seomoz.org/q/why-is-ose-showing-now-data-for-this-url]

What puzzles me is that this page got indexed anyway. Was it indexed based on Google knowing that there was once content on the page? Was it indexed based on the trust level of our root domain?

What are your thoughts? I'm asking not only because I don't know the answer, but because I know the argument is going to be made that if Google indexed the page then it must have been crawlable...therefore we didn't really have a crawlability problem.

Why Google index a page it can't crawl?

OlegKorneitchouk

Yep. If you had links to that page from other authority pages, the pagerank/authority would transfer over, even with the indexing issue.

danatanseo

Awesome explanation Oleg. We had some other product pages (128) to be exact, that fell victim to the same coding error. I found it interesting that not only were most of them indexed, some of them actually had PageAuthority and or PageRank.

I am thinking Google may have allocated authority to some of these product pages because they had decent link profiles, despite Googlebot not being able to access the whole page. Is that possible?

OlegKorneitchouk

It has crawled and indexed the page - check out the cached copy.

If you view the source, you can see that there is some HTML code but it seems to get cut off prematurely (perhaps due to a coding error). But that HTML code was enough to get the page indexed, but I would be suprised to see if it ranks for any terms. i.e. a search for the pages title does not return the correct url - "Shure SLX24/SM58 | Wireless Microphone System - CCI Solutions"

So G recognizes a page is there but see's think's it is blank - which is why it is indexed but won't rank for anything.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

How can Google index a page that it can't crawl completely?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

I'm looking for a bulk way to take off from the Google search results over 600 old and inexisting pages?

Prioritise a page in Google/why is a well-optimised page not ranking

New website won't rank for branded keywords in Google, but does in Bing

Best way for Google and Bing not to crawl my /en default english pages

Urgent Site Migration Help: 301 redirect from legacy to new if legacy pages are NOT indexed but have links and domain/page authority of 50+?

How Long Does it Take for Rel Canonical to De-Index / Re-Index a Page?

Robots.txt file - How to block thosands of pages when you don't have a folder path

How can I change my website's content on specific pages without affecting ranking for specific keywords?