Website pages missing from seomoz crawl
-
Hi!
I just added a website and the crawling result output has only 42 pages but my website has about 75 pages. What am i missing?
Thanks!
-
Is there any chance you could email me your sitemap as produced by wordpress? info[at]pathfindermedia[dot]co[dot]uk I'll take a closer look at whats being excluded.
-
I'll do that Keri. Thanks!
-
I don't think so:
User-agent: * Disallow: /cgi-bin Disallow: /wp-admin Disallow: /wp-includes Disallow: /wp-content/plugins Disallow: /wp-content/cache Disallow: /wp-content/themes Disallow: /trackback Disallow: /feed Disallow: /comments Disallow: /category/*/* Disallow: */trackback Disallow: */feed Disallow: */comments Disallow: /*?* Disallow: /*? Allow: /wp-content/uploads # Goole Bot User-agent: Googlebot Disallow: /*/feed/$ Disallow: /*/feed/rss/$ Disallow: /*/trackback/$ # Google Image User-agent: Googlebot-Image Disallow: Allow: /*
Regards.
-
Another possibility could be your robots.txt file, is it blocking some directories?
-
Hi! It's probably best to email help@seomoz.org about this. You can give them your full URL and they can help figure out why Roger isn't crawling everything. Thanks!
-
Anyone?
-
Hi!
One thing i figured out is that the crawling on both seomoz and xml-sitemaps.com returm the same 42 pages.Here's my website homepage URL - http://bit.ly/TGjpVx
And a couple of missing pages from 36 at total - http://bit.ly/WM3Rwe and http://bit.ly/VpHJ9H.
Regards,
OV -
Hi!
My website has a xml sitemap, generated by Google XML sitemap Wordpress plugin, with all the 75 pages.The crawler http://www.xml-sitemaps.com/ also outputs just 42 pages. I think it has someting to do with the blogs being archived (?).
I need to solve this and don't know how?!
Thanks for your help do.
Regards.
-
Actually, the SEOmoz crawler should be crawling all of the pages -- it's OSE that doesn't crawl everything, but the crawler from your campaign should show all that it could find. If you email help@seomoz.org they'd be happy to help you figure it out, or if you want to share your URL here along with some pages that are missing, the Q&A people could help diagnose things too.
-
Hi Ovieira,
This is not necessarily an indication that there are pages that are hidden from crawlers and the missing pages could simply be low priority for the moment. Or could have been created after the initial crawl had taken place.
The best way to check is to run a crawler like http://www.xml-sitemaps.com/ and that will give you a better idea. If the sitemap generates a complement of your pages then it's probably just a case of waiting until the next Moz crawl.
Mulith
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Home page cannibal
I was wondering if others had the same problem I have. It appears Google loves that home page too much and I'm having a difficult time getting it to rank the page I really want. And that happens if a keyword I want to rank for only appears on the home page one time with a keyword density of .1%. Take vanillaqueen.com for example. The home page ranks on the first page for "bulk vanilla beans" and not http://vanillaqueen.com/shop/category/vanilla-beans/ or http://vanillaqueen.com/five-reasons-why-buying-bulk-vanilla-makes-good-sense/ And I'll add another one that I recently took on. This is a personal injury attorney in a large city so there is a ton of competition who have been doing SEO for a very long time. (Fortunately he also does business and civil litigation law to keep the business going). Last month, according to webmaster tools, he got a couple of clicks (hey, it's something!) on "personal injury attorney [his city]" on page 2 in the SERPS, but it was his home page. http://bit.ly/1Gvumlm **In this case I don't mind people landing on the home page, but does the fact that another page that is much better optimized for those keywords indicate a penalty on that page? And is his rank lower because the better page is not ranking and Google has to find the next best thing in the home page? ** Has anyone else experienced that and what have you done to get Google to not go home? P.S. The law site is a huge challenge because of the competition. Any help you pros out there can offer to get this underdog out of hiding will be much appreciated. We're starting a smart, strategic content marketing plan now that I'm very excited about.
On-Page Optimization | | katandmouse1 -
Is it better to have an hreflang go to the home page in a different language if there's no corresponding page
If I have some pages in English, but not in Spanish on my website: Should my hreflang go to the home page on the Spanish site? Or should I not have an "es-MX" hreflang for that page? Ideally I would have all the pages translated, but this has not all been done yet.
On-Page Optimization | | RoxBrock0 -
Can you canonical from one domain page to a different domain page
We are a boating site and have our main site with all it's products. We have an engine section within our main site. But we also have an outside domain, specific to a certain manufacturer of engines. So we want our customers to still find the engine information for this manufacturer within our main site, as well as find the manufacturer targeted engine site in the SERPS. My question is this: Can I canonical those pages within our main site to pages on the outside domain? Or does are canonicals to be used only within the same domain? Thanks,
On-Page Optimization | | tdawson090 -
Category listing page coming above product pages
A new SEO client we have taken on seem to be hitting most of the points right on with their site and SEO. However one thing that is bugging me is that their category pages i.e. "Footwear" which title tag includes the brands they stock. Is almost always coming up above (if they are ever even found) the product individual pages. Anyone seen this sort of things happening? Very frustrating.
On-Page Optimization | | iboxsecurityltd0 -
Why is the seomoz showing it crawled 3 pages when i only have 2 pages?
I had seomoz crawl my site. I only have 2 pages. The site url is www.autoinsurancefremontca.com.
On-Page Optimization | | Greenpeak0 -
Is reported duplication on the pages or their canonical pages?
There are several sections getting flagged for duplication on one of our sites: http://mysite.com/section-1/?something=X&confirmed=true
On-Page Optimization | | Safelincs
http://mysite.com/section-2/?something=X&confirmed=true
http://mysite.com/section-3/?something=X&confirmed=true Each of the above are showing as having duplicates of the other sections. Indeed, these pages are exactly the same (it's just an SMS confirmation page you enter your code in), however, they all have canonical links back to the section (without the query string), i.e. section-1, section-2 and section-3 respectively. These three sections have unique content and aren't flagged up for duplications themselves, so my questions are: Are the pages with the query strings the duplicates, and if so why are the canonical links being ignored? or Are the canonical pages without the query strings the duplicates, and if so why don't they appear as URLs in their own right in the duplicate content report? I am guessing it's the former, but I can't figure out why it would ignore the canonical links. Any ideas? Thanks0 -
Why a page with an On Page A grade has a less good rank than a page with a F grade?
Why a page with an On Page A grade is ranked 17 in Google when the home page with a F grade is ranked 9 ? Thanks
On-Page Optimization | | Amadeus_eBC0