Crawl test. Bot crawled only 200 or so links when it should have crawled thousands
-
Hi everyone,
I just recieved my crawl test report and its only given me 200 or so URL's when my site has thousands, any thoughts?
-
Hi Ryan,
I am the site owner and this is the precise reason im trying to take matters into my own hands.
<meta name="keywords" content="E60,Rear,lamp,set" /> I see what you mean, because this is actually ridiculous, not quite sure how it got into this state either. Whats that saying, "if you want something done you have to do it yourself" Looks like i have to take a crash course in SEO to sort it all out. Thanks very much for all your help.
-
I realize you may not have full control over the site. What I would share is:
"That's how the site is" is not an acceptable response, unless the site owner is satisfied with their current SEO ranking.
The keywords have NOTHING to do with the product being displayed on the page. The link I offered is for a Hella e60 Rear Lamp. The only related in the keyword section is "rear". I am quite certain that is by coincidence.
Your keywords are not dynamically generated to vary with the pages content, nor were they manually altered to fit the pages content. The keyword selection is awful. The numbers "3", "5", and "7" are listed as 3 of the key words.
I want to help you, so don't take this the wrong way. The best thing about that site is it probably qualifies as a textbook case of what NOT to do from a SEO perspective. Perhaps you can appeal to a SEO company to use the site in a case study and turn it around.
-
Thank you very much Ryan, the columns on two sides of the page cant be helped as thats, how the site is, only the central content changes. However the duplicate keywords are for the products themselves, for example i sell 50 different BMW oil filters. Theres not much i can do about duplicating keywords as all of the products are very very similar.
I think you might be right about the site redesign....
-
A few notes about your site:
-
you are using meta keyords in your header. It offers no benefit and I would suggest removing it. It's not related to your inquiry but is something I noticed.
-
your site has a 50 keyword TAG block with the same keywords on every page. This isn't good from a SEO perspective on many levels. You want your keywords to focus the unique content on a given page
-you site pages are likely viewed as all duplicates. I can recognize the main item in the center of the page changes, but would a crawler? Your left and right sidebars are identical on all pages, along with most of your header. The actual content you offer is only a small percentage of the total page.
The large image of the various car parts is not considered as part of the content, aside from the ALT tag.
Look at a random page from your site: http://www.incarmotorfactors.co.uk/content/16-hella-bmw-e60-rear-lamp-set
According to the Analyze tool there are 5975 words on the page. I estimate about 100 of them are unique words addressing your Rear Lamp product, and the remaining 5800+ words are exactly the same as every other product page.
A crawler will see your pages as 98% duplicated data and the result will likely be your site isn't going to be listed. I would recommend a site re-design. Before taking that advice, it would probably be best to hear from others who have a lot more SEO expertise then myself.
-
-
-
What is different about your site? Is it flash or javascript based? Can you share your site URL?
-
Hi Ryan,
I used
On-Page Optimization Tools: Crawl Test. However this problem may be deeper than i first thought, as SEOMoz is not able to read any of my site info properly.
Open site explorer cant read it
Linkscape cant read the links. Crawl test isnt read properly, however my server and robots.txt files are fine, theres no blocking attempts from the server. Very strange.
-
What Crawl Test tool did you use?
Depending on the crawl tool, it may not look at content blocked from your Robots.txt file. You may want to ensure it is configured correctly.
Are there any permission issues? The crawler will look at your site the way a guest would. Any content which requires users to log in would be hidden to the crawler.
Are there any other issues regarding your site's accessibility? Connection or firewall issues? Could a server admin have seen a server performance issue and kicked the crawler before it finished? You can check server logs for this information.
If you check everything and do not locate a definitive cause, I would suggest running the crawl once more and checking the results before pursuing the matter further.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Links being reported in Webmaster Tools
Hi Are the Total Links To Your Site, as reported in GWT, purely external inbound links ? Since these links are usually, as far as i can tell, much higher in number than any other link reporting tool and hence, i presume, more accurate, why don't services such as Moz etc include this in reporting ? I know its just a total number and link quality is whats important not quantity, but i would have thought interesting to show in reporting in conjunction with link quality info such as is already reported. Since most backlink reporting tools do show a total but always much much lower than that reported in gwt (i think) All Best Dan
Moz Pro | | Dan-Lawrence0 -
Will moz crawl pages blocked by robots.txt and nofollow links?
i have over 2,000 temporary redirects in my campaign report redirects are mostly events like being redirected to a login page before showing the actual data im thinking of adding nofollow on the link so moz wont crawl the redirection to reduce the notification will this solve my problem?
Moz Pro | | WizardOfMoz0 -
1 page crawled ... and other errors
1. Why is only one (1) page crawled every second time you crawl my site? 2. Why do your bot not obey the rules specified in the robots.txt? 3. Why does your site constantly loose connection to my facebook account/page? This means that when ever i want to compare performance i need to re-authorize, and therefor can not see any data until next time. Next time i also need to re-authorize ... 4. Why cant i add a competitor twitter account? What ever i type i get an "uh oh account cannot be tracked" - and if i randomly succeed, the account added never shows up with any data. It has been like this for ages. If have reported these issues over and over again. We are part of a large scandinavian company represented by Denmark, Sweden, Norway and Finland. The companies are also part of a larger worldwide company spreading across England, Ireland, Continental Europe and Northern Europe. I count at least 10 accounts on Seomoz.org We, the Northern Europe (4 accounts) are now reconsidering our membership at seomoz.org. We have recently expanded our efforts and established a SEO-community in the larger scale businees spanning all our countries. Also in this community we are now discussing the quality of your services. We'll be meeting next time at 27-28th of june in London. I hope i can bring some answers that clarify the problem we have seen here on seomoz.org. As i have written before: I love your setup and you tools - when they work. Regretebly, that is only occasionally the case!
Moz Pro | | alsvik1 -
How often does seomoz crawl the site? Can you force a crawl at a specific time ?
How often does seomoz crawl the site? Can you force a crawl at a specific time ?
Moz Pro | | stewbuch18720 -
Percentage of good links vs. bad
Hi Does anyone know the best way of determining good links from bad links using the SEO Moz tools? I bought some directory links to two or three pages on my site a few years back. The were all very obviously spammy because of the anchor text and I didn't have a high enough ratio of good links to counteract them. I read somewhere that if more than 10% of the links to a page have the same (or similar) anchor text, it's obvious that you're on the bad list.
Moz Pro | | nsjadmin0 -
SEomoz slow to crawl?
Hello - I am just trying out the trial and it said the next crawl was nov 1st but I see no change in any of the errors since the initial crawl... so just waiting to find out if what I changed was fixed or not. Is this normal ?
Moz Pro | | Bethany_BabyBrowns0 -
Initial Crawl Questions
Hello. I just joined and used the Crawl tool. I have many questions and hoping the community can offer some guidance. 1. I received an Excel file with 3k+ records. Is there a friendly online viewer for the Crawl report? Or is the Excel file the only output? 2. Assuming the Excel file is the only output, the Time Crawled is a number (i.e. 1305798581). I have tried changing the field to a date/time format but that did not work. How can I view the field as a normal date/time such as May 15, 2011 14:02? 3. I use the ™ symbol in my Title. This symbol appears in the output as a few ascii characters. Is that a concern? Should I remove the trademark symbol from my Title? 4. I am using XenForo forum software. All forum threads automatically receive a Title Tag and Meta Description as part of a template. The Crawl Test report shows my Title Tag and Meta Description as blank for many threads. I have looked at the source code of several pages and they all have clean Title tags and I don't understand why the Crawl Report doesn't show them. Any ideas? 5. In some cases the HTTP Status Code field shows a result of "3". Why does that mean? 6. For every URL in the Crawl Report there is an entry in the Referrer field. What exactly is the relationship between these fields? I thought the Crawl Tool would inspect every page on the site. If a page doesn't have a referring page is it missed? What if a page has multiple referring pages? How is that information displayed? 7. Under Google Webmaster Tools > Site Configurations > Settings > Parameter Handling I have the options set as either "Ignore" or "Let Google Decide" for various URL parameters. These are "pages" of my site which should mostly be ignored. For example a forum may have 7 headers, each on of which can be sorted in ascending or descending order. The only page that matters is the initial page. All the rest should be ignored by Google and the Crawl. Presently there are 11 records for many pages which really should only have one record due to these various sort parameters. Can I configure the crawl so it ignores parameter pages? I am anxious to get started on my site. I dove into the crawl results and it's just too messy in it's present state for me to pull out any actionable data. Any guidance would be appreciated.
Moz Pro | | RyanKent0 -
Competitive Link Analysis
How can I make the a new report for the Competitive Link Analysis? My report has a date from two weeks ago and I would like to see an update.
Moz Pro | | CalgaryRealtor390