Crawl diagnostic how important is these 2 types of errors and what to do?
-
Hi,
I am trying to SEO optimized my webpage dreamesatehuahin.comWhen I saw SEO Moz webpage crawl diagnostic I kind of got a big surprise due to the high no. of errors. I don’t know if this is the kind of errors that need to be taken very serious i my paticular case,
When I am looking at the details I can see the errors are cause by the way my wordpress theme is put together. I don’t know how to resolve this. But If important I might hire a programmer.
- DUPLICATE ERRORS (40 ISSUES HIGH PRIORITY ACCORDING TO MOZ)
They are all the same as this one.
http://www.dreamestatehuahin.com/property-feature/restaurent/page/2/
is eaqual to this one
http://www.dreamestatehuahin.com/property-feature/restaurent/page/2/?view=list
This one exsist
http://www.dreamestatehuahin.com/property-feature/car-park/
while a level down don’t exsit
http://www.dreamestatehuahin.com/property-feature/- DUPLICATE PAGE TITLE (806 ISSUES MEDIUM PRIORITY ACCORDING TO MOZ)
This is related to search results and pagination.
Etc. Title for each of these pages is the same
http://www.dreamestatehuahin.com/property-search/page/1
http://www.dreamestatehuahin.com/property-search/page/2
http://www.dreamestatehuahin.com/property-search/page/3
http://www.dreamestatehuahin.com/property-search/page/4
- Title element is to long (405)
http://www.dreamestatehuahin.com/property-feature/fitness/?view=list
this is not what I consider real pages but maybe its actually is a page for google.
The title from souce code is auto generated and in this case it not makes sense
<title>Fitness Archives - Dream Estate Hua Hin | Property For Sale And RentDream Estate Hua Hin | Property For Sale And Rent</title>I know at the moment there are properly more important things for our website like content, title, meta descriptions, intern and extern links and are looking into this and taking the whole optimization seriously. Have for instance just hired a content writer rewrite and create new content based on keywords research.
I WOULD REALLY APPRICIATE SOME EXPERIENCE PEOPLE FEEDBACK ON HOW IMPORTANT IS IT THAT I FIX THIS ISSUES IF AT ALL POSSIBLE?
best regards,
Nicolaj
- DUPLICATE ERRORS (40 ISSUES HIGH PRIORITY ACCORDING TO MOZ)
-
Hi Nicolaj,
I am happy I could be of help. by the way, GetFlywheel Can put you in a Singapore data center.
the crawl links you can click on the arrow for more information for instance your pointing Google at a non-indexed page with the canonical tag this is just an example of what you can see
All best,
Thomas
-
Thanks Thomas this was so helpful I really appriciate it.
You have shared some good knowledge and pointed me in the right direction. great tips on articles and tools as well.
best regards,
Nicolaj
-
Hi Nicolaj,
I have done a separate crawl on your site and I have posted information and links below. The answers to your questions.
#1
In terms of duplicate content Google knows that you are trying to use the page with the canonical tag pointing to it you can see that here.
many of the issues you are having are answered by Dan Shure in this excellent post summing up best practices for WordPress SEO
http://moz.com/blog/setup-wordpress-for-seo-success
http://www.dreamestatehuahin.com/property-feature/restaurent/page/2/?view=list
Change in your .htaccess file:
RewriteRule ^(.*)$ /index.php?/$1 [L]
To:
RewriteRule ^(.*)$ /index.php/$1 [L]
http://www.webconfs.com/url-rewriting-tool.php
This will fix huge problems in large sites that can be caused by having that?.
I am talking about very large sites
if using Nginx a faster alternative in my opinion to Apache you would be able to use this tool to rewrite any http://winginx.com/en/htaccess
#1)B
This one exists
http://www.dreamestatehuahin.com/property-feature/car-park/
while a level down don’t exist
http://www.dreamestatehuahin.com/property-feature/
This is an issue where your /property–feature/ is showing me a 404
#2
2_) DUPLICATE PAGE TITLE (806 ISSUES MEDIUM PRIORITY ACCORDING TO MOZ)_
This is related to search results and pagination.
Etc. Title for each of these pages is the same
http://www.seobythesea.com/2011/11/google-granted-patent-hostname-mirrors/
"Paginated pages aren’t pages that contain duplicate content, but will sometimes contain duplicated titles and duplicated meta descriptions based upon things like a content management system that you might be using." Bill Slawski
However they can become a huge issue on larger sites.
Yes they can be a very large problem on big sites if you think about it Google does not get the right signals I have two clients with sites over half a million pages this is one of the largest issues I have ever run across for very big sites.
http://www.slideshare.net/ericenge/pagination-and-seo-making-it-easy
This is a very complicated issue if it is something affecting your site and your crawl budget than I create secular non-pagination pages.
http://googlewebmastercentral.blogspot.com/2011/09/pagination-with-relnext-and-relprev.html
Many times it is better to not use pagination and create a single page with a secular title. It depends on your website.
#3
3) Title element is to long (405)
http://www.dreamestatehuahin.com/property-feature/fitness/?view=list
this is not what I consider real pages but maybe its actually is a page for google.
_ The title from source code is auto generated and in this case it not makes sense_
_<title>Fitness Archives - Dream Estate Hua Hin | Property For Sale And RentDream Estate Hua Hin | Property For Sale And Rent</title> _
You should handwrite your title tags and take extreme care in their creation. There are very strong signal to Google.
You have too many words and you have the word property in their and estate as well twice this is spammy
For better results use this guide to writing title tags. Do not allow them to be auto generated.
Please read this http://moz.com/learn/seo/title-tag
Your title tag is too long regardless it is not a wise practice to use as long of a title as you have in there. You use the word property way too much in the title tag you are okay as far as the URL
http://www.dreamestatehuahin.com/property-feature/fitness/?view=list
as it does use a canonical tag it is okay
As far as hiring a programmer it is up to you your site is deeply in need of better coding and hosting your site speed is over 10 seconds. It took me a long time to do any research on your site all. This will kill your conversion if I were browsing this for any other reason other than to help
Just trying to troubleshoot your page takes forever and I thought it was my browser and then my second browser then I did a speed test not to get off subject but get that fixed ASAP
http://tools.pingdom.com/fpt/#!/dMSPsX/http://www.dreamestatehuahin.com/property-feature/fitness/
I would use guides take this to clean up a lot of what is wrong
http://www.feedthebot.com/pagespeed/
In addition I would post it with a managed WordPress hosting company
GetFlywheel, WP engine, Pagely, Pressable, PressLabs & WebSynthesis are all great companies.
GetFlywheel is a fantastic deal at USD15 a site and every site has its very own fully WordPress optimized SSD VPS I have accounts with every company above and that is my opinion.
How are you doing overall getting traffic? How are you doing in converting that traffic into leads?
It would be wise in my opinion to hire a company to help you with the development / programming and SEO.
Let me know if you have any other questions.
Please remember page speed is about pleasing the end-user if they click the back key because your site will not load under 15 seconds ( something it has yet to do in my testing below)
http://tools.pingdom.com/fpt/#!/dugdXo/http://www.dreamestatehuahin.com/
I know speed is a small part of Google's algorithm the reason I am bringing it up is the site is far too slow for normal users to actually browse without leaving. I am certain if you spread your site up your conversion results would be a lot better.
I know at the moment there are properly more important things for our website like content, title, meta descriptions, intern and extern links and are looking into this and taking the whole optimization seriously. Have for instance just hired a content writer rewrite and create new content based on keywords research.
I believe in taking care of the entire site obviously you want to start at the most critical and do things a section at a time so you can see the results. It is fantastic that you have somebody creating content that is very important. What methods did you use to obtain these keywords?
External and internal links are extremely serious without a proper back link profile your site simply will not be seen as important by Google.
Part of what you are talking about is your "title" things like title tags that are too long will affect you because Google will only show a certain amount of pixels I would pay close attention to what this page says along with what the tool shows you in this photo that is larger here http://imgur.com/KFtAY6m.png
title tags are critical, back links are critical, the content you create must be something that people will Share, and like enough to +1 and more importantly link to in order to have real value.
I would suggest a complete site audit
http://www.feedthebot.com/titleandalttags.html
I understand this is twice but use this.
http://moz.com/learn/seo/title-tag
I would recommend using these fantastic tools
Deep Crawl
an incredible tool for finding issues with sites like yours or any site. Up to any size great for huge websites. Because it is hosted on Deep crawls cloud server and not local computer you do not have to worry about your computers RAM ( this only becomes a problem with extremely large sites over 1 million in my experience but it is all depending on your computer of course)
Starts at USD80 a month and will crawl 100,000 URI's however they must be crawled within the term of that month. Packages go up in price by quite a bit after however you get a lot more crawls as well.
Another extremely similar but not cloud-based tool called screaming frog has a free version as well as paid version the free version will crawl up to 500 pages for free and the tool is able to be used on Mac, PC & Ubuntu
The cost is one time cost of 100 British pounds approximately 170 US you do have to renew the license to update, but that is only once a year and it is worth every cent.
The only thing you have to worry about with local installation is your computers specifications most importantly RAM
( this only becomes a problem with extremely large sites but it is all depending on your computer of course)
You can crawl unlimited URI's and your license to update the tool expires after 365 days it is a true bargain.
this is a fantastic guide to doing almost anything with screaming frog guide by SEER Interactive it is valuable in fact using both tools because of their similarities I found this guide applies to both.
http://www.seerinteractive.com/blog/screaming-frog-guide
http://www.screamingfrog.co.uk/seo-spider/
I use that in combination of the tools below Before people think I am a tool only type of person believe me I am not.
They are not designed to do the work for you simply make some of it easier most of the work is done by learning. you can get much more out of using Moz learning, Distilled U, and other great resources than you can by hitting a button but the combination is a synergy.
For complete audits I recommend
Ahrefs, Moz ( all tools), MajesticSEO AuthorityLabs, SERPS,Deep Crawl Screaming Frog Brightedge, AnalyticsSEO, Searchmetrics, Raven, SEMRush & Marin Software.
kind of overkill but I believe they all add something
http://deepcrawl.co.uk/use-cases/architecture-optimisation
Make sure that the keyword research is done by somebody who knows what they are doing. You have a site that needs a lot of love and care, but definitely is salvageable.
you have to fix your XML site map considering you are using Yoast I would use that site map over the one you are using. You are very few links in your XML site map
shown here
http://www.dreamestatehuahin.com/sitemap.xml
this is a summary of the crawl
https://blueprintseo.sharefile.com/d/sb3882a2f46646d49
all links below are to give you insight into the crawl.
I hope I have been of help,
Thomas
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Crawl Stats Decline After Site Launch (Pages Crawled Per Day, KB Downloaded Per Day)
Hi all, I have been looking into this for about a month and haven't been able to figure out what is going on with this situation. We recently did a website re-design and moved from a separate mobile site to responsive. After the launch, I immediately noticed a decline in pages crawled per day and KB downloaded per day in the crawl stats. I expected the opposite to happen as I figured Google would be crawling more pages for a while to figure out the new site. There was also an increase in time spent downloading a page. This has went back down but the pages crawled has never went back up. Some notes about the re-design: URLs did not change Mobile URLs were redirected Images were moved from a subdomain (images.sitename.com) to Amazon S3 Had an immediate decline in both organic and paid traffic (roughly 20-30% for each channel) I have not been able to find any glaring issues in search console as indexation looks good, no spike in 404s, or mobile usability issues. Just wondering if anyone has an idea or insight into what caused the drop in pages crawled? Here is the robots.txt and attaching a photo of the crawl stats. User-agent: ShopWiki Disallow: / User-agent: deepcrawl Disallow: / User-agent: Speedy Disallow: / User-agent: SLI_Systems_Indexer Disallow: / User-agent: Yandex Disallow: / User-agent: MJ12bot Disallow: / User-agent: BrightEdge Crawler/1.0 (crawler@brightedge.com) Disallow: / User-agent: * Crawl-delay: 5 Disallow: /cart/ Disallow: /compare/ ```[fSAOL0](https://ibb.co/fSAOL0)
Intermediate & Advanced SEO | | BandG0 -
Google only indexing the top 2/3 of my page?
HI, I have a page that is about 5000 lines of code total. I was having difficulty figuring out why the addition of a lot of targeted, quality content to the bottom of the pages was not helping with rankings. Then, when fetching as Google, I noticed that only about 3300 lines were getting indexed for some reason. So naturally, that content wasn't going to have any effect if Google in not seeing it. Has anyone seen this before? Thoughts on what may be happening? I'm not seeing any errors begin thrown by the page....and I'm not aware of a limit of lines of code Google will crawl. Pages load under 5 seconds so loading speed shouldn't be the issue. Thanks, Kevin
Intermediate & Advanced SEO | | yandl1 -
How can I stop my facets being crawled?
Hi If my facets are being crawled, how can I stop this? Or set them up so they are SEO friendly - this is new to me as I haven't had to deal with lots of facets in the past. Here's an example of a page on the site - https://www.key.co.uk/en/key/lift-tables Here's an example of a facet URL - https://www.key.co.uk/en/key/lift-tables#facet:-1002779711011711697110,-700000000000001001651484832107103,-700000000000001057452564832109109&productBeginIndex:0&orderBy:5&pageView:list& I've been trying to read up on URL parameters etc, I'm new to it so it's taking some time to understand Any advice would be great!
Intermediate & Advanced SEO | | BeckyKey0 -
How does Tripadviser ensure all their user reviews get crawled?
Tripadvisor has a LOT of user generated content. Searching for a random hotel always seems to return a paginated list of 90+ pages. However once the first page is clicked and "#REVIEWS" is appended to the URL, the URL never changes with any subsequent clicks of the paginated links. How do they ensure that all this review content gets crawled? Thanks, linklater
Intermediate & Advanced SEO | | linklater0 -
Crawl budget
I am a believer in this concept, showing google less pages will increase their importance. here is my question: I manage a website with millions of pages, high organic traffic (lower than before). I do believe that too many pages are crawled. there are pages that I do not need google to crawl and followed. noindex follow does not save on the mentioned crawl budget. deleting those pages is not possible. any advice will be appreciated. If I disallow those pages I am missing on pages that help my important pages.
Intermediate & Advanced SEO | | ciznerguy2 -
VisitSweden indexing error
Hi all Just got a new site up about weekend travel for VisitSweden, the official tourism office of Sweden. Everything went just fine except som issues with indexing. The site can be found here at weekend.visitsweden.com/no/ For some weird reason the "frontpage" of the site does not get indexed. What I have done myself to find the issue: Added sitemaps.xml Configured and added site to webmaster tools Checked 301s so they are not faulty By doing a simple site:weekend.visitsweden.com/no/ you can see that the frontpage is simple not in the index. Also by doing a cache:weekend.visitsweden.com/no/ I see that Google tries to index the page without the trailing /no/ for some reason. http://webcache.googleusercontent.com/search?q=cache:http://weekend.visitsweden.com/no/ Any smart ideas to get this fixed or where to start looking? All help greatly appreciated Kind regards Fredrik
Intermediate & Advanced SEO | | Resultify0 -
2-websites focused on different markets but similar content
Hi all! I have a client who wants to branch out to another market (currently in Northern California and wants to open an office in Southern California), what would happen if we put up a second website that has similar content, but is exclusively for Southern California, with a different office address, and all the content geared towards Southern California market? There would be NO linking between the sites. Would that generate a penalty? Thanks! BB
Intermediate & Advanced SEO | | BBuck0 -
Need help with huge spike in duplicate content and page title errors.
Hi Mozzers, I come asking for help. I've had a client who's reported a staggering increase in errors of over 18,000! The errors include duplicate content and page titles. I think I've found the culprit and it's the News & Events calender on the following page: http://www.newmanshs.wa.edu.au/news-events/events/07-2013 Essentially each day of the week is an individual link, and events stretching over a few days get reported as duplicate content. Do you have any ideas how to fix this issue? Any help is much appreciated. Cheers
Intermediate & Advanced SEO | | bamcreative0