Crawl diagnostic how important is these 2 types of errors and what to do?
-
Hi,
I am trying to SEO optimized my webpage dreamesatehuahin.comWhen I saw SEO Moz webpage crawl diagnostic I kind of got a big surprise due to the high no. of errors. I don’t know if this is the kind of errors that need to be taken very serious i my paticular case,
When I am looking at the details I can see the errors are cause by the way my wordpress theme is put together. I don’t know how to resolve this. But If important I might hire a programmer.
- DUPLICATE ERRORS (40 ISSUES HIGH PRIORITY ACCORDING TO MOZ)
They are all the same as this one.
http://www.dreamestatehuahin.com/property-feature/restaurent/page/2/
is eaqual to this one
http://www.dreamestatehuahin.com/property-feature/restaurent/page/2/?view=list
This one exsist
http://www.dreamestatehuahin.com/property-feature/car-park/
while a level down don’t exsit
http://www.dreamestatehuahin.com/property-feature/- DUPLICATE PAGE TITLE (806 ISSUES MEDIUM PRIORITY ACCORDING TO MOZ)
This is related to search results and pagination.
Etc. Title for each of these pages is the same
http://www.dreamestatehuahin.com/property-search/page/1
http://www.dreamestatehuahin.com/property-search/page/2
http://www.dreamestatehuahin.com/property-search/page/3
http://www.dreamestatehuahin.com/property-search/page/4
- Title element is to long (405)
http://www.dreamestatehuahin.com/property-feature/fitness/?view=list
this is not what I consider real pages but maybe its actually is a page for google.
The title from souce code is auto generated and in this case it not makes sense
<title>Fitness Archives - Dream Estate Hua Hin | Property For Sale And RentDream Estate Hua Hin | Property For Sale And Rent</title>I know at the moment there are properly more important things for our website like content, title, meta descriptions, intern and extern links and are looking into this and taking the whole optimization seriously. Have for instance just hired a content writer rewrite and create new content based on keywords research.
I WOULD REALLY APPRICIATE SOME EXPERIENCE PEOPLE FEEDBACK ON HOW IMPORTANT IS IT THAT I FIX THIS ISSUES IF AT ALL POSSIBLE?
best regards,
Nicolaj
- DUPLICATE ERRORS (40 ISSUES HIGH PRIORITY ACCORDING TO MOZ)
-
Hi Nicolaj,
I am happy I could be of help. by the way, GetFlywheel Can put you in a Singapore data center.
the crawl links you can click on the arrow for more information for instance your pointing Google at a non-indexed page with the canonical tag this is just an example of what you can see
All best,
Thomas
-
Thanks Thomas this was so helpful I really appriciate it.
You have shared some good knowledge and pointed me in the right direction. great tips on articles and tools as well.
best regards,
Nicolaj
-
Hi Nicolaj,
I have done a separate crawl on your site and I have posted information and links below. The answers to your questions.
#1
In terms of duplicate content Google knows that you are trying to use the page with the canonical tag pointing to it you can see that here.
many of the issues you are having are answered by Dan Shure in this excellent post summing up best practices for WordPress SEO
http://moz.com/blog/setup-wordpress-for-seo-success
http://www.dreamestatehuahin.com/property-feature/restaurent/page/2/?view=list
Change in your .htaccess file:
RewriteRule ^(.*)$ /index.php?/$1 [L]
To:
RewriteRule ^(.*)$ /index.php/$1 [L]
http://www.webconfs.com/url-rewriting-tool.php
This will fix huge problems in large sites that can be caused by having that?.
I am talking about very large sites
if using Nginx a faster alternative in my opinion to Apache you would be able to use this tool to rewrite any http://winginx.com/en/htaccess
#1)B
This one exists
http://www.dreamestatehuahin.com/property-feature/car-park/
while a level down don’t exist
http://www.dreamestatehuahin.com/property-feature/
This is an issue where your /property–feature/ is showing me a 404
#2
2_) DUPLICATE PAGE TITLE (806 ISSUES MEDIUM PRIORITY ACCORDING TO MOZ)_
This is related to search results and pagination.
Etc. Title for each of these pages is the same
http://www.seobythesea.com/2011/11/google-granted-patent-hostname-mirrors/
"Paginated pages aren’t pages that contain duplicate content, but will sometimes contain duplicated titles and duplicated meta descriptions based upon things like a content management system that you might be using." Bill Slawski
However they can become a huge issue on larger sites.
Yes they can be a very large problem on big sites if you think about it Google does not get the right signals I have two clients with sites over half a million pages this is one of the largest issues I have ever run across for very big sites.
http://www.slideshare.net/ericenge/pagination-and-seo-making-it-easy
This is a very complicated issue if it is something affecting your site and your crawl budget than I create secular non-pagination pages.
http://googlewebmastercentral.blogspot.com/2011/09/pagination-with-relnext-and-relprev.html
Many times it is better to not use pagination and create a single page with a secular title. It depends on your website.
#3
3) Title element is to long (405)
http://www.dreamestatehuahin.com/property-feature/fitness/?view=list
this is not what I consider real pages but maybe its actually is a page for google.
_ The title from source code is auto generated and in this case it not makes sense_
_<title>Fitness Archives - Dream Estate Hua Hin | Property For Sale And RentDream Estate Hua Hin | Property For Sale And Rent</title> _
You should handwrite your title tags and take extreme care in their creation. There are very strong signal to Google.
You have too many words and you have the word property in their and estate as well twice this is spammy
For better results use this guide to writing title tags. Do not allow them to be auto generated.
Please read this http://moz.com/learn/seo/title-tag
Your title tag is too long regardless it is not a wise practice to use as long of a title as you have in there. You use the word property way too much in the title tag you are okay as far as the URL
http://www.dreamestatehuahin.com/property-feature/fitness/?view=list
as it does use a canonical tag it is okay
As far as hiring a programmer it is up to you your site is deeply in need of better coding and hosting your site speed is over 10 seconds. It took me a long time to do any research on your site all. This will kill your conversion if I were browsing this for any other reason other than to help
Just trying to troubleshoot your page takes forever and I thought it was my browser and then my second browser then I did a speed test not to get off subject but get that fixed ASAP
http://tools.pingdom.com/fpt/#!/dMSPsX/http://www.dreamestatehuahin.com/property-feature/fitness/
I would use guides take this to clean up a lot of what is wrong
http://www.feedthebot.com/pagespeed/
In addition I would post it with a managed WordPress hosting company
GetFlywheel, WP engine, Pagely, Pressable, PressLabs & WebSynthesis are all great companies.
GetFlywheel is a fantastic deal at USD15 a site and every site has its very own fully WordPress optimized SSD VPS I have accounts with every company above and that is my opinion.
How are you doing overall getting traffic? How are you doing in converting that traffic into leads?
It would be wise in my opinion to hire a company to help you with the development / programming and SEO.
Let me know if you have any other questions.
Please remember page speed is about pleasing the end-user if they click the back key because your site will not load under 15 seconds ( something it has yet to do in my testing below)
http://tools.pingdom.com/fpt/#!/dugdXo/http://www.dreamestatehuahin.com/
I know speed is a small part of Google's algorithm the reason I am bringing it up is the site is far too slow for normal users to actually browse without leaving. I am certain if you spread your site up your conversion results would be a lot better.
I know at the moment there are properly more important things for our website like content, title, meta descriptions, intern and extern links and are looking into this and taking the whole optimization seriously. Have for instance just hired a content writer rewrite and create new content based on keywords research.
I believe in taking care of the entire site obviously you want to start at the most critical and do things a section at a time so you can see the results. It is fantastic that you have somebody creating content that is very important. What methods did you use to obtain these keywords?
External and internal links are extremely serious without a proper back link profile your site simply will not be seen as important by Google.
Part of what you are talking about is your "title" things like title tags that are too long will affect you because Google will only show a certain amount of pixels I would pay close attention to what this page says along with what the tool shows you in this photo that is larger here http://imgur.com/KFtAY6m.png
title tags are critical, back links are critical, the content you create must be something that people will Share, and like enough to +1 and more importantly link to in order to have real value.
I would suggest a complete site audit
http://www.feedthebot.com/titleandalttags.html
I understand this is twice but use this.
http://moz.com/learn/seo/title-tag
I would recommend using these fantastic tools
Deep Crawl
an incredible tool for finding issues with sites like yours or any site. Up to any size great for huge websites. Because it is hosted on Deep crawls cloud server and not local computer you do not have to worry about your computers RAM ( this only becomes a problem with extremely large sites over 1 million in my experience but it is all depending on your computer of course)
Starts at USD80 a month and will crawl 100,000 URI's however they must be crawled within the term of that month. Packages go up in price by quite a bit after however you get a lot more crawls as well.
Another extremely similar but not cloud-based tool called screaming frog has a free version as well as paid version the free version will crawl up to 500 pages for free and the tool is able to be used on Mac, PC & Ubuntu
The cost is one time cost of 100 British pounds approximately 170 US you do have to renew the license to update, but that is only once a year and it is worth every cent.
The only thing you have to worry about with local installation is your computers specifications most importantly RAM
( this only becomes a problem with extremely large sites but it is all depending on your computer of course)
You can crawl unlimited URI's and your license to update the tool expires after 365 days it is a true bargain.
this is a fantastic guide to doing almost anything with screaming frog guide by SEER Interactive it is valuable in fact using both tools because of their similarities I found this guide applies to both.
http://www.seerinteractive.com/blog/screaming-frog-guide
http://www.screamingfrog.co.uk/seo-spider/
I use that in combination of the tools below Before people think I am a tool only type of person believe me I am not.
They are not designed to do the work for you simply make some of it easier most of the work is done by learning. you can get much more out of using Moz learning, Distilled U, and other great resources than you can by hitting a button but the combination is a synergy.
For complete audits I recommend
Ahrefs, Moz ( all tools), MajesticSEO AuthorityLabs, SERPS,Deep Crawl Screaming Frog Brightedge, AnalyticsSEO, Searchmetrics, Raven, SEMRush & Marin Software.
kind of overkill but I believe they all add something
http://deepcrawl.co.uk/use-cases/architecture-optimisation
Make sure that the keyword research is done by somebody who knows what they are doing. You have a site that needs a lot of love and care, but definitely is salvageable.
you have to fix your XML site map considering you are using Yoast I would use that site map over the one you are using. You are very few links in your XML site map
shown here
http://www.dreamestatehuahin.com/sitemap.xml
this is a summary of the crawl
https://blueprintseo.sharefile.com/d/sb3882a2f46646d49
all links below are to give you insight into the crawl.
I hope I have been of help,
Thomas
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What happens to crawled URLs subsequently blocked by robots.txt?
We have a very large store with 278,146 individual product pages. Since these are all various sizes and packaging quantities of less than 200 product categories my feeling is that Google would be better off making sure our category pages are indexed. I would like to block all product pages via robots.txt until we are sure all category pages are indexed, then unblock them. Our product pages rarely change, no ratings or product reviews so there is little reason for a search engine to revisit a product page. The sales team is afraid blocking a previously indexed product page will result in in it being removed from the Google index and would prefer to submit the categories by hand, 10 per day via requested crawling. Which is the better practice?
Intermediate & Advanced SEO | | AspenFasteners1 -
When I crawl my website I have urls with (#!162738372878) at the end of my urls
When I crawl my website I have urls with (#!162738372878) at the end of my urls. I used screaming frog to look check my website and I seen these. My normal urls are in there too, but each of them have a copy with this strange symbol and number at the end. I used a website builder called homestead to make the website and I seen a bunch of there urls in my crawl as well - http://editor.homestead.com/faq is an example I recently created a new website with their new website builder and transferred it to my old domain. However, I didnt know they didnt offer 301 redirects or canonical tags(learned about those afterwards) and I changed my page names. So they recommended I leave the old website published along with the new website. So if I search my website name on google, sometimes both will show in the results. I just want to sort this all out somehow. My website is www.coastlinetvinstalls.com Any feedback is greatly appreciated. Thanks, Matt
Intermediate & Advanced SEO | | Matt160 -
Having a Keyword in # is not that important in 2018, Do you agree?
Earlier having a Keyword in was one of the important ranking factor or at least every SEO guru use to suggest this. But, of late, we are noticing that Google is not giving much weightage to it. What are your thoughts on this?
Intermediate & Advanced SEO | | SameerBhatia3 -
Intermittent DNS errors. IP team not able to diagnose
Intermittent DNS errors showing up in GSC for our fashion portal www.AJIO.com. Our IP team doesn't find any issues at our end. Everytime i write to them, they come back saying 'DNS is resolving fine in all servers'. How do we resolve this? Pl help
Intermediate & Advanced SEO | | AJIOreliance0 -
Client wants to show 2 different types of content based on cookie usage - potential cloaking issue?
Hi, A client of mine has compliance issues in their industry and has to show two different types of content to visitors: domain.com/customer-a/about-us domain.com/customer-b/about-us Next year, they have to increase that to three different types of customer. Rather than creating a third section (customer-c), because it's very similar to one of the types of customers already (customer-b), their web development agency is suggesting changing the content based on cookies, so if a user has indentified themselves as customer-b, they'll be shown /customer-b/, but if they've identified themselves as customer-c, they'll see a different version of /customer-b/ - in other words, the URL won't change, but the content on the page will change, based on their cookie selection. I'm uneasy about this from an SEO POV because: Google will only be able to see one version (/customer-b/ presumably), so it might miss out on indexing valuable /customer-c/ content, It makes sense to separate them into three URL paths so that Google can index them all, It feels like a form of cloaking - i.e. Google only sees one version, when two versions are actually available. I've done some research but everything I'm seeing is saying that it's fine, that it's not a form of cloaking. I can't find any examples specific to this situation though. Any input/advice would be appreciated. Note: The content isn't shown differently based on geography - i.e. these three customers would be within one country (e.g. the UK), which means that hreflang/geo-targeting won't be a workaround unfortunately.
Intermediate & Advanced SEO | | steviephil0 -
Can't crawl website with Screaming frog... what is wrong?
Hello all - I've just been trying to crawl a site with Screaming Frog and can't get beyond the homepage - have done the usual stuff (turn off JS and so on) and no problems there with nav and so on- the site's other pages have indexed in Google btw. Now I'm wondering whether there's a problem with this robots.txt file, which I think may be auto-generated by Joomla (I'm not familiar with Joomla...) - are there any issues here? [just checked... and there isn't!] If the Joomla site is installed within a folder such as at e.g. www.example.com/joomla/ the robots.txt file MUST be moved to the site root at e.g. www.example.com/robots.txt AND the joomla folder name MUST be prefixed to the disallowed path, e.g. the Disallow rule for the /administrator/ folder MUST be changed to read Disallow: /joomla/administrator/ For more information about the robots.txt standard, see: http://www.robotstxt.org/orig.html For syntax checking, see: http://tool.motoricerca.info/robots-checker.phtml User-agent: *
Intermediate & Advanced SEO | | McTaggart
Disallow: /administrator/
Disallow: /bin/
Disallow: /cache/
Disallow: /cli/
Disallow: /components/
Disallow: /includes/
Disallow: /installation/
Disallow: /language/
Disallow: /layouts/
Disallow: /libraries/
Disallow: /logs/
Disallow: /modules/
Disallow: /plugins/
Disallow: /tmp/0 -
Are footer links important?
We currently display a list of links in the footer of our site to help boost SEO. They were put in place years ago and in a recent discuss with our UX team they requested we remove them from the site. Do footer links have any value? Or is this an old dated practice that no longer works? If we remove the footer links should we expect to see if have an impact on our SEO traffic?
Intermediate & Advanced SEO | | Mivito0 -
Is DOCTYPE important for SEO?
Hello fellow Mozzers. I am just having a brief look at a potential clients website before speaking to them tomorrow and whilst looking at the source I noticed that they don't appear to have a clear definition for their Doctype. All the have at the top of each page is I have to admit that Doctypes aren't my strong point but I know that they are normally slightly more descriptive than this. Can this have any effect on rankings? or is this just an issue for W3C validation? Thanks 🙂
Intermediate & Advanced SEO | | AdeLewis0