Historic issue with incomplete indexing
-
Hi there
We run quite a big site in the UK in the commercial real-estate space.
Historically we have always had a challenge getting our "primary" landing pages indexed, which are location based property result pages.
e.g. https://realla.co/to-rent/commercial-property/oxford
For example, for the "towns" category we have 8,549 submitted in our xml sitemap, with only 3,171 indexed. This is a general issue across all our sitemaps. 120k submitted, 80k indexed. Our pages are linked through breadcrumbs, and nearby links.
In the new search console these pages are reported as "crawled - currently not indexed"
These all sit under the folder:
site:https://realla.co/to-rent/commercial-property/*
site:https://realla.co/to-rent/office/*
We have done extensive work to optimise performance, including AMP pages.
Each location page has many details pages for individual properties e.g.
https://realla.co/to-rent/details/0ffbbd0a1a1147edb8847c5ce6179509
One action we have remaining is to nest the details under the locations pages, which may help. These details pages are indexed fully.
Any feedback much appreciated
-
Hi Ian,
The details URL should ideally have keywords in it, getting property name in details page URL would be of great help, like : https://realla.co/to-rent/details/Office-to-let-John-Eccles-House-Robert Robinson-Avenue-Oxford-Science-Park-Oxford-OX4-4GP
About the category (locations in your case), you are submitting too many of them, your URL structure needs to re-structured, there is work to be done there and sitemap updated according to that. For example:
https://realla.co/to-rent/commercial-property/
can be changed to
https://realla.co/commercial-property-to-rent/
I hope this helps, let me know if you have further queries.
Regards,
Vijay
-
Thanks for your reply
We are just about to nest the "details" pages under the results path e.g. /to-rent/commercial-property/newbury/details/1294321739712973129 etc so it sits under the right location.
I think this is in line with your recommendation.
We have alot of individual sitemap files, should these be consolidated?
-
Hi Ian,
I have analyzed the website in detail, the problem seems to be that you are not giving any differentiation to search engine bots between important category/sub-category(in your case different locations) pages compared to product pages (in your case property details page). The location pages URL structure and their sitemap submission strategy can be re-worked to get the desired results.
Another scope of improvement is in URL structure for property details page **For example, **
https://realla.co/to-rent/details/0ffbbd0a1a1147edb8847c5ce6179509 should be https://realla.co/to-rent/details/Office-to-let-John-Eccles-House-Robert Robinson-Avenue-Oxford-Science-Park-Oxford-OX4-4GP
Your site structure is huge, and it must be getting dynamic links generated or removed, you need to be careful with the site structure and how often to submit sitemap.
I hope this helps. Let me know if you have further queries, I will be happy to help.
Regards,
Vijay
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Pages not indexable?
Hello, I've been trying to find out why Google Search Console finds these pages non-indexable: https://www.visitflorida.com/en-us/eat-drink.html https://www.visitflorida.com/en-us/florida-beaches/beach-finder.html Moz and SEMrush both crawl the pages and show no errors but GSC comes back with, "blocked by robots.txt" but I've confirmed it is not. Anyone have any thoughts? 6AYn1TL
Technical SEO | | KenSchaefer0 -
Is my page being indexed?
To put you all in context, here is the situation, I have pages that are only accessible via an intern search tool that shows the best results for the request. Let's say i want to see the result on page 2, the page 2 will have a request in the url like this: ?p=2&s=12&lang=1&seed=3688 The situation is that we've disallowed every URL's that contains a "?" in the robots.txt file which means that Google doesn't crawl the page 2,3,4 and so on. If a page is only accessible via page 2, do you think Google will be able to access it? The url of the page is included in the sitemap. Thank you in advance for the help!
Technical SEO | | alexrbrg0 -
Pages Not Getting Indexed
Hey there I have a website with pretty much 3-4 pages. All of them had a canonical pointing to one page and the same content ( which happened by mistake ) I removed the canonical URL and added one pointing to its page. Also, I added the original content that was supposed to be there to begin with. It's been weeks but those pages are not getting indexed on the SERPS while the one that they use to point with the canonical does.
Technical SEO | | AngelosS0 -
Indexing a catalogue
A client of mine has a large printed product catalogue that they post on their website as a pdf. Should I take a different approach of posting this catalogue in order to gain SEO value?
Technical SEO | | garymeld0 -
Image Indexing Issue by Google
Hello All,My URL is: www.thesalebox.comI have Submitted my image Sitemap in google webmaster tool on 10th Oct 2013,Still google could not indexing any of my web images,Please refer my sitemap - www.thesalebox.com/AppliancesHomeEntertainment.xml and www.thesalebox.com/Hardware.xmland my webmaster status and image indexing status are below, Can you please help me, why my images are not indexing in google yet? is there any issue? please give me suggestions?Thanks!
Technical SEO | | CommercePundit0 -
Interesting indexing issue - any input would be greatly appreciated!
A few months ago we did SEO for a website, just like any other website. However, we did not see crawl/indexing results that we have with all of our other SEO projects - the Google webmaster tool was indicating that only 1 page of the site (although only 20 pages) was indexed. The site was older & originally developed in Dreamweaver, so although that shouldn't have been an issue, we were desperate to solve the problem & ended up rebuilding the site in WordPress. While this actually helped increase the number of pages on the site that Google indexed (now all 20) - we are still seeing strange things in the search results. For example, when we check rankings manually for a particular term, the new description is showing, however, it is displaying the old title text. Does anyone know what the problem could be? Thank you so much!!
Technical SEO | | ZAG0 -
How to change noindex to index?
Hey, I've recently upgraded to a pro SEOmoz account and have realised i have 14574 issues to do with 'blocked by meta-robot' and that 'This page is being kept out of the search engine indexes by the meta tag , which may have a value of "noindex", keeping this page out of the index.' How can i change this so my pages get indexed? I read somewhere that i need to change my privacy settings but that thread was 3 years old and now the WP Dashboard has updated.. Please let me know Many thanks, Jamie P.s Im using WordPress 3.5 And i have the XML sitemap plugin And i have no idea where to look for this robots.txt file..
Technical SEO | | markgreggs0 -
Rel=canonical issue
Re. http://www.appetise.com. We have been alerted that we are "not making appropriate use of the rel=canonical tag". Please could someone just clarify this for us and let us know the recommended remedial action we need to take to rectify the issue? Many Thanks, RB
Technical SEO | | E-resistible0