Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
URLs dropping from index (Crawled, currently not indexed)
-
I've noticed that some of our URLs have recently dropped completely out of Google's index.
When carrying out a URL inspection in GSC, it comes up with 'Crawled, currently not indexed'.
Strangely, I've also noticed that under referring page it says 'None detected', which is definitely not the case.
I wonder if it could be something to do with the following? https://www.seroundtable.com/google-ranking-index-drop-30192.html - It seems to be a bug affecting quite a few people.
Here are a few examples of the URLs that have gone missing:
https://www.ihasco.co.uk/courses/detail/sexual-harassment-awareness-training
https://www.ihasco.co.uk/courses/detail/conflict-resolution-training
https://www.ihasco.co.uk/courses/detail/prevent-duty-training
Any help here would be massively appreciated!
-
The same issue facing my website
-
It seems like this issue is quite common lately. I have experienced something similar with some pages on my site InstPro.net which are not getting indexed properly either. any advice would be appreciated.
-
It seems like this issue is quite common lately. I have experienced something similar with some pages on my site InstPro.net which are not getting indexed properly either. any advice would be appreciated.
-
@Philljones22 said in URLs dropping from index (Crawled, currently not indexed):
Same issue here, my most of the URLs are getting de indexed after indexing the search console.
I'm experiencing the same problem. Most of my URLs are getting de-indexed after being indexed by the search console.
https://www.stardewvalleyapk.me/
https://www.stardewvalleyapk.me/stardew-valley-mod-apk/ -
I don't know why but I am facing the same issue from past 3 monts.
My most of the URLs are getting de indexed after indexing the search console. -
@kingshah001 said in URLs dropping from index (Crawled, currently not indexed):
Same issue here, my most of the URLs are getting de indexed after indexing the search console.
Same issue here, my most of the URLs are getting de indexed after indexing the search console.
https://inshotproapps.com
https://instoproapps.com/inshot-for-pc/ -
Thanks for sharing details.
-
Same issue here, my most of the URLs are getting de indexed after indexing the search console.
Some of the URLs are below:
https://apkcroc.com/
https://apkcroc.com/vn-mod-apk/
https://apkcroc.com/kinemaster-mod-apk/
https://apkcroc.com/terragenesis-mod-apk/
https://apkcroc.com/sky-fighters-3d-mod-apk/ -
My site is also a victim of same the issue, collecting bits and actionable advice. I'm planning to post my experience on Moz forum soon.
-
Hello,
since the beginning of ladykiller.nl I am having the same issues with Google to crawl sitemap(s) and index urls. I am using Yoast as a plugin for the sitemap.
For the moment +3620 urls are indexed, but my website has +10.000 urls :(.
Also from time to time I get a notice in GSC that Google can not fetch certain sitemap urls f.e. https://ladykiller.nl/post-sitemap.xml. Mostly the issue is fixed after a week or so. Please find print screen here: https://prnt.sc/Pm_h2Arjxu-kAlready asked on numerous forums for help, as I can not find a solution to get this problem fixed. However, without any good results so far.
Therefore, I am trying it here again in the hope maybe some of you guys have some better understanding of what the issue might be and how it can be fixed. All help is highly appreciated!
Thanks in advance for having a look into it :)!
Warm regards,
John -
Hi there,
The third URL you are referencing, is actually indexed:
https://dmitrii-regexseo.tinytake.com/tt/NDY4NDY4N18xNDgzNjgzMA
As for "crawled, not indexed" - in most cases it happens because of one and only reason - Google is seeing your page as thin content, not worth being indexed. Typically it happens on bigger sites with a lot of similar pages. In your case, you got many courses, with exactly same structure. So, if the content is not completely different, then Google might deem it not worthy.
As for the bug you referenced - did your URLs drop off the index exactly at the time when this issue has been discovered? (aka within the last week?).
Do you have any cannibalization happening?
To me it looks like that's the case. If I do this search: "site:https://www.ihasco.co.uk/ Sexual Harassment Training course"
There are many pages that are indexed and are ranking: https://dmitrii-regexseo.tinytake.com/tt/NDY4NDcwN18xNDgzNjg4Mg
So, basically, you have pages that are more authoritative with similar content. Therefore your courses pages are dropping as thin content.
I would recommend doing some internal linking optimization to tell Google what is actually important. Look in GSC for internal links metrics.
Hope this helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Japanese URL-structured sitemap (pages) not being indexed by Bing Webmaster Tools
Hello everyone, I am facing an issue with the sitemap submission feature in Bing Webmaster Tools for a Japanese language subdirectory domain project. Just to outline the key points: The website is based on a subdirectory URL ( example.com/ja/ ) The Japanese URLs (when pages are published in WordPress) are not being encoded. They are entered in pure Kanji. Google Webmaster Tools, for instance, has no issues reading and indexing the page's URLs in its sitemap submission area (all pages are being indexed). When it comes to Bing Webmaster Tools it's a different story, though. Basically, after the sitemap has been submitted ( example.com/ja/sitemap.xml ), it does report an error that it failed to download this part of the sitemap: "page-sitemap.xml" (basically the sitemap featuring all the sites pages). That means that no URLs have been submitted to Bing either. My apprehension is that Bing Webmaster Tools does not understand the Japanese URLs (or the Kanji for that matter). Therefore, I generally wonder what the correct way is to go on about this. When viewing the sitemap ( example.com/ja/page-sitemap.xml ) in a web browser, though, the Japanese URL's characters are already displayed as encoded. I am not sure if submitting the Kanji style URLs separately is a solution. In Bing Webmaster Tools this can only be done on the root domain level ( example.com ). However, surely there must be a way to make Bing's sitemap submission understand Japanese style sitemaps? Many thanks everyone for any advice!
Technical SEO | | Hermski0 -
Google tries to index non existing language URLs. Why?
Hi, I am working for a SAAS client. He uses two different language versions by using two different subdomains.
Technical SEO | | TheHecksler
de.domain.com/company for german and en.domain.com for english. Many thousands URLs has been indexed correctly. But Google Search Console tries to index URLs which were never existing before and are still not existing. de.domain.com**/en/company
en.domain.com/de/**company ... and an thousand more using the /en/ or /de/ in between. We never use this variant and calling these URLs will throw up a 404 Page correctly (but with wrong respond code - we`re fixing that 😉 ). But Google tries to index these kind of URLs again and again. And, I couldnt find any source of these URLs. No Website is using this as an out going link, etc.
We do see in our logfiles, that a Screaming Frog Installation and moz.com w opensiteexplorer were trying to access this earlier. My Question: How does Google comes up with that? From where did they get these URLs, that (to our knowledge) never existed? Any ideas? Thanks 🙂0 -
Google is indexing bad URLS
Hi All, The site I am working on is built on Wordpress. The plugin Revolution Slider was downloaded. While no longer utilized, it still remained on the site for some time. This plugin began creating hundreds of URLs containing nothing but code on the page. I noticed these URLs were being indexed by Google. The URLs follow the structure: www.mysite.com/wp-content/uploads/revslider/templates/this-part-changes/ I have done the following to prevent these URLs from being created & indexed: 1. Added a directive in my Htaccess to 404 all of these URLs 2. Blocked /wp-content/uploads/revslider/ in my robots.txt 3. Manually de-inedex each URL using the GSC tool 4. Deleted the plugin However, new URLs still appear in Google's index, despite being blocked by robots.txt and resolving to a 404. Can anyone suggest any next steps? I Thanks!
Technical SEO | | Tom3_150 -
How google crawls images and which url shows as source?
Hi, I noticed that some websites host their images to a different url than the one their actually website is hosted but in the end google link to the one that the site is hosted. Here is an example: This is a page of a hotel in booking.com: http://www.booking.com/hotel/us/harrah-s-caesars-palace.en-gb.html When I try a search for this hotel in google images it shows up one of the images of the slideshow. When I click on the image on Google search, if I choose the Visit Page button it links to the url above but the actual image is located in a totally different url: http://r-ec.bstatic.com/images/hotel/840x460/135/13526198.jpg My question is can you host your images to one site but show it to another site and in the end google will lead to the second one?
Technical SEO | | Tz_Seo0 -
Category URL Pagination where URLs don't change between pages
Hello, I am working on an e-commerce site where there are categories with multiple pages. In order to avoid pagination issues I was thinking of using rel=next and rel=prev and cannonical tags. I noticed a site where the URL doesn't change between pages, so whether you're on page 1,2, or 3 of the same category, the URL doesn't change. Would this be a cleaner way of dealing with pagination?
Technical SEO | | whiteonlySEO0 -
CDN Being Crawled and Indexed by Google
I'm doing a SEO site audit, and I've discovered that the site uses a Content Delivery Network (CDN) that's being crawled and indexed by Google. There are two sub-domains from the CDN that are being crawled and indexed. A small number of organic search visitors have come through these two sub domains. So the CDN based content is out-ranking the root domain, in a small number of cases. It's a huge duplicate content issue (tens of thousands of URLs being crawled) - what's the best way to prevent the crawling and indexing of a CDN like this? Exclude via robots.txt? Additionally, the use of relative canonical tags (instead of absolute) appear to be contributing to this problem as well. As I understand it, these canonical tags are telling the SEs that each sub domain is the "home" of the content/URL. Thanks! Scott
Technical SEO | | Scott-Thomas0 -
Google is indexing my directories
I'm sure this has been asked before, but I was looking at all of Google's results for my site and I found dozens of results for directories such as: Index of /scouting/blog/wp-includes/js/swfupload/plugins Obviously I don't want those indexed. How do I prevent Google from indexing those? Also, it only seems to be doing it with Wordpress, not any of the directories on my main site. (We have a wordpress blog, which is only a portion of the site)
Technical SEO | | UnderRugSwept0 -
No indexing url including query string with Robots txt
Dear all, how can I block url/pages with query strings like page.html?dir=asc&order=name with robots txt? Thanks!
Technical SEO | | HMK-NL0