How can I make a list of all URLs indexed by Google?
-
I have a large site with over 6000 pages indexed but only 600 actual pages and need to clean up with 301 redirects. Haven't had this need since Google stopped displaying the url's in the results.
-
very depressed of being not indexing URL - https://www.wrightleatherworks.com/collections/gun-holsters-for-women please someone help
-
Open the sitemap in a text editor or XML viewer. Look at the <loc> tags in the sitemap file. These tags contain the URL of each page on the website. You can copy the URLs from the <loc> tags into a spreadsheet or text document.
-
Open the sitemap in a text editor or XML viewer. Look at the <loc> tags in the sitemap file. These tags contain the URL of each page on the website. You can copy the URLs from the <loc> tags into a spreadsheet or text document.
-
@aplusnetsolutions create a seperate sitemap for not indexed pages and upload to google web master tools. i had a website had same issues. Also remember it will take some time also
-
@aplusnetsolutions
Make sure all these things are selected in screaming frog and -
Going to Google search console it will tell you how many inches are inside Google search index there are a few other ways to do this if you want
#1
By far the easiest is to going to search console and select coverage and all known pages
number 2
For the quick and dirty method, simply perform a simple site search in your Google search bar with “site:yourdomain.com”
#3
You can crawl your site using screaming frog (it will 500 pages for free but every SEO should have it) use it to index your site and connect it to Google search console API as well as Google analytics and you will have a easy-to-read CSV export of your index and non-index pages
more on how this is done![ you can see the non-indexable URLs on the the right click that]( image url)
video https://youtu.be/iYeXSdUt_hg
#4
or indexer site with any thing that can export a CSV download the CSV uploaded here and it will tell you which pages are in Google's index
https://www.rankwatch.com/free-tools/google-index-checkerhttps://searchengineland.com/check-urls-indexed-without-upsetting-google-follow-267472
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
google webmaster tools Indexing request rejected
when i try to index my posts in google webmaster tools i see this eror : Indexing request rejected
SEO Tactics | | sasansasyino
During live testing, indexing issues were detected with the URL
Crawl
Time
Sep 23, 2023, 11:05:05 PM
Crawled as
Google Inspection Tool desktop
Crawl allowed?
Yes
Page fetch
error
Failed: Hostload exceeded
Indexing allowed?
N/A
Indexing
User-declared canonical
N/A
Google-selected canonical
Only determined after indexing my website : http://123select.ir/0 -
Multiple URLs from One Domain on Page 1
Is it possible (and if it is possible, how difficult is it) to get multiple URLs from the same domain name to be indexed on the first page of Google for the same keyword or phrase? Assuming each URL has significantly different content of course... Context: A large real estate brand has 6 franchise offices in the same city. Each office has its own listing/landing page on the parent brand website. Each franchise owner wants their page on the website to rank for the term 'Christchurch real estate'. The homepage of the parent brand website currently ranks on the first page of Google for this search term, but none of the franchisee pages do. So my question is: with the right unique content on each franchisee page, supported by quality backlinks to each of the different offices, is it possible to get multiple franchisees listed on the first page of Google given that ultimately the 6 URLs are all attached to the same domain name? (And, if so, do you have any hot tips you can share to assist me on this uphill battle?)
SEO Tactics | | BeKonstructive0 -
When to re-write and redirect a blog url?
What are best practices for rewriting (and then redirecting) blog URLs? I refresh old blog posts on our blog every month and many of them have URLs that are too long or could be improved. However, many of them also already get decent organic traffic and I don't want to lose traffic due to a URL redirect. Are there any best practices or "rules" I can follow when deciding whether to re-write and redirect blog URLs?
Content Development | | Emily.R.Monrovia
Thanks!0 -
Plagiarized Site Effecting Google Rankings
Can someone provides insights on a de-indexing example? I have gone through the depths of Google lack of support and requesting duplicate content flags, so no avail. Here's the scenario: Client had a competing SEO provider try to earn his business. In doing so, he copied word for word our blog that we have been producing content on over the last 5 years. He also integrated Google reviews in the structured data on this new URL. Well, fast forward 1-2 months later, our rankings started to drop. We found this 100% plagiarized site is taking away from our keyword rankings on GMB, and is no and Google search, and our site GMB is now only displaying on a branded name search as well as our search traffic has dropped. I have identified the plagiarized, duplicated content, being tied to our GMB as well, as the source of the problem. Well, I finally obtain ed control of the plagarized domain and shut down the hosted, and forwarded the URL to our URL. Well, Google still has the HTTS version of the site indexed. And it is in my professional opinion, that since the site is still indexed and is associated with the physician GMB that was ranking for our target keyword and no longer does, that this is the barrier to ranking again. Since its the HTTPS version, it is not forwarded to our domain. Its a 504 error but is still ranking in the google index. The hosting and SSL was canceled circa December 10th. I have been waiting for Google to de-index this site, therefore allowing our primary site to climb the rankings and GMB rankings once again. But it has been 6 weeks and Google is still indexing this spam site. I am incredibly frustrated with google support (as a google partner) and disappointed that this spam site is still indexed. Again, my conclusion that when this SPAM site is de-indexed, we will return back to #1. But when? and at this point, ever? Highlighted below is the spam site. Any suggestions? Capture.PNG
SEO Tactics | | WebMarkets0 -
Google Search Console - Excluded Pages and Multiple Properties
I have used Moz to identify keywords that are ideal for my website and then I optimized different pages for those keywords, but unfortunately rankings for some of the pages have declined. Since I am working with an ecommerce site, I read that having a lot of Excluded pages on the Google Search Console was to be expected so I initially ignored them. However, some of the pages I was trying to optimize are listed there, especially under the 'Crawled - currently not indexed' and the 'Discovered - currently not indexed' sections. I have read this page (link: https://moz.com/blog/crawled-currently-not-indexed-coverage-status ) and plan on focusing on Steps 5 & 7, but wanted to ask if anyone else has had experience with these issues. Also, does anyone know if having multiple properties (https vs http, www vs no www) can negatively affect a site? For example, could a sitemap from one property overwrite another? Would removing one property from the Console have any negative impact on the site? I plan on asking these questions on a Google forum, but I wanted to add it to this post in case anyone here had any insights. Thank you very much for your time,
SEO Tactics | | ForestGT
Forest0 -
Solve Redirect Chains
I've gotten a few Redirect Issues that involve Redirect Chains, with the https:// version redirecting to the www. version and then redirecting to the right URL. Here is an example:
On-Page Optimization | | Billywig
Schermafbeelding 2021-12-07 om 11.04.32.png I've tried setting a direct redirect between the first and the last URL, but WordPress doesn't seem to allow that (it's overwritten). I've also tried checking the internal links to make sure that none of the links are the first one. They don't seem to be there. Does anyone have any tips on solving these Redirect Chains?0