Skip to content
    Moz logo Menu open Menu close
    • Products
      • Moz Pro
      • Moz Pro Home
      • Moz Local
      • Moz Local Home
      • STAT
      • Moz API
      • Moz API Home
      • Compare SEO Products
      • Moz Data
    • Free SEO Tools
      • Domain Analysis
      • Keyword Explorer
      • Link Explorer
      • Competitive Research
      • MozBar
      • More Free SEO Tools
    • Learn SEO
      • Beginner's Guide to SEO
      • SEO Learning Center
      • Moz Academy
      • SEO Q&A
      • Webinars, Whitepapers, & Guides
    • Blog
    • Why Moz
      • Agency Solutions
      • Enterprise Solutions
      • Small Business Solutions
      • Case Studies
      • The Moz Story
      • New Releases
    • Log in
    • Log out
    • Products
      • Moz Pro

        Your all-in-one suite of SEO essentials.

      • Moz Local

        Raise your local SEO visibility with complete local SEO management.

      • STAT

        SERP tracking and analytics for enterprise SEO experts.

      • Moz API

        Power your SEO with our index of over 44 trillion links.

      • Compare SEO Products

        See which Moz SEO solution best meets your business needs.

      • Moz Data

        Power your SEO strategy & AI models with custom data solutions.

      NEW Keyword Suggestions by Topic
      Moz Pro

      NEW Keyword Suggestions by Topic

      Learn more
    • Free SEO Tools
      • Domain Analysis

        Get top competitive SEO metrics like DA, top pages and more.

      • Keyword Explorer

        Find traffic-driving keywords with our 1.25 billion+ keyword index.

      • Link Explorer

        Explore over 40 trillion links for powerful backlink data.

      • Competitive Research

        Uncover valuable insights on your organic search competitors.

      • MozBar

        See top SEO metrics for free as you browse the web.

      • More Free SEO Tools

        Explore all the free SEO tools Moz has to offer.

      What is your Brand Authority?
      Moz

      What is your Brand Authority?

      Check yours now
    • Learn SEO
      • Beginner's Guide to SEO

        The #1 most popular introduction to SEO, trusted by millions.

      • SEO Learning Center

        Broaden your knowledge with SEO resources for all skill levels.

      • On-Demand Webinars

        Learn modern SEO best practices from industry experts.

      • How-To Guides

        Step-by-step guides to search success from the authority on SEO.

      • Moz Academy

        Upskill and get certified with on-demand courses & certifications.

      • SEO Q&A

        Insights & discussions from an SEO community of 500,000+.

      Unlock flexible pricing & new endpoints
      Moz API

      Unlock flexible pricing & new endpoints

      Find your plan
    • Blog
    • Why Moz
      • Small Business Solutions

        Uncover insights to make smarter marketing decisions in less time.

      • Agency Solutions

        Earn & keep valuable clients with unparalleled data & insights.

      • Enterprise Solutions

        Gain a competitive edge in the ever-changing world of search.

      • The Moz Story

        Moz was the first & remains the most trusted SEO company.

      • Case Studies

        Explore how Moz drives ROI with a proven track record of success.

      • New Releases

        Get the scoop on the latest and greatest from Moz.

      Surface actionable competitive intel
      New Feature

      Surface actionable competitive intel

      Learn More
    • Log in
      • Moz Pro
      • Moz Local
      • Moz Local Dashboard
      • Moz API
      • Moz API Dashboard
      • Moz Academy
    • Avatar
      • Moz Home
      • Notifications
      • Account & Billing
      • Manage Users
      • Community Profile
      • My Q&A
      • My Videos
      • Log Out

    The Moz Q&A Forum

    • Forum
    • Questions
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. Home
    2. SEO Tactics
    3. Technical SEO
    4. How Does Google's "index" find the location of pages in the "page directory" to return?

    Moz Q&A is closed.

    After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

    How Does Google's "index" find the location of pages in the "page directory" to return?

    Technical SEO
    3
    9
    1765
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with question management privileges can see it.
    • reidsteven75
      reidsteven75 last edited by

      This is my understanding of how Google's search works, and I am unsure about one thing in specific:

      1. Google continuously crawls websites and stores each page it finds (let's call it "page directory")
      2. Google's "page directory" is a cache so it isn't the "live" version of the page
      3. Google has separate storage called "the index" which contains all the keywords searched.  These keywords in "the index" point to the pages in the "page directory" that contain the same keywords.
      4. When someone searches a keyword, that keyword is accessed in the "index" and returns all relevant pages in the "page directory"
      5. These returned pages are given ranks based on the algorithm

      The one part I'm unsure of is how Google's "index" knows the location of relevant pages in the "page directory".  The keyword entries in the "index" point to the "page directory" somehow. I'm thinking each page has a url in the "page directory", and the entries in the "index" contain these urls.   Since Google's "page directory" is a cache, would the urls be the same as the live website (and would the keywords in the "index" point to these urls)?

      For example if webpage is found at wwww.website.com/page1, would the "page directory" store this page under that url in Google's cache?

      The reason I want to discuss this is to know the effects of changing a pages url by understanding how the search process works better.

      1 Reply Last reply Reply Quote 0
      • reidsteven75
        reidsteven75 @cbielich last edited by

        Yeah that makes sense.  I also have a lot of experience with databases and the back ends of websites so I know your language.

        I'm wondering how Google correlates the url with the page entries then. Maybe each page entry would have a url field so Google knows the location of the live version to constantly update that entry in the "page directory" database?

        1 Reply Last reply Reply Quote 0
        • cbielich
          cbielich @reidsteven75 last edited by

          That is a question that no one here can answer. We cant speak for how Google does things internally.

          but.... as a web / database programmer for 14+ years let me tell you how its "generally" done

          Usually when you have to link to separate sets of data together (ie. database or tables) there is usually a unique_id created to link them which usually is never changed. So when a new record is created that record will live with that ID for its life, also known as a (unique identifier which tends to be an auto-incremented number that is dynamically generated and can not be repeated).

          Since records tend to be linked this way, any other fields that exist in the record (firstName, lastName, Url, blah blah) then can be changed without the original ID being disturbed.

          So to answer your question from my experience I would assume Google links from a unique identifier of some sort and not the URL directly.

          Hope I didn't lose you, its my favorite subject...but no one here speaks that language to much 🙂

          reidsteven75 1 Reply Last reply Reply Quote 1
          • reidsteven75
            reidsteven75 @TakeshiYoung last edited by

            That makes sense, thanks for getting back to me so fast!

            Perhaps you can help answer my next question.  I have a client who used to host his domain at "www.oldurl.com", and has migrated his website to "www.newurl.com".  He wants to use his old domain "www.oldurl.com", so he setup forwarding/masking so that when someone tries to access "www.oldurl.com" they are forwarded to "www.newurl.com" but the url shown to the user is "www.oldurl.com".

            My client want his old url "www.oldurl.com" to be ranked in Google, but from what I understand his new url will be ranked.  I know masking is really bad for SEO, and I want to educate my client as to why on the technical side.  I have read Google see's all the content as duplicate with masking.  Do you know the details as to why?

            1 Reply Last reply Reply Quote 0
            • reidsteven75
              reidsteven75 last edited by

              Hey Cesar,

              Thanks for the links!  Really useful info there.

              Unfortunately they I couldn't find the answer I was looking for so I'll be more specific in what I'm asking.

              From what I understand Google uses two database systems.   One contains keywords and the other contains cached pages.  How does a keyword entry point to a page entry?  Does it use a unique id number, or does it use the url that page is using in the "live" vesion on the web?

              cbielich 1 Reply Last reply Reply Quote 0
              • TakeshiYoung
                TakeshiYoung @reidsteven75 last edited by

                Just because you create a new page and delete the old one, Google won't know immediately about it. So if Google crawls the new page before it's had a chance to crawl the old one, then it will indeed consider the new page to be duplicate content. Then when it tries to crawl the old page, it will discover that it no longer exists. However, as long as links to the old page exist, it will continue to try to crawl that page. Eventually it may de-index the old page if it keeps returning an error.

                Bottom line, if you are moving content to a new URL, be sure to include a 301 redirect on the old page so that Google (and other search engines) know that the piece of content has moved. You can also do this with canonical tags, but 301s are more effective.

                reidsteven75 1 Reply Last reply Reply Quote 1
                • reidsteven75
                  reidsteven75 @TakeshiYoung last edited by

                  Thanks for the response and links Takeshi.  Maybe I can rephrase the question to be more clear. Let's say a piece of content (or page) is at the url "www.oldurl.com/page".  During a migration this same piece of content now at the url "www.newurl.com/page".   The "www.oldurl.com" doesn't exist anymore so there isn't duplicate content in the live web.

                  Would Google create a new entry in it's "page directory" (what is the industry standard name for this directory?) and give it the url "www.newurl.com/page"?

                  If it does create a new entry, would Google keep the old entry "www.oldurl.com/page" although the old url doesn't exist in the "live" web anymore?

                  TakeshiYoung 1 Reply Last reply Reply Quote 0
                  • cbielich
                    cbielich last edited by

                    Wow you just asked questions that would require about 10,000,000,000 answers 😉

                    Lets start here

                    1. Video from the man himself Mr. Matt Cutts - Matt Cutts (Works for Google)
                    2. Great Web 2.0 Page create from Google themself - (Google Them self)
                    3. Older but still relevant description about how "backlinks" affect PR - (Google Them self)
                    1 Reply Last reply Reply Quote 2
                    • TakeshiYoung
                      TakeshiYoung last edited by

                      This a pretty confusing question, and the terminology you use is different from industry standard. Check out these links for a quick overview of how Google works:

                      • http://www.google.com/insidesearch/howsearchworks/thestory/
                      • http://www.googleguide.com/google_works.html

                      If you are just worried about changing a page's url, just be sure to put in a 301 redirect from the old page to the new page. That way, even if Google has an older version of the page indexed, it will automatically redirect the user to the new page as well as help Google discover the new location of the page.

                      reidsteven75 1 Reply Last reply Reply Quote 1
                      • 1 / 1
                      • First post
                        Last post

                      Got a burning SEO question?

                      Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


                      Start my free trial


                      Browse Questions

                      Explore more categories

                      • Moz Tools

                        Chat with the community about the Moz tools.

                      • SEO Tactics

                        Discuss the SEO process with fellow marketers

                      • Community

                        Discuss industry events, jobs, and news!

                      • Digital Marketing

                        Chat about tactics outside of SEO

                      • Research & Trends

                        Dive into research and trends in the search industry.

                      • Support

                        Connect on product support and feature requests.

                      • See all categories

                      Related Questions

                      • Chophel

                        My WP website got attack by malware & now my website site:www.example.ca shows about 43000 indexed page in google.

                        Hi All My wordpress website got attack by malware last week. It affected my index page in google badly. my typical site:example.ca shows about 130 indexed pages on google. Now it shows about 43000 indexed pages.  I had my server company tech support scan my site and clean the malware yesterday. But it still shows the same number of indexed page on google. Does anybody had ever experience such situation and how did you fixed it. Looking for help. Thanks FILE HIT LIST:
                        {YARA}Spam_PHP_WPVCD_ContentInjection : /home/example/public_html/wp-includes/wp-tmp.php
                        {YARA}Backdoor_PHP_WPVCD_Deployer : /home/example/public_html/wp-includes/wp-vcd.php
                        {YARA}Backdoor_PHP_WPVCD_Deployer : /home/example/public_html/wp-content/themes/oceanwp.zip
                        {YARA}webshell_webshell_cnseay02_1 : /home/example2/public_html/content.php
                        {YARA}eval_post : /home/example2/public_html/wp-includes/63292236.php
                        {YARA}webshell_webshell_cnseay02_1 : /home/example3/public_html/content.php
                        {YARA}eval_post : /home/example4/public_html/wp-admin/28855846.php
                        {HEX}php.generic.malware.442 : /home/example5/public_html/wp-22.php
                        {HEX}php.generic.cav7.421 : /home/example5/public_html/SEUN.php
                        {HEX}php.generic.malware.442 : /home/example5/public_html/Webhook.php

                        Technical SEO | | Chophel
                        0
                      • BrianAlpert78

                        Does a no-indexed parent page impact its child pages?

                        If I have a page* in WordPress that is set as private and is no-indexed with Yoast, will that negatively affect the visibility of other pages that are set as children of that first page? *The context is that I want to organize some of the pages on a business's WordPress site into silos/directories. For example, if the business was a home remodeling company, it'd be convenient to keep all the pages about bathrooms, kitchens, additions, basements, etc. bundled together under a "services" parent page (/services/kitchens/, /services/bathrooms/, etc.). The thing is that the child pages will all be directly accessible from the menus, so there doesn't need to be anything on the parent /services/ page itself. Another such parent page/directory/category might be used to keep different photo gallery pages together (/galleries/kitchen-photos/, /galleries/bathroom-photos/, etc.). So again, would it be safe for pages like /services/kitchens/ and /galleries/addition-photos/ if the /services/ and /galleries/ pages (but not /galleries/* or anything like that) are no-indexed? Thanks!

                        Technical SEO | | BrianAlpert78
                        1
                      • netzkern_AG

                        Does Google index internal anchors as separate pages?

                        Hi, Back in September, I added a function that sets an anchor on each subheading (h[2-6]) and creates a Table of content that links to each of those anchors. These anchors did show up in the SERPs as JumpTo Links. Fine. Back then I also changed the canonicals to a slightly different structur and meanwhile there was some massive increase in the number of indexed pages - WAY over the top - which has since been fixed by removing (410) a complete section of the site. However ... there are still ~34.000 pages indexed to what really are more like 4.000 plus (all properly canonicalised). Naturally I am wondering, what google thinks it is indexing. The number is just way of and quite inexplainable. So I was wondering: Does Google save JumpTo links as unique pages? Also, does anybody know any method of actually getting all the pages in the google index? (Not actually existing sites via Screaming Frog etc, but actual pages in the index - all methods I found sadly do not work.) Finally: Does somebody have any other explanation for the incongruency in indexed vs. actual pages? Thanks for your replies! Nico

                        Technical SEO | | netzkern_AG
                        0
                      • KatherineWatierOng

                        How do I "undo" or remove a Google Search Console change of address?

                        I have a client that set a change of address in Google Search Console where they informed Google that their preferred domain was a subdomain, and now they want Google to also consider their base domain (without the change of address). How do I get the change of address in Google search console removed?

                        Technical SEO | | KatherineWatierOng
                        0
                      • jim_shook

                        Best way to handle pages with iframes that I don't want indexed? Noindex in the header?

                        I am doing a bit of SEO work for a friend, and the situation is the following: The site is a place to discuss articles on the web. When clicking on a link that has been posted, it sends the user to a URL on the main site that is URL.com/article/view. This page has a large iframe that contains the article itself, and a small bar at the top containing the article with various links to get back to the original site. I'd like to make sure that the comment pages (URL.com/article) are indexed instead of all of the URL.com/article/view pages, which won't really do much for SEO. However, all of these pages are indexed. What would be the best approach to make sure the iframe pages aren't indexed? My intuition is to just have a "noindex" in the header of those pages, and just make sure that the conversation pages themselves are properly linked throughout the site, so that they get indexed properly. Does this seem right? Thanks for the help...

                        Technical SEO | | jim_shook
                        0
                      • UIPL

                        How to stop my webmail pages not to be indexed on Google ??

                        when i did a search in google for Site:mywebsite.com , for a list of pages indexed. Surprisingly the following come up " Webmail - Login " Although this is associated with the domain , this is a completely different server , this the rackspace email server browser interface  I am sure that there is nothing on the website that links or points to this.
                        So why is Google indexing it ? & how do I get it out of there. I tried in webmaster tool but I could not , as it seems like a sub-domain. Any ideas ? Thanks Naresh Sadasivan

                        Technical SEO | | UIPL
                        0
                      • TomLondon

                        Pages removed from Google index?

                        Hi All, I had around 2,300 pages in the google index until a week ago. The index removed a load and left me with 152 submitted, 152 indexed? I have just re-submitted my sitemap and will wait to see what happens. Any idea why it has done this? I have seen a drop in my rankings since. Thanks

                        Technical SEO | | TomLondon
                        0
                      • priceseo

                        How to determine which pages are not indexed

                        Is there a way to determine which pages of a website are not being indexed by the search engines? I know Google Webmasters has a sitemap area where it tells you how many urls have been submitted and how many are indexed out of those submitted. However, it doesn't necessarily show which urls aren't being indexed.

                        Technical SEO | | priceseo
                        1

                      Get started with Moz Pro!

                      Unlock the power of advanced SEO tools and data-driven insights.

                      Start my free trial
                      Products
                      • Moz Pro
                      • Moz Local
                      • Moz API
                      • Moz Data
                      • STAT
                      • Product Updates
                      Moz Solutions
                      • SMB Solutions
                      • Agency Solutions
                      • Enterprise Solutions
                      Free SEO Tools
                      • Domain Authority Checker
                      • Link Explorer
                      • Keyword Explorer
                      • Competitive Research
                      • Brand Authority Checker
                      • MozBar Extension
                      • MozCast
                      Resources
                      • Blog
                      • SEO Learning Center
                      • Help Hub
                      • Beginner's Guide to SEO
                      • How-to Guides
                      • Moz Academy
                      • API Docs
                      About Moz
                      • About
                      • Team
                      • Careers
                      • Contact
                      Why Moz
                      • Case Studies
                      • Testimonials
                      Get Involved
                      • Become an Affiliate
                      • MozCon
                      • Webinars
                      • Practical Marketer Series
                      • MozPod
                      Connect with us

                      Contact the Help team

                      Join our newsletter
                      Moz logo
                      © 2021 - 2025 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                      • Accessibility
                      • Terms of Use
                      • Privacy

                      Looks like your connection to Moz was lost, please wait while we try to reconnect.