The Moz Q&A Forum

    • Forum
    • Questions
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Forum
    2. Categories
    3. Moz Tools
    4. Moz Pro
    5. What to do with a site of >50,000 pages vs. crawl limit?
    Moz Q&A is closed.

    After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

    What to do with a site of >50,000 pages vs. crawl limit?

    Moz Pro
    5 3 2.5k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • scienceisrad
      scienceisrad last edited by

      What happens if you have a site in your Moz Pro campaign that has more than 50,000 pages?

      Would it be better to choose a sub-folder of the site to get a thorough look at that sub-folder?

      I have a few different large government websites that I'm tracking to see how they are fairing in rankings and SEO.  They are not my own websites.  I want to see how these agencies are doing compared to what the public searches for on technical topics and social issues that the agencies manage.  I'm an academic looking at science communication.  I am in the process of re-setting up my campaigns to get better data than I have been getting -- I am a newbie to SEO and the campaigns I slapped together a few months ago need to be set up better, such as all on the same day, making sure I've set it to include www or not for what ranks, refining my keywords, etc.

      I am stumped on what to do about the agency websites being really huge, and what all the options are to get good data in light of the 50,000 page crawl limit.  Here is an example of what I mean:

      To see how EPA is doing in searches related to air quality, ideally I'd track all of EPA's web presence.

      www.epa.gov has 560,000 pages -- if I put in www.epa.gov for a campaign, what happens with the site having so many more pages than the 50,000 crawl limit?  What do I miss out on?  Can I "trust" what I get?

      www.epa.gov/air has only 1450 pages, so if I choose this for what I track in a campaign, the crawl will cover that subfolder completely, and I am getting a complete picture of this air-focused sub-folder ... but (1) I'll miss out on air-related pages in other sub-folders of www.epa.gov, and (2) it seems like I have so much of the 50,000-page crawl limit that I'm not using and could be using.  (However, maybe that's not quite true - I'd also be tracking other sites as competitors - e.g. non-profits that advocate in air quality, industry air quality sites - and maybe those competitors count towards the 50,000-page crawl limit and would get me up to the limit? How do the competitors you choose figure into the crawl limit?)

      Any opinions on which I should do in general on this kind of situation?  The small sub-folder vs. the full humongous site vs. is there some other way to go here that I'm not thinking of?

      1 Reply Last reply Reply Quote 0
      • scienceisrad
        scienceisrad last edited by

        Hi Sean -- Can you clarify for me how competitors in a campaign figure in to the 50,000 page limit?  Does the main page in the campaign get thoroughly crawled first and then competitors are crawled up to the limit?

        Some examples:

        If the main site is 100 pages, and I pick 2 competitors that are 100 to 1000 pages and a 3rd gargantuan competitor of 300,000 pages, what happens?  Does it matter in what order I enter competitors in this situation as to whether the 100-page and 1000-page competitors get crawled vs. whether the limit maxes out on the 300K competitor before crawling the smaller competitors?

        If the main site is 300,000 pages, do any competitors in the campaign just not get crawled at all because the 50,000 limit gets all used up on the  main site?

        What if the main site is 20,000 pages and a competitor is 45,000 pages?  Thorough crawl of main site and then partial crawl of competitor?

        I feel like I have a direction to go in based on our previous discussion for the main site in the campaign, but now I'm still a little stumped and confused about how competitors operate within the crawl limit.

        1 Reply Last reply Reply Quote 0
        • Sean_Peerenboom
          Sean_Peerenboom last edited by

          Hi There,

          Thanks for writing us and this is a tricky one because it is difficult to say if there is an objectively right answer. 😞 In this case your best bet would be to create a sub folder that is under the standard subscription campaign limit and attempting to pick up what you miss using the other research tools. Although, our research tools are predominantly designed for one off interactions, you could probably use them to capture information that is a bit outside of the campaigns purview. Here is a link to our research tools for your reference: moz.com/researchtools/ose/

          If you do decide to enter a website that far surpasses the crawl limits then, what will be cut off is determined by the existing site structure. 😞 The way that our crawler works is that it will go from the link provided and use the existing link structure to keep crawling the site or until we run into a dead end.

          Both approaches may present issues so it will be more of a judgement call. One thing that I will say is that we have a much easier time crawling fewer pages so that may be something to keep in mind.

          Hope this helps and if you have any questions for me please let me know.

          Have a fantastic day!

          1 Reply Last reply Reply Quote 0
          • scienceisrad
            scienceisrad last edited by

            Thanks Patrick for the tip about ScreamingFrog!  I checked out the link you shared, and it looks like a powerful tool.  I'm going to put it on my list of additional tools I need to get going on using.

            In the meantime, though, I still need a strategy for what to do in Moz.  Any opinions on whether I should set my Moz campaigns to the smaller sub-folders of a few thousand pages vs. the humongous full sites of 100,000+ pages?  I guess I'm leaning towards setting them to the smaller sub-folders.  Or maybe I should do a small sub-folder for one of the huge sites and do the full site for another campaign, and see what kind of results I get.

            1 Reply Last reply Reply Quote 0
            • PatrickDelehanty
              PatrickDelehanty last edited by

              Hi there

              I would look into ScreamingFrog - you can crawl 500 URIs for free, otherwise, if you have a license, you can crawl as many pages as you'd like.

              Let me know if this helps! Good luck!

              1 Reply Last reply Reply Quote 2
              • 1 / 1
              • First post
                Last post

              Got a burning SEO question?

              Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


              Start my free trial


              Explore more categories

              • Moz Tools

                Chat with the community about the Moz tools.

                Getting Started
                Moz Pro
                Moz Local
                Moz Bar
                API
                What's New

              • SEO Tactics

                Discuss the SEO process with fellow marketers

                Content Development
                Competitive Research
                Keyword Research
                Link Building
                On-Page Optimization
                Technical SEO
                Reporting & Analytics
                Intermediate & Advanced SEO
                Image & Video Optimization
                International SEO
                Local SEO

              • Community

                Discuss industry events, jobs, and news!

                Moz Blog
                Moz News
                Industry News
                Jobs and Opportunities
                SEO Learn Center
                Whiteboard Friday

              • Digital Marketing

                Chat about tactics outside of SEO

                Affiliate Marketing
                Branding
                Conversion Rate Optimization
                Web Design
                Paid Search Marketing
                Social Media

              • Research & Trends

                Dive into research and trends in the search industry.

                SERP Trends
                Search Behavior
                Algorithm Updates
                White Hat / Black Hat SEO
                Other SEO Tools

              • Support

                Connect on product support and feature requests.

                Product Support
                Feature Requests
                Participate in User Research

              • See all categories

              Get started with Moz Pro!

              Unlock the power of advanced SEO tools and data-driven insights.

              Start my free trial
              Products
              • Moz Pro
              • Moz Local
              • Moz API
              • Moz Data
              • STAT
              • Product Updates
              Moz Solutions
              • SMB Solutions
              • Agency Solutions
              • Enterprise Solutions
              • Digital Marketers
              Free SEO Tools
              • Domain Authority Checker
              • Link Explorer
              • Keyword Explorer
              • Competitive Research
              • Brand Authority Checker
              • Local Citation Checker
              • MozBar Extension
              • MozCast
              Resources
              • Blog
              • SEO Learning Center
              • Help Hub
              • Beginner's Guide to SEO
              • How-to Guides
              • Moz Academy
              • API Docs
              About Moz
              • About
              • Team
              • Careers
              • Contact
              Why Moz
              • Case Studies
              • Testimonials
              Get Involved
              • Become an Affiliate
              • MozCon
              • Webinars
              • Practical Marketer Series
              • MozPod
              Connect with us

              Contact the Help team

              Join our newsletter
              Moz logo
              © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
              • Accessibility
              • Terms of Use
              • Privacy

              Looks like your connection to Moz was lost, please wait while we try to reconnect.