The Moz Q&A Forum

    • Forum
    • Questions
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Forum
    2. Categories
    3. SEO Tactics
    4. Technical SEO
    5. Sitemap use for very large forum-based community site
    Moz Q&A is closed.

    After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

    Sitemap use for very large forum-based community site

    Technical SEO
    3 3 817
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • CommManager
      CommManager last edited by

      I work on a very large site with two main types of content, static landing pages for products, and a forum & blogs (user created) under each product. Site has maybe 500k - 1 million pages. We do not have a sitemap at this time.
      Currently our SEO discoverability in general is good, Google is indexing new forum threads within 1-5 days roughly. Some of the "static" landing pages for our smaller, less visited products however do not have great SEO.
      Question is, could our SEO be improved by creating a sitemap, and if so, how could it be implemented? I see a few ways to go about it:

      1. Sitemap includes "static" product category landing pages only - i.e., the product home pages, the forum landing pages, and blog list pages. This would probably end up being 100-200 URLs.
      2. Sitemap contains the above but is also dynamically updated with new threads & blog posts.

      Option 2 seems like it would mean the sitemap is unmanageably long (hundreds of thousands of forum URLs). Would a crawler even parse something that size? Or with Option 1, could it cause our organically ranked pages to change ranking due to Google re-prioritizing the pages within the sitemap?
      Not a lot of information out there on this topic, appreciate any input. Thanks in advance.

      1 Reply Last reply Reply Quote 0
      • GFD_Chris
        GFD_Chris last edited by

        Agreed, you'll likely want to go with option #2. Dynamic sitemaps are a must when you're dealing with large sites like this. We advise them on all of our clients with larger sites. If your forum content is important for search then these are definitely important to include as the content likely changes often and might be naturally deeper in the architecture.

        In general, I'd think of sitemaps from a discoverability perspective instead of a ranking one. The primary goal is to give Googlebot an avenue to crawl your sites content regardless of internal linking structure.

        1 Reply Last reply Reply Quote 0
        • Martijn_Scheijbeler
          Martijn_Scheijbeler last edited by

          Hi

          Go with option 2, there is no scaling issue here. I have worked with and for sites that have a high multiplier on the number of sitemaps and pages that they're submitting, in some cases up to 100M pages. In all cases, Google was totally fine in crawling and processing the data that was there. As long as you follow the guidelines (max 50K URLs in a sitemap) you're fine as you're just providing another file that usually doesn't exceed about 50MB (depending on if you also add images to the sitemap). If you have an engineering team build the right infrastructure you can easily deal with thousands of these files and run them automated every day/week.

          My main focus on big sites is also to streamline their sitemaps to have sitemaps with just the last 50.000 pages and the same for the last 50.000 pages that were updated. This way you're able to also monitor the indexation level of these pages. If you are able to, for example, combine the data from log file analysis you can say: we added 50K pages and Google in the last days were able to crawl X percentage of that.

          Hope this gives you some extra insights.

          Martijn.

          1 Reply Last reply Reply Quote 2
          • 1 / 1
          • First post
            Last post

          Got a burning SEO question?

          Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


          Start my free trial


          Explore more categories

          • Moz Tools

            Chat with the community about the Moz tools.

            Getting Started
            Moz Pro
            Moz Local
            Moz Bar
            API
            What's New

          • SEO Tactics

            Discuss the SEO process with fellow marketers

            Content Development
            Competitive Research
            Keyword Research
            Link Building
            On-Page Optimization
            Technical SEO
            Reporting & Analytics
            Intermediate & Advanced SEO
            Image & Video Optimization
            International SEO
            Local SEO

          • Community

            Discuss industry events, jobs, and news!

            Moz Blog
            Moz News
            Industry News
            Jobs and Opportunities
            SEO Learn Center
            Whiteboard Friday

          • Digital Marketing

            Chat about tactics outside of SEO

            Affiliate Marketing
            Branding
            Conversion Rate Optimization
            Web Design
            Paid Search Marketing
            Social Media

          • Research & Trends

            Dive into research and trends in the search industry.

            SERP Trends
            Search Behavior
            Algorithm Updates
            White Hat / Black Hat SEO
            Other SEO Tools

          • Support

            Connect on product support and feature requests.

            Product Support
            Feature Requests
            Participate in User Research

          • See all categories

          • If I'm using a compressed sitemap (sitemap.xml.gz) that's the URL that gets submitted to webmaster tools, correct?
            jgresalfi
            jgresalfi
            0
            6
            4.1k

          • How to create site map for large site (ecommerce type) that has 1000's if not 100,000 of pages.
            BestRide
            BestRide
            0
            10
            11.8k

          • Can you have a /sitemap.xml and /sitemap.html on the same site?
            PioneerServices
            PioneerServices
            0
            6
            3.7k

          Get started with Moz Pro!

          Unlock the power of advanced SEO tools and data-driven insights.

          Start my free trial
          Products
          • Moz Pro
          • Moz Local
          • Moz API
          • Moz Data
          • STAT
          • Product Updates
          Moz Solutions
          • SMB Solutions
          • Agency Solutions
          • Enterprise Solutions
          • Digital Marketers
          Free SEO Tools
          • Domain Authority Checker
          • Link Explorer
          • Keyword Explorer
          • Competitive Research
          • Brand Authority Checker
          • Local Citation Checker
          • MozBar Extension
          • MozCast
          Resources
          • Blog
          • SEO Learning Center
          • Help Hub
          • Beginner's Guide to SEO
          • How-to Guides
          • Moz Academy
          • API Docs
          About Moz
          • About
          • Team
          • Careers
          • Contact
          Why Moz
          • Case Studies
          • Testimonials
          Get Involved
          • Become an Affiliate
          • MozCon
          • Webinars
          • Practical Marketer Series
          • MozPod
          Connect with us

          Contact the Help team

          Join our newsletter
          Moz logo
          © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
          • Accessibility
          • Terms of Use
          • Privacy

          Looks like your connection to Moz was lost, please wait while we try to reconnect.