The Moz Q&A Forum

    • Forum
    • Questions
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Forum
    2. Categories
    3. Research & Trends
    4. SERP Trends
    5. Getting indexed in Google Scholar

    Moz Q&A is closed.

    After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

    Getting indexed in Google Scholar

    SERP Trends
    seo googlescholar
    4 4 612
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • SimpleSearch
      SimpleSearch last edited by

      Hi all! We have a client who publishes scholarly research as a highly regarded non-profit. Their Publications aren't being indexed in Google Scholar 50% of the time and when they are, Google is pulling random stuff from a PDF vs. from the html page. Any advice on best practices is enormously appreciated

      Ehjay LydiaShammi mec0011 3 Replies Last reply Reply Quote 1
      • mec0011
        mec0011 @SimpleSearch last edited by

        @SimpleSearch I understand your client's frustration with Google Scholar not indexing their research consistently and accurately. Here are some steps you can take to improve their visibility:

        1. Optimize content for Google Scholar:

        Structured data: Implement schema.org markup for scholarly articles. This helps Google Scholar understand the content and context of the research.
        Metadata accuracy: Ensure all metadata (title, authors, keywords, publication date, etc.) is accurate and consistent across all platforms where the research is published.
        Open access: Consider publishing research openly whenever possible. Google Scholar favors open access publications.
        Backlinks: Encourage citations from other reputable research publications.
        Internal linking: Link to the research from other relevant pages on the client's website.
        2. Troubleshoot indexing issues:

        Check for technical errors: Use Google Search Console to identify any technical issues that might prevent indexing.
        Use the "Fetch as Google" tool: Submit the research page to Google to expedite indexing.
        Check Google Scholar's guidelines: Review Google Scholar's guidelines for publishers to ensure compliance.
        Contact Google Scholar support: If the issue persists, contact Google Scholar support for further assistance.
        3. Address content issues:

        PDF vs. HTML: Ensure the HTML version of the research is high quality, well-formatted, and text-based. Google Scholar prioritizes HTML over PDF content.
        Avoid irrelevant content: Remove any irrelevant content from the PDF that might confuse Google Scholar.
        4. Additional resources:

        Google Scholar Publisher Guidelines: https://scholar.google.com/intl/en/scholar/publishers.html
        Google Search Console: https://search.google.com/search-console/about
        Schema.org: https://schema.org/
        Remember: Getting indexed in Google Scholar takes time and effort. Implement these best practices consistently and be patient.

        1 Reply Last reply Reply Quote 0
        • LydiaShammi
          LydiaShammi @SimpleSearch last edited by

          @SimpleSearch

          I understand your concern about inconsistent indexing on Google Scholar. Let's address the issues step by step:

          Issue 1: Publications not being indexed in Google Scholar 50% of the time

          Solution:

          1. Ensure Proper Metadata: Check if your client's publications have accurate and comprehensive metadata. This includes title, author(s), abstract, keywords, publication date, and references. Metadata helps Google Scholar understand the content and index it correctly.

          2. Consistent Citation Format: Ensure that the citation format used in the publications is consistent and follows standard academic conventions. This helps Google Scholar accurately identify and index the content.

          3. Robots.txt File: Make sure the website's robots.txt file allows Google Scholar's bots to crawl and index the publications. Check for any disallow rules that might be preventing indexing.

          4. Sitemap Submission: Submit a sitemap to Google Search Console containing URLs of the publications. This helps Google Scholar discover and index the content more efficiently.

          5. Canonical URLs: Ensure that canonical URLs are implemented correctly for each publication. This helps Google understand the preferred version of the content and avoid duplicate indexing issues.

          6. Quality Content: Ensure that the content of the publications is of high quality and relevance. Google Scholar prioritizes scholarly and authoritative content.

          7. Indexing Frequency: Understand that Google Scholar's indexing process may not be immediate. It may take some time for new publications to be indexed. Patience is key.

          Issue 2: Google pulling random content from PDF instead of HTML page

          Solution:

          1. Structured HTML: Ensure that the HTML version of the publications is properly structured using semantic markup. This helps Google Scholar understand the hierarchy and content of the page more accurately.

          2. Text Accessibility: Make sure the text content of the publications is accessible within the HTML page and not embedded within images or other non-text formats. Google Scholar primarily indexes text-based content.

          3. Metadata Alignment: Check if the metadata provided in the HTML version aligns with the content of the publication. Discrepancies between metadata and actual content may confuse Google Scholar's indexing process.

          4. PDF Optimization: If the publications are available in both HTML and PDF formats, optimize the PDFs for indexing by adding proper metadata, text layer, and bookmarks. This can improve the chances of Google Scholar correctly extracting content from PDFs.

          5. PDF-to-HTML Conversion: Consider converting PDF publications to HTML format using tools or services that preserve the structure and formatting. This ensures better compatibility with Google Scholar's indexing algorithms.

          6. Reporting Issues: If the problem persists despite following these steps, consider reporting the issue to Google Scholar through their support channels or forums. Provide detailed information about the specific publications and indexing issues encountered.

          By implementing these best practices and troubleshooting steps, your client can improve the indexing and visibility of their scholarly publications on Google Scholar. Regularly monitor indexing status and make necessary adjustments to ensure ongoing visibility.

          Visit our webpage lydiadigitalacademy and stay connected!

          1 Reply Last reply Reply Quote 0
          • Ehjay
            Ehjay @SimpleSearch last edited by

            @SimpleSearch This happens sometimes if following are not done properly.

            • Ensure Structured Data: Implement Schema.org markup for scholarly articles including title, author names and affiliations, publication date, keywords, and abstract. This provides clear and consistent information for Google Scholar bots to understand.
            • Format Author Names Consistently: Use the same format (e.g., first name, last name) across all publications and platforms. Consider using ORCID identifiers for authors.
            • Use High-Quality PDFs: Create searchable PDFs with full text and avoid image-based PDFs. Embed metadata within the PDF document properties.
            • Optimize HTML Pages: Make sure the HTML pages for each publication are well-structured with clear titles, headings, and body content. Include the full text of the research and relevant keywords naturally throughout the text.
            • Link to Related Publications: Use internal links to connect related research on your website. This helps Google Scholar understand the context and subject matter of your publications.

            This comes is with experience, working for exam guide about nursing licensing related content on a website.

            Hope this helps you out. 😊

            1 Reply Last reply Reply Quote 0
            • 1 / 1
            • First post
              Last post

            Got a burning SEO question?

            Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


            Start my free trial


            Explore more categories

            • Moz Tools

              Chat with the community about the Moz tools.

              Getting Started
              Moz Pro
              Moz Local
              Moz Bar
              API
              What's New

            • SEO Tactics

              Discuss the SEO process with fellow marketers

              Content Development
              Competitive Research
              Keyword Research
              Link Building
              On-Page Optimization
              Technical SEO
              Reporting & Analytics
              Intermediate & Advanced SEO
              Image & Video Optimization
              International SEO
              Local SEO

            • Community

              Discuss industry events, jobs, and news!

              Moz Blog
              Moz News
              Industry News
              Jobs and Opportunities
              SEO Learn Center
              Whiteboard Friday

            • Digital Marketing

              Chat about tactics outside of SEO

              Affiliate Marketing
              Branding
              Conversion Rate Optimization
              Web Design
              Paid Search Marketing
              Social Media

            • Research & Trends

              Dive into research and trends in the search industry.

              SERP Trends
              Search Behavior
              Algorithm Updates
              White Hat / Black Hat SEO
              Other SEO Tools

            • Support

              Connect on product support and feature requests.

              Product Support
              Feature Requests
              Participate in User Research

            • See all categories

            • Getting indexed by Google scholar
              newwhy
              newwhy
              2
              3
              1.9k

            Get started with Moz Pro!

            Unlock the power of advanced SEO tools and data-driven insights.

            Start my free trial
            Products
            • Moz Pro
            • Moz Local
            • Moz API
            • Moz Data
            • STAT
            • Product Updates
            Moz Solutions
            • SMB Solutions
            • Agency Solutions
            • Enterprise Solutions
            • Digital Marketers
            Free SEO Tools
            • Domain Authority Checker
            • Link Explorer
            • Keyword Explorer
            • Competitive Research
            • Brand Authority Checker
            • Local Citation Checker
            • MozBar Extension
            • MozCast
            Resources
            • Blog
            • SEO Learning Center
            • Help Hub
            • Beginner's Guide to SEO
            • How-to Guides
            • Moz Academy
            • API Docs
            About Moz
            • About
            • Team
            • Careers
            • Contact
            Why Moz
            • Case Studies
            • Testimonials
            Get Involved
            • Become an Affiliate
            • MozCon
            • Webinars
            • Practical Marketer Series
            • MozPod
            Connect with us

            Contact the Help team

            Join our newsletter
            Moz logo
            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
            • Accessibility
            • Terms of Use
            • Privacy

            Looks like your connection to Moz was lost, please wait while we try to reconnect.