Getting indexed in Google Scholar
-
Hi all! We have a client who publishes scholarly research as a highly regarded non-profit. Their Publications aren't being indexed in Google Scholar 50% of the time and when they are, Google is pulling random stuff from a PDF vs. from the html page. Any advice on best practices is enormously appreciated
-
@SimpleSearch I understand your client's frustration with Google Scholar not indexing their research consistently and accurately. Here are some steps you can take to improve their visibility:
- Optimize content for Google Scholar:
Structured data: Implement schema.org markup for scholarly articles. This helps Google Scholar understand the content and context of the research.
Metadata accuracy: Ensure all metadata (title, authors, keywords, publication date, etc.) is accurate and consistent across all platforms where the research is published.
Open access: Consider publishing research openly whenever possible. Google Scholar favors open access publications.
Backlinks: Encourage citations from other reputable research publications.
Internal linking: Link to the research from other relevant pages on the client's website.
2. Troubleshoot indexing issues:Check for technical errors: Use Google Search Console to identify any technical issues that might prevent indexing.
Use the "Fetch as Google" tool: Submit the research page to Google to expedite indexing.
Check Google Scholar's guidelines: Review Google Scholar's guidelines for publishers to ensure compliance.
Contact Google Scholar support: If the issue persists, contact Google Scholar support for further assistance.
3. Address content issues:PDF vs. HTML: Ensure the HTML version of the research is high quality, well-formatted, and text-based. Google Scholar prioritizes HTML over PDF content.
Avoid irrelevant content: Remove any irrelevant content from the PDF that might confuse Google Scholar.
4. Additional resources:Google Scholar Publisher Guidelines: https://scholar.google.com/intl/en/scholar/publishers.html
Google Search Console: https://search.google.com/search-console/about
Schema.org: https://schema.org/
Remember: Getting indexed in Google Scholar takes time and effort. Implement these best practices consistently and be patient. -
I understand your concern about inconsistent indexing on Google Scholar. Let's address the issues step by step:
Issue 1: Publications not being indexed in Google Scholar 50% of the time
Solution:
-
Ensure Proper Metadata: Check if your client's publications have accurate and comprehensive metadata. This includes title, author(s), abstract, keywords, publication date, and references. Metadata helps Google Scholar understand the content and index it correctly.
-
Consistent Citation Format: Ensure that the citation format used in the publications is consistent and follows standard academic conventions. This helps Google Scholar accurately identify and index the content.
-
Robots.txt File: Make sure the website's robots.txt file allows Google Scholar's bots to crawl and index the publications. Check for any disallow rules that might be preventing indexing.
-
Sitemap Submission: Submit a sitemap to Google Search Console containing URLs of the publications. This helps Google Scholar discover and index the content more efficiently.
-
Canonical URLs: Ensure that canonical URLs are implemented correctly for each publication. This helps Google understand the preferred version of the content and avoid duplicate indexing issues.
-
Quality Content: Ensure that the content of the publications is of high quality and relevance. Google Scholar prioritizes scholarly and authoritative content.
-
Indexing Frequency: Understand that Google Scholar's indexing process may not be immediate. It may take some time for new publications to be indexed. Patience is key.
Issue 2: Google pulling random content from PDF instead of HTML page
Solution:
-
Structured HTML: Ensure that the HTML version of the publications is properly structured using semantic markup. This helps Google Scholar understand the hierarchy and content of the page more accurately.
-
Text Accessibility: Make sure the text content of the publications is accessible within the HTML page and not embedded within images or other non-text formats. Google Scholar primarily indexes text-based content.
-
Metadata Alignment: Check if the metadata provided in the HTML version aligns with the content of the publication. Discrepancies between metadata and actual content may confuse Google Scholar's indexing process.
-
PDF Optimization: If the publications are available in both HTML and PDF formats, optimize the PDFs for indexing by adding proper metadata, text layer, and bookmarks. This can improve the chances of Google Scholar correctly extracting content from PDFs.
-
PDF-to-HTML Conversion: Consider converting PDF publications to HTML format using tools or services that preserve the structure and formatting. This ensures better compatibility with Google Scholar's indexing algorithms.
-
Reporting Issues: If the problem persists despite following these steps, consider reporting the issue to Google Scholar through their support channels or forums. Provide detailed information about the specific publications and indexing issues encountered.
By implementing these best practices and troubleshooting steps, your client can improve the indexing and visibility of their scholarly publications on Google Scholar. Regularly monitor indexing status and make necessary adjustments to ensure ongoing visibility.
Visit our webpage lydiadigitalacademy and stay connected!
-
-
@SimpleSearch This happens sometimes if following are not done properly.
- Ensure Structured Data: Implement Schema.org markup for scholarly articles including title, author names and affiliations, publication date, keywords, and abstract. This provides clear and consistent information for Google Scholar bots to understand.
- Format Author Names Consistently: Use the same format (e.g., first name, last name) across all publications and platforms. Consider using ORCID identifiers for authors.
- Use High-Quality PDFs: Create searchable PDFs with full text and avoid image-based PDFs. Embed metadata within the PDF document properties.
- Optimize HTML Pages: Make sure the HTML pages for each publication are well-structured with clear titles, headings, and body content. Include the full text of the research and relevant keywords naturally throughout the text.
- Link to Related Publications: Use internal links to connect related research on your website. This helps Google Scholar understand the context and subject matter of your publications.
This comes is with experience, working for exam guide about nursing licensing related content on a website.
Hope this helps you out.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
My website is being Cached with non-www and With WWW it is not indexed and cached
Hello Team, I had one question that my website is being indexed and cached with Non-www but with WWW it is not caching it is showing 404 error. Even each and every redirection is proper. Still it is showing an error. Can you please tell me what issue i had with my site?? Here is my links: https://webcache.googleusercontent.com/search?q=cache:nCH1DvhuQT8J:https://www.canvaschamp.com/+&cd=1&hl=en&ct=clnk&gl=usa
SERP Trends | | CommercePundit0 -
Google Fetch and Render - Partial result (resources temporarily unavailable)
Over the past few weeks, my website pages have been showing as partial in the Google Search Console. There are many resources/ files (js, css, images) that are 'temporarily unreachable'. The website files haven't had any structural changes for about 2 years (it historically has always shows as 'completed' and rendered absolutely fine in the search console). I have checked and the robots.txt is fine as is the sitemap. My host hasn't been very helpful, but has confirmed there are no server issues. My website rankings have now dropped which I think is due to these resources issues and I need to clear this issue up asap - can any one here offer any assistance? It would be hugely appreciated. Thanks, Dan
SERP Trends | | dan_550 -
Why rich snippets have disappeared in Google search results ?
Hello, Few weeks ago, we have implemented a snippets strategy in order to increase our ranking for our blog posts. That was successful and our results were showing up in Google. But today, every single snippet has disappeared. We go back to a simple search result, without snippets for us or for our competitors. It seems that Google has delete rich snippet for specific keywords because for the generic keywords (for exemple "inbound marketing definition" in our case), there is still a snippet result. Do you know if Google has changed snippets parameters for keywords with low search volume ? Thank you !
SERP Trends | | Laure-Nile0 -
Search results vary in chrome vs other browsers even in Incognito mode: Google's stand?
Hi all, We use incognito mode or private browsing to check the Actual results which are not impacted by previous history, location (sometimes), etc. Even we browse this way, we can see the different search results. Why would this happen? What's Google's stand on this? What is the actual way to browse to get the unbiased results for certain search queries? I have experienced that Chrome will rank our own websites bit higher compared to the other browsers even in incognito mode. Thanks
SERP Trends | | vtmoz1 -
Google crawled "Cross Domain" links. Is this an issue? if yes then how we can remove it?
We have two sites. One site for US (site.com)and another is for India(site.co.in). Redirection is working based on ip address and location.So if I write US(site.com) site link in India location then it will automatically redirect to India site(site.co.in). We have given a link in the header of the site which will be used to open another country site. Example : Currently India site open in my browser so it contain US site link. and US site open in my browser so it contain India site link. We check on google web master and we found back links of cross domains so Is it an issue?
SERP Trends | | Shanil1230 -
How google identify that web page was updated and want to index again?
How Google robot identify that these many pages were updated on particular web site?
SERP Trends | | ankit.rahevar0 -
Google Trends Hot Searches hourly historical data?
Is there a site that archives Google Trends' hourly Hot Searches data? I'd like to see if specific keywords were trending at a specific time yesterday. Is this data out there? Is there a different site I should be using for this info?
SERP Trends | | BostonWright0 -
Best keyword research tool for Google image search?
What is the best research tool for finding search data specifically for Google Image search?
SERP Trends | | nicole.healthline1