Getting indexed in Google Scholar
-
Hi all! We have a client who publishes scholarly research as a highly regarded non-profit. Their Publications aren't being indexed in Google Scholar 50% of the time and when they are, Google is pulling random stuff from a PDF vs. from the html page. Any advice on best practices is enormously appreciated
-
@SimpleSearch I understand your client's frustration with Google Scholar not indexing their research consistently and accurately. Here are some steps you can take to improve their visibility:
- Optimize content for Google Scholar:
Structured data: Implement schema.org markup for scholarly articles. This helps Google Scholar understand the content and context of the research.
Metadata accuracy: Ensure all metadata (title, authors, keywords, publication date, etc.) is accurate and consistent across all platforms where the research is published.
Open access: Consider publishing research openly whenever possible. Google Scholar favors open access publications.
Backlinks: Encourage citations from other reputable research publications.
Internal linking: Link to the research from other relevant pages on the client's website.
2. Troubleshoot indexing issues:Check for technical errors: Use Google Search Console to identify any technical issues that might prevent indexing.
Use the "Fetch as Google" tool: Submit the research page to Google to expedite indexing.
Check Google Scholar's guidelines: Review Google Scholar's guidelines for publishers to ensure compliance.
Contact Google Scholar support: If the issue persists, contact Google Scholar support for further assistance.
3. Address content issues:PDF vs. HTML: Ensure the HTML version of the research is high quality, well-formatted, and text-based. Google Scholar prioritizes HTML over PDF content.
Avoid irrelevant content: Remove any irrelevant content from the PDF that might confuse Google Scholar.
4. Additional resources:Google Scholar Publisher Guidelines: https://scholar.google.com/intl/en/scholar/publishers.html
Google Search Console: https://search.google.com/search-console/about
Schema.org: https://schema.org/
Remember: Getting indexed in Google Scholar takes time and effort. Implement these best practices consistently and be patient. -
I understand your concern about inconsistent indexing on Google Scholar. Let's address the issues step by step:
Issue 1: Publications not being indexed in Google Scholar 50% of the time
Solution:
-
Ensure Proper Metadata: Check if your client's publications have accurate and comprehensive metadata. This includes title, author(s), abstract, keywords, publication date, and references. Metadata helps Google Scholar understand the content and index it correctly.
-
Consistent Citation Format: Ensure that the citation format used in the publications is consistent and follows standard academic conventions. This helps Google Scholar accurately identify and index the content.
-
Robots.txt File: Make sure the website's robots.txt file allows Google Scholar's bots to crawl and index the publications. Check for any disallow rules that might be preventing indexing.
-
Sitemap Submission: Submit a sitemap to Google Search Console containing URLs of the publications. This helps Google Scholar discover and index the content more efficiently.
-
Canonical URLs: Ensure that canonical URLs are implemented correctly for each publication. This helps Google understand the preferred version of the content and avoid duplicate indexing issues.
-
Quality Content: Ensure that the content of the publications is of high quality and relevance. Google Scholar prioritizes scholarly and authoritative content.
-
Indexing Frequency: Understand that Google Scholar's indexing process may not be immediate. It may take some time for new publications to be indexed. Patience is key.
Issue 2: Google pulling random content from PDF instead of HTML page
Solution:
-
Structured HTML: Ensure that the HTML version of the publications is properly structured using semantic markup. This helps Google Scholar understand the hierarchy and content of the page more accurately.
-
Text Accessibility: Make sure the text content of the publications is accessible within the HTML page and not embedded within images or other non-text formats. Google Scholar primarily indexes text-based content.
-
Metadata Alignment: Check if the metadata provided in the HTML version aligns with the content of the publication. Discrepancies between metadata and actual content may confuse Google Scholar's indexing process.
-
PDF Optimization: If the publications are available in both HTML and PDF formats, optimize the PDFs for indexing by adding proper metadata, text layer, and bookmarks. This can improve the chances of Google Scholar correctly extracting content from PDFs.
-
PDF-to-HTML Conversion: Consider converting PDF publications to HTML format using tools or services that preserve the structure and formatting. This ensures better compatibility with Google Scholar's indexing algorithms.
-
Reporting Issues: If the problem persists despite following these steps, consider reporting the issue to Google Scholar through their support channels or forums. Provide detailed information about the specific publications and indexing issues encountered.
By implementing these best practices and troubleshooting steps, your client can improve the indexing and visibility of their scholarly publications on Google Scholar. Regularly monitor indexing status and make necessary adjustments to ensure ongoing visibility.
Visit our webpage lydiadigitalacademy and stay connected!
-
-
@SimpleSearch This happens sometimes if following are not done properly.
- Ensure Structured Data: Implement Schema.org markup for scholarly articles including title, author names and affiliations, publication date, keywords, and abstract. This provides clear and consistent information for Google Scholar bots to understand.
- Format Author Names Consistently: Use the same format (e.g., first name, last name) across all publications and platforms. Consider using ORCID identifiers for authors.
- Use High-Quality PDFs: Create searchable PDFs with full text and avoid image-based PDFs. Embed metadata within the PDF document properties.
- Optimize HTML Pages: Make sure the HTML pages for each publication are well-structured with clear titles, headings, and body content. Include the full text of the research and relevant keywords naturally throughout the text.
- Link to Related Publications: Use internal links to connect related research on your website. This helps Google Scholar understand the context and subject matter of your publications.
This comes is with experience, working for exam guide about nursing licensing related content on a website.
Hope this helps you out.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google showing different links in SERPs
Google search results are showing my site links in both URLs, "mydomain.com" and "https://mydomain.com". However the one with https is showing a favicon, and the other one is not. So i wanna keep the https one and remove the other one. I went to GSC to submit "mydomain.com" for removal and it said that URL will be deleted in ALL of its variations.So how do i delete the "mydomain.com" links? Should i just index the ones with https again? Would that work? Someone suggested me to do 301 redirect on all pages that are being displayed twice. But i am not sure if i need to do that since i am using squarespace, and both of the links lead to the same page?
SERP Trends | | winter22330 -
Best proxy service to browse the Google from different countries to check the ranking
Hi Moz community, We need to check our website/pages rankings for random keywords at random timings in different countries. Beside checking in search console, we would like to check in browser. But Google now is not allowing us to browse the results of other country. I would like to use best proxy service to browse Google from different location to check how our pages are ranking & fluctuating. Please suggest on this. Thanks
SERP Trends | | vtmoz0 -
How to count number of app's installations for users who install app from https://play.google.com?
The task is to find out which banner on my site provides more installations of the game. The banners' urls link directly (refferal source in the url is specified) to the game page on https://play.google.com. Google Analytics and Google Play are connected but I've faced the problem that Google Analytics counts only visitors who install app via Google Play app so visitors from the banners to play.google.com aren't counted at all. That's the question, is there any way to count visitors to the app page on play.google.com and to count the number of installations by these visitors ?
SERP Trends | | seoMob0 -
Google vs. Bing
We rolled out a new site for a customer on an old (branded) domain (14 years old) about 3 weeks ago, and we are doing very, very well in Google for 40+ keywords, but we can't make a dent in Bing. Absolutely nothing. We are using the Webmaster tools for both. Are we missing something? Are we not using the Bing Webmaster tools correctly?
SERP Trends | | CsmBill0 -
Can some keywords get penality? - all situation
Last 3 years we created backlinks with 3 main anchors for our website.
SERP Trends | | bele
Domain name example is www.jackusedcars.com , keywords: bmw , audi , mercedes. We have chosen some big keywords as our main keywords. And some more small search volume words: buy used cars, used cars sell off. 90% of backlinks are with main keywords. OSE:
BMW 2,505 162,638
audi 1,111 209,542
mercedes 735 64,649
used cars 382 28,368
car sale 136 8,517
toyota 108 13,106
buy used car 34 820
car sell off 28 710
usedcars.com 26 45
sold cars 23 472 Website title example is: BMW Shop, buy Audi and Mercedes used cars 90% of backlinks are to index page. (Now we have Linking Root Domains 5,158; Total Links: 512k) all backlinks are related, we never used any auto spam tool etc.
in 2011, November 16-30th "audi" keyword traffic dropped, around -80%. Other keywords were ok.
We haven't been kicked by penguin in august 24. Graph was the same. Since November, our index page traffic dropped by 70%... 1st question: We got penalized for overoptimizing with "AUDI" keyword? Or its just another reason it stopped driving traffic? I know that for such linkbuilding we could get kicked by penguin on next update. So now we are de-optimizing the website, changing our old backlinks to different anchors and different urls (car pages - with car name anchors). We are quite good in the search results with product pages now. 1st page always - serp depends on the competition. For example "Used BMW 530 car for sale". We are creating new backlinks like this:
10% to index with different anchors - not the old big ones
20% to http://italian.jackusedcars.com with italian anchors
20% to http://www.jackusedcars.com/search/bmw530 with "buy bmw530", "bmw530 sale" and so on
50% to product names http://www.jackusedcars.com/BMW-530-i-x-2009-full-options.html with product name anchor We still want to get our traffic back with popular keywords. We have them written in title and keyword density on-page is 0.99% (previously was 1.37%).
Every month we lose around 10% of traffic to index page. We were with these keywords in top3, and now only 1 keyword is somewhere in top10, others are not even in top50. 2nd question: Should we remove "BMW, AUDI, Mercedes" from title? (they are still driving us around 20% traffic + 20% with bmw sub-keywords) We could lose almost 50% of total traffic. Only sub-pages will drive traffic with non-popular keywords. We have plans of making page www.jackusedcars.com/bmw and optimize it with "BMW" keyword. Could it go through?
Some old backlinks would be changed to this page.
Our best conversion is with these main keywords, so we really need to get them back. All comments are welcome. Graph attached. visitors_yearly.jpg0 -
Google Trends Hot Searches hourly historical data?
Is there a site that archives Google Trends' hourly Hot Searches data? I'd like to see if specific keywords were trending at a specific time yesterday. Is this data out there? Is there a different site I should be using for this info?
SERP Trends | | BostonWright0 -
How to get Google Results for Did You Mean | Showing results for
If someone misspells our company name in Google, how do I get google to display **Did You Mean: **xyz. Our company name is difficult to spell and could be spelled multiple ways. What is the trick to this?
SERP Trends | | hfranz0 -
Google Places Citation Differences
Hi All... I know its important to get citations the same in terms of company name, address, phone number and URL but how dissimilar can they be before this causes problems with Google Places? One of my clients has citations with very minor differences to the correct address i.e.: 2a such and such a road when it should be: 2 such and such a road I cannot see duplicate listing for the company. Anyone got any ideas / experience of this? I don't want to suggest changing them all if its not nessasary? Cheers in advance. Justin
SERP Trends | | GrouchyKids0