Scholar

Support for Scholarly Publishers

Google and Google Scholar can boost the worldwide visibility and accessibility of your content. We work with publishers of scholarly information to index peer-reviewed papers, theses, preprints, abstracts, and technical reports from all disciplines of research and make them searchable on Google and Google Scholar. This page provides policy and technical information for scholarly publishers and societies. Detailed technical inclusion guidelines for webmasters can be found here.

Publisher Policies

Multiple versions of a work are grouped to improve its ranking. In many research areas versions of a work may appear as preprints and conference papers before being published as a journal article. These preliminary versions of a work are often cited in addition to the authoritative journal version. The number of citations to a particular work is an important part of determining its rank in the Google Scholar search results. Grouping versions allows us to collect all citations to all versions of a work. In practice, this can significantly improve the position of an article in the search results.

Publisher's full-text, if indexed, is the primary version. When multiple versions of a work are indexed, we select the full and authoritative text from the publisher as the primary version. We can only do this if we are able to successfully identify, crawl and process the full text of the publisher’s version.

Publishers have control over access to their articles. We work with publishers to preserve their control over access to their content and only cache articles and papers that are not access-controlled. Publishers can help us by identifying which regions of their sites are access-controlled. For details, please click here.

Google users must see at least the complete abstract or the first full page. This is a necessary component of our indexing program. For papers with access restrictions, all users clicking on search results must see at least the full author-written abstract or the first full page of the article without requiring to login or click on additional links.

We will respond to complaints regarding copyright infringement. Our policy is to respond to all notices of alleged copyright infringement that comply with the Digital Millennium Copyright Act. For directions and more information, please click here.


Frequently Asked Questions

Common Questions

  1. I'm a publisher of scholarly works and would like to have my content included in Google and Google Scholar?
  2. I publish scholarly textbooks and monographs. Can my content be included in Google Scholar?
  3. Can I see usage statistics for my content?
  4. What do I do if I believe you're linking to a webpage that infringes my copyright?

Technical Questions

  1. My articles are in PDF format. Can you still index my site?
  2. How can I tell if a PDF file has searchable text?
  3. Some of my articles are split into multiple files, one file per section. Can you work with these?
  4. How do I remove a listing from your search results?
  5. I see a 'cached' (or 'View as HTML') link for my access-controlled articles. I need to have this fixed right away!
  6. Is there anything I can do to help rank my articles better?
  7. All my articles are available to your crawlers, but not all of them seem to show up in Google Scholar. Can I do something to help improve coverage?

Common Questions

  1. I'm a publisher of scholarly works and would like to have my content included in Google and Google Scholar?

    Your content is most welcome. As noted above, an abstract (at least) of each work must be available to non-subscribers who come from Google and Google Scholar. Please configure your website according to our technical inclusion guidelines, and then contact us to consider it for inclusion in Google Scholar.
     
  2. I publish scholarly textbooks and monographs. Can my content be included in Google Scholar?

    Maybe. For now, Google Scholar indexes mostly scholarly articles. For textbooks and monographs, we recommend Google Book Search. Google Scholar automatically includes scholarly works from Google Book Search.
     
  3. Can I see usage statistics for my content?

    Since users click through to your website, your web server logs should have all the usage statistics.
     
  4. What do I do if I believe you're linking to a webpage that infringes my copyright?

    It is our policy to respond to notices of alleged infringement that comply with the Digital Millennium Copyright Act. For directions and more information, please click here.

Technical Questions

  1. My articles are in PDF format. Can you still index my site?

    Yes. We can index PDF articles as long as they're searchable and as long as their size doesn't exceed 5MB. For larger documents and for scanned images that require OCR, we recommend Google Book Search.
     
  2. How can I tell if a PDF file has searchable text?

    Open the file in Adobe Acrobat Reader. Click 'Find' (look for the binocular icon), and confirm that you can search for and find several words on the page.
     
  3. Some of my articles are split into multiple files, one file per section. Can you work with these?

    Alas, we can't. We can index only one file per article at the moment.
     
  4. How do I remove a listing from your search results?

    Refer to Google webmaster help here.
     
  5. I see a 'cached' (or 'View as HTML') link to my access-controlled articles. I need to have this fixed right away!

    Of course! Please email us with specific examples of where the links appear; we'll investigate and fix as soon as possible. This is not intentional but may happen due to technical issues. For example, our methodical crawlers may accidentally discover a forgotten alternative interface to your content. You'll need to tell us of all such interfaces, because crawlers can go places where you least expect them. Please email us and we'll look into it.

    If you believe another site is infringing your copyright, please see our directions on the DMCA process.
     
  6. Is there anything I can do to help rank my articles better?

    Indeed you can. Our indexing algorithms automatically extract bibliographic data, citations and other information from articles and use it for ranking purposes. Providing authoritative metadata about your articles can help facilitate this and can increase the likelihood of identifying all the citations to your articles. We strongly recommend this approach. Please refer to the following technical inclusion guidelines for the details of how to implement it.
     
  7. All my articles are available to your crawlers, but not all of them seem to show up in Google Scholar. Can I do something to help improve coverage?

    Yes. Gaps in coverage are certainly not intentional; but they could be caused by a number of different technical issues in the automatic processing of your website by our search robots. The troubleshooting section in our technical inclusion guidelines describes ways to identify and fix common coverage issues. We encourage you to take a look.