This crawler allows you to define a list of scholars for which it then creates a CSV file of all the scholars' publications listed on Google Scholar.
Currently, the following information per publication will be crawled automatically:
- Author
- Title of Paper
- Publication Year
- Number of Citations
- Number of co-authors
- Citations per year -> Creates a column for each year during a specific time span. The span can be defined as needed.
I would suggest running the crawler in Google Colab (https://colab.research.google.com/)
- Create a list of scholars for which you want to crawl their publications.
- Make sure they have a Google Scholar profile and that the names in the list match the names in the Google Scholar profiles.
- Paste the names into the 'author_names' list in the code file.
- Optional: Edit the crawler as you see fit.
- Install the "scholarly" module.
- Run the crawler!
You can interrupt the crawler's runtime. As long as the CSV file is still in the directory, it will continue where it left off when you restart it.