Skip to content

vcutrona/scholar-export

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

scholar-export

This CLI tool helps you in exporting publications from Google Scholar for a set of authors, given their IDs. Publications are exported in CSV format. Duplicated entries are filtered out if they share the same <title, author> pair.

How to use

Clone the repository

git clone git@github.com:vcutrona/scholar-export.git
cd scholar-export

Install dependencies

NOTE: The script has been tested with Python 3.9.

Install the required dependencies by running:

virtualenv -p python3.9 venv # we suggest to create a virtual environment
source venv/bin/activate
pip install -r requirements.txt

Usage with CLI

To show the command help, simply run:

python cli.py --help

WARNING: The tool relies on the scholarly package, without using a proxy. It may happen Google Scholar blocks this tools.

Single author

Export publications of a single author by executing the following command:

python cli.py \
  author \
  --author-id "your-scholar-id" \
  --data-pkl "/path/to/pkl/file" \
  --start-year 1900 \
  --end-year 2030 \
  --output-file "/path/to/output/csv/file"

Multiple authors

Export publications for multiple users. The Google Scholar IDs are read from a CSV file. The column containing the IDs must be set as a command parameter. Given a researchers.csv file structured as follows:

ScholarID Name
xyz123 Research1
qkj987 Research2

The following command will download the publications made by all the listed authors:

python cli.py \
  authors \
  --authors-file "researchers.csv" \
  --author-id-column "ScholarID" \
  --data-pkl "/path/to/pkl/file" \
  --start-year 1900 \
  --end-year 2030 \
  --output-file "/path/to/output/csv/file"

About

Exporting publications from Google Scholar

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages