Python 3.9- Link - tested on Python 3.9 on MacOSX Sonomapip- Link - package installer for Pythonvenv- Link - Serves files in virtual environment
- Create and activate a Python virtual environment. We have tested using Python3.9.
brew install gccexport CC=/opt/homebrew/Cellar/gcc/14.1.0_2/bin/g++-14This will be different for different users/systems.export CFLAGS="-Wa,-q"pip install --upgrade pip setuptools wheelpython -m pip install -r requirements.txtpython -m spacy download en_core_web_smpython -m nltk.downloader popular- Set
OPEN_AI_KEYinconfig.py. export TOKENIZERS_PARALLELISM=false
Note: pip cache purge might be needed sometimes to start fresh installation.
- Configure file paths in
config.py. Sample data files are provided:data/sample-dataset-sans-embeddings.tsv- the output file from the scraper module as the input file to compute embeddings.data/sample-dataset-with-embeddings.tsv- the output file with computed embeddings.
- Run
python embed.py
vitaLITy was created by Arpit Narechania, Alireza Karduni, Ryan Wesslen, and Emily Wall.
@article{narechania2021vitality,
title={vitaLITy: Promoting Serendipitous Discovery of Academic Literature with Transformers \& Visual Analytics},
author={Narechania, Arpit and Karduni, Alireza and Wesslen, Ryan and Wall, Emily},
journal={IEEE Transactions on Visualization and Computer Graphics},
year={2022},
doi={10.1109/TVCG.2021.3114820},
publisher={IEEE}
}The software is available under the MIT License.
If you have any questions, feel free to open an issue or contact Arpit Narechania.