Hex.pm exposes some useful API endpoints, for example to build popular_packages.txt we would iterate over a few pages of: https://hex.pm/api/packages?sort=total_downloads&page=1 If that sounds reasonable I could write a mix task to fetch and decode.