After reviewing https://github.com/chanzuckerberg/software-mentions/blob/main/sample_notebooks/Interacting%20with%20the%20dataset.ipynb, I think there are some mentions missing. For example "Scikit learn" is nowhere to be found in our dataset. I think there may have been an issue somewhere in the cleaning process, removing extra labels.