If possible extend PyKoi (or a similar toolset) to inference against any/all of the following datasets: - ArXiv - pubmed - wikipedia