You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Collection of stand-alone supplementary materials: reports, scripts, tables, and figures for the journal version of paper "Corrections of Zipf's and Heaps' Laws Derived from Hapax Rate Models" . This article is to appear in Journal of Quantitative Linguistics.
This project processes text files to identify hapax legomena (words that appear only once) and saves the results in an Excel file. It uses tokenization, optional lemmatization, and frequency analysis to extract and list these rare words.