A simple script to parse a text corpus in the Khmer language and convert it to a form that can be imported by FieldWorks.
- Install .Net Core SDK 3.0 or later
- Run
dotnet tool restore - Run
dotnet restore - Run
dotnet fsi src/Khmer/doit.fsx file1.txt file2.txt ... fileNNN.txt > all-output.txtto create output file - Run
dotnet fsi src/Khmer/split.fsx all-output.txtto createsplit-output-NNN.sfmfiles