-
Notifications
You must be signed in to change notification settings - Fork 16
Open
Description
I can't seem to get passim to work. I made an example data file containing:
{"id": "t1", "text": "This is a text. It contains repeated words. This is the repeated sentence and passim is supposed to find it."}
{"id": "t2", "text": "This is the repeated sentence and passim is supposed to find it."}
{"id": "t3", "text": "This text does not contain the repeated sentence. It is about nothing really."}
{"id": "t4", "text": "Another text. This is the repeated sentence and passim is supposed to find it. What a nice text"}
Then I do:
$passim data/data.json out
It runs and finished without error messages. There is a file in out/out.json, but it is empty:
$ls out/out.json/
part-r-00000-e21212ac-8adf-4adb-9f09-5eab3d2a9136.json _SUCCESS
$ cat out/out.json/part-r-00000-e21212ac-8adf-4adb-9f09-5eab3d2a9136.json
$
I tried adding different options, including --min-match 2, --pairwise, --n 2, but the output remains empty.
Am I missing something? Could you maybe provide a minimal working example?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels