Skip to content

Minimal working example #4

@jvdzwaan

Description

@jvdzwaan

I can't seem to get passim to work. I made an example data file containing:

{"id": "t1", "text": "This is a text. It contains repeated words. This is the repeated sentence and passim is supposed to find it."}
{"id": "t2", "text": "This is the repeated sentence and passim is supposed to find it."}
{"id": "t3", "text": "This text does not contain the repeated sentence. It is about nothing really."}
{"id": "t4", "text": "Another text. This is the repeated sentence and passim is supposed to find it. What a nice text"}

Then I do:

$passim data/data.json out

It runs and finished without error messages. There is a file in out/out.json, but it is empty:

$ls out/out.json/
part-r-00000-e21212ac-8adf-4adb-9f09-5eab3d2a9136.json  _SUCCESS
$ cat out/out.json/part-r-00000-e21212ac-8adf-4adb-9f09-5eab3d2a9136.json
$

I tried adding different options, including --min-match 2, --pairwise, --n 2, but the output remains empty.

Am I missing something? Could you maybe provide a minimal working example?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions