Add ability to compute trec_eval metrics directly on in-memory data structures

From you paper:

<img width="335" alt="Screen Shot 2019-11-08 at 3 52 14 PM" src="https://user-images.githubusercontent.com/313837/68509788-c89aba00-023f-11ea-86f8-a742b2c0772d.png">

This is now pretty easy... with [pyserini on PyPI](https://pypi.org/project/pyserini/).

But the real point of this issue is this: currently, as a I understand it, the input to evaluation is a file. Can we make it so that we can compute evaluation metrics directly from in-memory data structures?

Question is, what should the in-memory data structures look like? A Panda DF with the standard trec output format columns? A dictionary to support random access by qid? Something else?

If we can converge on something, I can even try to volunteer some of my students to contribute to this effort... :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add ability to compute trec_eval metrics directly on in-memory data structures #13

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Add ability to compute trec_eval metrics directly on in-memory data structures #13

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions