Skip to content

Evaluation suite #4

@patham9

Description

@patham9

An initial evaluation suite for incremental open-ended Q&A is available in
https://github.com/patham9/NarsGPT/tree/gptONA
mem.json symbolic-vector hybrid database file from running the incremental Q&A can be obtained here:
https://github.com/patham9/NarsGPT/releases/tag/Evaluation_Memfile
This allows interactive Q&A testing with the belief learned from the csv.

Special thanks to Peter Voss for providing an incremental Q&A benchmark: https://github.com/patham9/NarsGPT/blob/Evaluation/INT_Inf_benchmarkTest.csv

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions