A structured, open dataset of NGSA past papers (paper 1 question and answers), compiled for educational and non-commercial use. Includes multiple choice questions, answers, and metadata in JSON format for easy integration.
I wanted this data for a personal project, but couldn’t find it on the Ministry of Education’s website, so I built it myself. I'm sharing it so others (teachers, developers, students) can benefit too.
ngsa-dataset/
├── README.md
├── LICENSE
├── papers
| ├── english-papers/
| | ├── National Grade 6 Assessment - 2010 - English P1
| | ├── National Grade 6 Assessment - 2011 - English P1
| | └── ...
| └── math-papers/
| ├── National Grade 6 Assessment - 2010 - Mathematics P1
| ├── National Grade 6 Assessment - 2011 - Mathematics P1
| └── ...
└── data/
├── english/
│ ├── ngsa-english-dataset-qa-2021.json
│ ├── ngsa-english-dataset-qa-2022.json
│ └── ...
└── math/
├── ngsa-mathematics-dataset-qa-2010-unfinished.json
└── ...
NOTE:
- Files labeled with
filename-unfinishedcontain question numbers and correct answer data only. The question text, answer option labels, instruction text, and context have not yet been added. - Files labeled with
filename-uncontain question numbers, correct answer data, option labels, and question texts. The instruction text and context have not yet been added. - Files labeled with
filename-testscontain test metadata such as the subject, level, paper type, and a list of references (question_ids) that point to the individual questions. - Files labeled with
filename-qacontain the individual questions for each subject range
This dataset is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). You may use, share, and adapt the data for non-commercial purposes, provided you give credit to:
- Ezra Minty - Compiler of the NGSA Paper 1 Dataset
Full license details: [ https://creativecommons.org/licenses/by-nc/4.0/ ]