Dear António,
Thank you for your kind words and for reaching out. We are glad to hear that you find COCONUT useful.
You are correct in noting that while the curated COCONUT dataset itself is distributed under a CC0 license, some of the underlying source databases from which data is aggregated have their own licensing terms, including restrictions such as non-commercial use. Because of this, the licensing conditions of the original sources should still be taken into consideration when reusing the data.
In the downloadable COCONUT dataset, each entry includes references to its source database(s). This information can be used to identify which records originate from databases with more restrictive licenses, such as NPAtlas or KNApSAcK. If your intended use requires full commercial compatibility, the safest approach would be to filter out entries that originate exclusively from sources with non-commercial or otherwise restrictive licenses.
In other words, while the curated dataset is released under CC0 to facilitate open reuse, users should review the provenance metadata provided with each entry and ensure that their use complies with the licensing terms of the original sources.
Please feel free to reach out if you have further questions.
Dear António,
Thank you for your kind words and for reaching out. We are glad to hear that you find COCONUT useful.
You are correct in noting that while the curated COCONUT dataset itself is distributed under a CC0 license, some of the underlying source databases from which data is aggregated have their own licensing terms, including restrictions such as non-commercial use. Because of this, the licensing conditions of the original sources should still be taken into consideration when reusing the data.
In the downloadable COCONUT dataset, each entry includes references to its source database(s). This information can be used to identify which records originate from databases with more restrictive licenses, such as NPAtlas or KNApSAcK. If your intended use requires full commercial compatibility, the safest approach would be to filter out entries that originate exclusively from sources with non-commercial or otherwise restrictive licenses.
In other words, while the curated dataset is released under CC0 to facilitate open reuse, users should review the provenance metadata provided with each entry and ensure that their use complies with the licensing terms of the original sources.
Please feel free to reach out if you have further questions.