-
Notifications
You must be signed in to change notification settings - Fork 8
Annual Report: TDWG Data Quality Interest Group for 2023
Arthur Chapman edited this page Nov 20, 2023
·
3 revisions
The Interest Group currently has 2 Task Groups - 2 were requested to be wound up during the year
[TG1 – Framework on Data Quality - CLOSED]
TG2 – Data Quality Tests and Assertions
[TG3 – Data Quality Use Cases - CLOSED]
TG4 – Best Practices for Development of Vocabularies of Value
A lot of work has been carried out during the year - especially wrt Task Group 2 (see under report for that Task Group below). Requests for the winding up of Task Groups 1 and 3 have been made to the executive as the work of those two groups has been completed and outcomes folded into Task Group 2 in the lead up to a new Biodiversity Data Quality Standard (tentatively named BDQ Core).
- A lot of progress has been made in Task Group 2 leading to the start of a draft Standards document (https://github.com/tdwg/bdq/wiki/TG2-Tests-and-Assertions-Standards-Document)
- The winding up of Task Group 1: Framework on Data Quality (see https://github.com/tdwg/bdq/wiki/Winding-up-of-Task-Group-1-%E2%80%93-Framework-on-Data-Quality)
- The winding up of Task Group 3: Data Quality Use Cases (see https://github.com/tdwg/bdq/wiki/Final-Report-of-Task-Group-3:-Data-Quality-Use-Cases)
- No real progress on Task Group 4.
- Inability to meet ‘face-to-face’ to finalize the writing of the standard document. Zoom and equivalents are useful but far from optimal for collaboration and the different time zones do not help.
- Zero
- We hope to complete the implementation of the outstanding tests, the test data and the standards document.
- Submission of the work as a TDWG standard.
Task group wound up - see final report at https://github.com/tdwg/bdq/wiki/Winding-up-of-Task-Group-1-%E2%80%93-Framework-on-Data-Quality
- Completing test data suite (last task before submission of Standard)
- Finalized 99 CORE tests and documented them against a standard template: https://github.com/tdwg/bdq/issues?q=is%3Aissue+is%3Aopen+label%3ATes
- The generation of test datasets that can be used to validate the isntallation of the test code is approaching completion with some issues still to be worked out on the final structure of the test data. See https://github.com/tdwg/bdq/tree/master/tg2/core/testdata for a subset.
- Tests and specifications (based on a single template) are final
- Test data template agreed
- Code has been written to extract the parameters of each of the tests to RDF and we believe that this will form the basis of the proposed TDWG standard for the Tests and Assertions. Finalized 99 CORE tests and documented them against a standard template: https://github.com/tdwg/bdq/issues?q=is%3Aissue+is%3Aopen+label%3ATest.
- Inability to meet ‘face-to-face’.
- Busy TG2 members
- ‘Burnt out’ TG2 members. This work has taken much longer than anyone in the group anticipated. This has largely been due to the complexity of the task. COVID-19
- Zero
- Proof and finalise Test Data and make available for public review
- Develop a technical specification
- Submit the work of TG2 as a TDWG standard.
Task Group wound up - see final report at https://github.com/tdwg/bdq/wiki/Final-Report-of-Task-Group-3:-Data-Quality-Use-Cases
Preparing best practices document
Very slow progress during 2023 due to convener personal circumstances.
None
The Task Group does not plan to propose a new data standard or any modification to existing ones but intends to provide a best current practice for building TDWG vocabularies of values.