I'm accessing the evaluation datasets directly via huggingface datasets like this:
ds = datasets.load_dataset(
"DBD-research-group/BirdSet",
"HSN",
trust_remote_code=True,
cache_dir=cache_dir,
)
classes = ds['test'].info.features['ebird_code'].names
l=ds['test'].info.features['ebird_code']
I can loop over the entries and get the file names, but there may have been audio files that were annotated and had zero annotations. Does the dataset contain a list of all annotated audio files somewhere?
Thanks!