-
Notifications
You must be signed in to change notification settings - Fork 3
Description
how many are there?
From a quick Chado query there are 425 publications with an approved Canto session but no annotation. This is the breakdown by triage status:
count | canto_triage_status -------+------------------------------------ 1 | Bioinformatics 1 | Browser datasets, hosted 3 | Cell composition or WT feature 406 | Curatable 2 | Method or reagent 1 | Not English 3 | Other 3 | Phylogeny and evolutionary studies 2 | Review or comment 3 | Sequence feature or region
Originally posted by @kimrutherford in #1302
I'd like this list to understand what it mainly contains.
I expect it will be mainly articles we class as curated, because we read them, but there are other data types (like HTP data, browser datasets, structure but with no annotations), etc.
In effect, we have 'curated them' even though we made no annotations in Canto.
We should perhaps have another bin for these.
Papers read but no Canto curation.
(browser datasets hosted would be a subset of this set)
We can discuss once I have seen the contents.
It is sometimes difficult to classify these woth our 'binary classification'
We often classify papers as "uncuratable" at triage, and never read them.
Some we read, but are unable to make any annotations. This is still 'work'