Skip to content

List of PMID/title Sessions through canto classed as 'curatable' with no annotation #1304

@ValWood

Description

@ValWood

how many are there?

From a quick Chado query there are 425 publications with an approved Canto session but no annotation. This is the breakdown by triage status:

 count |        canto_triage_status         
-------+------------------------------------
     1 | Bioinformatics
     1 | Browser datasets, hosted
     3 | Cell composition or WT feature
   406 | Curatable
     2 | Method or reagent
     1 | Not English
     3 | Other
     3 | Phylogeny and evolutionary studies
     2 | Review or comment
     3 | Sequence feature or region

Originally posted by @kimrutherford in #1302

I'd like this list to understand what it mainly contains.

I expect it will be mainly articles we class as curated, because we read them, but there are other data types (like HTP data, browser datasets, structure but with no annotations), etc.
In effect, we have 'curated them' even though we made no annotations in Canto.

We should perhaps have another bin for these.

Papers read but no Canto curation.

(browser datasets hosted would be a subset of this set)

We can discuss once I have seen the contents.

It is sometimes difficult to classify these woth our 'binary classification'
We often classify papers as "uncuratable" at triage, and never read them.

Some we read, but are unable to make any annotations. This is still 'work'

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions