Skip to content

Animesh911/accumulation-curve

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Accumulation-curve

Plot accumulation curve based on subsampling with replacement.

Species accumulation curve (collector's curve, species richness curve) are used to estimate number of species in a particular area.

accumulation_curve.py takes input data as TSV file where each value is Taxonomic id. Here, algorithm performs sampling by fraction on columns and counts unique taxonomic id in a sample (ignoring unclassified taxonomic ids). This process is simulated and repeated for the specified sample fraction.

Usage

Usage: accumulation_curve_kaiju.py [-h] [--file FILE] [--sample_frac SAMPLE_FRAC] [--threshold [THRESHOLD]] [--sim [SIM]] [--ci CI] [--splot SPLOT] [--format {png,jpeg,jpg,tiff,pdf}] [--sdata SDATA]


  Options:

    -h, --help                        show this help message and exit
    --file FILE                       TSV file where samples are in column
    --sample_frac SAMPLE_FRAC         comma seperated fraction of sample (without spaces).
                                      Example: --sample_frac 0,0.01,0.05,0.1,0.15,0.2,0.25,0.3,0.35,0.4,0.45,0.5,0.55,0.6,0.65,0.7,0.75,0.8,.85,0.9,0.95,1
    --threshold [THRESHOLD]           Minimum occurance in a sample to claim a species, default = 2
    --sim [SIM]                       No of times to simulate, default = 10
    --ci CI                           Size of confidence interval, ci: None (default) or sd (standard deviation) or int (95)
    --splot SPLOT                     Save the plot as...
    --format {png,jpeg,jpg,tiff,pdf}  Save plot output format as..., default = png
    --sdata SDATA                     Save the final output data used for plotting as...

Example

python accumulation_curve_kaiju.py --file test.txt -s test

TSV file with sample header and detected taxonomic id

test.txt: 
sample1	sample2
0	288793
0	0
0	0
0	0
0	0
1696	0
0	0
0	0
0	0
0	0
0	398767
0	0
0	2591109
881	58050
454601	0
1555112	1367477
0	0
218208	0
179	0
443144	0
1930071	0
1917158	0
0	1936003
0	0
0	2575375
0	2601894
0	936155
585455	0
0	1110502
135619	1851148

Output Accumulation curve

test.png

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages