Info • Polyphilus sieberi DSE2052 v1.0


[June 2021] The Polyphilus sieberi DSE2052 v1.0 genome was sequenced with PacBio, assembled with Flye, and annotated with the JGI Annotation Pipeline. The transcriptome was sequenced with Illumina and assembled with Trinity. A large amount of rRNA contamination resulted in poor RNA mapping to the genome. The mitochondrial genome was assembled separately and is available in the Download tab.

Genome Assembly
Genome Assembly size (Mbp) 69.89
Sequencing read coverage depth 72.7X
# of contigs 63
# of scaffolds 63
# of scaffolds >= 2Kbp 63
Scaffold N50 10
Scaffold L50 (Mbp) 2.67
# of gaps 0
% of scaffold length in gaps 0.0%
Three largest Scaffolds (Mbp) 7.72, 5.53, 3.28

ESTs Data set # sequences total # mapped to genome % mapped to genome
EstClusters ESTclusters 66543 63081 94.8%
Ests est.fasta 177188101 120121549 67.8%

Gene Models FilteredModels1
length (bp) of: average median
gene 1772 1543
transcript 1605 1381
exon 521 324
intron 82 57
protein length (aa) 459 377
exons per gene 3.08 3
# of gene models 19299


The work conducted by the U.S. Department of Energy Joint Genome Institute, a DOE Office of Science User Facility, is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231.