Status
[March 2020] The Aspergillus flavus NRRL 3357 genome was not sequenced and assembled at the Joint Genome Institute, but rather by Jeff Skerker using a combination of long-read and short-read datasets (Pacbio, Oxford Nanopore, and Illumina). Eight chromosomes were assembled using a hybrid assembly method and the CANU assembler (v1.7.1). The assembly was polished using Pacbio data using pbalign (v0.3.1), blasr (v5.3) and the Arrow (v2.2.2) algorithm. Final error correction performed using Pilon and Illumina data. Eight chromosomes were assembled, 7 out of 8 are complete telomere-to-telomere assemblies.
This assembly was then annotated using the JGI annotation pipeline, with several modifications. To preserve original gene names as much as possible, previously produced models available on fungiDB were mapped forward onto the new assembly, which constitute 43.29% of the Filtered Model (FM) set. Additionally, to improve capture of UTRs using RNAseq data available at NCBI, we applied our est extension procedure to mapped fungiDB models, producing a new track, estExt_Aspflav1_ExternalModels (20.79% of FM). Our standard filtering parameters were also adjusted to allow capture of more models with transcriptomic support, as well as prioritize mapped forward models (and their est extended versions) for inclusion in the FM set.
Genome Assembly | |
Genome Assembly size (Mbp) | 37.75 |
Sequencing read coverage depth | 650x |
# of contigs | 8 |
# of scaffolds | 8 |
# of scaffolds >= 2Kbp | 8 |
Scaffold N50 | 4 |
Scaffold L50 (Mbp) | 4.81 |
# of gaps | 0 |
% of scaffold length in gaps | 0.0% |
Three largest Scaffolds (Mbp) | 6.51, 6.31, 5.20 |
ESTs | Data set | # sequences total | # mapped to genome | % mapped to genome |
Ests | est.fasta | 19038495 | 18641048 | 97.9% |
Other | Trinity_assembled_Illumina_transcriptome | 110843 | 58815 | 53.1% |
Gene Models | FilteredModels4 | |
length (bp) of: | average | median |
gene | 1898 | 1570 |
transcript | 1709 | 1413 |
exon | 528 | 299 |
intron | 87 | 62 |
description: | ||
protein length (aa) | 462 | 390 |
exons per gene | 3.24 | 3 |
# of gene models | 13715 |
Collaborators
- Jeffrey Skerker,
UC Berkeley, CA
- Louise Glass, UC Berkeley, CA
- Nancy
Keller, U. Wisconsin-Madison, WI
Genome Reference(s)
Skerker JM, Pianalto KM, Mondo SJ, Yang K, Arkin AP, Keller NP, Grigoriev IV, Louise Glass NL
Chromosome assembled and annotated genome sequence of Aspergillus flavus NRRL 3357.
G3 (Bethesda). 2021 Aug 7;11(8):. doi: 10.1093/g3journal/jkab213
Links
- JGI PhyloGroup Portals: Fungi Dikarya Ascomycota Pezizomycotina Eurotiomycetes Eurotiales Aspergillus