Sequencing and Assembly Information |
|
|
Assembly name |
Zm-B104-DRAFT-ISU_USDA-0.1 |
|
WGS accession |
|
|
Sequencing description |
Sequence service provider: Dow AgroSciences Sequencing technologies: Illumina paired end sequencing Sequencing method: Illumina pair-end sequencing (50x) technology was used to sequence the B104 maize line followed by its genome assembly using a reference-guided approach with B73 as the reference genome. De novo assembly was used for assembling the unmapped Illumina reads. Draft beta pseudomolecule assembly was done using reference-guided assembly via ALLMAPS with B73 and W22 synteny markers generated with SynMap using the Ensembl B73 version 4.32 genome or the W22 genome vs the unassembled B104 scaffolds. Many scaffolds with annotations are still unplaced. Additionally, in the upcoming full-release, 20x PacBio reads are being generated to create pseudomolecules. Sequencing hardware: HiSeq2000 |
|
Assembly description |
Assembly methods: Reference based assembly plus de novo assembly for unmapped reads Construction of pseudomolecules: Yes |
|
Browse Genome |
Genome browser at MaizeGDB |
|
Data download |
https://ftp.maizegdb.org/MaizeGDB/FTP/Zm-B104-DRAFT-ISU_USDA-0.1/
|
|
Release date |
fall 2017 |
|
Sequencing method |
Illumina paired end sequencing |
|
Finishing strategy |
Draft |
|
Seq hardware |
HiSeq2000 |
|
Genome used for alignment |
Reference Based scaffold assembly based on B73 genome v2 version. Pseudomolecule assembly using B73 v4 and W22. |
|
Genome coverage |
50X |
|
Seq service provider |
Dow AgroSciences |
|
Assembly statistics |
|
Scaff num |
407,410 |
|
Perc seq scaffold |
85 |
|
Perc seq unscaffold |
15 |
|
Total scaff length |
2,270,000,000 bp |
|
Longest scaff |
2,021,246 bp |
|
Shortest scaff |
201 bp |
|
N50 scaff length |
6,422 bp |
|
N50 scaff count |
21,663 |
|
N90 scaff length |
5,567 bp |
|
N90 scaff count |
359 |
|
Total contig length |
2,256,653,983 bp |
|
Longest contig |
469,437 bp |
|
N50 contig length |
32,474 bp |
|
N50 contig count |
18,287 |
|
N90 contig length |
2,363 bp |
|
N90 contig count |
105,178 |
|
Total number of scaffolds in assembly.
|
% assembly in scaffolded contigs.
|
% assembly in UNscaffolded contigs.
|
Total sequence length represented by scaffolds.
|
Longest scaffold in assembly.
|
Shortest scaffold in assembly.
|
The length of scaffold which takes the sum length (summing from longest to shortest scaffold) past 50% of the total assembly size.
|
How many scaffolds are counted in reaching the N50 threshold.
|
The length of scaffold which takes the sum length (summing from longest to shortest scaffold) past 90% of the total assembly size.
|
How many scaffolds are counted in reaching the N90 threshold.
|
Total sequence length represented by contigs.
|
The longest contig.
|
The length of contig which takes the sum length (summing from longest to shortest contig) past 50% of the total assembly size.
|
How many contig are counted in reaching the N50 threshold.
|
The length of contig which takes the sum length (summing from longest to shortest contig) past 90% of the total assembly size.
|
How many contig are counted in reaching the N90 threshold.
|
|
|
|
A contig is a contiguous consensus sequence that is
derived from a collection of overlapping reads.
A scaffold is set of a ordered and orientated contigs
that are linked to one another by mate pairs of sequencing reads.
|