Annotation of the genome assembly (version 2) of the microalga Tisochrysis lutea
We recently published the first draft genome of T. lutea obtained with the Illumina short-read technology. While this technology has a very low sequencing error rates, the assemblers are known to misassemble the long repeated sequences, resulting into the fragmentation of the genome assembly. The genome of T. lutea was re-sequenced with the long-read technology Pacific Bioscience. Indeed, long-read assemblers show efficiency to resolve the assembly of long repeated elements such as TEs. However, this technology have to date a high sequencing error rates and its combination with short-read Illumina data is became a common method to overcome this error rate. A de novo genome assembly was perform from the long-reads and was improved with Illumina short-read data, used in the first genome assembly version.
The de novo genome of T. lutea is composed of 193 contigs and has a size of 82 Mb. A gain of around 30 Mb was obtained (+34%), compared to the previous genome assembly, having a size of 54 Mb and composed of 7,659 contigs. The size of the coding regions has fewly increased between the both genome versions. While the de novo genome assembly encodes for ~16,000 genes, corresponding to a coding region length of 28 Mb, the previous gene proportion of the draft genome version was of 25 Mb. This suggest that the new assembled regions are mostly repeated elements. This new genome version is by far away more accurate than the previous one and was suitable to properly detect and annotate the TE content.
To identify potential autonomous TEs, we designed a pipeline named PiRATE (Pipeline to Retrieve and Annotate TEs) and conducted an accurate TE annotation in a de novo genome of T. lutea. We established that its genome is composed of 15.9% and 4.9% of Class I and Class II TEs respectively. Among them 3.8% and 15.95% correspond to potentially autonomous and non-autonomous TEs respectively.
owner : {{md.getOwnername()}}
{{'mdStatusRecord' | translate}}: {{('mdStatus-' + md.mdStatus) | translate}}
- Identification
- Content
- ReferenceSystem
- Quality
- DomainConsistancy
- Constraints
- Distribution
- Meta-metadata
- ObjectCatalogue
Date | 2018-01-01 |
---|---|
Date type | Creation: Date identifies when the resource was brought into existence |
Date type | Revision: Date identifies when the resource was examined or re-examined and improved or amended |
Date | 2019-08-06 |
Date type | Publication: Date identifies when the resource was issued |
Date type | Publication: Date identifies when the resource was issued |
Unique resource identifier | 4c25bb60-8d90-4127-92ed-c5df6a989e56 |
Unique resource identifier |
Point of contact
Organisation name | Ifremer, Scientific Information Systems for the sea |
---|---|
Delivery point | IFREMER Centre de Bretagne ZI Pointe du diable CS 10070 |
City | PLOUZANE |
Postal code | 29280 |
Country | France |
Electronic mail address | sismer@ifremer.fr |
Linkage | http://data.ifremer.fr/SISMER |
Role | Dataset Holding Organisation: Dataset Holding Organisation |
Point of contact
Organisation name | SEA scieNtific Open data Edition |
---|---|
Delivery point | By address: IFREMER / IDM / SISMER - Scientific Information Systems for the SEA, IFREMER Centre de Bretagne, ZI Pointe du diable CS 10070 |
City | PLOUZANE |
Postal code | 29280 |
Country | France |
Electronic mail address | data@seanoe.org |
Linkage | https://www.seanoe.org/ |
Role | Publisher: Publisher |
Point of contact
Organisation name | Ifremer, Scientific Information Systems for the sea |
---|---|
Delivery point | IFREMER Centre de Bretagne ZI Pointe du diable CS 10070 |
City | PLOUZANE |
Postal code | 29280 |
Country | France |
Electronic mail address | sismer@ifremer.fr |
Linkage | http://data.ifremer.fr/SISMER |
Role | Author: Party who authored the resource |
Descriptive keywords
GEMET - INSPIRE themes, version 1.0 | |
---|---|
SeaDataNet Agreed Parameter Groups | Other biological measurements |
SeaDataNet Parameter Discovery Vocabulary |
Concentration of other substances in biota
|
Microphytobenthos biomass
|
MEDIN data format categories | Text or Plaintext |
Language | English: English |
---|---|
Character set | UTF8: 8-bit variable size UCS Transfer Format, based on ISO/IEC 10646 |
Topic category code |
|
Supplemental Information | How to cite: Berthelier Jérémy, Casse Nathalie, Daccord Nicolas, Jamilloux Véronique, Saint-Jean Bruno, Carrier Gregory (2018). Annotation of the genome assembly (version 2) of the microalga Tisochrysis lutea. SEANOE. https://doi.org/10.17882/52231 |
Reference System Information
Anchor | WGS 1984 |
---|
Hierarchy level | Dataset: Information applies to the dataset |
---|
Lineage
Statement | Quality controlled data |
---|
Domain consistency
Conformance result
|
Domain consistency
Conformance result
|
Domain consistency
Conformance result
|
mdLegalAndSecurityConstraintsSection
Resource constraints
|
|||||||||||
Resource constraints
|
Transfer options
|
File identifier | 4c25bb60-8d90-4127-92ed-c5df6a989e56 | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Metadata language | English | ||||||||||||||||
Character set | UTF8: 8-bit variable size UCS Transfer Format, based on ISO/IEC 10646 | ||||||||||||||||
Hierarchy level | Dataset: Information applies to the dataset | ||||||||||||||||
Date stamp | 2024-06-25T00:00:00 | ||||||||||||||||
Metadata standard name | ISO 19115:2003/19139 | ||||||||||||||||
Metadata standard version | 1.0 | ||||||||||||||||
Contact
|
Overviews
extent
- geoDesc
- {{d}}
- geoBox
-
- geoDescCode
- {{mdView.current.record.geoDescCode}}
tempExtent
- creationDate
- publicationDate
- revisionDate
- tempExtentBegin
Associated resources
Not available