Track topics on Twitter Track topics that are important to you
Completion of eukaryal genomes can be difficult task with the highly repetitive sequences along the chromosomes and short read lengths of second-generation sequencing. Saccharomyces cerevisiae strain CEN.PK113-7D, widely used as a model organism and a cell factory, was selected for this study to demonstrate the superior capability of very long sequence reads for de novo genome assembly. We generated long reads using two common third-generation sequencing technologies (Oxford Nanopore Technology (ONT) and Pacific Biosciences (PacBio)) and used short reads obtained using Illumina sequencing for error correction. Assembly of the reads derived from all three technologies resulted in complete sequences for all 16 yeast chromosomes, as well as the mitochondrial chromosome, in one step. Further, we identified three types of DNA methylation (5mC, 4mC and 6mA). Comparison between the reference strain S288C and strain CEN.PK113-7D identified chromosomal rearrangements against a background of similar gene content between the two strains. We identified full-length transcripts through ONT direct RNA sequencing technology. This allows for the identification of transcriptional landscapes, including untranslated regions (UTRs) (5' UTR and 3' UTR) as well as differential gene expression quantification. About 91% of the predicted transcripts could be consistently detected across biological replicates grown either on glucose or ethanol. Direct RNA sequencing identified many polyadenylated non-coding RNAs, rRNAs, telomere-RNA, long non-coding RNA and antisense RNA. This work demonstrates a strategy to obtain complete genome sequences and transcriptional landscapes that can be applied to other eukaryal organisms.
This article was published in the following journal.
Name: Nucleic acids research
Transcriptional reporter systems allow researchers to investigate the function and regulation of transcription factors. Conventional systems employ artificial cDNA overexpression vectors containing ei...
A regularly shaped grid is useful for analyzing data particularly at multilayer levels, where patterns can be visually represented and analytically compared-conceptually similar to Picasso's cubism. H...
Somatic genome mutations occur due to combinations of various intrinsic/extrinsic mutational processes and DNA repair mechanisms. Different molecular processes frequently generate different signatures...
We integrated the genomic sequencing of 1,918 breast cancers, including 1,501 hormone receptor-positive tumors, with detailed clinical information and treatment outcomes. In 692 tumors previously expo...
The genomic landscape of metastatic castration-resistant prostate cancer (mCRPC) differs from that of the primary tumor and is dynamic during tumor progression. The real-time and repeated characteriza...
The investigators propose to conduct a retrospective study of single agent ceritinib in patients with previously untreated anaplastic lymphoma kinase (ALK) rearranged adenocarcinoma of the...
The investigators propose to conduct a pilot feasibility study of single agent afatinib in patients with previously untreated metastatic EGFR (epidermal growth factor receptor) mutant aden...
This is a multicenter prospective collection of leftover respiratory tract secretions, paired blood and NP swabs, and clinical circumstances from pediatric HCT patients, followed by next g...
This study will prospectively characterize the molecular, cellular and genetic properties of primary and metastatic neuroblastoma, osteosarcoma, retinoblastoma, Ewing sarcoma family of tum...
The aim of this study is to establish large tissue sections for 10 kinds of tumors. in order to observe the tumor landscape on microscope. The tumors including esophageal carcinoma,gastric...
An increase number of repeats of a genomic, tandemly repeated DNA sequence from one generation to the next.
The genomic analysis of assemblages of organisms.
The detection of RESTRICTION FRAGMENT LENGTH POLYMORPHISMS by selective PCR amplification of restriction fragments derived from genomic DNA followed by electrophoretic analysis of the amplified restriction fragments.
The generation of theories from analysis of empirical data.
Bacteria that can survive and grow in the complete, or nearly complete absence of oxygen.
DNA sequencing is the process of determining the precise order of nucleotides within a DNA molecule. During DNA sequencing, the bases of a small fragment of DNA are sequentially identified from signals emitted as each fragment is re-synthesized from a ...