Advertisement

Topics

Comprehensive benchmarking of SNV callers for highly admixed tumor data.

08:00 EDT 11th October 2017 | BioPortfolio

Summary of "Comprehensive benchmarking of SNV callers for highly admixed tumor data."

Precision medicine attempts to individualize cancer therapy by matching tumor-specific genetic changes with effective targeted therapies. A crucial first step in this process is the reliable identification of cancer-relevant variants, which is considerably complicated by the impurity and heterogeneity of clinical tumor samples. We compared the impact of admixture of non-cancerous cells and low somatic allele frequencies on the sensitivity and precision of 19 state-of-the-art SNV callers. We studied both whole exome and targeted gene panel data and up to 13 distinct parameter configurations for each tool. We found vast differences among callers. Based on our comprehensive analyses we recommend joint tumor-normal calling with MuTect, EBCall or Strelka for whole exome somatic variant calling, and HaplotypeCaller or FreeBayes for whole exome germline calling. For targeted gene panel data on a single tumor sample, LoFreqStar performed best. We further found that tumor impurity and admixture had a negative impact on precision, and in particular, sensitivity in whole exome experiments. At admixture levels of 60% to 90% sometimes seen in pathological biopsies, sensitivity dropped significantly, even when variants were originally present in the tumor at 100% allele frequency. Sensitivity to low-frequency SNVs improved with targeted panel data, but whole exome data allowed more efficient identification of germline variants. Effective somatic variant calling requires high-quality pathological samples with minimal admixture, a consciously selected sequencing strategy, and the appropriate variant calling tool with settings optimized for the chosen type of data.

Affiliation

Journal Details

This article was published in the following journal.

Name: PloS one
ISSN: 1932-6203
Pages: e0186175

Links

DeepDyve research library

PubMed Articles [21585 Associated PubMed Articles listed on BioPortfolio]

Reliability of algorithmic somatic copy number alteration detection from targeted capture data.

Whole exome and gene panel sequencing are increasingly used for oncological diagnostics. To investigate the accuracy of SCNA detection algorithms on simulated and clinical tumor samples, the precision...

Assessment of variant pathogenicity in a highly admixed population.

Analysis of ancestry informative markers in three main ethnic groups from Ecuador supports a trihybrid origin of Ecuadorians.

Ancestry inference is traditionally done using autosomal SNPs that present great allele frequency differences among populations from different geographic regions. These ancestry informative markers (A...

Tobacco Use Cessation Among Quitline Callers Who Implemented Complete Home Smoking Bans During the Quitting Process.

The implementation of a home smoking ban (HSB) is associated with tobacco use cessation. We identified which quitline callers were most likely to report 30-day cessation among those who implemented co...

Bioinformatics Data Analysis of Next-Generation Sequencing Data from Heterogeneous Tumor Samples.

Tumor heterogeneity is a major challenge when it comes to treating cancer and also complicates research aimed at determining genetic sources for tumorigenesis. Leveraging high-throughput sequencing te...

Clinical Trials [7256 Associated Clinical Trials listed on BioPortfolio]

OPtimal Type 2 dIabetes Management Including Benchmarking and Standard trEatment.

Demonstrate that the use of benchmarking improves quality of patient care, in particular the control of diabetes, lipids and blood pressure, by determining the percentage of patients in th...

Suitability of Some Data Quality Controls Thresholds for Genetic Association Studies of Admixed Population

Background: In genetic studies, the quality of DNA samples is tested first. Samples that are low-quality are not used. Some studies involve minority ethnic groups. And example is admixed A...

Integrating Cancer Control Referrals and Navigators Into United Way 211 Missouri

The proposed study will: 1. estimate the prevalence of need for cancer screening and prevention in a population of 211 callers; 2. determine whether cancer communication i...

Comprehensive Segmental Revision System

Clinical Data evaluation to document the performance and clinical outcomes of the Comprehensive Segmental Revision System.

Degree of Worry as a Predictor for Utilization of Acute Health Care Within 48 Hours After Contact With Out-of-hours

The overall aim of the study is to construct a scale that systematically incorporates the callers' perspective in a "degree of worry - scale" and to explore the consequences for the actors...

Medical and Biotech [MESH] Definitions

Information application based on a variety of coding methods to minimize the amount of data to be stored, retrieved, or transmitted. Data compression can be applied to various forms of data, such as images and signals. It is used to reduce costs and increase efficiency in the maintenance of large volumes of data.

Comprehensive, methodical analysis of complex biological systems by monitoring responses to perturbations of biological processes. Large scale, computerized collection and analysis of the data are used to develop and test models of biological systems.

Method of measuring performance against established standards of best practice.

A rare but highly lethal childhood tumor found almost exclusively in infants. Histopathologically, it resembles RHABDOMYOSARCOMA but the tumor cells are not of myogenic origin. Although it arises primarily in the kidney, it may be found in other parts of the body. The rhabdoid cytomorphology is believed to be the expression of a very primitive malignant cell. (From Holland et al., Cancer Medicine, 3d ed, p2210)

Various units or machines that operate in combination or in conjunction with a computer but are not physically part of it. Peripheral devices typically display computer data, store data from the computer and return the data to the computer on demand, prepare data for human use, or acquire data from a source and convert it to a form usable by a computer. (Computer Dictionary, 4th ed.)

Quick Search
Advertisement
 


DeepDyve research library

Relevant Topics

Cancer
  Bladder Cancer Brain Cancer Breast Cancer Cancer Cervical Cancer Colorectal Head & Neck Cancers Hodgkin Lymphoma Leukemia Lung Cancer Melanoma Myeloma Ovarian Cancer Pancreatic Cancer ...

Cancer Disease
Cancer is not just one disease but many diseases. There are more than 100 different types of cancer. Most cancers are named for the organ or type of cell in which they start - for example, cancer that begins in the colon is called colon cancer; cancer th...

Antiretroviral therapy
Standard antiretroviral therapy (ART) consists of the combination of at least three antiretroviral (ARV) drugs to maximally suppress the HIV virus and stop the progression of HIV disease. Huge reductions have been seen in rates of death and suffering whe...


Searches Linking to this Article