Unified Methods for Feature Selection in Large-Scale Genomic Studies with Censored Survival Outcomes.

08:00 EDT 10th March 2020 | BioPortfolio

Summary of "Unified Methods for Feature Selection in Large-Scale Genomic Studies with Censored Survival Outcomes."

One of the major goals in large-scale genomic studies is to identify genes with a prognostic impact on time-to-event outcomes which provide insight into the disease's process. With rapid developments in high-throughput genomic technologies in the past two decades, the scientific community is able to monitor the expression levels of tens of thousands of genes and proteins resulting in enormous data sets where the number of genomic features is far greater than the number of subjects. Methods based on univariate Cox regression are often used to select genomic features related to survival outcome; however, the Cox model assumes proportional hazards (PH), which is unlikely to hold for each feature. When applied to genomic features exhibiting some form of non-proportional hazards (NPH), these methods could lead to an under- or over-estimation of the effects. We propose a broad array of marginal screening techniques that aid in feature ranking and selection by accommodating various forms of NPH. First, we develop an approach based on Kullback-Leibler information divergence and the Yang-Prentice model that includes methods for the PH and proportional odds (PO) models as special cases. Next, we propose R2 measures for the PH and PO models that can be interpreted in terms of explained randomness. Lastly, we propose a generalized pseudo-R2 index that includes PH, PO, crossing hazards and crossing odds models as special cases and can be interpreted as the percentage of separability between subjects experiencing the event and not experiencing the event according to feature measurements.


Journal Details

This article was published in the following journal.

Name: Bioinformatics (Oxford, England)
ISSN: 1367-4811


DeepDyve research library

PubMed Articles [31505 Associated PubMed Articles listed on BioPortfolio]

Distributed Selection of Continuous Features in Multilabel Classification Using Mutual Information.

Multilabel learning is a challenging task demanding scalable methods for large-scale data. Feature selection has shown to improve multilabel accuracy while defying the curse of dimensionality of high-...

Functional Genomic Selection in Crop Breeding.

Genomic selection (GS) is rapidly being adopted by many plant and animal breeding programs. New statistical methods that increase prediction accuracy are needed to enable effective GS. This chapter wi...

Bi-level feature selection in high dimensional AFT models with applications to a genomic study.

We propose a new bi-level feature selection method for high dimensional accelerated failure time models by formulating the models to a single index model. The method yields sparse solutions at both th...

Clinical study designs and patient selection methods based on genomic biomarkers: Points-to-consider documents.

Recently, genomic biomarkers have been widely used clinically for prediction of the efficacy and safety of pharmacotherapy and diagnosis and prognosis of pathological conditions. Therefore, genomic bi...

Biomarker discovery in inflammatory bowel diseases using network-based feature selection.

Reliable identification of Inflammatory biomarkers from metagenomics data is a promising direction for developing non-invasive, cost-effective, and rapid clinical tests for early diagnosis of IBD. We ...

Clinical Trials [9192 Associated Clinical Trials listed on BioPortfolio]

Genomic Evaluation in Patients With Diffuse Large B Cell Lymphoma After First Relapse/Progression

DLBCL has the highest frequency out of all lymphoid malignancies. With the recent development of antitumor agents targeting intracellular/extracellular cell signaling pathways, patients ha...

Return of Genomic Results and Aggregate Penetrance in Population-Based Cohorts

The PopSeq Project is a prospective cohort study that will develop and implement a genomic return of results (gRoR) process in the Framingham Heart Study (FHS) and Jackson Heart Study (JHS...

CPAP Device In-lab Assessment NZ

The purpose of this trial is to assess device performance against participants in an overnight study to ensure the product meets user and clinical requirements

Quest Sound Recover (SR2) vs. Venture SR2

Goal of this study is to determine the benefit of an improved feature on a new hearing aid platform. To investigate the improvements of this feature is compared on a new and older hearing...

Effectiveness of a Unified Transdiagnostic Treatment in Routine Care

The purpose of this study is to examine effectiveness and implementation for the Unified Protocol for the Transdiagnostic Treatment of Emotional Disorders in trauma exposed veterans.

Medical and Biotech [MESH] Definitions

Contiguous large-scale (1000-400,000 basepairs) differences in the genomic DNA between individuals, due to SEQUENCE DELETION; SEQUENCE INSERTION; or SEQUENCE INVERSION.

The techniques used to produce molecules exhibiting properties that conform to the demands of the experimenter. These techniques combine methods of generating structural changes with methods of selection. They are also used to examine proposed mechanisms of evolution under in vitro selection conditions.

Small-scale tests of methods and procedures to be used on a larger scale if the pilot study demonstrates that these methods and procedures can work.

A method for analyzing and mapping differences in the copy number of specific genes or other large sequences between two sets of chromosomal DNA. It is used to look for large sequence changes such as deletions, duplications, or amplifications within the genomic DNA of an individual (with a tumor for example) or family members or population or between species.

Methods for cultivation of cells, usually on a large-scale, in a closed system for the purpose of producing cells or cellular products to harvest.

Quick Search

DeepDyve research library

Relevant Topic

A diagnostic test is any kind of medical test performed to aid in the diagnosis or detection of disease. For example: to diagnose diseases to measure the progress or recovery from disease to confirm that a person is free from disease Clin...

Searches Linking to this Article