Track topics on Twitter Track topics that are important to you
The original random forests (RFs) algorithm has been widely used and has achieved excellent performance for the classification and regression tasks. However, the research on the theory of RFs lags far behind its applications. In this article, to narrow the gap between the applications and the theory of RFs, we propose a new RFs algorithm, called random Shapley forests (RSFs), based on the Shapley value. The Shapley value is one of the well-known solutions in the cooperative game, which can fairly assess the power of each player in a game. In the construction of RSFs, RSFs use the Shapley value to evaluate the importance of each feature at each tree node by computing the dependency among the possible feature coalitions. In particular, inspired by the existing consistency theory, we have proved the consistency of the proposed RFs algorithm. Moreover, to verify the effectiveness of the proposed algorithm, experiments on eight UCI benchmark datasets and four real-world datasets have been conducted. The results show that RSFs perform better than or at least comparable with the existing consistent RFs, the original RFs, and a classic classifier, support vector machines.
This article was published in the following journal.
Name: IEEE transactions on cybernetics
In this article, we propose a novel entropy and confidence-based undersampling boosting (ECUBoost) framework to solve imbalanced problems. The boosting-based ensemble is combined with a new undersampl...
Studies using time-frequency analysis have reported that somatosensory evoked potentials provide information regarding the location of spinal cord injury. However, a better understanding of the time-f...
Environmental data may be "large" due to number of records, number of covariates, or both. Random forests has a reputation for good predictive performance when using many covariates with nonlinear rel...
Empirical testing of chemicals for drug efficacy costs many billions of dollars every year. The ability to predict the action of molecules in silico would greatly increase the speed and decrease the c...
Oblique random forests (ObRFs) have attracted increasing attention recently. Their popularity is mainly driven by learning oblique hyperplanes instead of expensively searching for axis-aligned hyperpl...
The purpose of this prospective single-center study is to evaluate the laboratory outcome after random start ovarian stimulation in oocytes donors. The study will be performed in egg-donor...
The goal of this project is to determine the barriers to, and risks and benefits of random laboratory drug testing for adolescents with identified drug or alcohol problems.
The purpose of this study is to assess the efficacy of random placenta margin incision for hemorrhage control during cesarean section of complete placenta previa pregnancies.
A random sample a random sample of half of all men born in 1943 and living in the city of Gothenburg, Sweden, have been investigated in 1993 at 50 years of age and will be followed continu...
The objective of this study is to determine whether treatment with random nicotine delivery via a nicotine film both before and after the target quit date will facilitate smoking cessation...
The interaction of two or more persons or organizations directed toward a common goal which is mutually beneficial. An act or instance of working or acting together for a common purpose or benefit, i.e., joint action. (From Random House Dictionary Unabridged, 2d ed)
In statistics, a technique for numerically approximating the solution of a mathematical problem by studying the distribution of some random variable, often generated by a computer. The name alludes to the randomness characteristic of the games of chance played at the gambling casinos in Monte Carlo. (From Random House Unabridged Dictionary, 2d ed, 1993)
Clinical trials that involve at least one test treatment and one control treatment, concurrent enrollment and follow-up of the test- and control-treated groups, and in which the treatments to be administered are selected by a random process, such as the use of a random-numbers table.
Work consisting of a clinical trial that involves at least one test treatment and one control treatment, concurrent enrollment and follow-up of the test- and control-treated groups, and in which the treatments to be administered are selected by a random process, such as the use of a random-numbers table.
Works consisting of discourses for the purpose of religious instruction or exhortation, especially one based on a text of Scripture and delivered by a member of the clergy, as part of a religious service. (From: Random House Unabridged Dictionary, 2d ed)