FAST RELEVANCE-REDUNDANCY DOMINANCE: FEATURE SELECTION FOR HIGH DIMENSIONAL DATA

Home

Document Info

Title:	FAST RELEVANCE-REDUNDANCY DOMINANCE: FEATURE SELECTION FOR HIGH DIMENSIONAL DATA
Author(s):	David Browne, Carlo Manna and Steven D. Prestwich
ISBN:	978-989-8533-66-1
Editors:	Yingcai Xiao and Ajith P. Abraham
Year:	2017
Edition:	Single
Keywords:	Feature-Selection, Filter-Based, High-Dimensional, Scalability, Microarray
Type:	Full Paper
First Page:	255
Last Page:	262
Language:	English
Cover:
Full Contents:	click to dowload
Paper Abstract:	Feature selection is used to select a subset of features for model construction. This reduces dimensionality which is important for simplification, efficiency and reducing overfitting. Filter-based methods are the most scalable, rating features by their relevance to the target variable via appropriate statistics. Browne, et al. proposed a filter feature selection method, called Relevance-Redundancy Dominance (RRD) with useful properties (no threshold setting, adaptability to any statistics etc.), but with a poor scalability. In this paper, we present a scalable version of RRD, called Fast Relevance-Redundancy Dominance which holds the same properties as RRD while improving scalability. To show the effectiveness of the proposed approach we have carried out extensive numerical experiments on high dimensional datasets (DNA microarray datasets) which shows that it outperforms state-of-the-art algorithms.

	Go Back