Publication: Evolutionary feature selection on high dimensional data using a search space reduction approach
Loading...
Identifiers
Publication date
Reading date
Event date
Start date of the public exhibition period
End date of the public exhibition period
Advisors
Authors of photography
Person who provides the photography
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Abstract
Feature selection is becoming more and more a challenging task due to the increase of the dimensionality of the data. The complexity of the interactions among features and the size of the search space make it unfeasible to find the optimal subset of features. In order to reduce the search space, feature grouping has arisen as an approach that allows to cluster feature according to the shared information about the class. On the other hand, metaheuristic algorithms have proven to achieve sub-optimal solutions within a reasonable time. In this work we propose a Scatter Search (SS) strategy that uses feature grouping to generate an initial population comprised of diverse and high quality solutions. Solutions are then evolved by applying random mechanisms in combination with the feature group structure, with the objective of maintaining during the search a population of good and, at the same time, as diverse as possible solutions. Not only does the proposed strategy provide the best subset of features found but it also reduces the redundancy structure of the data. We test the strategy on high dimensional data from biomedical and text-mining domains. The results are compared with those obtained by other adaptations of SS and other popular strategies. Results show that the proposed strategy can find, on average, the smallest subsets of features without degrading the performance of the classifier
Doctoral program
Related publication
Research projects
U
Description
Proyectos de investigación
FECYT -- APRENDIZAJE PROFUNDO Y APRENDIZAJE ONLINE EXPLICABLES PARA SOST...
PY20-00870
UPO-138516
Bibliographic reference
Engineering Applications of Artificial Intelligence, vol. 117, p. 105556






