A Gaussian Model for Feature Selection in Protein Fold Recognition

Pedro Shiguihara-Juárez, Nils Murrugarra-Llerena

Resultado de la investigación: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

Protein fold recognition is an important task to discover new biological functions of proteins. In this context, machine learning techniques have been used to protein fold recognition, stating this task as a classification problem. However, in many cases, the similarity of patterns to protein fold recognition becomes this process in a complex task, limiting the performance of the machine learning techniques. In this paper, we propose a feature selection method to support machine learning methods for protein fold recognition, using gaussian distributions in the process of features analysis. We cluster features by gaussian distributions. These clusters give information to reduce the dimensionality of the features. After that, we use baselines classifiers to protein fold recognition, using a well-known dataset for this task. The results suggest that the clustering and reduction of dimensionality of features using gaussian distribution can help to improve the accuracy of machine learning techniques on this task.

Idioma originalInglés
Título de la publicación alojadaProceedings of the 2018 IEEE Sciences and Humanities International Research Conference, SHIRCON 2018
EditorialInstitute of Electrical and Electronics Engineers Inc.
ISBN (versión digital)9781538683743
DOI
EstadoPublicada - 27 dic. 2018
Publicado de forma externa
Evento2018 IEEE Sciences and Humanities International Research Conference, SHIRCON 2018 - Lima, Perú
Duración: 20 nov. 201822 nov. 2018

Serie de la publicación

NombreProceedings of the 2018 IEEE Sciences and Humanities International Research Conference, SHIRCON 2018

Conferencia

Conferencia2018 IEEE Sciences and Humanities International Research Conference, SHIRCON 2018
País/TerritorioPerú
CiudadLima
Período20/11/1822/11/18

Huella

Profundice en los temas de investigación de 'A Gaussian Model for Feature Selection in Protein Fold Recognition'. En conjunto forman una huella única.

Citar esto