Exploring Deep Neural Networks and Decision Tree for Spanish Text Classification

Pedro Shiguihara, Lilian Berton

Resultado de la investigación: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

Nowadays, huge amounts of information are available on social networks, blogs, websites, and digital libraries. Most of this information is in unstructured text format, so text mining approaches have become increasingly studied to process all this data. Text classification aims to automatically classify documents into predetermined categories, applying machine learning (ML) algorithms. In this paper, we collected a dataset set related to reviews of a food store in Peru and compared different vectorization models, such as Term Frequency Inverse Document Frequency (TF-IDF), Bag of Words (BoW), and classification algorithms, such as traditional ML classifiers SVM, Decision Tree, MLP, KNN, Naive Bayes and a recent approach "deep jointly informed neural networks"(DJINN) that initialize deep feedforward neural networks based on decision trees. The results show DJINN gets a F1-score higher than traditional ML, being a promising technique for text classification.

Idioma originalInglés
Título de la publicación alojadaProceedings of the 2022 IEEE 29th International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2022
EditorialInstitute of Electrical and Electronics Engineers Inc.
Páginas1-4
Número de páginas4
ISBN (versión digital)9781665486361
ISBN (versión impresa)9781665486361
DOI
EstadoPublicada - 11 ago. 2022
Publicado de forma externa
Evento29th IEEE International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2022 - Lima, Perú
Duración: 11 ago. 202213 ago. 2022

Serie de la publicación

NombreProceedings of the 2022 IEEE 29th International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2022

Conferencia

Conferencia29th IEEE International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2022
País/TerritorioPerú
CiudadLima
Período11/08/2213/08/22

Huella

Profundice en los temas de investigación de 'Exploring Deep Neural Networks and Decision Tree for Spanish Text Classification'. En conjunto forman una huella única.

Citar esto