TY - GEN
T1 - A Method to Construct Guidelines for Spanish Comments Annotation for Sentiment Analysis
AU - Urpay-Camasi, John
AU - Garcia-Calderon, Jorge
AU - Shiguihara, Pedro
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - The application of sentiment analysis in social networks supports the understanding of complaints and claims of users' comments. To train the models that automate this analysis, it is important to construct guidelines that generate a more robust corpus. As far as we know, no related work of guidelines for spanish comments annotation has been found. We propose a method to construct guidelines to annotators reach a consensus in the entire annotation process of spanish comments from social networks. We annotated 3259 spanish comments using our guidelines, where the concordance analysis from our annotators was 84%. We employed our corpus and eight baseline classifiers for sentiment analysis detection, achieving 78.63% as the highest F1-Score with Multilayer Perceptron. Our method is useful to tackle labeling spanish comments which can be used in NLP tasks such as sentiment analysis.
AB - The application of sentiment analysis in social networks supports the understanding of complaints and claims of users' comments. To train the models that automate this analysis, it is important to construct guidelines that generate a more robust corpus. As far as we know, no related work of guidelines for spanish comments annotation has been found. We propose a method to construct guidelines to annotators reach a consensus in the entire annotation process of spanish comments from social networks. We annotated 3259 spanish comments using our guidelines, where the concordance analysis from our annotators was 84%. We employed our corpus and eight baseline classifiers for sentiment analysis detection, achieving 78.63% as the highest F1-Score with Multilayer Perceptron. Our method is useful to tackle labeling spanish comments which can be used in NLP tasks such as sentiment analysis.
KW - annotation of spanish comments
KW - dataset labeling
KW - guidelines for corpus annotation
KW - sentiment analysis
UR - http://www.scopus.com/inward/record.url?scp=85124396931&partnerID=8YFLogxK
U2 - 10.1109/SHIRCON53068.2021.9652313
DO - 10.1109/SHIRCON53068.2021.9652313
M3 - Contribución a la conferencia
AN - SCOPUS:85124396931
T3 - Proceedings of the 2021 IEEE Sciences and Humanities International Research Conference, SHIRCON 2021
BT - Proceedings of the 2021 IEEE Sciences and Humanities International Research Conference, SHIRCON 2021
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 17 November 2021 through 19 November 2021
ER -