Violencia Identificada en el Lenguaje (VIL): Creación de recurso para mensajes violentos

Patricio Martínez Barco; Estela Saquete Boró; Beatriz Botella Gil; Robiert Sepúlveda Torres

Ayuda

Violencia Identificada en el Lenguaje (VIL): Creación de recurso para mensajes violentos

Autores: Patricio Martínez Barco , Estela Saquete Boró , Beatriz Botella Gil, Robiert Sepúlveda Torres
Localización: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 70, 2023, págs. 187-198
Idioma: español
DOI: 10.26342/2023-70-15
Títulos paralelos:
- Violence Identified in Language (VIL): Creation of a resource for the detection of violent messages
Enlaces
- Texto completo
Resumen
- español
  La sociedad avanza cargada de conocimientos nuevos y muy accesibles, que se publican en el mundo virtual. Es una realidad que las Tecnologías de la Información y la Comunicación (TIC) han traído muchos beneficios a nuestras vidas pero también vemos como año tras año aumenta el uso de violencia en plataformas digitales. Nuestro trabajo se enfoca en la creación de recursos que permitan la detección de mensajes violentos en la red social Twitter. Se parte de la creación de una guía de anotación de grano fino para anotar un corpus de mensajes violentos (VIL) con el fin de utilizar herramientas de aprendizaje automático que nos ayuden a detectar automáticamente el problema. Con este corpus se entrenan dos modelos de lenguaje (BETO y RoBERTa base) con los que se alcanza un valor en la métrica F1m de 97.03% y 96.51% clasificando si un tuit es o no violento.
- English
  Society is moving forward full of new and very accessible knowledge, which is published in the virtual world. It is a reality that ICTs have brought many benefits to our lives but we also see how year after year the use of violence on digital platforms increases. Our work focuses on the detection of violent messages in the social network Twitter. Starting from the creation of a fine-grained annotation guide to obtain a corpus of violent messages (VIL) in order to use Machine Learning tools that help us to automatically detect the problem Two language models are trained with this corpus (BETO and RoBERTa base) with which a value of 97.03% and 96.51% is reached in the F1m metric, classifying whether or not a tweet is violent.
Referencias bibliográficas
- Alonso, L. y V. J. Vázquez. 2017. Sobre la libertad de expresión y el discurso del odio: Textos críticos. Athenaica ediciones universitarias.
- Arcila-Calderón, C., J. J. Amores, P. Sánchez-Holgado, y D. Blanco-Herrero. 2021. Using shallow and deep learning to automatically detect...
- Badjatiya, P., S. Gupta, M. Gupta, y V. Varma. 2017. Deep learning for hate speech detection in tweets. En Proceedings of the 26th international...
- Basile, V., C. Bosco, E. Fersini, D. Nozza, V. Patti, F. M. R. Pardo, P. Rosso, y M. Sanguinetti. 2019. Semeval-2019 task 5: Multilingual...
- Bassignana, E., V. Basile, y V. Patti. 2018. Hurtlex: A multilingual lexicon of words to hurt. En 5th Italian Conference on Computational...
- Bruns, A. 2019. After the ‘apicalypse’: Social media platforms and their fight against critical scholarly research. Information, Communication...
- Burnap, P. y M. L. Williams. 2014. Hate speech, machine classification and statistical modelling of information flows on twitter: Interpretation...
- Cañete, J., G. Chaperón, R. Fuentes, y J. Pérez. 2020. Spanish pre-trained bert model and evaluation data. PML4DC at ICLR, 2020.
- Cohen, J. 1960. A coefficient of agreement for nominal scales. Educational and psychological measurement, 20(1):37–46.
- Dadvar, M., D. Trieschnigg, R. Ordelman, y F. d. Jong. 2013. Improving cyberbullying detection with user context. En European Conference on...
- del Arco, F. M. P., M. D. Molina-González, L. A. Ureña-López, y M.-T. MartınValdivia. 2022. Integrating implicit and explicit linguistic phenomena...
- Devlin, J., M.-W. Chang, K. Lee, y K. Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv...
- Fernández, J., F. Llopis, P. Martínez-Barco, Y. Gutiérrez, y A. Dıez. 2017. Analizando opiniones en las redes sociales. Procesamiento del...
- Flores, J. y M. Casal. 2008. Ciberbullying. Guıa rápida para la prevención del acoso por medio de las nuevas tecnologías.
- Fortuna, P. y S. Nunes. 2018. A survey on automatic detection of hate speech in text. ACM Computing Surveys (CSUR), 51(4):1–30.
- Frenda, S., A. T. Cignarella, V. Basile, C. Bosco, V. Patti, y P. Rosso. 2022. The unbearable hurtfulness of sarcasm. Expert Systems with...
- Frenda, S., V. Patti, y P. Rosso. 2022. Killing me softly: Creative and cognitive aspects of implicitness in abusive language online. Natural...
- Gitari, N. D., Z. Zuping, H. Damien, y J. Long. 2015. A lexicon-based approach for hate speech detection. International Journal of Multimedia...
- Gutiérrez-Fandiño, A., J. Armengol Estapé, M. P`amies, J. Llop-Palao, J. Silveira-Ocampo, C. P. Carrino, A. Gonzalez-Agirre, C. Armentano-Oller,...
- Liu, Y., M. Ott, N. Goyal, J. Du, M. Joshi, D. Chen, O. Levy, M. Lewis, L. Zettlemoyer, y V. Stoyanov. 2019. Roberta: A robustly optimized...
- Martins, R., M. Gomes, J. J. Almeida, P. Novais, y P. Henriques. 2018. Hate speech classification in social media using emotional analysis....
- Mathew, B., P. Saha, S. M. Yimam, C. Biemann, P. Goyal, y A. Mukherjee. 2021. Hatexplain: A benchmark dataset for explainable hate speech...
- McMenamin, G. R. 2017. Introducción a la lingüística forense: un libro de curso. Press at California State University, Fresno.
- Nielsen, L. B. 2002. Subtle, pervasive, harmful: Racist and sexist remarks in public as hate speech. Journal of Social Issues, 58:265–280,...
- Nobata, C., J. Tetreault, A. Thomas, Y. Mehdad, y Y. Chang. 2016. Abusive language detection in online user content. En Proceedings of the...
- Olteanu, A., C. Castillo, J. Boy, y K. Varshney. 2018. The effect of extremist violence on hateful speech online. En Proceedings of the international...
- Ott, B. L. 2017. The age of twitter: Donald j. trump and the politics of debasement. Critical studies in media communication, 34(1):59–68.
- Plaza-Del-Arco, F.-M., M. D. MolinaGonzález, L. A. Ureña-López, y M. T. Martın-Valdivia. 2020. Detecting misogyny and xenophobia in spanish...
- Plaza-del Arco, F. M., A. B. P. Portillo, P. L. Úbeda, B. Gil, y M.-T. Martın-Valdivia. 2022. Share: A lexicon of harmful expressions by...
- Poletto, F., V. Basile, M. Sanguinetti, C. Bosco, y V. Patti. 2021. Resources and benchmark corpora for hate speech detection: a systematic...
- Qian, J., M. ElSherief, E. Belding, y W. Y. Wang. 2019. Learning to decipher hate symbols. arXiv preprint arXiv:1904.02418.
- Rosenthal, S., P. Atanasova, G. Karadzhov, M. Zampieri, y P. Nakov. 2020. A largescale semi-supervised dataset for offensive language identification....
- Salado, M. R. 2022. Análisis ling¨uıstico del discurso de odio en redes sociales. VISUAL REVIEW. International Visual Culture Review/Revista...
- Sánchez-Junquera, J., P. Rosso, M. Montes, B. Chulvi, y others. 2021. Masking and bert-based models for stereotype identication. Procesamiento...
- Sarkar, D., M. Zampieri, T. Ranasinghe, y A. Ororbia. 2021. Fbert: A neural transformer for identifying offensive content. arXiv preprint...
- Song, B., C. Pan, S. Wang, y Z. Luo. 2021. Deepblueai at semeval-2021 task 7: Detecting and rating humor and offense with stacking diverse...
- Sood, S. O., E. F. Churchill, y J. Antin. 2012. Automatic identification of personal insults on social news sites. Journal of the American...
- Stenetorp, P., S. Pyysalo, G. Topic, T. Ohta, S. Ananiadou, y J. Tsujii. 2012. Brat: a web-based tool for nlp-assisted text annotation. En...
- Tiedemann, J. 2012. Parallel data, tools and interfaces in OPUS. En Proceedings of the Eighth International Conference on Language Resources...
- WeAreSocial y Hootsuite. 2022. Digital report espaNa 2022: Nueve de cada diez españoles usan las redes sociales y pasan casi dos horas al...
- Wiegand, M., J. Ruppenhofer, A. Schmidt, y C. Greenberg. 2018. Inducing a lexicon of abusive words–a feature-based approach. En Proceedings...
- Xu, J.-M., K.-S. Jun, X. Zhu, y A. Bellmore. 2012. Learning from bullying traces in social media. En Proceedings of the 2012 conference of...