The Construction of a Corpus for Detecting Irony and Sarcasm in Portuguese / A construção de um corpus para detectar a ironia e o sarcasmo em Português

Gabriel Schubert Marten, Larissa Astrogildo de Freitas

Abstract


Portuguese is a low-resource language, where a few works developed corpora for specific Natural Language Processing tasks, such as sarcasm and irony detection, sentiment analysis and others. In this work, we developed a corpus in the Portuguese language to sarcasm and irony detection task. In the future, we intend to develop a tool to recognize sarcasm and irony and we intend to use the corpus presented in this article.


Keywords


Corpus, Portuguese Language, Sarcasm and Irony Detection.

References


Amir, S., Wallace, B. C., Lyu, H., e Silva, P. C. M. J. (2016). Modelling context with user embeddings for sarcasm detection in social media. arXiv preprint arXiv:1607.00976.

Angrimani, D. (1994). Espreme que sai sangue: um estudo do sensacionalismo na imprensa, volume 47. Summus Editorial.

Cho, J. e Garcia-Molina, H. (1999). The evolution of the web and implications for an incremental crawler. Technical report, Stanford.

Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., e Kuksa, P. (2011). Natural language processing (almost) from scratch. Journal of machine learning research, 12(1): 2493–2537.

Freitas, L. A., Vanin, A., Hogetop, D., Bochernitsan, M., e Vieira, R. (2014). Pathways for irony detection in tweets. In 29th Symposium on Applied Computing, pages 628–633.

Ghanem, B., Karoui, J., Benamara, F., Moriceau, V., e Rosso, P. (2019). Idat at fire2019: Overview of the track on irony detection in Arabic tweets. In 11th Forum for Information Retrieval Evaluation, pages 10–13.

Lee, C. J. e Katz, A. N. (1998). The differential role of ridicule in sarcasm and irony. Metaphor and symbol, 13(1):1–15.

Misra, R. e Arora, P. (2019). Sarcasm detection using hybrid neural network. arXiv preprint arXiv:1908.07414.

Ortega-Bueno, R., Rangel, F., Hernández Farias, D., Rosso, P., Montes-y Gómez, M., e Medina Pagola, J. E. (2019). Overview of the task on irony detection in Spanish variants. In Iberian Languages Evaluation Forum (IberLEF 2019),co-located with 34th Conference of the Spanish Society for Natural Language Processing (SEPLN 2019). CEUR-WS. org.

Pang, B. e Lee, L. (2008). Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2:1–135.

Sardinha, T. B. (2000). Linguística de corpus: histórico e problemática. Delta: documentação de estudos em linguística teórica e aplicada, 16(2):323–367.

Silva, F. R. A. e Bonfante, A. G. (2018). Detecção de ironia e sarcasmo em língua portuguesa: uma abordagem utilizando deep learning. Monografia (Bacharel em Ciência da Computação), UFMG (Universidade Federal do Mato Grosso), Brasil.

Van Hee, C., Lefever, E., e Hoste, V. (2018). Semeval-2018 task 3: Irony detection in English tweets. In 12th International Workshop on Semantic Evaluation, pages 39–50.




DOI: https://doi.org/10.34117/bjdv.v7i5.29714

Refbacks

  • There are currently no refbacks.