nach oben

Erschienen in:

2024 | OriginalPaper | Buchkapitel

LEMT: A Label Enhanced Multi-task Learning Framework for Malevolent Dialogue Response Detection

verfasst von : Kaiyue Wang, Fan Yang, Yucheng Yao, Xiabing Zhou

Erschienen in: Advances in Knowledge Discovery and Data Mining

Verlag: Springer Nature Singapore

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Malevolent Dialogue Response Detection has gained much attention from the NLP community recently. Existing methods have difficulties in effectively utilizing the conversational context and the malevolent information. In this work, we propose a novel framework, the Label Enhanced Multi-task learning (LEMT), which incorporates a structured representation of malevolence description information and exploits malevolence shift detection as an auxiliary task. Specifically, we introduce a hierarchical structure encoder based on prior probability knowledge to capture the semantic information of different malevolent types and integrate it with utterance information. In addition, the malevolence shift detection is modeled to improve the ability of the model to distinguish between different malevolent information. Experimental results show that our LEMT outperforms state-of-the-art methods and verifies the effectiveness of the modules.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel SASBO: Sparse Attack via Stochastic Binary Optimization

Nächstes Kapitel Two-Stage Knowledge Graph Completion Based on Semantic Features and High-Order Structural Features

https://huggingface.co/bert-base-uncased.

https://chat.openai.com/.

Davidson, T., Warmsley, D., Macy, M., Weber, I.: Automated hate speech detection and the problem of offensive language. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 11, pp. 512–515 (2017)

Gao, Q., et al.: Emotion recognition in conversations with emotion shift detection based on multi-task learning. Knowl.-Based Syst. 248, 108861 (2022)CrossRef

Ghosal, D., Majumder, N., Poria, S., Chhaya, N., Gelbukh, A.: DialogueGCN: a graph convolutional neural network for emotion recognition in conversation. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 154–164 (2019)

Kenton, J.D.M.W.C., Toutanova, L.K.: BERT: pre-training of deep bidirectional transformers for language understanding 1, 2 (2019)

Kumar, R., Ojha, A.K., Malmasi, S., Zampieri, M.: Benchmarking aggression identification in social media. In: Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018), pp. 1–11 (2018)

Li, S., Yan, H., Qiu, X.: Contrast and generation make BART a good dialogue emotion recognizer. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, pp. 11002–11010 (2022)

Perez, E., et al.: Red teaming language models with language models. In: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pp. 3419–3448 (2022)

Roussinov, D., Robles-Flores, J.A.: Applying question answering technology to locating malevolent online content. Decis. Support Syst. 43(4), 1404–1418 (2007)CrossRef

Sheng, E., Chang, K.W., Natarajan, P., Peng, N.: “Nice try, kiddo”: investigating ad Hominems in dialogue responses. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 750–767 (2021)

10.

Sun, H., et al.: On the safety of conversational models: taxonomy, dataset, and benchmark. In: Findings of the Association for Computational Linguistics: ACL 2022, pp. 3906–3923 (2022)

11.

Waseem, Z., Hovy, D.: Hateful symbols or hateful people? Predictive features for hate speech detection on twitter. In: Proceedings of the NAACL Student Research Workshop, pp. 88–93 (2016)

12.

Wolf, M.J., Miller, K., Grodzinsky, F.S.: Why we should have seen that coming: comments on microsoft’s tay “experiment,’’ and wider implications. ACM SIGCAS Comput. Soc. 47(3), 54–64 (2017)CrossRef

13.

Zampieri, M., Malmasi, S., Nakov, P., Rosenthal, S., Farra, N., Kumar, R.: Predicting the type and target of offensive posts in social media. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 1415–1420 (2019)

14.

Zhang, M., Jin, L., Song, L., Mi, H., Chen, W., Yu, D.: SafeConv: explaining and correcting conversational unsafe behavior. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 22–35 (2023)

15.

Zhang, Y., Ren, P., Deng, W., Chen, Z., Rijke, M.: Improving multi-label malevolence detection in dialogues through multi-faceted label correlation enhancement. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 3543–3555 (2022)

16.

Zhang, Y., Ren, P., de Rijke, M.: A taxonomy, data set, and benchmark for detecting and classifying malevolent dialogue responses. J. Am. Soc. Inf. Sci. 72(12), 1477–1497 (2021)

17.

Zhou, J., et al: Hierarchy-aware global model for hierarchical text classification. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1106–1117 (2020)

18.

Zhou, J., et al.: Towards identifying social bias in dialog systems: framework, dataset, and benchmark. In: Findings of the Association for Computational Linguistics: EMNLP 2022, pp. 3576–3591 (2022)

Titel: LEMT: A Label Enhanced Multi-task Learning Framework for Malevolent Dialogue Response Detection
verfasst von: Kaiyue Wang
Fan Yang
Yucheng Yao
Xiabing Zhou
Verlag: Springer Nature Singapore
Buch: Advances in Knowledge Discovery and Data Mining
Print ISBN: 978-981-9722-41-9

Electronic ISBN: 978-981-9722-42-6

Copyright-Jahr: 2024
DOI: https://doi.org/10.1007/978-981-97-2242-6_11

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner