Skip to main content
Top

2024 | OriginalPaper | Chapter

LEMT: A Label Enhanced Multi-task Learning Framework for Malevolent Dialogue Response Detection

Authors : Kaiyue Wang, Fan Yang, Yucheng Yao, Xiabing Zhou

Published in: Advances in Knowledge Discovery and Data Mining

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Malevolent Dialogue Response Detection has gained much attention from the NLP community recently. Existing methods have difficulties in effectively utilizing the conversational context and the malevolent information. In this work, we propose a novel framework, the Label Enhanced Multi-task learning (LEMT), which incorporates a structured representation of malevolence description information and exploits malevolence shift detection as an auxiliary task. Specifically, we introduce a hierarchical structure encoder based on prior probability knowledge to capture the semantic information of different malevolent types and integrate it with utterance information. In addition, the malevolence shift detection is modeled to improve the ability of the model to distinguish between different malevolent information. Experimental results show that our LEMT outperforms state-of-the-art methods and verifies the effectiveness of the modules.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Davidson, T., Warmsley, D., Macy, M., Weber, I.: Automated hate speech detection and the problem of offensive language. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 11, pp. 512–515 (2017) Davidson, T., Warmsley, D., Macy, M., Weber, I.: Automated hate speech detection and the problem of offensive language. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 11, pp. 512–515 (2017)
2.
go back to reference Gao, Q., et al.: Emotion recognition in conversations with emotion shift detection based on multi-task learning. Knowl.-Based Syst. 248, 108861 (2022)CrossRef Gao, Q., et al.: Emotion recognition in conversations with emotion shift detection based on multi-task learning. Knowl.-Based Syst. 248, 108861 (2022)CrossRef
3.
go back to reference Ghosal, D., Majumder, N., Poria, S., Chhaya, N., Gelbukh, A.: DialogueGCN: a graph convolutional neural network for emotion recognition in conversation. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 154–164 (2019) Ghosal, D., Majumder, N., Poria, S., Chhaya, N., Gelbukh, A.: DialogueGCN: a graph convolutional neural network for emotion recognition in conversation. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 154–164 (2019)
4.
go back to reference Kenton, J.D.M.W.C., Toutanova, L.K.: BERT: pre-training of deep bidirectional transformers for language understanding 1, 2 (2019) Kenton, J.D.M.W.C., Toutanova, L.K.: BERT: pre-training of deep bidirectional transformers for language understanding 1, 2 (2019)
5.
go back to reference Kumar, R., Ojha, A.K., Malmasi, S., Zampieri, M.: Benchmarking aggression identification in social media. In: Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018), pp. 1–11 (2018) Kumar, R., Ojha, A.K., Malmasi, S., Zampieri, M.: Benchmarking aggression identification in social media. In: Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018), pp. 1–11 (2018)
6.
go back to reference Li, S., Yan, H., Qiu, X.: Contrast and generation make BART a good dialogue emotion recognizer. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, pp. 11002–11010 (2022) Li, S., Yan, H., Qiu, X.: Contrast and generation make BART a good dialogue emotion recognizer. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, pp. 11002–11010 (2022)
7.
go back to reference Perez, E., et al.: Red teaming language models with language models. In: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pp. 3419–3448 (2022) Perez, E., et al.: Red teaming language models with language models. In: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pp. 3419–3448 (2022)
8.
go back to reference Roussinov, D., Robles-Flores, J.A.: Applying question answering technology to locating malevolent online content. Decis. Support Syst. 43(4), 1404–1418 (2007)CrossRef Roussinov, D., Robles-Flores, J.A.: Applying question answering technology to locating malevolent online content. Decis. Support Syst. 43(4), 1404–1418 (2007)CrossRef
9.
go back to reference Sheng, E., Chang, K.W., Natarajan, P., Peng, N.: “Nice try, kiddo”: investigating ad Hominems in dialogue responses. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 750–767 (2021) Sheng, E., Chang, K.W., Natarajan, P., Peng, N.: “Nice try, kiddo”: investigating ad Hominems in dialogue responses. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 750–767 (2021)
10.
go back to reference Sun, H., et al.: On the safety of conversational models: taxonomy, dataset, and benchmark. In: Findings of the Association for Computational Linguistics: ACL 2022, pp. 3906–3923 (2022) Sun, H., et al.: On the safety of conversational models: taxonomy, dataset, and benchmark. In: Findings of the Association for Computational Linguistics: ACL 2022, pp. 3906–3923 (2022)
11.
go back to reference Waseem, Z., Hovy, D.: Hateful symbols or hateful people? Predictive features for hate speech detection on twitter. In: Proceedings of the NAACL Student Research Workshop, pp. 88–93 (2016) Waseem, Z., Hovy, D.: Hateful symbols or hateful people? Predictive features for hate speech detection on twitter. In: Proceedings of the NAACL Student Research Workshop, pp. 88–93 (2016)
12.
go back to reference Wolf, M.J., Miller, K., Grodzinsky, F.S.: Why we should have seen that coming: comments on microsoft’s tay “experiment,’’ and wider implications. ACM SIGCAS Comput. Soc. 47(3), 54–64 (2017)CrossRef Wolf, M.J., Miller, K., Grodzinsky, F.S.: Why we should have seen that coming: comments on microsoft’s tay “experiment,’’ and wider implications. ACM SIGCAS Comput. Soc. 47(3), 54–64 (2017)CrossRef
13.
go back to reference Zampieri, M., Malmasi, S., Nakov, P., Rosenthal, S., Farra, N., Kumar, R.: Predicting the type and target of offensive posts in social media. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 1415–1420 (2019) Zampieri, M., Malmasi, S., Nakov, P., Rosenthal, S., Farra, N., Kumar, R.: Predicting the type and target of offensive posts in social media. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 1415–1420 (2019)
14.
go back to reference Zhang, M., Jin, L., Song, L., Mi, H., Chen, W., Yu, D.: SafeConv: explaining and correcting conversational unsafe behavior. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 22–35 (2023) Zhang, M., Jin, L., Song, L., Mi, H., Chen, W., Yu, D.: SafeConv: explaining and correcting conversational unsafe behavior. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 22–35 (2023)
15.
go back to reference Zhang, Y., Ren, P., Deng, W., Chen, Z., Rijke, M.: Improving multi-label malevolence detection in dialogues through multi-faceted label correlation enhancement. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 3543–3555 (2022) Zhang, Y., Ren, P., Deng, W., Chen, Z., Rijke, M.: Improving multi-label malevolence detection in dialogues through multi-faceted label correlation enhancement. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 3543–3555 (2022)
16.
go back to reference Zhang, Y., Ren, P., de Rijke, M.: A taxonomy, data set, and benchmark for detecting and classifying malevolent dialogue responses. J. Am. Soc. Inf. Sci. 72(12), 1477–1497 (2021) Zhang, Y., Ren, P., de Rijke, M.: A taxonomy, data set, and benchmark for detecting and classifying malevolent dialogue responses. J. Am. Soc. Inf. Sci. 72(12), 1477–1497 (2021)
17.
go back to reference Zhou, J., et al: Hierarchy-aware global model for hierarchical text classification. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1106–1117 (2020) Zhou, J., et al: Hierarchy-aware global model for hierarchical text classification. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1106–1117 (2020)
18.
go back to reference Zhou, J., et al.: Towards identifying social bias in dialog systems: framework, dataset, and benchmark. In: Findings of the Association for Computational Linguistics: EMNLP 2022, pp. 3576–3591 (2022) Zhou, J., et al.: Towards identifying social bias in dialog systems: framework, dataset, and benchmark. In: Findings of the Association for Computational Linguistics: EMNLP 2022, pp. 3576–3591 (2022)
Metadata
Title
LEMT: A Label Enhanced Multi-task Learning Framework for Malevolent Dialogue Response Detection
Authors
Kaiyue Wang
Fan Yang
Yucheng Yao
Xiabing Zhou
Copyright Year
2024
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-97-2242-6_11

Premium Partner