Skip to main content

2024 | OriginalPaper | Buchkapitel

LEMT: A Label Enhanced Multi-task Learning Framework for Malevolent Dialogue Response Detection

verfasst von : Kaiyue Wang, Fan Yang, Yucheng Yao, Xiabing Zhou

Erschienen in: Advances in Knowledge Discovery and Data Mining

Verlag: Springer Nature Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Malevolent Dialogue Response Detection has gained much attention from the NLP community recently. Existing methods have difficulties in effectively utilizing the conversational context and the malevolent information. In this work, we propose a novel framework, the Label Enhanced Multi-task learning (LEMT), which incorporates a structured representation of malevolence description information and exploits malevolence shift detection as an auxiliary task. Specifically, we introduce a hierarchical structure encoder based on prior probability knowledge to capture the semantic information of different malevolent types and integrate it with utterance information. In addition, the malevolence shift detection is modeled to improve the ability of the model to distinguish between different malevolent information. Experimental results show that our LEMT outperforms state-of-the-art methods and verifies the effectiveness of the modules.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Davidson, T., Warmsley, D., Macy, M., Weber, I.: Automated hate speech detection and the problem of offensive language. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 11, pp. 512–515 (2017) Davidson, T., Warmsley, D., Macy, M., Weber, I.: Automated hate speech detection and the problem of offensive language. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 11, pp. 512–515 (2017)
2.
Zurück zum Zitat Gao, Q., et al.: Emotion recognition in conversations with emotion shift detection based on multi-task learning. Knowl.-Based Syst. 248, 108861 (2022)CrossRef Gao, Q., et al.: Emotion recognition in conversations with emotion shift detection based on multi-task learning. Knowl.-Based Syst. 248, 108861 (2022)CrossRef
3.
Zurück zum Zitat Ghosal, D., Majumder, N., Poria, S., Chhaya, N., Gelbukh, A.: DialogueGCN: a graph convolutional neural network for emotion recognition in conversation. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 154–164 (2019) Ghosal, D., Majumder, N., Poria, S., Chhaya, N., Gelbukh, A.: DialogueGCN: a graph convolutional neural network for emotion recognition in conversation. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 154–164 (2019)
4.
Zurück zum Zitat Kenton, J.D.M.W.C., Toutanova, L.K.: BERT: pre-training of deep bidirectional transformers for language understanding 1, 2 (2019) Kenton, J.D.M.W.C., Toutanova, L.K.: BERT: pre-training of deep bidirectional transformers for language understanding 1, 2 (2019)
5.
Zurück zum Zitat Kumar, R., Ojha, A.K., Malmasi, S., Zampieri, M.: Benchmarking aggression identification in social media. In: Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018), pp. 1–11 (2018) Kumar, R., Ojha, A.K., Malmasi, S., Zampieri, M.: Benchmarking aggression identification in social media. In: Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018), pp. 1–11 (2018)
6.
Zurück zum Zitat Li, S., Yan, H., Qiu, X.: Contrast and generation make BART a good dialogue emotion recognizer. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, pp. 11002–11010 (2022) Li, S., Yan, H., Qiu, X.: Contrast and generation make BART a good dialogue emotion recognizer. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, pp. 11002–11010 (2022)
7.
Zurück zum Zitat Perez, E., et al.: Red teaming language models with language models. In: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pp. 3419–3448 (2022) Perez, E., et al.: Red teaming language models with language models. In: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pp. 3419–3448 (2022)
8.
Zurück zum Zitat Roussinov, D., Robles-Flores, J.A.: Applying question answering technology to locating malevolent online content. Decis. Support Syst. 43(4), 1404–1418 (2007)CrossRef Roussinov, D., Robles-Flores, J.A.: Applying question answering technology to locating malevolent online content. Decis. Support Syst. 43(4), 1404–1418 (2007)CrossRef
9.
Zurück zum Zitat Sheng, E., Chang, K.W., Natarajan, P., Peng, N.: “Nice try, kiddo”: investigating ad Hominems in dialogue responses. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 750–767 (2021) Sheng, E., Chang, K.W., Natarajan, P., Peng, N.: “Nice try, kiddo”: investigating ad Hominems in dialogue responses. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 750–767 (2021)
10.
Zurück zum Zitat Sun, H., et al.: On the safety of conversational models: taxonomy, dataset, and benchmark. In: Findings of the Association for Computational Linguistics: ACL 2022, pp. 3906–3923 (2022) Sun, H., et al.: On the safety of conversational models: taxonomy, dataset, and benchmark. In: Findings of the Association for Computational Linguistics: ACL 2022, pp. 3906–3923 (2022)
11.
Zurück zum Zitat Waseem, Z., Hovy, D.: Hateful symbols or hateful people? Predictive features for hate speech detection on twitter. In: Proceedings of the NAACL Student Research Workshop, pp. 88–93 (2016) Waseem, Z., Hovy, D.: Hateful symbols or hateful people? Predictive features for hate speech detection on twitter. In: Proceedings of the NAACL Student Research Workshop, pp. 88–93 (2016)
12.
Zurück zum Zitat Wolf, M.J., Miller, K., Grodzinsky, F.S.: Why we should have seen that coming: comments on microsoft’s tay “experiment,’’ and wider implications. ACM SIGCAS Comput. Soc. 47(3), 54–64 (2017)CrossRef Wolf, M.J., Miller, K., Grodzinsky, F.S.: Why we should have seen that coming: comments on microsoft’s tay “experiment,’’ and wider implications. ACM SIGCAS Comput. Soc. 47(3), 54–64 (2017)CrossRef
13.
Zurück zum Zitat Zampieri, M., Malmasi, S., Nakov, P., Rosenthal, S., Farra, N., Kumar, R.: Predicting the type and target of offensive posts in social media. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 1415–1420 (2019) Zampieri, M., Malmasi, S., Nakov, P., Rosenthal, S., Farra, N., Kumar, R.: Predicting the type and target of offensive posts in social media. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 1415–1420 (2019)
14.
Zurück zum Zitat Zhang, M., Jin, L., Song, L., Mi, H., Chen, W., Yu, D.: SafeConv: explaining and correcting conversational unsafe behavior. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 22–35 (2023) Zhang, M., Jin, L., Song, L., Mi, H., Chen, W., Yu, D.: SafeConv: explaining and correcting conversational unsafe behavior. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 22–35 (2023)
15.
Zurück zum Zitat Zhang, Y., Ren, P., Deng, W., Chen, Z., Rijke, M.: Improving multi-label malevolence detection in dialogues through multi-faceted label correlation enhancement. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 3543–3555 (2022) Zhang, Y., Ren, P., Deng, W., Chen, Z., Rijke, M.: Improving multi-label malevolence detection in dialogues through multi-faceted label correlation enhancement. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 3543–3555 (2022)
16.
Zurück zum Zitat Zhang, Y., Ren, P., de Rijke, M.: A taxonomy, data set, and benchmark for detecting and classifying malevolent dialogue responses. J. Am. Soc. Inf. Sci. 72(12), 1477–1497 (2021) Zhang, Y., Ren, P., de Rijke, M.: A taxonomy, data set, and benchmark for detecting and classifying malevolent dialogue responses. J. Am. Soc. Inf. Sci. 72(12), 1477–1497 (2021)
17.
Zurück zum Zitat Zhou, J., et al: Hierarchy-aware global model for hierarchical text classification. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1106–1117 (2020) Zhou, J., et al: Hierarchy-aware global model for hierarchical text classification. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1106–1117 (2020)
18.
Zurück zum Zitat Zhou, J., et al.: Towards identifying social bias in dialog systems: framework, dataset, and benchmark. In: Findings of the Association for Computational Linguistics: EMNLP 2022, pp. 3576–3591 (2022) Zhou, J., et al.: Towards identifying social bias in dialog systems: framework, dataset, and benchmark. In: Findings of the Association for Computational Linguistics: EMNLP 2022, pp. 3576–3591 (2022)
Metadaten
Titel
LEMT: A Label Enhanced Multi-task Learning Framework for Malevolent Dialogue Response Detection
verfasst von
Kaiyue Wang
Fan Yang
Yucheng Yao
Xiabing Zhou
Copyright-Jahr
2024
Verlag
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-97-2242-6_11

Premium Partner