Skip to main content

2024 | OriginalPaper | Buchkapitel

SATJiP: Spatial and Augmented Temporal Jigsaw Puzzles for Video Anomaly Detection

verfasst von : Liheng Shen, Tetsu Matsukawa, Einoshin Suzuki

Erschienen in: Advances in Knowledge Discovery and Data Mining

Verlag: Springer Nature Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Video Anomaly Detection (VAD) is a significant task, which refers to taking a video clip as input and outputting class labels, e.g., normal or abnormal, at the frame level. Wang et al. proposed a method called DSTJiP, which trains the model by solving Decoupled Spatial and Temporal Jigsaw Puzzles and achieves impressive VAD performance. However, the model sometimes fails to detect abnormal human actions where abnormal motions are accompanied by normal motions. The reason is that the model learns representations of little- and non-motion parts of training examples, resulting in being insensitive to abnormal motions. To circumvent this problem, we propose to solve Spatial and Augmented Temporal Jigsaw Puzzles (SATJiP) as an extension of DSTJiP. SATJiP encourages the model to focus on motions by a novel pretext task, enabling it to detect abnormal motions accompanied by normal motions. Experiments conducted on three standard VAD benchmarks demonstrate that SATJiP outperforms the state-of-the-art methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Astrid, M., Zaheer, M.Z., Lee, J.Y., Lee, S.I.: Learning not to reconstruct anomalies. In: Proceedings of BMVC (2021) Astrid, M., Zaheer, M.Z., Lee, J.Y., Lee, S.I.: Learning not to reconstruct anomalies. In: Proceedings of BMVC (2021)
2.
Zurück zum Zitat Astrid, M., Zaheer, M.Z., Lee, S.I.: Synthetic temporal anomaly guided end-to-end video anomaly detection. In: Proceedings of ICCVW (2021) Astrid, M., Zaheer, M.Z., Lee, S.I.: Synthetic temporal anomaly guided end-to-end video anomaly detection. In: Proceedings of ICCVW (2021)
3.
Zurück zum Zitat Barbalau, A., et al.: SSMTL++: revisiting self-supervised multi-task learning for video anomaly detection. Comput. Vis. Image Underst. 229, 103656 (2023)CrossRef Barbalau, A., et al.: SSMTL++: revisiting self-supervised multi-task learning for video anomaly detection. Comput. Vis. Image Underst. 229, 103656 (2023)CrossRef
4.
Zurück zum Zitat Cai, R., Zhang, H., Liu, W., Gao, S., Hao, Z.: Appearance-motion memory consistency network for video anomaly detection. In: Proceedings of AAAI (2021) Cai, R., Zhang, H., Liu, W., Gao, S., Hao, Z.: Appearance-motion memory consistency network for video anomaly detection. In: Proceedings of AAAI (2021)
5.
Zurück zum Zitat Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Computi. Surv. (CSUR) 41(3), 1–58 (2009)CrossRef Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Computi. Surv. (CSUR) 41(3), 1–58 (2009)CrossRef
7.
Zurück zum Zitat Chen, C., et al.: Comprehensive regularization in a bi-directional predictive network for video anomaly detection. In: Proceedings of AAAI, vol. 36 (2022) Chen, C., et al.: Comprehensive regularization in a bi-directional predictive network for video anomaly detection. In: Proceedings of AAAI, vol. 36 (2022)
8.
Zurück zum Zitat Deng, H., Zhang, Z., Zou, S., Li, X.: Bi-directional frame interpolation for unsupervised video anomaly detection. In: Proceedings of WACV (2023) Deng, H., Zhang, Z., Zou, S., Li, X.: Bi-directional frame interpolation for unsupervised video anomaly detection. In: Proceedings of WACV (2023)
9.
Zurück zum Zitat Feichtenhofer, C., Li, Y., He, K., et al.: Masked autoencoders as spatiotemporal learners. In: Proceedings of NeurIPS, vol. 35 (2022) Feichtenhofer, C., Li, Y., He, K., et al.: Masked autoencoders as spatiotemporal learners. In: Proceedings of NeurIPS, vol. 35 (2022)
10.
Zurück zum Zitat Feng, X., Song, D., Chen, Y., Chen, Z., Ni, J., Chen, H.: Convolutional transformer based dual discriminator generative adversarial networks for video anomaly detection. In: Proceedings of MM (2021) Feng, X., Song, D., Chen, Y., Chen, Z., Ni, J., Chen, H.: Convolutional transformer based dual discriminator generative adversarial networks for video anomaly detection. In: Proceedings of MM (2021)
11.
Zurück zum Zitat Georgescu, M., Barbalau, A., Ionescu, R.T., Khan, F.S., Popescu, M., Shah, M.: Anomaly detection in video via self-supervised and multi-task learning. In: Proceedings of CVPR (2021) Georgescu, M., Barbalau, A., Ionescu, R.T., Khan, F.S., Popescu, M., Shah, M.: Anomaly detection in video via self-supervised and multi-task learning. In: Proceedings of CVPR (2021)
12.
Zurück zum Zitat Gong, D., et al.: Memorizing normality to detect anomaly: memory-augmented deep autoencoder for unsupervised anomaly detection. In: Proceedings of ICCV (2019) Gong, D., et al.: Memorizing normality to detect anomaly: memory-augmented deep autoencoder for unsupervised anomaly detection. In: Proceedings of ICCV (2019)
13.
Zurück zum Zitat Huang, X., Zhao, C., Wu, Z.: A video anomaly detection framework based on appearance-motion semantics representation consistency. In: Proceedings of ICASSP (2023) Huang, X., Zhao, C., Wu, Z.: A video anomaly detection framework based on appearance-motion semantics representation consistency. In: Proceedings of ICASSP (2023)
14.
Zurück zum Zitat Ilg, E., Mayer, N., Saikia, T., Keuper, M., Dosovitskiy, A., Brox, T.: Flownet 2.0: evolution of optical flow estimation with deep networks. In: Proceedings of CVPR (2017) Ilg, E., Mayer, N., Saikia, T., Keuper, M., Dosovitskiy, A., Brox, T.: Flownet 2.0: evolution of optical flow estimation with deep networks. In: Proceedings of CVPR (2017)
15.
Zurück zum Zitat Ionescu, R.T., Khan, F.S., Georgescu, M.I., Shao, L.: Object-centric auto-encoders and dummy anomalies for abnormal event detection in video. In: Proceedings of CVPR (2019) Ionescu, R.T., Khan, F.S., Georgescu, M.I., Shao, L.: Object-centric auto-encoders and dummy anomalies for abnormal event detection in video. In: Proceedings of CVPR (2019)
16.
Zurück zum Zitat Lai, Y., Han, Y., Wang, Y.: Anomaly detection with prototype-guided discriminative latent embeddings. In: Proceedings of ICDM (2021) Lai, Y., Han, Y., Wang, Y.: Anomaly detection with prototype-guided discriminative latent embeddings. In: Proceedings of ICDM (2021)
17.
Zurück zum Zitat Lee, S., Kim, H.G., Ro, Y.M.: BMAN: bidirectional multi-scale aggregation networks for abnormal event detection. IEEE Trans. Image Process. 29, 2395–2408 (2020)CrossRef Lee, S., Kim, H.G., Ro, Y.M.: BMAN: bidirectional multi-scale aggregation networks for abnormal event detection. IEEE Trans. Image Process. 29, 2395–2408 (2020)CrossRef
18.
Zurück zum Zitat Liu, W., Luo, W., Lian, D., Gao, S.: Future frame prediction for anomaly detection–a new baseline. In: Proceedings of CVPR (2018) Liu, W., Luo, W., Lian, D., Gao, S.: Future frame prediction for anomaly detection–a new baseline. In: Proceedings of CVPR (2018)
19.
Zurück zum Zitat Liu, Y., Liu, J., Zhao, M., Yang, D., Zhu, X., Song, L.: Learning appearance-motion normality for video anomaly detection. In: Proceedings of ICME (2022) Liu, Y., Liu, J., Zhao, M., Yang, D., Zhu, X., Song, L.: Learning appearance-motion normality for video anomaly detection. In: Proceedings of ICME (2022)
20.
Zurück zum Zitat Liu, Z., Nie, Y., Long, C., Zhang, Q., Li, G.: A hybrid video anomaly detection framework via memory-augmented flow reconstruction and flow-guided frame prediction. In: Proceedings of ICCV (2021) Liu, Z., Nie, Y., Long, C., Zhang, Q., Li, G.: A hybrid video anomaly detection framework via memory-augmented flow reconstruction and flow-guided frame prediction. In: Proceedings of ICCV (2021)
21.
Zurück zum Zitat Lu, C., Shi, J., Jia, J.: Abnormal event detection at 150 FPS in MATLAB. In: Proceedings of ICCV (2013) Lu, C., Shi, J., Jia, J.: Abnormal event detection at 150 FPS in MATLAB. In: Proceedings of ICCV (2013)
22.
Zurück zum Zitat Luo, W., Liu, W., Gao, S.: Remembering history with convolutional LSTM for anomaly detection. In: Proceedings of ICME (2017) Luo, W., Liu, W., Gao, S.: Remembering history with convolutional LSTM for anomaly detection. In: Proceedings of ICME (2017)
23.
Zurück zum Zitat Luo, W., Liu, W., Gao, S.: A revisit of sparse coding based anomaly detection in stacked RNN framework. In: Proceedings of ICCV (2017) Luo, W., Liu, W., Gao, S.: A revisit of sparse coding based anomaly detection in stacked RNN framework. In: Proceedings of ICCV (2017)
24.
Zurück zum Zitat Mahadevan, V., Li, W., Bhalodia, V., Vasconcelos, N.: Anomaly detection in crowded scenes. In: Proceedings of CVPR (2010) Mahadevan, V., Li, W., Bhalodia, V., Vasconcelos, N.: Anomaly detection in crowded scenes. In: Proceedings of CVPR (2010)
25.
Zurück zum Zitat Park, H., Noh, J., Ham, B.: Learning memory-guided normality for anomaly detection. In: Proceedings of CVPR (2020) Park, H., Noh, J., Ham, B.: Learning memory-guided normality for anomaly detection. In: Proceedings of CVPR (2020)
26.
Zurück zum Zitat Pathak, D., Krahenbuhl, P., Donahue, J., Darrell, T., Efros, A.A.: Context encoders: feature learning by inpainting. In: Proceedings of CVPR (2016) Pathak, D., Krahenbuhl, P., Donahue, J., Darrell, T., Efros, A.A.: Context encoders: feature learning by inpainting. In: Proceedings of CVPR (2016)
28.
Zurück zum Zitat Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-CAM: Visual explanations from deep networks via gradient-based localization. In: Proceedings of CVPR (2017) Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-CAM: Visual explanations from deep networks via gradient-based localization. In: Proceedings of CVPR (2017)
29.
Zurück zum Zitat Shen, L., Matsukawa, T., Suzuki, E.: Detecting video anomalous events with an enhanced abnormality score. In: Proceedings of PRICAI, vol. 13629 (2022) Shen, L., Matsukawa, T., Suzuki, E.: Detecting video anomalous events with an enhanced abnormality score. In: Proceedings of PRICAI, vol. 13629 (2022)
30.
Zurück zum Zitat Sun, C., Shi, C., Jia, Y., Wu, Y.: Learning event-relevant factors for video anomaly detection. In: Proceedings of AAAI, vol. 37 (2023) Sun, C., Shi, C., Jia, Y., Wu, Y.: Learning event-relevant factors for video anomaly detection. In: Proceedings of AAAI, vol. 37 (2023)
31.
Zurück zum Zitat Tang, Y., Zhao, L., Zhang, S., Gong, C., Li, G., Yang, J.: Integrating prediction and reconstruction for anomaly detection. Pattern Recogn. Lett. 129, 123–130 (2020)CrossRef Tang, Y., Zhao, L., Zhang, S., Gong, C., Li, G., Yang, J.: Integrating prediction and reconstruction for anomaly detection. Pattern Recogn. Lett. 129, 123–130 (2020)CrossRef
32.
Zurück zum Zitat Tishby, N., Zaslavsky, N.: Deep learning and the information bottleneck principle. In: Proceedings of ITW, pp. 1–5 (2015) Tishby, N., Zaslavsky, N.: Deep learning and the information bottleneck principle. In: Proceedings of ITW, pp. 1–5 (2015)
33.
Zurück zum Zitat Tong, Z., Song, Y., Wang, J., Wang, L.: VideoMAE: masked autoencoders are data-efficient learners for self-supervised video pre-training. In: Procedings of NeurIPS, vol. 35 (2022) Tong, Z., Song, Y., Wang, J., Wang, L.: VideoMAE: masked autoencoders are data-efficient learners for self-supervised video pre-training. In: Procedings of NeurIPS, vol. 35 (2022)
34.
Zurück zum Zitat Vu, H., Nguyen, T.D., Travers, A., Venkatesh, S., Phung, D.: Energy-based localized anomaly detection in video surveillance. In: Proceedings of PAKDD (2017) Vu, H., Nguyen, T.D., Travers, A., Venkatesh, S., Phung, D.: Energy-based localized anomaly detection in video surveillance. In: Proceedings of PAKDD (2017)
36.
Zurück zum Zitat Wang, X., Wang, X., et al.: Robust unsupervised video anomaly detection by multipath frame prediction. IEEE Trans. Neural Netw, Learn. Syst. 33(6), 2301–2312 (2022)MathSciNetCrossRef Wang, X., Wang, X., et al.: Robust unsupervised video anomaly detection by multipath frame prediction. IEEE Trans. Neural Netw, Learn. Syst. 33(6), 2301–2312 (2022)MathSciNetCrossRef
37.
Zurück zum Zitat Wang, Y., Qin, C., Bai, Y., Xu, Y., Ma, X., Fu, Y.: Making reconstruction-based method great again for video anomaly detection. In: Proceedings of ICDM (2022) Wang, Y., Qin, C., Bai, Y., Xu, Y., Ma, X., Fu, Y.: Making reconstruction-based method great again for video anomaly detection. In: Proceedings of ICDM (2022)
38.
Zurück zum Zitat Wang, Z., Zou, Y., Zhang, Z.: Cluster attention contrast for video anomaly detection. In: Proceedings of MM (2020) Wang, Z., Zou, Y., Zhang, Z.: Cluster attention contrast for video anomaly detection. In: Proceedings of MM (2020)
39.
Zurück zum Zitat Yang, Z., Liu, J., Wu, Z., Wu, P., Liu, X.: Video event restoration based on keyframes for video anomaly detection. In: Proceedings of CVPR (2023) Yang, Z., Liu, J., Wu, Z., Wu, P., Liu, X.: Video event restoration based on keyframes for video anomaly detection. In: Proceedings of CVPR (2023)
40.
Zurück zum Zitat Ye, M., Peng, X., Gan, W., Wu, W., Qiao, Y.: AnoPCN: video anomaly detection via deep predictive coding network. In: Proceedings of MM (2019) Ye, M., Peng, X., Gan, W., Wu, W., Qiao, Y.: AnoPCN: video anomaly detection via deep predictive coding network. In: Proceedings of MM (2019)
41.
Zurück zum Zitat Yu, G., et al.: Cloze test helps: effective video anomaly detection via learning to complete video events. In: Proceedings of MM (2020) Yu, G., et al.: Cloze test helps: effective video anomaly detection via learning to complete video events. In: Proceedings of MM (2020)
42.
Zurück zum Zitat Zhou, W., Li, Y., Zhao, C.: Object-guided and motion-refined attention network for video anomaly detection. In: Proceedings of ICME (2022) Zhou, W., Li, Y., Zhao, C.: Object-guided and motion-refined attention network for video anomaly detection. In: Proceedings of ICME (2022)
Metadaten
Titel
SATJiP: Spatial and Augmented Temporal Jigsaw Puzzles for Video Anomaly Detection
verfasst von
Liheng Shen
Tetsu Matsukawa
Einoshin Suzuki
Copyright-Jahr
2024
Verlag
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-97-2242-6_3

Premium Partner