Skip to main content

2024 | OriginalPaper | Buchkapitel

DEAL: Data-Efficient Active Learning for Regression Under Drift

verfasst von : Béla H. Böhnke, Edouard Fouché, Klemens Böhm

Erschienen in: Advances in Knowledge Discovery and Data Mining

Verlag: Springer Nature Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Current work on Active Learning (AL) tends to assume that the relationship between input and target variables does not change, i.e., the oracle is static. However, oracles can be stream-like and exhibit concept drift, which requires updating the learned relationship. Standard drift detection and adaption methods rely on constantly observing the target variables, which is too costly in AL. Current work on AL for regression has not addressed the challenge of frequently drifting oracles. We propose a new AL method that estimates its error due to drift by learning statistics about how often and how severe drift occurs, based on a Gaussian Process model with a time-variant kernel. Whenever the estimated error reaches a user-required threshold, our model measures the target variables and recalibrates the learned relationship as well as the drift statistics. Our drift-aware model requires up to 20 times fewer measurements than widely used methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
Note one can reduce the complexity of DEAL down to \(O(n \cdot i)\) (with learning epochs \(i \ll n\)), by using gradient-based GP models and batch training. Further, one can cap the number of measurements used for training, reducing the factor n to a constant.
 
Literatur
1.
Zurück zum Zitat Bachman, P., Sordoni, A., Trischler, A.: Learning algorithms for active learning. In: ICML (2017) Bachman, P., Sordoni, A., Trischler, A.: Learning algorithms for active learning. In: ICML (2017)
2.
Zurück zum Zitat Bifet, A., Holmes, G., Kirkby, R., Pfahringer, B.: MOA: massive online analysis. J. Mach. Learn. Res. 11, 1601–1604 (2010) Bifet, A., Holmes, G., Kirkby, R., Pfahringer, B.: MOA: massive online analysis. J. Mach. Learn. Res. 11, 1601–1604 (2010)
3.
Zurück zum Zitat Klebaner, F.C.: Introduction to Stochastic Calculus with Applications. WSPC, London (2005) Klebaner, F.C.: Introduction to Stochastic Calculus with Applications. WSPC, London (2005)
4.
Zurück zum Zitat Carne, G.D., Buticchi, G., Liserre, M., Vournas, C.: Load control using sensitivity identification by means of smart transformer. IEEE Trans. Smart Grid 9(4), 2606–2615 (2018)CrossRef Carne, G.D., Buticchi, G., Liserre, M., Vournas, C.: Load control using sensitivity identification by means of smart transformer. IEEE Trans. Smart Grid 9(4), 2606–2615 (2018)CrossRef
5.
Zurück zum Zitat Carne, G.D., Buticchi, G., Liserre, M., Vournas, C.: Real-time primary frequency regulation using load power control by smart transformers. IEEE Trans. Smart Grid 10(5), 5630–5639 (2019)CrossRef Carne, G.D., Buticchi, G., Liserre, M., Vournas, C.: Real-time primary frequency regulation using load power control by smart transformers. IEEE Trans. Smart Grid 10(5), 5630–5639 (2019)CrossRef
6.
Zurück zum Zitat Han, D., Ma, J., He, R., Dong, Z.Y.: A real application of measurement-based load modeling in large-scale power grids and its validation. IEEE Trans. Power Syst. 24, 1756–1764 (2009)CrossRef Han, D., Ma, J., He, R., Dong, Z.Y.: A real application of measurement-based load modeling in large-scale power grids and its validation. IEEE Trans. Power Syst. 24, 1756–1764 (2009)CrossRef
7.
Zurück zum Zitat Gama, J., Zliobaite, I., Bifet, A., Pechenizkiy, M., Bouchachia, A.: A survey on concept drift adaptation. ACM Comput. Surv. 46, 1–37 (2014)CrossRef Gama, J., Zliobaite, I., Bifet, A., Pechenizkiy, M., Bouchachia, A.: A survey on concept drift adaptation. ACM Comput. Surv. 46, 1–37 (2014)CrossRef
9.
Zurück zum Zitat Iwashita, A.S., Papa, J.P.: An overview on concept drift learning. IEEE Access 7, 1532–1547 (2019)CrossRef Iwashita, A.S., Papa, J.P.: An overview on concept drift learning. IEEE Access 7, 1532–1547 (2019)CrossRef
10.
Zurück zum Zitat Iwata, T.: Active learning for regression with aggregated outputs. CoRR (2022) Iwata, T.: Active learning for regression with aggregated outputs. CoRR (2022)
12.
Zurück zum Zitat Konyushkova, K., Sznitman, R., Fua, P.: Learning active learning from data. In: NIPS (2017) Konyushkova, K., Sznitman, R., Fua, P.: Learning active learning from data. In: NIPS (2017)
13.
Zurück zum Zitat Krawczyk, B., Cano, A.: Adaptive ensemble active learning for drifting data stream mining. In: IJCAI (2019) Krawczyk, B., Cano, A.: Adaptive ensemble active learning for drifting data stream mining. In: IJCAI (2019)
14.
Zurück zum Zitat Krawczyk, B., Pfahringer, B., Wozniak, M.: Combining active learning with concept drift detection for data stream mining. In: IEEE BigData (2018) Krawczyk, B., Pfahringer, B., Wozniak, M.: Combining active learning with concept drift detection for data stream mining. In: IEEE BigData (2018)
15.
Zurück zum Zitat Kurlej, B., Wozniak, M.: Learning curve in concept drift while using active learning paradigm. In: ICAIS (2011) Kurlej, B., Wozniak, M.: Learning curve in concept drift while using active learning paradigm. In: ICAIS (2011)
16.
Zurück zum Zitat Kurlej, B., Wozniak, M.: Active learning approach to concept drift problem. Log. J. IGPL 20(3), 550–559 (2012)MathSciNetCrossRef Kurlej, B., Wozniak, M.: Active learning approach to concept drift problem. Log. J. IGPL 20(3), 550–559 (2012)MathSciNetCrossRef
17.
Zurück zum Zitat Lindgren, G., Rootzen, H., Sandsten, M.: Stationary Stochastic Processes for Scientists and Engineers. T &F (2013) Lindgren, G., Rootzen, H., Sandsten, M.: Stationary Stochastic Processes for Scientists and Engineers. T &F (2013)
18.
Zurück zum Zitat Liu, S., et al.: Online active learning for drifting data streams. IEEE Trans. Neural Networks Learn. Syst. 34(1), 186–200 (2023)CrossRef Liu, S., et al.: Online active learning for drifting data streams. IEEE Trans. Neural Networks Learn. Syst. 34(1), 186–200 (2023)CrossRef
19.
Zurück zum Zitat Liu, W., Zhang, H., Ding, Z., Liu, Q., Zhu, C.: A comprehensive active learning method for multiclass imbalanced data streams with concept drift. Knowl. Based Syst. 215, 106778 (2021)CrossRef Liu, W., Zhang, H., Ding, Z., Liu, Q., Zhu, C.: A comprehensive active learning method for multiclass imbalanced data streams with concept drift. Knowl. Based Syst. 215, 106778 (2021)CrossRef
20.
Zurück zum Zitat Lu, J., Liu, A., Dong, F., Gu, F., Gama, J., Zhang, G.: Learning under concept drift: a review. IEEE Trans. Knowl. Data Eng. 31(12), 2346–2363 (2019) Lu, J., Liu, A., Dong, F., Gu, F., Gama, J., Zhang, G.: Learning under concept drift: a review. IEEE Trans. Knowl. Data Eng. 31(12), 2346–2363 (2019)
21.
Zurück zum Zitat Micchelli, C.A., Xu, Y., Zhang, H.: Universal kernels. J. Mach. Learn. Res. 7, 2651–2667 (2006)MathSciNet Micchelli, C.A., Xu, Y., Zhang, H.: Universal kernels. J. Mach. Learn. Res. 7, 2651–2667 (2006)MathSciNet
22.
Zurück zum Zitat Mohamad, S., Sayed Mouchaweh, M., Bouchachia, A.: Active learning for data streams under concept drift and concept evolution. In: ECML-PKDD (2016) Mohamad, S., Sayed Mouchaweh, M., Bouchachia, A.: Active learning for data streams under concept drift and concept evolution. In: ECML-PKDD (2016)
23.
Zurück zum Zitat Montiel, J., et al.: River: machine learning for streaming data in Python. JMLR 22, 1–8 (2021)MathSciNet Montiel, J., et al.: River: machine learning for streaming data in Python. JMLR 22, 1–8 (2021)MathSciNet
24.
Zurück zum Zitat Park, C.H., Kang, Y.: An active learning method for data streams with concept drift. In: IEEE BigData (2016) Park, C.H., Kang, Y.: An active learning method for data streams with concept drift. In: IEEE BigData (2016)
25.
Zurück zum Zitat Riquelme, C., Johari, R., Zhang, B.: Online active linear regression via thresholding. In: AAAI (2017) Riquelme, C., Johari, R., Zhang, B.: Online active linear regression via thresholding. In: AAAI (2017)
27.
Zurück zum Zitat Shan, J., Zhang, H., Liu, W., Liu, Q.: Online active learning ensemble framework for drifted data streams. IEEE Trans. Neural Networks Learn. Syst. 30(2), 486–498 (2019)CrossRef Shan, J., Zhang, H., Liu, W., Liu, Q.: Online active learning ensemble framework for drifted data streams. IEEE Trans. Neural Networks Learn. Syst. 30(2), 486–498 (2019)CrossRef
28.
Zurück zum Zitat Stefano, M., Bruno, S.: An active-learning algorithm that combines sparse polynomial chaos expansions and bootstrap for structural reliability analysis. Struct. Saf. 75, 67–74 (2018)CrossRef Stefano, M., Bruno, S.: An active-learning algorithm that combines sparse polynomial chaos expansions and bootstrap for structural reliability analysis. Struct. Saf. 75, 67–74 (2018)CrossRef
29.
Zurück zum Zitat Lookman, T., Balachandran, P.V., Xue, D., Yuan, R.: Active learning in materials science with emphasis on adaptive sampling using uncertainties for targeted design. NPJ Comput. Mater. 5, 21 (2019)CrossRef Lookman, T., Balachandran, P.V., Xue, D., Yuan, R.: Active learning in materials science with emphasis on adaptive sampling using uncertainties for targeted design. NPJ Comput. Mater. 5, 21 (2019)CrossRef
30.
Zurück zum Zitat Viktor, L., Barbara, H., Wersing, H.: KNN classifier with self adjusting memory for heterogeneous concept drift. In: ICDM (2016) Viktor, L., Barbara, H., Wersing, H.: KNN classifier with self adjusting memory for heterogeneous concept drift. In: ICDM (2016)
31.
Zurück zum Zitat Wu, D., Lin, C., Huang, J.: Active learning for regression using greedy sampling. Inf. Sci. 474, 90–105 (2019)MathSciNetCrossRef Wu, D., Lin, C., Huang, J.: Active learning for regression using greedy sampling. Inf. Sci. 474, 90–105 (2019)MathSciNetCrossRef
32.
Zurück zum Zitat Yoo, D., Kweon, I.S.: Learning loss for active learning. In: CVPR (2019) Yoo, D., Kweon, I.S.: Learning loss for active learning. In: CVPR (2019)
33.
Zurück zum Zitat Zhang, H., Liu, W., Shan, J., Liu, Q.: Online active learning paired ensemble for concept drift and class imbalance. IEEE Access 6, 73815–73828 (2018)CrossRef Zhang, H., Liu, W., Shan, J., Liu, Q.: Online active learning paired ensemble for concept drift and class imbalance. IEEE Access 6, 73815–73828 (2018)CrossRef
34.
Zurück zum Zitat Zliobaite, I.: Learning under concept drift: an overview. CoRR (2010) Zliobaite, I.: Learning under concept drift: an overview. CoRR (2010)
35.
Zurück zum Zitat Zliobaite, I., Bifet, A., Holmes, G., Pfahringer, B.: MOA concept drift active learning strategies for streaming data. In: WAPA. JMLR Proceedings (2011) Zliobaite, I., Bifet, A., Holmes, G., Pfahringer, B.: MOA concept drift active learning strategies for streaming data. In: WAPA. JMLR Proceedings (2011)
36.
Zurück zum Zitat Zliobaite, I., Bifet, A., Pfahringer, B., Holmes, G.: Active learning with evolving streaming data. In: ECML/PKDD (3) (2011) Zliobaite, I., Bifet, A., Pfahringer, B., Holmes, G.: Active learning with evolving streaming data. In: ECML/PKDD (3) (2011)
37.
Zurück zum Zitat Zliobaite, I., Bifet, A., Pfahringer, B., Holmes, G.: Active learning with drifting streaming data. IEEE Trans. Neural Networks Learn. Syst. 25(1), 27–39 (2014)CrossRef Zliobaite, I., Bifet, A., Pfahringer, B., Holmes, G.: Active learning with drifting streaming data. IEEE Trans. Neural Networks Learn. Syst. 25(1), 27–39 (2014)CrossRef
Metadaten
Titel
DEAL: Data-Efficient Active Learning for Regression Under Drift
verfasst von
Béla H. Böhnke
Edouard Fouché
Klemens Böhm
Copyright-Jahr
2024
Verlag
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-97-2266-2_15

Premium Partner