Skip to main content

2024 | OriginalPaper | Buchkapitel

ScaleViz: Scaling Visualization Recommendation Models on Large Data

verfasst von : Ghazi Shazan Ahmad, Shubham Agarwal, Subrata Mitra, Ryan Rossi, Manav Doshi, Vibhor Porwal, Syam Manoj Kumar Paila

Erschienen in: Advances in Knowledge Discovery and Data Mining

Verlag: Springer Nature Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Automated visualization recommendation (Vis-Rec) models help users to derive crucial insights from new datasets. Typically, such automated Vis-Rec models first calculate a large number of statistics from the datasets and then use machine-learning models to score or classify multiple visualizations choices to recommend the most effective ones, as per the statistics. However, state-of-the-art models rely on a very large number of expensive statistics and therefore using such models on large datasets becomes infeasible due to prohibitively large computational time, limiting the effectiveness of such techniques to most large real-world datasets. In this paper, we propose a novel reinforcement-learning (RL) based framework that takes a given Vis-Rec model and a time budget from the user and identifies the best set of input statistics, specifically for a target dataset, that would be most effective while generating accurate enough visual insights. We show the effectiveness of our technique as it enables two state of the art Vis-Rec models to achieve up to 10X speedup in time-to-visualize on four large real-world datasets.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Deng, H., Runger, G.: Feature selection via regularized trees. In: The 2012 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE (2012) Deng, H., Runger, G.: Feature selection via regularized trees. In: The 2012 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE (2012)
2.
Zurück zum Zitat Ding, R., Han, S., Xu, Y., Zhang, H., Zhang, D.: QuickInsights: quick and automatic discovery of insights from multi-dimensional data. In: ICMD (2019) Ding, R., Han, S., Xu, Y., Zhang, H., Zhang, D.: QuickInsights: quick and automatic discovery of insights from multi-dimensional data. In: ICMD (2019)
3.
Zurück zum Zitat Farahat, A.K., Ghodsi, A., Kamel, M.S.: An efficient greedy method for unsupervised feature selection. In: ICDM, pp. 161–170. IEEE (2011) Farahat, A.K., Ghodsi, A., Kamel, M.S.: An efficient greedy method for unsupervised feature selection. In: ICDM, pp. 161–170. IEEE (2011)
6.
Zurück zum Zitat Hu, K., Bakker, M.A., Li, S., Kraska, T., Hidalgo, C.: VizML: a machine learning approach to visualization recommendation. In: CHI, pp. 1–12 (2019) Hu, K., Bakker, M.A., Li, S., Kraska, T., Hidalgo, C.: VizML: a machine learning approach to visualization recommendation. In: CHI, pp. 1–12 (2019)
7.
Zurück zum Zitat Hulsebos, M., Demiralp, C., Groth, P.: Gittables: a large-scale corpus of relational tables. Proc. ACM Manag. Data 1, 1–17 (2023) Hulsebos, M., Demiralp, C., Groth, P.: Gittables: a large-scale corpus of relational tables. Proc. ACM Manag. Data 1, 1–17 (2023)
8.
Zurück zum Zitat Idreos, S., Papaemmanouil, O., Chaudhuri, S.: Overview of data exploration techniques. In: SIGMOD (2015) Idreos, S., Papaemmanouil, O., Chaudhuri, S.: Overview of data exploration techniques. In: SIGMOD (2015)
9.
Zurück zum Zitat Kachuee, M., et al.: Opportunistic learning: budgeted cost-sensitive learning from data streams. arXiv preprint arXiv:1901.00243 (2019) Kachuee, M., et al.: Opportunistic learning: budgeted cost-sensitive learning from data streams. arXiv preprint arXiv:​1901.​00243 (2019)
10.
Zurück zum Zitat Li, J., et al.: Feature selection: a data perspective. ACM Comput. Surv. (CSUR) 50, 1–45 (2017) Li, J., et al.: Feature selection: a data perspective. ACM Comput. Surv. (CSUR) 50, 1–45 (2017)
11.
Zurück zum Zitat Luo, Y., Qin, X., Tang, N., Li, G.: DeepEye: towards automatic data visualization. In: ICDE, pp. 101–112. IEEE (2018) Luo, Y., Qin, X., Tang, N., Li, G.: DeepEye: towards automatic data visualization. In: ICDE, pp. 101–112. IEEE (2018)
12.
Zurück zum Zitat Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, E.A.: Human-level control through deep reinforcement learning. Nature 518, 529–533 (2015)CrossRef Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, E.A.: Human-level control through deep reinforcement learning. Nature 518, 529–533 (2015)CrossRef
13.
Zurück zum Zitat Qian, X., et al.: Learning to recommend visualizations from data. In: KDD 2021. ACM (2021) Qian, X., et al.: Learning to recommend visualizations from data. In: KDD 2021. ACM (2021)
15.
Zurück zum Zitat Vartak, M., Huang, S., Siddiqui, T., Madden, S., Parameswaran, A.: Towards visualization recommendation systems. ACM SIGMOD Rec. 45, 34–39 (2017)CrossRef Vartak, M., Huang, S., Siddiqui, T., Madden, S., Parameswaran, A.: Towards visualization recommendation systems. ACM SIGMOD Rec. 45, 34–39 (2017)CrossRef
16.
Zurück zum Zitat Wang, C., Chen, M.H., Schifano, E., Wu, J., Yan, J.: Statistical methods and computing for big data. Stat. Interf. 9(4), 399 (2016)MathSciNetCrossRef Wang, C., Chen, M.H., Schifano, E., Wu, J., Yan, J.: Statistical methods and computing for big data. Stat. Interf. 9(4), 399 (2016)MathSciNetCrossRef
17.
Zurück zum Zitat Xu, Z., Weinberger, K., Chapelle, O.: The greedy miser: learning under test-time budgets. arXiv preprint arXiv:1206.6451 (2012) Xu, Z., Weinberger, K., Chapelle, O.: The greedy miser: learning under test-time budgets. arXiv preprint arXiv:​1206.​6451 (2012)
Metadaten
Titel
ScaleViz: Scaling Visualization Recommendation Models on Large Data
verfasst von
Ghazi Shazan Ahmad
Shubham Agarwal
Subrata Mitra
Ryan Rossi
Manav Doshi
Vibhor Porwal
Syam Manoj Kumar Paila
Copyright-Jahr
2024
Verlag
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-97-2262-4_8

Premium Partner