Skip to main content
Top

2024 | OriginalPaper | Chapter

ScaleViz: Scaling Visualization Recommendation Models on Large Data

Authors : Ghazi Shazan Ahmad, Shubham Agarwal, Subrata Mitra, Ryan Rossi, Manav Doshi, Vibhor Porwal, Syam Manoj Kumar Paila

Published in: Advances in Knowledge Discovery and Data Mining

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Automated visualization recommendation (Vis-Rec) models help users to derive crucial insights from new datasets. Typically, such automated Vis-Rec models first calculate a large number of statistics from the datasets and then use machine-learning models to score or classify multiple visualizations choices to recommend the most effective ones, as per the statistics. However, state-of-the-art models rely on a very large number of expensive statistics and therefore using such models on large datasets becomes infeasible due to prohibitively large computational time, limiting the effectiveness of such techniques to most large real-world datasets. In this paper, we propose a novel reinforcement-learning (RL) based framework that takes a given Vis-Rec model and a time budget from the user and identifies the best set of input statistics, specifically for a target dataset, that would be most effective while generating accurate enough visual insights. We show the effectiveness of our technique as it enables two state of the art Vis-Rec models to achieve up to 10X speedup in time-to-visualize on four large real-world datasets.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Deng, H., Runger, G.: Feature selection via regularized trees. In: The 2012 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE (2012) Deng, H., Runger, G.: Feature selection via regularized trees. In: The 2012 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE (2012)
2.
go back to reference Ding, R., Han, S., Xu, Y., Zhang, H., Zhang, D.: QuickInsights: quick and automatic discovery of insights from multi-dimensional data. In: ICMD (2019) Ding, R., Han, S., Xu, Y., Zhang, H., Zhang, D.: QuickInsights: quick and automatic discovery of insights from multi-dimensional data. In: ICMD (2019)
3.
go back to reference Farahat, A.K., Ghodsi, A., Kamel, M.S.: An efficient greedy method for unsupervised feature selection. In: ICDM, pp. 161–170. IEEE (2011) Farahat, A.K., Ghodsi, A., Kamel, M.S.: An efficient greedy method for unsupervised feature selection. In: ICDM, pp. 161–170. IEEE (2011)
6.
go back to reference Hu, K., Bakker, M.A., Li, S., Kraska, T., Hidalgo, C.: VizML: a machine learning approach to visualization recommendation. In: CHI, pp. 1–12 (2019) Hu, K., Bakker, M.A., Li, S., Kraska, T., Hidalgo, C.: VizML: a machine learning approach to visualization recommendation. In: CHI, pp. 1–12 (2019)
7.
go back to reference Hulsebos, M., Demiralp, C., Groth, P.: Gittables: a large-scale corpus of relational tables. Proc. ACM Manag. Data 1, 1–17 (2023) Hulsebos, M., Demiralp, C., Groth, P.: Gittables: a large-scale corpus of relational tables. Proc. ACM Manag. Data 1, 1–17 (2023)
8.
go back to reference Idreos, S., Papaemmanouil, O., Chaudhuri, S.: Overview of data exploration techniques. In: SIGMOD (2015) Idreos, S., Papaemmanouil, O., Chaudhuri, S.: Overview of data exploration techniques. In: SIGMOD (2015)
9.
go back to reference Kachuee, M., et al.: Opportunistic learning: budgeted cost-sensitive learning from data streams. arXiv preprint arXiv:1901.00243 (2019) Kachuee, M., et al.: Opportunistic learning: budgeted cost-sensitive learning from data streams. arXiv preprint arXiv:​1901.​00243 (2019)
10.
go back to reference Li, J., et al.: Feature selection: a data perspective. ACM Comput. Surv. (CSUR) 50, 1–45 (2017) Li, J., et al.: Feature selection: a data perspective. ACM Comput. Surv. (CSUR) 50, 1–45 (2017)
11.
go back to reference Luo, Y., Qin, X., Tang, N., Li, G.: DeepEye: towards automatic data visualization. In: ICDE, pp. 101–112. IEEE (2018) Luo, Y., Qin, X., Tang, N., Li, G.: DeepEye: towards automatic data visualization. In: ICDE, pp. 101–112. IEEE (2018)
12.
go back to reference Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, E.A.: Human-level control through deep reinforcement learning. Nature 518, 529–533 (2015)CrossRef Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, E.A.: Human-level control through deep reinforcement learning. Nature 518, 529–533 (2015)CrossRef
13.
go back to reference Qian, X., et al.: Learning to recommend visualizations from data. In: KDD 2021. ACM (2021) Qian, X., et al.: Learning to recommend visualizations from data. In: KDD 2021. ACM (2021)
15.
go back to reference Vartak, M., Huang, S., Siddiqui, T., Madden, S., Parameswaran, A.: Towards visualization recommendation systems. ACM SIGMOD Rec. 45, 34–39 (2017)CrossRef Vartak, M., Huang, S., Siddiqui, T., Madden, S., Parameswaran, A.: Towards visualization recommendation systems. ACM SIGMOD Rec. 45, 34–39 (2017)CrossRef
16.
go back to reference Wang, C., Chen, M.H., Schifano, E., Wu, J., Yan, J.: Statistical methods and computing for big data. Stat. Interf. 9(4), 399 (2016)MathSciNetCrossRef Wang, C., Chen, M.H., Schifano, E., Wu, J., Yan, J.: Statistical methods and computing for big data. Stat. Interf. 9(4), 399 (2016)MathSciNetCrossRef
17.
Metadata
Title
ScaleViz: Scaling Visualization Recommendation Models on Large Data
Authors
Ghazi Shazan Ahmad
Shubham Agarwal
Subrata Mitra
Ryan Rossi
Manav Doshi
Vibhor Porwal
Syam Manoj Kumar Paila
Copyright Year
2024
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-97-2262-4_8

Premium Partner