Top

Published in:

2024 | OriginalPaper | Chapter

ScaleViz: Scaling Visualization Recommendation Models on Large Data

Authors : Ghazi Shazan Ahmad, Shubham Agarwal, Subrata Mitra, Ryan Rossi, Manav Doshi, Vibhor Porwal, Syam Manoj Kumar Paila

Published in: Advances in Knowledge Discovery and Data Mining

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Automated visualization recommendation (Vis-Rec) models help users to derive crucial insights from new datasets. Typically, such automated Vis-Rec models first calculate a large number of statistics from the datasets and then use machine-learning models to score or classify multiple visualizations choices to recommend the most effective ones, as per the statistics. However, state-of-the-art models rely on a very large number of expensive statistics and therefore using such models on large datasets becomes infeasible due to prohibitively large computational time, limiting the effectiveness of such techniques to most large real-world datasets. In this paper, we propose a novel reinforcement-learning (RL) based framework that takes a given Vis-Rec model and a time budget from the user and identifies the best set of input statistics, specifically for a target dataset, that would be most effective while generating accurate enough visual insights. We show the effectiveness of our technique as it enables two state of the art Vis-Rec models to achieve up to 10X speedup in time-to-visualize on four large real-world datasets.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

previous chapter lil’HDoC: An Algorithm for Good Arm Identification Under Small Threshold Gap

next chapter Collaborative Filtering in Latent Space: A Bayesian Approach for Cold-Start Music Recommendation

https://www.kaggle.com/datasets/mexwell/carrier-dataset.

https://www.kaggle.com/datasets/manishkc06/usa-census-income-data.

https://www.kaggle.com/datasets/mrdheer/cars-dataset.

https://www.kaggle.com/datasets/ashydv/housing-dataset.

Deng, H., Runger, G.: Feature selection via regularized trees. In: The 2012 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE (2012)

Ding, R., Han, S., Xu, Y., Zhang, H., Zhang, D.: QuickInsights: quick and automatic discovery of insights from multi-dimensional data. In: ICMD (2019)

Farahat, A.K., Ghodsi, A., Kamel, M.S.: An efficient greedy method for unsupervised feature selection. In: ICDM, pp. 161–170. IEEE (2011)

Godfrey, P., Gryz, J., Lasek, P.: Interactive visualization of large data sets. IEEE TKDE (2016). https://doi.org/10.1109/TKDE.2016.2557324CrossRef

Harris, C., et al.: Insight-centric visualization recommendation. arXiv:2103.11297 (2021)

Hu, K., Bakker, M.A., Li, S., Kraska, T., Hidalgo, C.: VizML: a machine learning approach to visualization recommendation. In: CHI, pp. 1–12 (2019)

Hulsebos, M., Demiralp, C., Groth, P.: Gittables: a large-scale corpus of relational tables. Proc. ACM Manag. Data 1, 1–17 (2023)

Idreos, S., Papaemmanouil, O., Chaudhuri, S.: Overview of data exploration techniques. In: SIGMOD (2015)

Kachuee, M., et al.: Opportunistic learning: budgeted cost-sensitive learning from data streams. arXiv preprint arXiv:1901.00243 (2019)

10.

Li, J., et al.: Feature selection: a data perspective. ACM Comput. Surv. (CSUR) 50, 1–45 (2017)

11.

Luo, Y., Qin, X., Tang, N., Li, G.: DeepEye: towards automatic data visualization. In: ICDE, pp. 101–112. IEEE (2018)

12.

Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, E.A.: Human-level control through deep reinforcement learning. Nature 518, 529–533 (2015)CrossRef

13.

Qian, X., et al.: Learning to recommend visualizations from data. In: KDD 2021. ACM (2021)

14.

Sali, R., Adewole, S., Akakpo, A.: Feature selection using reinforcement learning. CoRR abs/2101.09460 (2021). https://arxiv.org/abs/2101.09460

15.

Vartak, M., Huang, S., Siddiqui, T., Madden, S., Parameswaran, A.: Towards visualization recommendation systems. ACM SIGMOD Rec. 45, 34–39 (2017)CrossRef

16.

Wang, C., Chen, M.H., Schifano, E., Wu, J., Yan, J.: Statistical methods and computing for big data. Stat. Interf. 9(4), 399 (2016)MathSciNetCrossRef

17.

Xu, Z., Weinberger, K., Chapelle, O.: The greedy miser: learning under test-time budgets. arXiv preprint arXiv:1206.6451 (2012)

Title: ScaleViz: Scaling Visualization Recommendation Models on Large Data
Authors: Ghazi Shazan Ahmad
Shubham Agarwal
Subrata Mitra
Ryan Rossi
Manav Doshi
Vibhor Porwal
Syam Manoj Kumar Paila
Publisher: Springer Nature Singapore
Book: Advances in Knowledge Discovery and Data Mining
Print ISBN: 978-981-9722-64-8

Electronic ISBN: 978-981-9722-62-4

Copyright Year: 2024
DOI: https://doi.org/10.1007/978-981-97-2262-4_8

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner