Impact Factor:6.549
 Scopus Suggested Journal: UNDER REVIEW for TITLE INCLUSSION

International Journal
of Computer Engineering in Research Trends (IJCERT)

Scholarly, Peer-Reviewed, Open Access and Multidisciplinary


Welcome to IJCERT

International Journal of Computer Engineering in Research Trends. Scholarly, Peer-Reviewed,Open Access and Multidisciplinary

ISSN(Online):2349-7084                 Submit Paper    Check Paper Status    Conference Proposal

Back to Current Issues

Assessing the Performance of Python Data Visualization Libraries: A Review

Addepalli Lavanya, Lokhande Gaurav, Sakinam Sindhuja, Hussain Seam, Mookerjee Joydeep, Vamsi Uppalapati, Waqas Ali, Vidya Sagar S.D
Affiliations
Universidad Politécnica De Valencia, Valencia, Spain
:10.22362/ijcert/2023/v10/i01/v10i0104


Abstract
Python is one of the most widely used programming languages for data analysis, visualization, and machine learning. One of Python's key strengths is its rich library ecosystem that provides powerful data visualization tools. Several Python data visualization libraries have emerged in recent years, making it challenging for data analysts and scientists to choose the right library for their visualization needs. Therefore, this research paper aims to assess the performance of Python data visualization libraries and comprehensively review their strengths and limitations. The research paper begins by providing an overview of the most popular Python data visualization libraries, including Matplotlib, Seaborn, Plotly, Bokeh, Altair, and ggplot. We then evaluate each library's performance in terms of its functionality, ease of use, flexibility, and speed.. Additionally, we assess the visual quality of the plots produced by each library and compare them to industry standards. We evaluate the performance of each library by testing them on various datasets and use cases, including large and small datasets, static and interactive visualizations, and different plot types, such as scatter plots, line plots, bar charts, and heatmaps. Our findings suggest that each library has unique strengths and limitations, making choosing one library that fits all visualization needs difficult. However, Matplotlib, Seaborn, and Plotly are the most popular and widely used Python data visualization libraries, each with unique strengths. Matplotlib is a powerful and flexible library that offers a broad range of plotting options, making it ideal for creating complex and customized plots. Seaborn is a high-level library that simplifies the plotting process by providing a consistent interface and easy-to-use functions. Plotly is an interactive visualization library offering rich features for creating web-based visualizations and dashboards. We also find that Bokeh, Altair, and ggplot are less popular but offer unique features and functionality. Bokeh is a library for creating interactive visualizations and dashboards, while Altair is a declarative visualization library that simplifies the plotting process by enabling users to create plots using a simple and intuitive syntax. ggplot is a library that offers a grammar of graphics approach to plotting, making it ideal for users familiar with the R programming language. Overall, this research paper provides a comprehensive review of the most popular Python data visualization libraries and their performance in terms of functionality, ease of use, flexibility, and speed. The findings of this research can help data analysts and scientists choose the a good library for their visualization needs to be based on their specific requirements. Additionally, this research paper can provide a starting point for future research on improving the performance and functionality of Python data visualization libraries.


Citation
Addepalli Lavanya, Lokhande Gaurav,Sakinam Sindhuja, Hussain Seam,Mookerjee Joydeep, Vamsi Uppalapati,Waqas Ali,Vidya Sagar S.D."Assessing the Performance of Python Data Visualization Libraries: A Review". International Journal of Computer Engineering In Research Trends (IJCERT) ,ISSN:2349-7084 ,Vol.10, Issue 01,pp.29-39, January - 2023, URL :https://ijcert.org/ems/ijcert_papers/V10I0104.pdf,


Keywords : Python, Data Visualization, Performance, Li- braries, Review

References
[1]	S. Cao, Y. Zeng, S. Yang, and S. Cao, "Research on Python data visualization technology," in Journal of Physics: Conference Series, vol. 1757, 2021, p. 012122.
[2]	I. Stanand  A.  Jovic', "An  overview  and  comparison  of  free  Python  li- braries for data mining and big data analysis," in 2019 42nd International convention on information and communication technology, electronics and microelectronics (MIPRO), 2019, pp. 977–982.
[3]	K. Dale, Data Visualization with Python and JavaScript.
[4]	M. C. Mihaescu and P. S. Popescu, "Review on publicly available datasets for educational data mining," Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol. 11, no. 3, p. e1403, 2021.
[5]	M. L. Waskom, "Seaborn: statistical data visualization," Journal of Open Source Software, vol. 6, no. 60, p. 3021, 2021.
[6]	I. Stanand  A.  Jovic', "An  overview  and  comparison  of  free  Python  li- braries for data mining and big data analysis," in 2019 42nd International convention on information and communication technology, electronics and microelectronics (MIPRO), 2019, pp. 977–982.
[7]	T. Zhang and L. Mei, "Analysis and research on computer visualization in data science with bokeh and JavaScript," in Journal of Physics: Conference Series, vol. 2033, 2021, p. 012154.
[8]	A. Batch and N. Elmqvist, "The interactive visualization gap in initial exploratory data analysis," IEEE transactions on visualization and computer graphics, vol. 24, no. 1, pp. 278–287, 2017.
[9]	R. Wang, Y. Perez-Riverol, H. Hermjakob, and J. A. Vizcaíno, "Open source libraries and frameworks for biological data visualization: A guide for developers," Proteomics, vol. 15, no. 8, pp. 1356–1374, 2015.
[10]	X. Lou, S. V. D. Lee, and S. Lloyd, "AIMBAT: A python/matplotlib  tool for measuring teleseismic arrival times," Seismological Research Letters, vol. 84, no. 1, pp. 85–93, 2013.
[11]	R. Kumar, "Future for scientific computing using Python," International Journal of Engineering Technologies and Management Research, vol. 2, no. 1, pp. 30–41, 2015.
[12]	C. Rossant, Learning IPython for interactive computing and data visualization. Packt Publishing Ltd, 2015.
[13]	D. Rolon-Mérette, M. Ross, T. Rolon-Mérette, and K. Church, "In- troduction to Anaconda and Python: Installation and setup," Quant. Methods Psychol, vol. 16, no. 5, pp. 3–11, 2016.
[14]	W. S. Pittard and S. Li, "The essential toolbox of data science: Python, R, Git, and Docker," Computational Methods and Data Analysis for Metabolomics, pp. 265–311, 2020.
[15]	P. Bruce, A. Bruce, and P. Gedeck, Practical statistics for data scientists: 50+ essential concepts using R and Python. O'Reilly Media, 2020.
[16]	M. Allen, D. Poggiali, K. Whitaker, T. R. Marshall, and R. A. Kievit, "Raincloud plots: a multi-platform tool for robust data visualization," Wellcome open research, vol. 4, 2019.
[17]	D. P. Kroese, Z. Botev, T. Taimre, and R. Vaisman, Data science and machine learning: mathematical and statistical methods. CRC Press, 2019.
[18]	C. Sievert, Interactive web-based data visualization with R, plotly, and shiny. CRC Press, 2020.
[19]	S. M. Ali, N. Gupta, G. K. Nayak, and R. K. Lenka, "Big data visual- ization: Tools and challenges," in 2016 2nd International Conference on Contemporary Computing and Informatics (IC3I), 2016, pp. 656–660.
[20]	I. Stanand  A.  Jovic', "An  overview  and  comparison  of  free  Python  li- braries for data mining and big data analysis," in 2019 42nd International convention on information and communication technology, electronics and microelectronics (MIPRO), 2019, pp. 977–982.
[21]	C.  Gubala  and  L.   Melonçon, "Data   Visualizations:   An  Integrative Literature Review of Empirical Studies  Across  Disciplines," in 2022 IEEE International Professional Communication Conference (ProComm), 2022, pp. 112–119. [Online]. Available: 10.1109/ProComm53155.2022.00024
[22]	L. Podo and P. Velardi, "Plotly. plus, an Improved Dataset for Visualiza- tion Recommendation," in Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022, pp. 4384–4388.
[23]	K. Jolly, Hands-on data visualization with Bokeh: Interactive web plotting for Python using Bokeh. Packt Publishing Ltd, 2018.
[24]	C. Chai, C. J. Ammon, M. Maceira, and R. B. Herrmann, "Interactive visualization of complex seismic data and models using Bokeh," Seis- mological Research Letters, vol. 89, no. 2A, pp. 668–676, 2018.
[25]	D. O. Embarak and O. Embarak, "Data visualization," Data Analysis and Visualization Using Python: Analyze Data to Create Visualizations for BI Systems, pp. 293–342, 2018.
[26]	S. A. Fahad and A. E. Yahya, "Big data visualization: Allotting by r  and python with gui tools," in 2018 International Conference on Smart Computing and Electronic Enterprise (ICSCEE), 2018, pp. 1–8.
[27]	D. Y. Chen, Pandas for everyone: Python data analysis. Addison- Wesley Professional, 2017.
[28]	P. Lemenkova, "Processing oceanographic data by Python libraries NumPy, SciPy and Pandas," Aquatic Research, vol. 2, no. 2, pp. 73–  91, 2019.
[29]	A. Pal and P. K. S. Prakash, Practical time series analysis: master time series data processing, visualization, and modeling using Python. Packt Publishing Ltd, 2017.
[30]	T. Petrou, Pandas Cookbook: Recipes for Scientific Computing, Time Series Analysis and Data Visualization using Python. Packt Publishing Ltd, 2017.
[31]	C. R. Harris, K. J. Millman, S. J. V. D. Walt, R. Gommers, P. Virtanen, D. Cournapeau, E. Wieser, J. Taylor, S. Berg, and N. J. Smith, "Array programming with NumPy," Nature, vol. 585, no. 7825, pp. 357–362, 2020.
[32]	P. Lemenkova, "Processing oceanographic data by Python libraries NumPy, SciPy and Pandas," Aquatic Research, vol. 2, no. 2, pp. 73–  91, 2019.
[33]	W. McKinney, Python for data analysis: Data wrangling with Pandas, NumPy, and IPython.
[34]	——, "Pandas, python data analysis library," URL http://pandas. pydata. org, pp. 3–15, 2015.
[35]	C. Fuhrer, J. E. Solem, and O. Verdier, Scientific Computing with Python: High-performance scientific computing with NumPy, SciPy, and pandas. Packt Publishing Ltd, 2021.
[36]	F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, and V. Dubourg, "Scikit-learn: Machine learning in Python," the Journal of machine Learning research, vol. 12, pp. 2825–2830, 2011.
[37]	O. Kramer and O. Kramer, "Scikit-learn," Machine learning for evolu- tion strategies, pp. 45–53, 2016.
[38]	J. Hao and T. K. Ho, "Machine learning made easy: a review of scikit- learn package in python programming language," Journal of Educational and Behavioral Statistics, vol. 44, no. 3, pp. 348–361, 2019.
[39]	R. Garreta and G. Moncecchi, Learning scikit-learn: machine learning in Python. Packt Publishing Ltd, 2013.
[40]	K. Ravishankara, V. Dhanush, and I. S. Srajan, "Whatsapp Chat Ana- lyzer," International Journal of Engineering Research & Technol- ogy, vol. 9, no. 5, pp. 897–900, 2020.
[41]	T. Haslwanter, "An Introduction to Statistics with Python," With Ap- plications in the Life Sciences.. Switzerland: Springer International Publishing, 2016.
[42]	E. Bisong and E. Bisong, "Matplotlib and seaborn," Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners, pp. 151–165, 2019.
[43]	I. Stanand  A.  Jovic', "An  overview  and  comparison  of  free  Python  li- braries for data mining and big data analysis," in 2019 42nd International convention on information and communication technology, electronics and microelectronics (MIPRO), 2019, pp. 977–982.
[44]	D. O. Embarak and O. Embarak, "Data visualization," Data Analysis and Visualization Using Python: Analyze Data to Create Visualizations for BI Systems, pp. 293–342, 2018.
[45]	E. Dabbas, Interactive Dashboards and Data Apps with Plotly and Dash: Harness the power of a fully fledged frontend web framework in Python- no JavaScript required. Packt Publishing Ltd, 2021.
[46]	J. VanderPlas, B. Granger, J. Heer, D. Moritz, K. Wongsuphasawat, A. Satyanarayan, E. Lees, I. Timofeev, B. Welsh, and S. Sievert, "Altair: interactive statistical visualizations for Python," Journal of open source software, vol. 3, no. 32, p. 1057, 2018.
[47]	S. A. Fahad and A. E. Yahya, "Big data visualization: Allotting by r  and python with gui tools," in 2018 International Conference on Smart Computing and Electronic Enterprise (ICSCEE), 2018, pp. 1–8.
[48]	D. O. Embarak and O. Embarak, "Data visualization," Data Analysis and Visualization Using Python: Analyze Data to Create Visualizations for BI Systems, pp. 293–342, 2018.
[49]	A. Cuttone, S. Lehmann, and J. E. Larsen, "Geoplotlib: a python toolbox for visualizing geographical data," arXiv preprint arXiv:1608.01933, 2016.
[50]	C. Room, "Machine Learning in Python," algorithms, vol. 8, no. 46,      p. 30, 2022.


DOI Link : https://doi.org/10.22362/ijcert/2023/v10/i01/v10i0104

Download :
  V10I0104.pdf


Refbacks : Currently there are no Refbacks

Announcements


Authors are not required to pay any article-processing charges (APC) for their article to be published open access in Journal IJCERT. No charge is involved in any stage of the publication process, from administrating peer review to copy editing and hosting the final article on dedicated servers. This is free for all authors. 

News & Events


Latest issue :Volume 10 Issue 1 Articles In press

A plagiarism check will be implemented for all the articles using world-renowned software. Turnitin.


Digital Object Identifier will be assigned for all the articles being published in the Journal from September 2016 issue, i.e. Volume 3, Issue 9, 2016.


IJCERT is a member of the prestigious.Each of the IJCERT articles has its unique DOI reference.
DOI Prefix : 10.22362/ijcert


IJCERT is member of The Publishers International Linking Association, Inc. (“PILA”)


Emerging Sources Citation Index (in process)


IJCERT title is under evaluation by Scopus.


Key Dates


☞   INVITING SUBMISSIONS FOR THE NEXT ISSUE :
☞   LAST DATE OF SUBMISSION : 31st March 2023
☞  SUBMISSION TO FIRST DECISION :
In 7 Days
☞  FINAL DECISION :
IN 3 WEEKS FROM THE DAY OF SUBMISSION

Important Announcements


All the authors, conference coordinators, conveners, and guest editors kindly check their articles' originality before submitting them to IJCERT. If any material is found to be duplicate submission or sent to other journals when the content is in the process with IJCERT, fabricated data, cut and paste (plagiarized), at any stage of processing of material, IJCERT is bound to take the following actions.
1. Rejection of the article.
2. The author will be blocked for future communication with IJCERT if duplicate articles are submitted.
3. A letter regarding this will be posted to the Principal/Director of the Institution where the study was conducted.
4. A List of blacklisted authors will be shared among the Chief Editors of other prestigious Journals
We have been screening articles for plagiarism with a world-renowned tool: Turnitin However, it is only rejected if found plagiarized. This more stern action is being taken because of the illegal behavior of a handful of authors who have been involved in ethical misconduct. The Screening and making a decision on such articles costs colossal time and resources for the journal. It directly delays the process of genuine materials.

Citation Index


Citations Indices All
Citations 1026
h-index 14
i10-index 20
Source: Google Scholar

Acceptance Rate (By Year)


Acceptance Rate (By Year)
Year Rate
2021 10.8%
2020 13.6%
2019 15.9%
2018 14.5%
2017 16.6%
2016 15.8%
2015 18.2%
2014 20.6%

Important Links



Conference Proposal




DOI:10.22362/ijcert