27-33Python_EmpoweringDataScienceApplicationsandResearch
27-33Python_EmpoweringDataScienceApplicationsandResearch
net/publication/372960535
CITATION READS
1 3,267
7 authors, including:
SEE PROFILE
All content following this page was uploaded by Shilpi Saxena on 08 August 2023.
STM JOURNALS
Development & Trends
http://computers.stmjournals.com/index.php?journal=JoOSDT&page=index
Review JoOSDT
Abstract
It was in 1991 that the first mention of Python came to light. Since it is one of the best programming
languages, it is widely used in the data analytics industry. It is speedy, easy to use, and can smoothly
alter data with no hiccups. It aids in the processes of data analytics generally, such as data collection,
analysis, modelling, and visualization. Python stands out from other languages because it provides all
the necessary tools for developing cutting-edge applications and machine learning quickly and easily.
The fields of data science and analytics have benefited greatly from recent advances in computing
power, the capacity to store vast volumes of data, and knowledge of relevant methodologies in areas
such as Data Analytics, Artificial Intelligence, Machine Learning, etc. The goal of this study is to have
a discussion on the. All the fundamental resources for developing Python applications may be found in
a single place with the help of an IDE (Integrated Development Environment). Integrated development
environments (IDEs) are research-oriented software packages that bring together a familiar Graphical
User Interface (GUI) with the most widely used programming languages and tools. It often consists of
a source code editor for writing programs and local build automation for creating a local version of
the software, like compiling computer source code. Additionally, it enables you to duplicate the
programs for use on your personal computer. These editors may be used by developers and software
engineers to design apps for personal computers or the web; by DevOps engineers to carry out
continuous integration; and to aid in job automation and boost the software engineer's productivity and
expertise. Some of the many great advantages of adopting a top-tier Python integrated development
environment are listed below. Build automation, code linking, testing, and debugging are just some of
the features available in Python IDEs that may help you save time. This research will begin with an
overview of the Python Data Science Library before moving on to a discussion of data science's many
practical applications. In addition, this research provides a comprehensive evaluation of Python IDEs
used in the fields of data science and research.
this study, we will dive into what Python can do for the fields of data science and analytics. We will
also talk about the value of integrated development environments (IDEs) for Python and how they may
help data scientists work more effectively and efficiently. Many academics, scientists, and data
analysts prefer this language. Python's adaptability, simplicity, and stability make it a prime language
for creating cutting-edge applications and machine learning techniques. In this study, we will dive into
what Python can do for the fields of data science and analytics. We will also talk about the value of
integrated development environments (IDEs) for Python and how they may help data scientists work
more effectively and efficiently.
Introduction to GUI
Python has a number of different alternatives for designing graphical user interfaces (GUI), but
tkinter is the one that is most often used. It is the quickest and most straightforward method for
developing GUI apps.
Climate Science
Research in the field of climate science makes extensive use of Python for a variety of tasks, including
the construction of climate models, the analysis of climate data, and the production of climate
visualizations. Libraries like as xarray and Cartopy make it easier to handle and analyze
multidimensional information relating to climate, while Matplotlib and Plotly provide assistance when
it comes to visualizing climate trends and changes [20].
CONCLUSIONS
In this study we have presented usage of python in various research areas like data science, machine
learning. We also discussed Data science using python, Python libraries for data science, Features of
python for dada science and Applications of python. Python is High-Level programming language used
to design software applications as support language for build control and management, testing and in
different sectors. Python provides access to many libraries like Pandas, NumPy, Scrapy, TensorFlow
can be used to support many aspects of Data Science and Machine learning.
REFERENCES
1. Cabo C. Effectiveness of flowcharting as a scaffolding tool to learn python. In 2018 IEEE Frontiers
in Education Conference (FIE). 2018 Oct 3; 1–7.
2. Vadlamani A, Kalicheti R, Chimalakonda S. API Scanner - Towards Automated Detection of
Deprecated APIs in Python Libraries. 2021 IEEE/ACM 43rd International Conference on Software
Engineering: Companion Proceedings (ICSE-Companion), Madrid, ES. 2021; 5–8. doi:
10.1109/ICSE-Companion52605.2021.00022.
3. Loulergue F, Philippe J. New List Skeletons for the Python Skeleton Library. 2019 20th
International Conference on Parallel and Distributed Computing, Applications and Technologies
(PDCAT), Gold Coast, QLD, Australia. 2019; 392–397. doi: 10.1109/PDCAT46702.2019.00077.
4. Ponnapalli VS, sai Manish AV, Ramu P, Sudhiksha S, Greeshma M. Array Factor Code
Development of Fractal Array Antenna using Python: A Mini-Study on Free and Open Source
Software for Antennas. In 2021 IEEE International Conference on Recent Trends on Electronics,
Information, Communication & Technology (RTEICT). 2021 Aug 27; 448–451.
5. Li Y, Schwiebert L. Boosting Python performance on Intel Processors: A case study of optimizing
music recognition. In 2016 IEEE 6th Workshop on Python for High-Performance and Scientific
Computing (PyHPC). 2016 Nov 14; 52–58.
6. Hoving R, Slot G, Jansen S. Python: Characteristics identification of a free open source software
ecosystem. 2013 7th IEEE International Conference on Digital Ecosystems and Technologies
(DEST), Menlo Park, CA, USA. 2013; 13–18. doi: 10.1109/DEST.2013.6611322.
7. Guillermo M, et al. Graph Database-modelled Public Transportation Data for Geographic Insight
Web Application. 2022 IEEE/ACIS 23rd International Conference on Software Engineering,
Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), Taichung,
Taiwan. 2022; 2–7. doi: 10.1109/SNPD54884.2022.10051802.
8. Foyer C, Conejero J, Ejarque J, Badia RM, Tate A, McIntosh-Smith S. Enabling System Wide
Shared Memory for Performance Improvement in PyCOMPSs Applications. 2020 IEEE/ACM 9th
Workshop on Python for High-Performance and Scientific Computing (PyHPC), GA, USA. 2020;
22–31. doi: 10.1109/PyHPC51966.2020.00008.
9. Elewah A, Badawi AA, Khalil H, Rahnamayan S, Elgazzar K. 3D-RadViz: Three Dimensional
Radial Visualization for Large-Scale Data Visualization. 2021 IEEE Congress on Evolutionary
Computation (CEC), Kraków, Poland. 2021; 1037–1046. doi: 10.1109/CEC45853.2021.9504983.
10. van Oort B, Cruz L, Aniche M, van Deursen A. The Prevalence of Code Smells in Machine
Learning projects. 2021 IEEE/ACM 1st Workshop on AI Engineering - Software Engineering for
AI (WAIN), Madrid, Spain. 2021; 1–8. doi: 10.1109/WAIN52551.2021.00011.
11. Serrano-Muñoz A, Elguea-Aguinaco Í, Chrysostomou D, BØgh S, Arana-Arexolaleiba N. A
Scalable and Unified Multi-Control Framework for KUKA LBR iiwa Collaborative Robots. 2023
IEEE/SICE International Symposium on System Integration (SII), Atlanta, GA, USA. 2023; 1–5.
doi: 10.1109/SII55687.2023.10039308.
12. Pinard A, Hammerling DM, Baker AH. Assessing Differences in Large Spatio-temporal Climate
Datasets with a New Python package. 2020 IEEE International Conference on Big Data (Big Data),
Atlanta, GA, USA. 2020; 2699–2707. doi: 10.1109/BigData50022.2020.9378100.
13. Loulergue F, Philippe J. New List Skeletons for the Python Skeleton Library. 2019 20th
International Conference on Parallel and Distributed Computing, Applications and Technologies
(PDCAT), Gold Coast, QLD, Australia. 2019; 392–397. doi: 10.1109/PDCAT46702.2019.00077.
14. Tohid R, et al. Asynchronous Execution of Python Code on Task-Based Runtime Systems. 2018
IEEE/ACM 4th International Workshop on Extreme Scale Programming Models and Middleware
(ESPM2), Dallas, TX, USA. 2018; 37–45. doi: 10.1109/ESPM2.2018.00009.
15. Ibba P, et al. Fruit Meter: An AD5933-Based Portable Impedance Analyzer for Fruit Quality
Characterization. 2020 IEEE International Symposium on Circuits and Systems (ISCAS), Seville,
Spain. 2020; 1–5. doi: 10.1109/ISCAS45731.2020.9181287.
16. Stančin I, Jović A. An overview and comparison of free Python libraries for data mining and big
data analysis. 2019 42nd International Convention on Information and Communication
Technology, Electronics and Microelectronics (MIPRO), Opatija, Croatia. 2019; 977–982. doi:
10.23919/MIPRO.2019.8757088.
17. Singh G, Gupta I, Singh J, Kaur N. Face Recognition using Open Source Computer Vision Library
(OpenCV) with Python. 2022 10th International Conference on Reliability, Infocom Technologies
and Optimization (Trends and Future Directions) (ICRITO), Noida, India. 2022; 1–6. doi:
10.1109/ICRITO56286.2022.9964836.
18. Nance D, Tomov S, Wong K. A Python Library for Matrix Algebra on GPU and Multicore
Architectures. 2022 IEEE 19th International Conference on Mobile Ad Hoc and Smart Systems
(MASS), Denver, CO, USA. 2022; 770–775. doi: 10.1109/MASS56207.2022.00121.
19. Mucesh S, et al. A machine learning approach to galaxy properties: joint redshift–stellar mass
probability distributions with Random Forest. Mon Notices Royal Astron Soc. 2021 Jan; 502(2):
2770–2786. doi: 10.1093/mnras/stab164.
20. Ortigoza Capetillo GM, Lorandi Medina AP, Neri I. Modeling and simulating urban afforestation:
an alternative for urban climate change mitigation. 2020 IEEE International Conference on
Engineering Veracruz (ICEV), Boca del Rio, Mexico. 2020; 1–4. doi:
10.1109/ICEV50249.2020.9289691.
21. Qin M. Machine Translation Technology Based on Natural Language Processing. 2022 European
Conference on Natural Language Processing and Information Retrieval (ECNLPIR), Hangzhou,
China. 2022; 10–13. doi: 10.1109/ECNLPIR57021.2022.00014
22. López CD, Cvetković M, Palensky P. Enhancing Power Factory Dynamic Models with Python for
Rapid Prototyping. 2019 IEEE 28th International Symposium on Industrial Electronics (ISIE),
Vancouver, BC, Canada. 2019; 93–99. doi: 10.1109/ISIE.2019.8781432.