Internship Report
Internship Report
SUBMITTED BY
KAVYA N
(U18KN22S0009)
CERTIFICATE BY COLLEGE
This document serves to certify that the internship report entitled “DATA SCIENCE WITH
PYTHON” has been conducted at “DIGIKSHETRA” by KAVYA N, Registration No.
U18KN22S0009. This report constitutes an original work completed by the aforementioned
student, who has successfully a 4-week internship program.
The internship report has been submitted to BENGALURU CITY UNIVERSITY for the
academic year 2024-25, fulfilling a key requirement for the successful completion of the VI
semester BACHELOR OF COMPUTER APPLICATION (BCA) program. This report is
also intended to support the award of the BCA degree from Bengaluru City University.
This document serves to certify that KAVYA N is a final year student currently enrolled in the
BACHELOR OF COMPUTER APPLICATIONS program (Batch 2022-20250 at MVM
COLLEGE OF ARTS SCIENCE AND MANAGEMENT, Bengaluru.
She has submitted an internship report developed under the guidance of our esteemed faculty,
in partial fulfillment of the requirements for the award of the degree of “Bachelor of Computer
Application” from Bengaluru City University. It is essential to emphasize that this report has
not been used as a basis for the award of any other degree or diploma from any university or
institution.
Head of Department of
I here declare that this internship report on “DATA SCIENCE WITH PYTHON” is an
original work of my experience during my internship at DIGIKSHETRA. This internship
study was conducted by me as part of the requirements for the successful completion of the VI
semester Bachelor of Computer Application program at Bengaluru City University. This has
not been submitted to any university for the purpose of obtaining a degree or diploma.
Science and Management, for her continuous support and for dedicating her valuable time to
guide me the successful completion of this report, as well as for providing invaluable insight
that facilitated a successful internship experience.
I would also like to thank Ms. Pushpalatha P, Head of Department of Bachelors of Computer
Applications at MVM College of Arts Science and Management, for her guidance,
encouragement, and support.
I sincerely thank my internship guide Mr. Dilip Kumar V, Assistant professor, Department of
Computer Applications at MVM College of Arts Science and Management, for his invaluable
guidance, continuous support, and insightful suggestions, which played a vital role in the
successful completion of my internship.
3.2 Libraries
4.4 Regression
5 Conclusion
CHAPTER 1
INTRODUCTION
1.1 INTRODUCTION TO STUDY
Internship program is essential for all BCA students, because it helps him or her to acquire real
life situation. I have selected DIGIKSHETRA which is one of the leading Programming code
Training Academy in Bengaluru. Internship report is one of the parts of completing BCA degree
of the department of Science Administration. Bangalore City University. This report is on The
process and Implementation of Software skills: A study on DIGIKSHETRA. It was change for
me to learn about the Management and Skills procedure of the company.
CONTEXT:
Data science with Python are pivotal fields in today’s technology – driven world, where
organization rely on data-driven insights to make informed decisions and optimize
processes. The internship provided an opportunity to delve into these disciplines, gaining
hands-on experience I manipulating data, building predictive models, and extracting
meaningful insights.
1.2 OBJECTIVE OF STUDY
The primary objective of this internship was to enhance my skills in data analysis, machine
learning algorithms, and their application in real-world scenarios. Specifically, the objectives
included:
And interpreting result to driven datasets actionable insights. The finding contribute to the
broader understanding of how these technique can be effectively applied in real-world
business contexts, paving the way for future research and applications in the field.
ORGANIZATION PROFILE
COMPANY
1.1 Company Profile
With a strong focus on user experience, accessibility, and scalability, our solutions are
tailored to meet the unique needs of diverse educational settings, from K-12 schools to
higher education institutions and corporate training programs. By delivering
customizable, adaptable, and future-ready solutions, we empower educators to deliver
high-quality instruction and empower learners to achieve their full potential in an
increasingly digital world.
Our comprehensive suite of products and services encompasses everything from learning
management systems and virtual classrooms to interactive content authoring tools and
professional development programs. Whether it's providing educators with the tools they
need to create immersive learning experiences or offering learners access to dynamic and
interactive learning environments, Digikshetra Solutions is committed to driving positive
change in education through technology.
One of the key strengths of Digikshetra Solutions lies in our team of experts, who bring a
wealth of experience and expertise from diverse fields such as education, technology,
instructional design, and user experience. By combining our knowledge and skills, we are
able to develop innovative solutions that address the evolving needs of the education
sector and deliver measurable results.
Recruitment and Talent Acquisition: This involves identifying staffing needs, sourcing
candidates through job postings and recruitment agencies, conducting interviews, and
selecting the best-fit candidates.
Employee Onboarding and Orientation: HR ensures that new hires receive necessary
training, complete required paperwork, understand company policies, and integrate
smoothly into their roles and the organization.
Cybersecurity: This team protects the organization from cyber threats by implementing
firewalls, antivirus software, intrusion detection systems (IDS), and conducting regular
security audits and assessments.
Technical Support: The IT helpdesk provides technical assistance, resolves user issues,
and offers guidance on software usage, hardware configuration, and network
connectivity.
Figure 1.3. Information Technology (IT) Department Structure
Market Research and Analysis: R&D conducts market research to identify industry trends,
customer preferences, and competitive landscapes to inform product development
strategies.
Product Innovation: This team conceptualizes and prototypes new products, features, and
services, leveraging emerging technologies and customer feedback to drive innovation
Prototype Development: R&D builds and tests prototypes to validate concepts, assess
feasibility, and gather user feedback before full-scale development and
commercialization.
Sales and Marketing Department: Market Segmentation and Targeting: Sales and
marketing professionals identify and segment target markets based on demographics,
behaviours, and preferences to tailor marketing messages and campaigns.
Lead Generation and Nurturing: This team generates leads through advertising, content
marketing, and networking efforts, then nurtures those leads through email campaigns,
follow-up calls, and personalized communication.
Digital Marketing: This involves leveraging digital channels such as websites, social media
platforms, search engine optimization (SEO), and pay-per-click (PPC) advertising to reach
and engage target audiences.
Sales Enablement: Sales enablement teams provide sales reps with the tools, resources,
and training needed to effectively communicate product features, benefits, and value
propositions to prospects and customers.
Figure 1.3.4 (a) Virtual Marketing Department Figure 1.3.4(b) Marketing Department
Customer Onboarding: This involves guiding new customers through the implementation
process, providing training, support, and resources to ensure successful adoption and
Financial Reporting and Compliance: This team prepares financial statements, regulatory
filings, and ensures compliance with accounting standards, tax laws, and industry
regulations.
Facilities Management: This team maintains and manages office facilities, equipment, and
infrastructure to create a safe, productive, and comfortable work environment.
Quality Assurance and Control: Operations experts develop and implement quality
standards, processes, and metrics to ensure products and services meet or exceed
customer expectations.
User Experience (UX) Design: This team focuses on designing intuitive, user-friendly
interfaces and interactions for digital products and platforms to enhance customer
satisfaction and engagement.
Market Expansion: This team explores new markets, geographies, and customer
segments to identify growth opportunities and develop market entry strategies to
capitalize on emerging trends.
Channel Development: Business development experts establish and manage distribution
channels, reseller networks, and affiliate programs to extends Digikshetra Solutions'
reach and increase sales volume.
Quality Assurance (QA) Testing is a critical function that ensures products and services
meet specified quality standards before they are released to customers. This involves
systematically testing various aspects of software, hardware, or other deliverables to
identify defects or areas for improvement.
QA testers develop comprehensive test plans, scenarios, and scripts based on project
requirements and specifications. These documents outline the testing approach, test
cases, and expected outcomes for each test scenario.
Different types of testing are performed throughout the development lifecycle, including
functional testing, which verifies that the system meets specified functional
requirements, and regression testing, which ensures that new changes do not adversely
impact existing functionality.
User acceptance testing (UAT) is conducted to validate the product against user
expectations and requirements. This involves engaging end-users to evaluate the
product's usability, accessibility, and overall user experience
QA testers use a variety of testing techniques and tools to identify defects and bugs in the
system. This may include manual testing, automated testing, performance testing,
security testing, and usability testing, among others.
Throughout the testing process, QA testers document their findings, including any defects
or issues discovered during testing. They work closely with development teams to
prioritize and address these issues, ensuring that high-priority defects are resolved
promptly.
The Grievance and Conflict Resolution department is responsible for managing disputes,
grievances, and conflicts within the organization. This department serves as a neutral
mediator, helping parties resolve issues in a fair and equitable manner.
One of the primary functions is to provide employees and stakeholders with a platform
to express their concerns and grievances. This could involve setting up confidential
reporting mechanisms, such as hotlines or online portals, where individuals can raise
issues anonymously.
Once the investigation is complete, the department works towards finding a resolution
that addresses the concerns of all parties involved. This could involve implementing
corrective actions, revising policies or procedures, or providing training and support to
prevent similar issues from arising in the future.
Figure 1.3.10 Customer Complaint Resolution
Additionally, the Grievance and Conflict Resolution department may play a role in
promoting a positive organizational culture by fostering open communication, trust, and
collaboration among employees. This includes promoting conflict resolution skills and
providing resources for managing interpersonal conflicts effectively.
These departments play integral roles in ensuring the success and growth of EduPhoenix
Solutions by effectively managing resources, delivering high-quality products and
services, and delivers comprehensive educational solutions that empower learners,
educators, and institutions to succeed in a rapidly evolving digital landscape.
• Virtual Classroom:
In today's digital age, our Virtual Classroom platform offers educators a versatile and
immersive online learning environment. Equipped with video conferencing, screen
sharing, interactive whiteboards, and chat functionality, this platform facilitates real-
time collaboration and interaction between educators and students. Whether
conducting live classes, webinars, or virtual workshops, educators can engage
learners effectively and foster meaningful learning experiences. Seamless
integration with our LMS ensures streamlined course delivery and administration.
• IoT Projects:
Internet of Things (IoT) projects involve the integration of physical devices, sensors,
and software applications to enable connectivity and data exchange. These projects
often focus on leveraging IoT technologies to create smart systems, automate
processes, and gather insights from real-world data. Digikshetra Solutions offers a
range of IoT projects aimed at providing hands-on learning experiences to students
and professionals.
Data Acquisition and Processing: Projects involve collecting data from sensors and
processing it using microcontrollers or single-board computers like Arduino,
Raspberry Pi, or ESP8266/ESP32.
Connectivity and Communication: Students explore different communication
protocols such as Wi-Fi, Bluetooth, Zigbee, and LoRa to enable connectivity between
IoT devices and networks.
IoT Applications: Projects may focus on developing IoT applications for smart
homes, smart cities, industrial automation, healthcare monitoring, environmental
monitoring, agriculture, and more.
Cloud Integration: Integration with cloud platforms such as AWS IoT, Google Cloud
IoT, or Microsoft Azure IoT enables students to store, analyze, and visualize IoT data,
as well as implement cloud-based services like remote monitoring and control.
• AI-ML Projects:
Artificial Intelligence (AI) and Machine Learning (ML) projects involve the
development and application of algorithms and models to analyze data, make
predictions, and automate tasks without explicit programming instructions. These
projects harness the power of AI and ML techniques to solve complex problems,
optimize processes, and extract insights from data. Digikhetra Solutions offers a
variety of AI-ML projects aimed at fostering understanding and proficiency in these
transformative technologies.
Key components of AI-ML projects include:
Data Collection and Preparation: Projects begin with data collection from various
sources such as sensors, databases, or web APIs. Data preprocessing techniques are
then applied to clean, transform, and prepare the data for analysis.
Model Training and Evaluation: The models are trained using 32 labelled data, and
their performance is evaluated using metrics such as accuracy, precision, recall, and
F1-score. Hyperparameter tuning and cross-validation techniques are applied to
optimize model performance.
Deployment and Integration: Once trained and evaluated, the models are
deployed into production environments and integrated with existing systems or
applications to deliver real-world value. This may involve deploying models on edge
devices, cloud platforms, or IoT devices.
Overall, IoT and AI-ML projects offered by Digikshetra Solutions provide participants with
practical experience, critical thinking skills, and technical expertise in emerging
technologies, preparing them for careers in the rapidly evolving fields of IoT and AI-ML.
CHAPTER 3
DATA SCIENCE
2.1 Introduction
In the digital era, data has become the cornerstone of decision-making processes across
industries. From predicting customer preferences to optimizing business operations,
organizations rely on data-driven insights to gain a competitive edge in the market. Data
Science, as a multidisciplinary field, lies at the intersection of statistics, computer science,
and domain expertise, offering a systematic approach to extract actionable insights from
vast and complex datasets.
Through this internship at Digikshetra Solutions, we delve into the dynamic field of Data
Science, exploring its methodologies, tools, applications, and future trends. By gaining
hands-on experience in data manipulation, visualization, predictive modelling, and data-
driven decision-making, interns will acquire the essential skills and knowledge required
to excel in the rapidly evolving landscape of Data Science.
Throughout this internship program, interns will have the opportunity to work on real-
world projects, collaborate with industry experts, and contribute to the development of
innovative solutions that harness the power of data to drive business success. By
immersing themselves in the practical application of Data Science techniques and
methodologies, interns will emerge as adept practitioners capable of making meaningful
contributions to organizations' data-driven initiatives.
In summary, this internship at Digikshetra Solutions serves as a gateway for aspiring Data
Scientists to embark on a rewarding journey in the field of Data Science, equipping them
with the skills, knowledge, and experience needed to thrive in an increasingly data-centric
world. Through hands-on learning, mentorship, and exposure to cutting-edge
technologies, interns will be empowered to harness the transformative potential of data
and drive impactful outcomes for businesses and society at large.
2.2 Overview
In today's digital landscape, the proliferation of data has transformed the way
organizations operate, offering unprecedented opportunities for insights-driven
decision-making. Data Science, as a multidisciplinary field, encompasses a wide range of
techniques, methodologies, and tools aimed at extracting actionable insights from large
and complex datasets. At Digikshetra Solutions, the exploration of Data Science begins
with a comprehensive overview that delves into the fundamental concepts,
methodologies, and applications within this dynamic domain.
At the heart of Data Science lies a solid foundation in statistics, mathematics, and
computer science. Interns at Digikshetra Solutions embark on a journey to understand
the theoretical underpinnings of Data Science, exploring concepts such as probability
theory, linear algebra, and calculus. By mastering these foundational principles, interns
gain the analytical prowess needed to manipulate, analyze, and interpret data effectively.
Data Science begins with the collection and preprocessing of raw data from various
sources. Digikshetra Solutions provides interns with hands-on experience in data
acquisition techniques, including web scraping, APIs, and databases. Interns learn to
clean, preprocess, and transform data to ensure its quality, consistency, and suitability for
analysis.
Exploratory Data Analysis (EDA) forms a crucial step in the Data Science workflow,
enabling interns to gain insights into the underlying patterns, trends, and relationships
within the data. Through visualizations, summary statistics, and hypothesis testing,
interns uncover key insights that inform subsequent analysis and decision-making
processes.
As datasets continue to grow in size and complexity, the ability to process and analyze big
data becomes increasingly critical. Digikshetra Solutions equips interns with the tools and
techniques needed to work with big data frameworks such as Apache Hadoop and Apache
Spark. Interns learn to leverage distributed computing paradigms to handle massive
datasets efficiently and extract actionable insights at scale.
Advancements in Deep Learning and Artificial Intelligence have revolutionized the field
of Data Science, enabling the development of complex models capable of learning from
vast amounts of data. Digikshetra Solutions exposes interns to deep learning frameworks
such as TensorFlow and PyTorch, allowing them to explore advanced neural network
architectures and applications in image recognition, natural language processing, and
more.
Data Science finds applications across a wide range of industries and domains, from
finance and healthcare to marketing and cybersecurity. Digikshetra Solutions provides
interns with exposure to real-world projects and use cases, allowing them to apply their
skills and expertise to solve complex problems and drive business outcomes.
In summary, the overview of Data Science at Digikshetra Solutions provides interns with
a comprehensive understanding of the principles, methodologies, and applications within
this dynamic field. By mastering the foundational concepts, exploring cutting-edge
technologies, and applying their skills to real-world projects, interns emerge prepared to
tackle the challenges and opportunities in the ever-evolving landscape of Data Science.
2.3 Data Acquisition
Data acquisition is the process of collecting raw data from various sources to be used for
analysis and modelling in data science projects. At Digikshetra Solutions, interns are
trained in comprehensive data acquisition techniques to gather relevant and high-quality
data for their projects.
• Identifying Data Sources: Interns begin by identifying the sources from which data
will be collected. These sources may include databases, APIs, web scraping, sensor
networks, social media platforms, IoT devices, and third-party data providers.
Understanding the nature of each data source is essential for effective acquisition.
• Web Scraping: When data is not readily available in structured formats, interns
employ web scraping techniques to extract information from websites. They use tools
like Beautiful Soup in Python to parse HTML and extract data elements such as text,
tables, or images from web pages.
• Data Cleaning and Preprocessing: Once data is acquired, interns perform data
cleaning and preprocessing to ensure its quality and usability for analysis. This
involves tasks such as removing duplicates, handling missing values, standardizing
formats, and transforming data into a suitable structure for analysis.
• Data Privacy and Compliance: Interns adhere to data privacy regulations and ethical
guidelines when acquiring data, especially when dealing with sensitive or personally
identifiable information (PII). They ensure compliance with regulations like GDPR,
HIPAA, or CCPA and implement measures to protect data privacy and security.
Data preparation and preprocessing are crucial steps in the data science workflow that
involve cleaning, transforming, and organizing raw data into a format suitable for analysis
and modelling. At Digikshetra Solutions, interns are trained in comprehensive data
preparation and preprocessing techniques to ensure the quality and integrity of the data
used in their projects.
• Data Cleaning: Interns start by identifying and addressing issues such as missing
values, duplicates, inconsistencies, and errors in the dataset. They employ techniques
like imputation, deletion, or interpolation to handle missing data and remove
redundant or irrelevant observations.
• Data Integration: When working with multiple datasets or sources, interns integrate
and merge the data to create a unified dataset for analysis. They ensure compatibility
between datasets by standardizing variables, resolving naming inconsistencies, and
aligning data structures.
• Data Sampling: In cases where the dataset is too large or imbalanced, interns may
employ data sampling techniques to create representative subsets for analysis. This
may include random sampling, stratified sampling, or oversampling/undersampling
methods to balance class distributions.
By mastering these data preparation and preprocessing techniques, interns at
Digikshetra Solutions can ensure that the data used in their projects is clean, consistent,
and well-structured, enabling them to build accurate and reliable predictive models and
extract valuable insights for business decision-making.
2.5 Exploratory Data Analysis (EDA)
Exploratory Data Analysis (EDA) is a crucial phase in the data science workflow, where
analysts explore and visualize datasets to gain insights and identify patterns. At
Digikshetra Solutions, interns are trained in various techniques and methodologies
involved in EDA to uncover hidden trends and relationships in data.
• Data Visualization: Visualization plays a vital role in EDA, enabling interns to create
informative plots and charts to visualize the distribution, trends, and relationships
within the data. Techniques such as histograms, box plots, scatter plots, and heatmaps
are utilized to explore different aspects of the dataset.
• Data Cleaning and Imputation: Before conducting EDA, interns perform data
cleaning to address missing values, outliers, and inconsistencies in the dataset.
Imputation techniques such as mean imputation, median imputation, and
interpolation are applied to handle missing data.
• Outlier Detection: Outliers can significantly impact the results of data analysis.
Interns employ outlier detection techniques such as z-score, modified z-score, and
isolation forests to identify and remove outliers from the dataset.
• Data Transformation: Transforming the data into a more suitable form for analysis
is an important step in EDA. Techniques such as logarithmic transformation, square
root transformation, and normalization are applied to ensure that the data meets the
assumptions of statistical tests and models.
In summary, Exploratory Data Analysis (EDA) is a critical phase in the data science
process, allowing interns at Digikshetra Solutions to gain a deeper understanding of the
dataset and uncover valuable insights that drive informed decision-making and further
analysis. Through practical training in EDA techniques, interns develop the skills and
expertise needed to extract meaningful information from data and deliver actionable
insights to stakeholders.
2.6 Model Selection
Model selection is a critical step in the data science process that involves choosing the
most appropriate machine learning algorithm or statistical model for a given dataset and
problem domain. At Digikshetra Solutions, interns are trained in various model selection
techniques to ensure that the chosen model effectively captures the underlying patterns
and relationships in the data.
• Exploratory Data Analysis (EDA): Interns conduct EDA to gain insights into the
characteristics and distributions of the dataset. This helps in identifying potential
relationships between variables, detecting outliers, and understanding the
underlying data patterns, which guide the selection of appropriate models.
• Validation and Testing: Once the final model is selected, interns validate its
performance on a held-out validation dataset or through cross-validation.
Additionally, the model's performance is evaluated on an unseen test dataset to
assess its ability to generalize to new data.
By carefully considering these factors and methodologies, interns at Digikshetra
Solutions are equipped to select and deploy robust and reliable models that effectively
address real-world challenges and deliver actionable insights for stakeholders.
CHAPTER 4
MINI PROJECT
PARKINSON’S DISEASE
Abstract:
Parkinson’s Disease (PD) is a progressive neurodegenerative disorder that primarily affects
motor functions due to the loss of dopamine-producing neurons in the brain. This project
explores the causes, symptoms, stages, and current treatment options for Parkinson’s Disease.
It also investigates emerging technologies such as wearable sensors, AI-based diagnostic tools,
and mobile health applications for symptom tracking and management. The study aims to
evaluate how technological integration can improve early diagnosis, treatment personalization,
and patient quality of life.
4.1 Introduction
Parkinson’s disease (PD) is a progressive neurodegenerative disorder that primarily affects
movement. It is the second most common neurodegenerative disease after Alzheimer’s.
Parkinson's is caused by the degeneration of dopamine-producing neurons in the brain,
particularly in a region called the substantia nigra.
Parkinson’s disease is a chronic and progressive neurological disorder that primarily affects
movement. It occurs when certain nerve cells (neurons) in a part of the brain called the
substantia nigra become damaged or die. These neurons normally produce dopamine, a
chemical messenger essential for smooth and coordinated muscle movement. As dopamine
levels decrease, people with Parkinson’s experience symptoms such as tremors, stiffness, slow
movements, and balance problems.
First described in 1817 by British doctor James Parkinson as the "Shaking Palsy," the disease
affects millions of people worldwide, particularly those over the age of 60. Although the exact
cause of Parkinson’s remains unknown, a combination of genetic and environmental factors is
believed to play a role. While there is no cure yet, treatments including medication, therapy,
and surgical options can help manage the symptoms and improve quality of life
Objectives:
• To understand the pathophysiology and progression of Parkinson’s Disease.
• To review existing diagnostic and treatment methods.
• To explore technological innovations aiding PD management.
• To propose or evaluate a system for improved patient care and monitoring.
• Develop a system or model to detect Parkinson’s disease in its early stages.
• Improve the accuracy of Parkinson’s disease diagnosis using data science python.
• Develop strategies or interventions to manage Parkinson’s disease symptoms and
improve quality of life.
Methodology:
• Literature Review: Study of clinical journals and medical research papers.
• Case Study Analysis: Reviewing patient management through mobile health
applications or AI models.
• Prototype/Tool Development (Optional): Developing a simple symptom tracker or
diagnostic model using machine learning (if applicable).
• Interviews/Surveys: (Optional) Collecting patient or caregiver feedback.
Future Scope:
• Development of AI-based early detection tools.
• Arora, S. et al. (2015). Detecting and monitoring the symptoms of Parkinson's disease
using smartphones: A pilot study.