Unit - II (Bca01)
Unit - II (Bca01)
• Data Collection: The process begins by collecting relevant data from various sources,
which can include structured data from databases, unstructured data from text
documents, or semi-structured data from sources like social media or IoT devices.
• Data Cleaning and Preparation: Raw data often contains errors, missing values, or
inconsistencies. Data analysts must clean and preprocess the data to ensure its
accuracy and consistency. This may involve data imputation, removing outliers, and
standardizing formats.
Overview of Data Analytics
• Data Storage: Data is typically stored in data warehouses, databases, or data lakes,
making it easily accessible for analysis. Proper data storage is crucial for efficient
analytics.
• Data Transformation: This step involves converting data into a suitable format for
analysis. It may include aggregating, filtering, and structuring data to create features
for modeling.
• Data Analysis: Data analysts use various statistical and machine learning techniques
to analyze the data. Descriptive analytics focuses on summarizing data, while
diagnostic analytics identifies the cause of past events. Predictive analytics forecasts
future trends, and prescriptive analytics recommends actions to optimize outcomes.
Overview of Data Analytics
• Data Visualization: Visualizing data through charts, graphs, and dashboards is
essential for presenting results in a comprehensible manner. Data visualization
aids in conveying insights and patterns to non-technical stakeholders.
• Data Reporting: The findings are often documented and shared through
reports, presentations, or interactive dashboards. Clear reporting ensures that
stakeholders can make informed decisions based on the data.
Overview of Data Analytics
• Continuous Monitoring: Data analytics is an ongoing process. Organizations
continuously collect, analyze, and interpret data to adapt to changing
circumstances and make data-driven decisions over time.
• Challenges: Data analytics may face challenges related to data quality, privacy,
security, and the need for skilled analysts. Data governance and compliance with
regulations like GDPR are also significant concerns.
• Tools and Technologies: Various tools and technologies are used in data analytics,
including programming languages like Python and R, data visualization tools like
Tableau, and machine learning libraries such as TensorFlow and scikit-learn.
Overview of Data Analytics
• Applications: Data analytics is used in various domains, including
business (for market analysis and customer insights), healthcare (for
patient outcomes and disease prediction), finance (for risk
assessment and fraud detection), and many others.