DVT and TSA 2 marks
DVT and TSA 2 marks
Unit-3
Gestalt principles describe how humans perceive visual elements as organized patterns
rather than separate parts.
They help in grouping related data points, making visualizations clearer and more
intuitive.
Elements with similar attributes (color, shape, size) are perceived as belonging together.
Helps in categorizing data efficiently without additional labels.
Color theory guides the selection of colors for clarity, distinction, and visual appeal.
It enhances communication by ensuring contrast, hierarchy, and accessibility.
Analogous Colors: Colors adjacent on the color wheel (e.g., blue, teal, green) create
harmony.
Complementary Colors: Opposite colors (e.g., blue and orange) provide high contrast
and visual impact.
7. What is the importance of color contrast in visualization?
Hue represents the pure color without modifications from white (tint), black (shade), or
gray (tone).
It defines the base identity of a color (e.g., red, blue, green).
Figure-Ground helps viewers distinguish between the main content (figure) and
background (ground).
Ensures that key data stands out clearly, preventing confusion.
Information Overload: Too much data can make it difficult to extract meaningful
insights.
Ambiguous Representations: Poor design choices (e.g., unclear labels) can lead to
misinterpretation.
A high cognitive load makes interpretation slow and difficult, leading to confusion.
Simplifying the design reduces cognitive strain and enhances understanding.
Unit-4
Boxplots show the distribution of a variable, including median, quartiles, and outliers.
They help in identifying data skewness and extreme values.
Pandas: Used for data manipulation, summarization, and basic statistical analysis.
Seaborn: Provides advanced visualization capabilities, including distribution and
correlation plots.
13. Why is understanding distribution important in data analysis?
Understanding distribution helps in selecting the right statistical tests and models.
It provides insights into data normality, skewness, and the presence of outliers.
A histogram represents the distribution of numerical data using bins, with no gaps between
bars.
A bar chart compares categorical data with distinct, spaced-out bars.
15. List any two challenges faced during exploratory data analysis.
Missing data can distort statistical results and predictive models if not handled properly.
Techniques like imputation or deletion ensure data integrity and improve model accuracy.
17. Define outliers in a dataset and mention one method to detect them.
Outliers are extreme values that deviate significantly from the rest of the data.
A common detection method is the IQR (Interquartile Range) method, where values outside
1.5 times the IQR are considered outliers.
Unit-3
1. What is a chatbot?
o A chatbot is an AI-powered program that interacts with users through text or
voice.
o It is used for customer support, automation, and user engagement in various
industries.
A chatbot personality defines its tone, style, and way of interacting with users.
It can be designed to reflect brand identity, making interactions more relatable and
engaging.
The Turing Test evaluates whether a chatbot can generate responses indistinguishable
from a human’s.
If users cannot tell whether they are interacting with a bot or a human, the chatbot is
considered intelligent.
13. What is an example of a real-world chatbot?
Siri (Apple): A voice assistant that helps users with tasks, searches, and automation.
ChatGPT: An AI chatbot that answers queries and engages in natural conversations.
Use Natural Language Processing (NLP) to understand and process user inputs
accurately.
Store conversation history and context to provide relevant and meaningful responses.
Trigger phrases are keywords or sentences that activate specific chatbot responses or
workflows.
They help guide conversations by detecting user intent and initiating appropriate
actions.
Dialogue flow defines the sequence of chatbot interactions, ensuring smooth and
meaningful conversations.
A well-structured dialogue flow improves user engagement and prevents chatbot
confusion.
Unit-4
Acoustic models map raw audio waveforms to phonetic units using machine learning
techniques.
They help ASR systems understand speech variations due to accents, noise, and
pronunciation differences.
Deep learning models, such as neural networks, can learn complex speech patterns
and variations more effectively than traditional models.
They improve recognition accuracy by handling noise, different accents, and
continuous speech more efficiently.
14. What is the purpose of Hidden Markov Models (HMM) in speech recognition?
HMMs are probabilistic models used to represent sequential data like speech.
They help in predicting the most likely sequence of phonemes or words from an audio
signal.
Voice Assistants: Used in Siri, Alexa, and Google Assistant for hands-free
interaction.
Medical Transcription: Converts doctors' spoken notes into text for record-keeping.
16. What is the Nyquist theorem and its relevance in speech processing?
The Nyquist theorem states that a signal must be sampled at least twice its highest
frequency to be accurately reconstructed.
In speech processing, it ensures that audio recordings capture all necessary frequency
components without loss of information.
18. What are MFCCs, and why are they used in ASR?
Viterbi decoding is an algorithm used to find the most probable sequence of words or
phonemes in ASR.
It efficiently decodes speech by selecting the best path through an HMM-based
model.
Noise can distort the speech signal, making it harder for ASR models to identify
words correctly.
It can introduce misinterpretations, leading to incorrect transcriptions and reduced
system performance.