0% found this document useful (0 votes)
12 views14 pages

DVT and TSA 2 marks

Uploaded by

shenojustin
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views14 pages

DVT and TSA 2 marks

Uploaded by

shenojustin
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

Data Visualization Techniques

Unit-3

1. Define Gestalt principles in data visualization.

 Gestalt principles describe how humans perceive visual elements as organized patterns
rather than separate parts.
 They help in grouping related data points, making visualizations clearer and more
intuitive.

2. List the key Gestalt principles used in visualization.

 Proximity: Elements close to each other are perceived as a group.


 Similarity: Similar-looking elements (color, shape, size) are seen as related.
 Figure-Ground: Differentiates between the main subject (figure) and background.
 Closure: The brain fills in gaps to recognize familiar shapes.
 Continuity: Elements aligned in a line or curve are perceived as a unit.

3. What is the role of proximity in data visualization?

 Proximity groups related data points, making comparisons easier.


 Reduces clutter by organizing elements based on spatial relationships.

4. Summarize the principle of similarity in Gestalt theory.

 Elements with similar attributes (color, shape, size) are perceived as belonging together.
 Helps in categorizing data efficiently without additional labels.

5. Define color theory in the context of visualization.

 Color theory guides the selection of colors for clarity, distinction, and visual appeal.
 It enhances communication by ensuring contrast, hierarchy, and accessibility.

6. Compare analogous and complementary colors.

 Analogous Colors: Colors adjacent on the color wheel (e.g., blue, teal, green) create
harmony.
 Complementary Colors: Opposite colors (e.g., blue and orange) provide high contrast
and visual impact.
7. What is the importance of color contrast in visualization?

 Enhances readability by distinguishing elements clearly.


 Improves accessibility, ensuring that data is understandable for colorblind users.

8. Define the term "cognition" in visualization.

 Cognition refers to the mental process of understanding and interpreting visual


information.
 It influences how quickly and accurately viewers comprehend data in a visualization.

9. What is a hue in color theory?

 Hue represents the pure color without modifications from white (tint), black (shade), or
gray (tone).
 It defines the base identity of a color (e.g., red, blue, green).

10. What is the role of perception in designing visualizations?

 Perception determines how users process and interpret graphical elements.


 Helps designers create intuitive visuals that align with human cognitive abilities.

11. Summarize the concept of figure-ground in Gestalt principles.

 Figure-Ground helps viewers distinguish between the main content (figure) and
background (ground).
 Ensures that key data stands out clearly, preventing confusion.

12. What is the significance of alignment in visualization design?

 Ensures a structured and organized layout for better readability.


 Helps create a logical flow, guiding users' eyes through the visualization.

13. Define saturation in the context of color theory.

 Saturation refers to the intensity or purity of a color.


 High saturation makes colors vivid, while low saturation results in a more muted,
grayscale appearance.

14. What is the importance of design best practices in visualization?

 Enhances clarity and prevents misinterpretation of data.


 Improves user experience by making visuals engaging and intuitive.

15. List two cognitive challenges in interpreting complex visualizations.

 Information Overload: Too much data can make it difficult to extract meaningful
insights.
 Ambiguous Representations: Poor design choices (e.g., unclear labels) can lead to
misinterpretation.

16. What is the key principle of simplicity in visualization?

 Reducing unnecessary elements to focus on essential data.


 Using clear labels, simple layouts, and minimalistic designs for better comprehension.

17. What is the advantage of using tooltips in visualizations?

 Provides additional details without cluttering the main chart or graph.


 Enhances user interaction by allowing exploration of data points dynamically.

18. How does cognitive load affect data visualization?

 A high cognitive load makes interpretation slow and difficult, leading to confusion.
 Simplifying the design reduces cognitive strain and enhances understanding.

19. How does the choice of font affect data visualization?

 Readable fonts improve accessibility and prevent misinterpretation.


 Consistent font styles ensure a professional and organized appearance.

20. Why is it important to use a limited color palette in data visualization?


 Prevents visual overload, making the visualization easier to interpret.
 Maintains consistency and ensures color distinctions remain meaningful.

Unit-4

1. Define exploratory data analysis (EDA).


o EDA is the process of analyzing and summarizing datasets using visual and
statistical techniques.
o It helps in identifying patterns, anomalies, relationships, and trends in the data.

2. What is the purpose of univariate analysis?


o Univariate analysis examines a single variable at a time to understand its
distribution and properties.
o It helps in detecting outliers, understanding central tendency (mean, median, mode),
and measuring dispersion (variance, standard deviation).

3. List any two key benefits of bivariate visualization.


o Helps in understanding relationships and correlations between two variables.
o Provides insights into trends and patterns that may not be visible in univariate
analysis.

4. List two tools commonly used for EDA.


o Python libraries like Pandas, Matplotlib, and Seaborn.
o R programming with ggplot2 and dplyr for data visualization and analysis.

5. What is a distribution plot? Give an example.


o A distribution plot shows how values of a variable are spread across different
ranges.
o Example: A histogram displaying the distribution of ages in a dataset.

6. Define pair plots in data visualization.


o A pair plot is a matrix of scatter plots used to visualize relationships between
multiple numerical variables.
o It helps identify correlations and potential patterns among features.

7. Summarize the role of scatter plots in data analysis.


o Scatter plots help visualize relationships between two continuous variables.
o They assist in detecting correlations, clusters, and outliers in the data.

8. What is the importance of correlation matrices in EDA?


o Correlation matrices quantify the relationships between multiple variables in a
dataset.
o They help in feature selection by identifying highly correlated variables that may
lead to multicollinearity.

9. Differentiate between univariate and bivariate analysis.

10. What does a boxplot visualize in a dataset?

 Boxplots show the distribution of a variable, including median, quartiles, and outliers.
 They help in identifying data skewness and extreme values.

11. Outline the concept of multivariate analysis.

 Multivariate analysis examines three or more variables simultaneously to understand


complex relationships.
 Techniques include Principal Component Analysis (PCA), multiple regression, and
clustering.

12. Name two Python libraries widely used for EDA.

 Pandas: Used for data manipulation, summarization, and basic statistical analysis.
 Seaborn: Provides advanced visualization capabilities, including distribution and
correlation plots.
13. Why is understanding distribution important in data analysis?

 Understanding distribution helps in selecting the right statistical tests and models.
 It provides insights into data normality, skewness, and the presence of outliers.

14. How does a histogram differ from a bar chart?

 A histogram represents the distribution of numerical data using bins, with no gaps between
bars.
 A bar chart compares categorical data with distinct, spaced-out bars.

15. List any two challenges faced during exploratory data analysis.

 Handling missing or inconsistent data can be time-consuming.


 Large datasets may require significant computational resources and efficient visualization
techniques.

16. What is the significance of handling missing data in EDA?

 Missing data can distort statistical results and predictive models if not handled properly.
 Techniques like imputation or deletion ensure data integrity and improve model accuracy.

17. Define outliers in a dataset and mention one method to detect them.

 Outliers are extreme values that deviate significantly from the rest of the data.
 A common detection method is the IQR (Interquartile Range) method, where values outside
1.5 times the IQR are considered outliers.

18. What is the role of feature scaling in data analysis?

 Feature scaling ensures all variables contribute equally to a model by normalizing or


standardizing their values.
 It improves the performance of distance-based models like k-means clustering and SVM.

19. Differentiate between categorical and numerical data.


 Categorical data represents discrete groups or categories (e.g., gender, colors).
 Numerical data consists of measurable values, which can be continuous (height, weight) or
discrete (number of students in a class).

20. Show the purpose of data transformation in EDA.

 Data transformation helps in normalizing distributions and improving model performance.


 It enables better handling of skewed data, reducing bias in statistical and machine learning
models.
Text and Speech Analysis

Unit-3

1. What is a chatbot?
o A chatbot is an AI-powered program that interacts with users through text or
voice.
o It is used for customer support, automation, and user engagement in various
industries.

2. What are conversational systems?


o Conversational systems are AI-based platforms designed to communicate with
users through natural language.
o They include chatbots, voice assistants, and interactive voice response (IVR)
systems.

3. What are entities in chatbot development?


o Entities are specific pieces of information that a chatbot extracts from user
input (e.g., dates, locations, product names).
o They help the chatbot understand and process user requests accurately.

4. What is Natural Language Understanding (NLU)?


o NLU is a branch of AI that enables chatbots to comprehend human language,
including intent and context.
o It involves machine learning models for interpreting user input and responding
appropriately.

5. What is the role of dialogue management in chatbots?


o Dialogue management controls how a chatbot maintains and organizes
conversations with users.
o It ensures logical flow, context retention, and smooth user interactions.

6. What is rule-based chatbots?


o Rule-based chatbots follow predefined rules and decision trees to respond to
user inputs.
o They lack learning capabilities and are limited to handling structured
conversations.
7. How do AI-based chatbots differ from rule-based chatbots?
o AI-based chatbots use machine learning and NLU to understand and generate
responses dynamically.
o Unlike rule-based chatbots, they improve over time by learning from past
interactions.

8. What is context-awareness in chatbots?


o Context-awareness enables chatbots to remember user history and maintain
continuity in conversations.
o It allows better personalization and improves the chatbot's ability to respond
naturally.

9. What is sentiment analysis in chatbots?


o Sentiment analysis helps chatbots detect emotions (positive, negative, neutral)
in user messages.
o It allows chatbots to adjust responses based on user mood, enhancing
engagement.

10. What is a chatbot personality?

 A chatbot personality defines its tone, style, and way of interacting with users.
 It can be designed to reflect brand identity, making interactions more relatable and
engaging.

11. What is multi-turn conversation in chatbots?

 Multi-turn conversation enables chatbots to engage in back-and-forth dialogues rather


than responding to single queries.
 It improves chatbot interactions by allowing users to refine and elaborate on their
inputs.

12. What is the Turing Test for chatbots?

 The Turing Test evaluates whether a chatbot can generate responses indistinguishable
from a human’s.
 If users cannot tell whether they are interacting with a bot or a human, the chatbot is
considered intelligent.
13. What is an example of a real-world chatbot?

 Siri (Apple): A voice assistant that helps users with tasks, searches, and automation.
 ChatGPT: An AI chatbot that answers queries and engages in natural conversations.

14. What is an API in chatbot development?

 An API (Application Programming Interface) allows chatbots to integrate with


external applications and databases.
 It helps chatbots fetch data, process user requests, and connect with services like
weather, payments, and customer support.

15. How to handle user inputs and providing context in chatbot?

 Use Natural Language Processing (NLP) to understand and process user inputs
accurately.
 Store conversation history and context to provide relevant and meaningful responses.

16. What is the difference between a chatbot and a virtual assistant?

 Chatbots focus on specific tasks like customer service and FAQs.


 Virtual Assistants (e.g., Alexa, Google Assistant) offer broader functionality,
including smart home control and scheduling.

17. What are trigger phrases in chatbot design?

 Trigger phrases are keywords or sentences that activate specific chatbot responses or
workflows.
 They help guide conversations by detecting user intent and initiating appropriate
actions.

18. What is a knowledge base in chatbot architecture?

 A knowledge base is a structured repository of information that a chatbot uses to


answer user queries.
 It can include FAQs, product details, and troubleshooting guides to enhance chatbot
intelligence.
19. What is the role of machine learning in chatbot development?

 Machine learning enables chatbots to improve over time by learning from


interactions.
 It helps in better intent recognition, response generation, and personalized user
experiences.

20. What is the purpose of dialogue flow in chatbots?

 Dialogue flow defines the sequence of chatbot interactions, ensuring smooth and
meaningful conversations.
 A well-structured dialogue flow improves user engagement and prevents chatbot
confusion.

Unit-4

1. What is Automatic Speech Recognition (ASR)?


o ASR is a technology that converts spoken language into text using machine
learning and signal processing techniques.
o It is used in applications like voice assistants, transcription services, and
automated customer support.

2. What are the main components of an ASR system?


o Acoustic Model: Converts speech signals into phonetic representations.
o Language Model: Predicts the probability of word sequences for accurate
transcription.
o Feature Extraction: Extracts relevant information from raw audio signals.

3. Define the Log Mel Spectrum in speech processing.


o It represents speech signals on the Mel scale, emphasizing frequencies
perceived by humans.
o The logarithmic transformation helps match human auditory perception,
improving recognition accuracy.

4. Why is feature extraction important in ASR?


o It reduces raw speech data into meaningful representations that machine
learning models can process efficiently.
o It helps improve recognition accuracy by removing irrelevant noise and
redundancy.
5. What is the role of the Mel Filter Bank in ASR?
o It simulates the human ear’s perception by applying filters that emphasize
important speech frequencies.
o It converts the linear frequency scale into the Mel scale, making the speech
features more distinguishable for ASR models.

6. What is Connectionist Temporal Classification (CTC)?


o CTC is a loss function used in ASR models to train systems without requiring
precise alignments between speech and text.
o It allows the model to predict sequences of characters or words even when the
input and output lengths are different.

7. How does CTC handle alignment issues in speech recognition?


o It assigns probabilities to different alignments and sums them up to compute
the overall likelihood of a sequence.
o It eliminates the need for manually labeled timestamps, allowing the model to
learn from raw speech-text pairs.

8. Define phonemes and their importance in ASR.


o Phonemes are the smallest units of sound that distinguish words in a language
(e.g., /p/, /b/, /t/).
o ASR systems use phonemes to model speech variations and improve
recognition accuracy across different speakers and accents.

9. What is the difference between speech recognition and voice recognition?


10. What are acoustic models in speech recognition?

 Acoustic models map raw audio waveforms to phonetic units using machine learning
techniques.
 They help ASR systems understand speech variations due to accents, noise, and
pronunciation differences.

11. Mention any two challenges in speech recognition.

 Background Noise: Noisy environments can degrade recognition accuracy.


 Speaker Variability: Differences in accents, pronunciation, and speech patterns can
make recognition difficult.

12. What is a spectrogram in speech processing?

 A spectrogram is a visual representation of sound, showing frequency (y-axis), time


(x-axis), and intensity (color).
 It helps in analyzing speech patterns and extracting relevant features for ASR models.

13. How does deep learning improve ASR performance?

 Deep learning models, such as neural networks, can learn complex speech patterns
and variations more effectively than traditional models.
 They improve recognition accuracy by handling noise, different accents, and
continuous speech more efficiently.

14. What is the purpose of Hidden Markov Models (HMM) in speech recognition?

 HMMs are probabilistic models used to represent sequential data like speech.
 They help in predicting the most likely sequence of phonemes or words from an audio
signal.

15. Mention two applications of ASR in real life.

 Voice Assistants: Used in Siri, Alexa, and Google Assistant for hands-free
interaction.
 Medical Transcription: Converts doctors' spoken notes into text for record-keeping.
16. What is the Nyquist theorem and its relevance in speech processing?

 The Nyquist theorem states that a signal must be sampled at least twice its highest
frequency to be accurately reconstructed.
 In speech processing, it ensures that audio recordings capture all necessary frequency
components without loss of information.

17. Define Language Model (LM) in ASR.

 A language model predicts the probability of word sequences to improve transcription


accuracy.
 It helps ASR systems choose the most likely words or phrases when recognizing
speech.

18. What are MFCCs, and why are they used in ASR?

 Mel-Frequency Cepstral Coefficients (MFCCs) are features extracted from speech


that represent important frequency characteristics.
 They are used in ASR because they closely mimic human auditory perception,
improving recognition performance.

19. What is Viterbi decoding in speech recognition?

 Viterbi decoding is an algorithm used to find the most probable sequence of words or
phonemes in ASR.
 It efficiently decodes speech by selecting the best path through an HMM-based
model.

20. How does noise affect speech recognition accuracy?

 Noise can distort the speech signal, making it harder for ASR models to identify
words correctly.
 It can introduce misinterpretations, leading to incorrect transcriptions and reduced
system performance.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy