0% found this document useful (0 votes)

22 views9 pages

Business Analytics Viva Questions

The document provides a comprehensive overview of important concepts and techniques in data analysis and statistical modeling, including Pareto charts, covariance matrices, regression analysis, and text mining. It covers various statistical tests, such as t-tests and p-values, as well as tools and commands used in R for data manipulation and visualization. Additionally, it discusses the classification of data analytics and the software used for creating interactive dashboards.

Uploaded by

Devansh Maheshwari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views9 pages

Business Analytics Viva Questions

Uploaded by

Devansh Maheshwari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Important Questions for Viva

Que 1) What are Pareto charts?

Ans 1) A Pareto chart is a combination of a bar graph and a line graph used in quality control and data analysis
to identify the most significant factors contributing to a problem. It is based on the Pareto Principle (80/20
rule), which states that roughly 80% of effects come from 20% of causes. Pareto charts help prioritize issues
by highlighting the few key causes that contribute to the majority of problems or outcomes.

Que 2) What is a covariance matrix?

Ans 2 ) A covariance matrix is a square matrix that displays the covariance between multiple variables in a
dataset. The diagonal elements represent the variance of each variable, while the off-diagonal elements show
how two variables change together. A positive covariance indicates that the variables move in the same
direction, whereas a negative covariance means they move oppositely.

Que 3) What is the difference between covariance and correlation?

Ans 3) Covariance and correlation both measure the relationship between two variables, but they differ in
interpretation and scale. Covariance indicates whether two variables move together (positive covariance) or in
opposite directions (negative covariance), but it does not quantify the strength of the relationship. In contrast,
correlation standardizes covariance, providing a value between -1 and 1, making it easier to interpret.
Correlation is unit-free and indicates both the direction and strength of the relationship, whereas covariance
depends on the units of the variables.

Que 4) What is the coefficient of determination (R²)?

Ans 4) The coefficient of determination (R²) measures how well a regression model explains the variability of
the dependent variable. It ranges from 0 to 1, where a value closer to 1 indicates a better fit, meaning the model
explains most of the variation, while a value near 0 suggests poor explanatory power. R² is calculated using
the ratio of explained variance to total variance. A higher R² generally indicates a better model, but an
extremely high value may suggest overfitting.

Que 5) Which command do we use for running a regression model in R?

Ans 5) In R, the lm() function runs a linear regression model. It follows the syntax lm(dependent_variable ~
independent_variable, data = dataset), where the dependent variable is predicted based on one or more
independent variables. After fitting the model, the summary(model) function provides details such as
coefficients, R² value, and significance levels.

Que 6) What command is used for checking multicollinearity issues in R?

Ans 6) In R, the vif() function from the car package is used to check for multicollinearity in a regression model.
A high Variance Inflation Factor (VIF) indicates that a predictor variable is highly correlated with others,
which can distort regression results. Generally, VIF values below 5 indicate low multicollinearity, while values
above 10 suggest a serious issue that may require removing or combining variables.

Que 7) What are heteroscedasticity issues in multiple linear regression?

Ans 7) In multiple linear regression, heteroscedasticity issues occur when the variance of residuals is not
constant across different levels of the independent variables. This violates the ordinary least squares (OLS)
assumption of homoscedasticity, leading to biased standard errors, incorrect p-values, and unreliable
confidence intervals. As a result, the regression model may produce inefficient estimates and distorted
predictions. Heteroscedasticity can be detected using residual plots, the Breusch-Pagan test, or White’s test.
To address this issue, techniques such as log transformations, weighted least squares (WLS), or robust standard
errors can be applied to improve model accuracy and reliability.
Ques 8) What is Q-Q Plot?
Ans 8) A Q-Q (Quantile-Quantile) plot is a graphical tool used to assess whether a dataset follows a particular
theoretical distribution, typically the normal distribution. It compares the quantiles of the sample data against
the quantiles of a theoretical distribution (e.g., normal, t, uniform). If the data follows the theoretical
distribution, the points in the Q-Q plot will form a straight diagonal line. Deviations from this line indicate
departures from the expected distribution (e.g., skewness, heavy tails, outliers).

Que 9) What is f-square values in regression analysis?

Ans 9) F-square (f^2) is an effect size measure in regression analysis that evaluates the contribution of an
independent variable to explaining the variance in a dependent variable. It is calculated by comparing the R²
value of a full model (with the predictor) to that of a reduced model (without the predictor). A higher f^2 value
indicates a greater impact of the predictor. According to Cohen’s guidelines, f^2 values of 0.02, 0.15, and 0.35
represent small, medium, and large effects, respectively. This measure is widely used to assess the importance
of predictors beyond statistical significance.

Que 10) What is the t-test in a regression model?

Ans 10) In a regression model, the t-test is used to determine whether an independent variable has a significant
effect on the dependent variable. It tests the null hypothesis that the coefficient of the predictor is equal to
zero, meaning it has no impact. The t-value is calculated by dividing the estimated coefficient by its standard
error. A high t-value with a p-value less than 0.05 suggests that the variable is statistically significant, while a
low t-value indicates that the variable may not contribute meaningfully to the model.

Que 11) What does the p-value indicate in a regression model?

Ans 11) In a regression model, the p-value indicates the statistical significance of an independent variable in
predicting the dependent variable. It tests the null hypothesis that the coefficient of the variable is equal to
zero, meaning the variable has no effect.

Interpretation of p-value:

• p < 0.05 → The variable is statistically significant and likely impacts the dependent variable.

• p > 0.05 → The variable is not statistically significant, meaning its effect may be due to chance.

Que 12) How to check the impact of independent variables on dependent variables?

Ans 12) To check the impact of independent variables on a dependent variable, various statistical techniques
are used in regression analysis. The regression coefficients (\beta) indicate the direction and strength of the
relationship, with larger absolute values suggesting a stronger effect. The t-test and p-value help determine
statistical significance, where a p-value below 0.05 indicates that the variable has a meaningful impact. The
R² and adjusted R² values show how much of the dependent variable’s variance is explained by the independent
variables, while the F-test assesses the overall model significance. To ensure accuracy, checking for
multicollinearity using the Variance Inflation Factor (VIF) is also essential.

Que 13) What is adjusted R2?

Ans 13) Adjusted R2 is a version of R2 that adjusts for the number of predictors in a regression model. R2 tells
us how well our model explains the variation in the dependent variable (higher is better). However, R2 always
increases when we add more predictors, even if they don’t actually improve the model. Adjusted R2 fixes this
by penalising models with unnecessary predictors.
Que 14) What do you mean by coefficient in the regression table?

Ans 14) In a regression table, the coefficient is represented by the symbol \beta (beta) and indicates the effect
of an independent variable on the dependent variable. It shows how much the dependent variable changes
when the independent variable increases by one unit while keeping other variables constant. A positive
coefficient means a direct relationship, while a negative coefficient suggests an inverse relationship between
the variables.

Que 15) What is a confidence interval?

Ans 15) A confidence interval (CI) is a range of values that estimates the true population parameter with a
certain level of confidence. In regression analysis, it is used to indicate the range in which the true coefficient
of an independent variable is likely to fall. It is calculated using the estimated coefficient, its standard error,
and a critical value from the t-distribution or z-distribution.

For a 95% confidence interval, it means that if the study were repeated multiple times, 95% of the intervals
would contain the true parameter value. A narrower confidence interval suggests higher precision, while a
wider interval indicates greater uncertainty. If a confidence interval includes zero, the variable may not be
statistically significant. Confidence intervals are crucial in statistical analysis, research, and predictive
modeling for understanding the reliability of estimates.

Que 16) What is a prediction interval?

Ans 16) A prediction interval (PI) is a range that estimates where a future individual observation of the
dependent variable is likely to fall, given specific values of the independent variables. Unlike a confidence
interval, which predicts the mean of the dependent variable, a prediction interval considers both sampling
variability and individual variability, making it wider than a confidence interval. It is typically denoted as PI
and calculated using the predicted value, the standard error of the regression, and the variance of individual
observations. For a 95% prediction interval, it means that 95% of future observations will fall within this
range.

Que 17) What is text mining?

Ans 17) Text mining is the process of extracting meaningful information, patterns, and insights from
unstructured text data using techniques from natural language processing (NLP), machine learning, and
statistical analysis. It involves steps like text preprocessing (removing stopwords, stemming, and
tokenization), feature extraction (converting text into numerical data), and pattern recognition (such as
sentiment analysis, topic modeling, and named entity recognition). Text mining is widely used in business
analytics, social media monitoring, customer feedback analysis, and fraud detection to derive actionable
insights from large volumes of text data.

Que 18) What is the command that we use in doing textual analysis?

Ans 18) In R, textual analysis uses packages like tm. To begin, a text corpus is created using the Corpus()
function from the tm package, allowing text data to be processed. Common preprocessing steps include
converting text to lowercase using tm_map(content_transformer(tolower)), removing punctuation with
removePunctuation(), and eliminating stopwords with removeWords(). A Document-Term Matrix (DTM) can
then be generated using DocumentTermMatrix(), which helps analyze word frequency and patterns. For
sentiment analysis, the syuzhet package’s get_sentiment() function is used to extract emotional tone.
Que 19) What are the pre-processing text requirements in textual analysis?

Ans 19) In textual analysis, preprocessing is a crucial step to clean and standardize text data for better accuracy
and meaningful insights. The process begins with lowercasing, which ensures uniformity by converting all
text to lowercase, preventing duplication of words due to case differences. Next, punctuation removal
eliminates unnecessary symbols, while stopword removal filters out common words like “the” and “is” that
do not add significant value to the analysis. Tokenization then breaks text into individual words or phrases,
making it easier to analyze. To further refine the text, stemming and lemmatization reduce words to their root
or base form, improving consistency (e.g., “running” to “run” or “better” to “good”). Additionally, removing
numbers helps when numerical data is not relevant, while whitespace removal ensures proper formatting. In
some cases, spelling correction is applied to fix typos and enhance text quality.

Que 20) What is sentiment analysis?

Ans 20) Sentiment analysis is the process of determining the emotional tone or opinion expressed in a piece
of text. It uses natural language processing (NLP), machine learning, and text analysis techniques to classify
text as positive, negative, or neutral. Sentiment analysis is widely used in social media monitoring, customer
feedback analysis, brand reputation management, and market research to understand public opinion and trends.

In R, sentiment analysis can be performed using the syuzhet, tidytext, or text packages. For example, the
get_sentiment() function in the syuzhet package can analyze text using different sentiment lexicons like Bing,
NRC, or AFINN. Advanced sentiment analysis also involves aspect-based sentiment analysis (ABSA), where
emotions are identified for specific topics, and deep learning models for more accurate predictions. By
analyzing emotions in text, sentiment analysis helps businesses and researchers make data-driven decisions
based on public opinion.

Que 21) What are the libraries that we import for running textual analysis?

Ans 21) In R, several libraries are used for textual analysis, enabling tasks like text preprocessing, tokenization,
sentiment analysis, and visualization. The tm package is widely used for text mining, helping clean text and
create a Document-Term Matrix (DTM) for further analysis. The tidytext package allows text processing using
tidy data principles, making it easier to work with structured text data. For advanced text processing, quanteda
provides tools for tokenization, word frequency analysis, and text classification. Sentiment analysis is
commonly performed using the syuzhet package, which offers multiple sentiment lexicons like Bing, NRC,
and AFINN to determine the emotional tone of text. Additionally, wordcloud is useful for visualizing frequent
words in a dataset, making textual insights more interpretable.

Que 22) What is the difference between data analytics and data analysis?

Ans 22) Data analytics and data analysis are closely related but differ in scope and purpose. Data analysis
refers to the process of examining, cleaning, transforming, and interpreting data to extract meaningful insights.
It focuses on identifying patterns, trends, and relationships within data using statistical and exploratory
techniques.

On the other hand, data analytics incorporates predictive modeling, machine learning, automation, and
business intelligence tools to drive decision-making. Data analytics is more action-oriented, aiming to optimize
processes, forecast future trends, and provide data-driven recommendations.
Que 23) What is the classification of data analytics?

Ans 23) Data analytics is classified into four main types, each serving a different purpose in extracting insights
and making data-driven decisions:

1. Descriptive Analytics – This type focuses on summarizing past data to understand trends and
patterns. It answers the question, “What happened?” using techniques like data visualization, reporting, and
dashboards. Examples include sales reports, website traffic analysis, and financial statements.

2. Diagnostic Analytics – Going a step further, this type aims to determine the reasons behind past
outcomes by identifying patterns and relationships in data. It answers the question, “Why did it happen?” using
techniques like drill-down analysis, correlation analysis, and statistical modeling.

3. Predictive Analytics – This type focuses on forecasting future trends based on historical data.
It answers the question, “What is likely to happen?” using methods like machine learning, regression analysis,
and time series forecasting. Businesses use predictive analytics for customer behavior prediction, fraud
detection, and risk assessment.

4. Prescriptive Analytics – The most advanced type, prescriptive analytics suggests actions to
achieve desired outcomes. It answers the question, “What should be done?” using optimization algorithms,
artificial intelligence (AI), and decision science. Examples include recommendation engines, supply chain
optimization, and personalized marketing strategies.

Que 24) What are moving averages?

Ans 24) Moving averages are statistical techniques used to analyze trends in time-series data by smoothing
out short-term fluctuations. They help in identifying patterns and forecasting future movements in areas such
as stock prices, sales trends, and economic indicators. The two main types are Simple Moving Average (SMA)
and Exponential Moving Average (EMA). SMA calculates the average of a fixed number of past data points,
giving equal weight to all values, while EMA assigns greater weight to recent data points, making it more
responsive to changes.

Que 25) What are the softwares used for making interactive dashboards?

Ans 25) Several software tools are used for creating interactive dashboards, helping businesses visualize and
analyze data effectively. Tableau is a powerful data visualization tool known for its drag-and-drop
functionality and real-time data integration. Microsoft Power BI is another widely used tool that enables users
to build dynamic dashboards with AI-powered insights and seamless integration with Excel and other
Microsoft applications. Google Data Studio (Looker Studio) is a free tool that allows users to create interactive
reports, especially useful for Google Analytics, Ads, and Sheets integration.

Que 26) What are commands in R?

Ans 26) In R, a command refers to an instruction given to the R programming environment to perform a
specific task. Commands are executed in the R console or script to manipulate data, create visualizations,
perform statistical analysis, or build machine learning models. Commands in R typically involve functions,
which take inputs (arguments) and return outputs. For example, the command mean(c(1, 2, 3, 4, 5)) calculates
the mean (average) of the given numbers. Similarly, plot(x, y) generates a scatter plot of two variables.

Que 27) What is syntax in R?

Ans 27) In R, syntax refers to the set of rules that define how code must be written and structured for the R
interpreter to understand and execute it correctly. The syntax includes the way functions, variables, operators,
and commands are used in R.
Que 28) What are packages in R?

Ans 28) In R, a package is a collection of functions, datasets, and documentation that extends the capabilities
of base R, allowing users to perform specialized tasks such as data manipulation, visualization, machine
learning, and statistical analysis. Packages are stored in repositories like CRAN (Comprehensive R Archive
Network) and can be easily installed using the install.packages("package_name") command. Once installed,
they need to be loaded into the R session using library(package_name). Some popular packages include dplyr
for data manipulation, ggplot2 for visualization, caret for machine learning, and shiny for building interactive
dashboards.

Que 29) What are libraries in R?

Ans 29) In R, a library is a collection of installed packages that provide additional functions and tools for data
analysis, visualization, statistical modeling, and machine learning. While the terms library and package are
often used interchangeably, a package refers to the individual software bundle, whereas a library is the location
where installed packages are stored. To use a package, it must first be installed using
install.packages("package_name") and then loaded into the R session using library(package_name). For
example, library(ggplot2) loads the ggplot2 package for data visualization.

Que 30) How to install a package in R?

Ans 30) To install a package in R, the install.packages() function is used, followed by the package name in
quotation marks. For example, to install the ggplot2 package, the command install.packages("ggplot2") is
executed. If multiple packages need to be installed at once, they can be specified within a vector, such as
install.packages(c("dplyr", "tidyverse", "caret")). Once a package is installed, it must be loaded into the R
session using the library() function, such as library(ggplot2), to make its functions available.

Que 31) What are the different types of data structures in R?

Ans 31) In R, data can be stored and manipulated using different data structures, which define how information
is organized and accessed. The primary structures in R include vectors, lists, matrices, data frames, and arrays.

1. Vectors—The simplest data structure in R, a vector contains elements of the same data type
(numeric, character, logical, etc.). It is created using the c() function, such as x <- c(1, 2, 3, 4).

2. Lists—Unlike vectors, lists can store elements of different data types, including numbers,
strings, and even other lists. A list is created using list(), for example, my_list <- list(1, "text", TRUE).

3. Matrices—A matrix is a two-dimensional data structure where all elements must be of the same
type. It is created using the matrix() function, such as mat <- matrix(1:6, nrow=2, ncol=3), which generates a
2×3 matrix.

4. Data Frames— The most commonly used structure, a data frame is a table-like format where
each column can contain different data types. It is created using data.frame(), for example, df <-
data.frame(Name = c("A", "B"), Age = c(25, 30)).

5. Arrays— Similar to matrices but with more than two dimensions, arrays store data in multi-
dimensional space. An array is created using array(), such as arr <- array(1:12, dim = c(2,3,2)), which forms a
2×3×2 array.
Que 32) What is the Variance Inflation Factor?

Ans 32) The Variance Inflation Factor (VIF) is a statistical measure used to detect multicollinearity in a
multiple regression model, which occurs when independent variables are highly correlated with each other.
Multicollinearity can distort the estimated coefficients and make the model unreliable. VIF quantifies how
much the variance of a regression coefficient is inflated due to correlation among predictors. A VIF value of
1 indicates no multicollinearity, while values between 1 and 5 suggest moderate correlation but are generally
acceptable. However, a VIF greater than 10 signals a high level of multicollinearity, requiring corrective
measures such as removing highly correlated variables, combining features, or using dimensionality reduction
techniques like Principal Component Analysis (PCA).

Que 33) What are the assumptions of the simple linear regression model?

Ans 33) The simple linear regression model in R, like any statistical model, is based on several key
assumptions that ensure the validity and reliability of the results. These assumptions include:

1. Linearity—There must be a linear relationship between the independent variable (X) and the
dependent variable (Y). This can be checked using scatter plots or correlation analysis.

2. Independence – The observations should be independent of each other, meaning that the value
of one observation does not influence another. This assumption is particularly important in time-series data,
where autocorrelation can be an issue.

3. Homoscedasticity (Constant Variance of Errors) – The residuals (errors) should have constant
variance across all levels of the independent variable. If the spread of residuals increases or decreases
systematically, it indicates heteroscedasticity, which can be detected using residual plots.

4. Normality of Residuals—The residuals should be normally distributed to ensure accurate

hypothesis testing and confidence intervals. This can be checked using a histogram, Q-Q plot, or the Shapiro-
Wilk test in R.

5. No Multicollinearity—Although this is more relevant in multiple regression, if there are

multiple independent variables, they should not be highly correlated with each other. Variance Inflation Factor
(VIF) is commonly used to check for multicollinearity.

To validate these assumptions in R, diagnostic tools such as residual plots, histograms, Q-Q plots, and
statistical tests (e.g., the Durbin-Watson test for independence and the Breusch-Pagan test for
homoscedasticity) can be used.

Que 34) What are standard errors in a regression model?

Ans 34) In regression analysis, standard errors measure the accuracy of estimated coefficients by indicating
the variability of the coefficient estimates across different samples. A smaller standard error suggests that the
estimate is more precise, while a larger standard error indicates greater uncertainty.

Standard errors are crucial for hypothesis testing and constructing confidence intervals for regression
coefficients. They help determine whether an independent variable significantly influences the dependent
variable by calculating the t-statistic (Coefficient / Standard Error) and the p-value. A high standard error may
suggest multicollinearity or insufficient data points. In R, standard errors can be found in the output of the
summary() function applied to a regression model, which provides insight into the reliability of the estimated
coefficients.
Que 35) What are the different types of measurement scales?

Ans 35) There are four main types of measurement scales in research: Nominal, Ordinal, Interval, and Ratio.
Nominal scales classify data into categories without any order (e.g., gender, nationality). Ordinal scales rank
data but without equal intervals (e.g., satisfaction levels, education levels). Interval scales have equal
differences between values but no true zero (e.g., temperature in Celsius, IQ scores). Ratio scales have a true
zero, allowing all mathematical operations (e.g., height, weight, income). These scales determine the type of
statistical analysis that can be applied.

Que 36) How to identify outliers in a dataset? Name any one thing that can be used to identify an outlier?

Ans 36) Outliers in a dataset can be identified using statistical methods, visualization techniques, or machine
learning algorithms. An outlier is a data point that significantly deviates from the rest of the observations,
potentially affecting the accuracy of statistical models.

One commonly used method to detect outliers is the Interquartile Range (IQR) Method, where the IQR = Q3
- Q1 (the range between the 75th percentile and the 25th percentile). Any data point that falls below Q1 - 1.5
× IQR or above Q3 + 1.5 × IQR is classified as an outlier. Another effective way to identify outliers is using
a boxplot, which visually represents the distribution of data and highlights extreme values beyond the
“whiskers.” In R, outliers can be detected using the boxplot() function, which makes it easier to spot deviations.

Que 37) What are the different types of data based on structures?

Ans 37) Data can be categorized into three main types based on its structure: structured, semi-structured, and
unstructured data. Each type differs in its organization, format, and how it is stored and processed.

1. Structured Data – This type of data is highly organized and stored in predefined formats such
as tables with rows and columns in relational databases (e.g., MySQL, PostgreSQL). Examples include sales
records, customer details, and financial transactions. Structured data is easy to query using SQL.

2. Semi-Structured Data – This data does not follow a strict tabular format but still contains tags
or markers to separate elements, providing some level of organization. Examples include JSON, XML, and
NoSQL databases (e.g., MongoDB, Firebase). Semi-structured data is commonly used in web applications,
APIs, and big data storage.

3. Unstructured Data – This type lacks a fixed format, making it more challenging to store and
analyze. Examples include text files, images, videos, social media posts, and emails. Since unstructured data
does not fit into traditional databases, specialized tools like Hadoop, Spark, and AI-driven analytics are used
for processing.

Que 38) What is a box plot?

Ans 38) A box plot, also known as a box-and-whisker plot, is a graphical representation used to visualize the
distribution, central tendency, and spread of a dataset. It summarizes key statistical measures, including the
minimum, first quartile (Q1), median (Q2), third quartile (Q3), and maximum, while also highlighting potential
outliers. The box represents the interquartile range (IQR), spanning from Q1 to Q3, with a line inside indicating
the median. The whiskers extend from Q1 to the minimum and from Q3 to the maximum, excluding outliers,
which are plotted separately as individual points beyond 1.5 × IQR. Box plots help detect skewness, variability,
and extreme values in data, making them useful in exploratory data analysis. In R, box plots can be generated
using the boxplot() function, which provides an efficient way to identify outliers and understand data
distribution.
Que 39) Why do we need data cleaning?

Ans 39) Data cleaning is an essential step in data analysis and machine learning as it ensures the accuracy,
consistency, and reliability of data. Raw data often contains errors, missing values, duplicates, inconsistencies,
and outliers, which can lead to misleading conclusions and incorrect predictions. By cleaning data, we improve
its quality and make it suitable for analysis.

Data cleaning helps in removing irrelevant or redundant information, correcting errors, handling missing
values, and standardizing formats. This process enhances the performance of machine learning models,
improves decision-making, and ensures compliance with data governance standards. Without proper data
cleaning, biased insights, incorrect statistical results, and faulty business decisions may occur. Overall, data
cleaning is crucial for maintaining the integrity and usability of data in analytics and predictive modeling.

Que 40) Why is data processing required before running the analysis?

Ans 40) Data processing is required before running analysis to ensure that the data is accurate, structured, and
ready for meaningful insights. Raw data is often messy, containing inconsistencies, missing values, duplicates,
and outliers, which can distort the results. By processing the data, we transform it into a clean, structured, and
usable format for statistical analysis or machine learning models.

Key steps in data processing include data cleaning (removing errors and inconsistencies), data transformation
(converting data into the required format), feature engineering (creating new meaningful variables), and
normalization (scaling data for consistency). Proper data processing enhances the accuracy, reliability, and
efficiency of analysis, leading to better decision-making and predictive performance. Without it, analysis may
produce incorrect or misleading results, affecting business strategies and research conclusions.

C Tadm 23
100% (1)
C Tadm 23
14 pages
LSSGB Practice Exam Questions and Answers
100% (4)
LSSGB Practice Exam Questions and Answers
101 pages
05 Expedition Audit L3
No ratings yet
05 Expedition Audit L3
54 pages
Toaz - Info Cyberpunk 2020 Adventure All Fall Down Ag5040 PR - PDF
100% (1)
Toaz - Info Cyberpunk 2020 Adventure All Fall Down Ag5040 PR - PDF
34 pages
CFA LVL II Quantitative Methods Study Notes
No ratings yet
CFA LVL II Quantitative Methods Study Notes
10 pages
Amith Vayu Niyama
100% (1)
Amith Vayu Niyama
34 pages
Chap 013
50% (2)
Chap 013
141 pages
Why Do We Need Statistics? - P Values - T-Tests - Anova - Correlation33
No ratings yet
Why Do We Need Statistics? - P Values - T-Tests - Anova - Correlation33
37 pages
Thermal Conductivity of Insulating Powder Experiment Lab Manual
No ratings yet
Thermal Conductivity of Insulating Powder Experiment Lab Manual
5 pages
F
0% (2)
F
10 pages
Chapter 4 Regression Models: Quantitative Analysis For Management, 11e (Render)
No ratings yet
Chapter 4 Regression Models: Quantitative Analysis For Management, 11e (Render)
27 pages
A Project Report On: Restaurant Management System
No ratings yet
A Project Report On: Restaurant Management System
25 pages
Datascience Interview
100% (1)
Datascience Interview
31 pages
Assignment-Based Subjective Questions/Answers
No ratings yet
Assignment-Based Subjective Questions/Answers
3 pages
Regression Analysis Summary Notes
No ratings yet
Regression Analysis Summary Notes
6 pages
Topic - 9 PDF
No ratings yet
Topic - 9 PDF
12 pages
049 Stat 326 Regression Final Paper
No ratings yet
049 Stat 326 Regression Final Paper
17 pages
Variance Lecture
No ratings yet
Variance Lecture
14 pages
Statistics Learners' Working Manual
No ratings yet
Statistics Learners' Working Manual
25 pages
Chapter 6 (Part Ii)
No ratings yet
Chapter 6 (Part Ii)
41 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
19 pages
Basic Statistics: Basic Statistical Interview Question
No ratings yet
Basic Statistics: Basic Statistical Interview Question
5 pages
Quantitative Methods Vocabulary
No ratings yet
Quantitative Methods Vocabulary
5 pages
Notes On Linear Regression - 2
No ratings yet
Notes On Linear Regression - 2
4 pages
Assosa University School of Graduate Studies Mba Program
No ratings yet
Assosa University School of Graduate Studies Mba Program
10 pages
4.analyze and Data Driven - Facebook
No ratings yet
4.analyze and Data Driven - Facebook
27 pages
2023 Level II Key Facts and Formula Sheet (KFFS)
No ratings yet
2023 Level II Key Facts and Formula Sheet (KFFS)
14 pages
Bracing Stiffness
No ratings yet
Bracing Stiffness
8 pages
Particulars of Factories Paying Revenue of Rs. One Crore and Above During The Year 2006-2007 As Compared To 2005 - 06 Commissionerate: Chennai-Iv
No ratings yet
Particulars of Factories Paying Revenue of Rs. One Crore and Above During The Year 2006-2007 As Compared To 2005 - 06 Commissionerate: Chennai-Iv
13 pages
Econometrics 2
No ratings yet
Econometrics 2
27 pages
Chapter 3 MLR
No ratings yet
Chapter 3 MLR
40 pages
Basics
No ratings yet
Basics
8 pages
Data Science Q&A - Latest Ed (2020) - 3 - 1
No ratings yet
Data Science Q&A - Latest Ed (2020) - 3 - 1
2 pages
Statistics
No ratings yet
Statistics
8 pages
Problem Set 2 Quantitative Methods UNIGE
No ratings yet
Problem Set 2 Quantitative Methods UNIGE
10 pages
Parametric Test Non-Parametric Test: Normally Distributed) Mean (Not Normally Distributed) Median
No ratings yet
Parametric Test Non-Parametric Test: Normally Distributed) Mean (Not Normally Distributed) Median
8 pages
YMS Topic Review (Chs 1-8)
No ratings yet
YMS Topic Review (Chs 1-8)
7 pages
Vedic Math Archive
No ratings yet
Vedic Math Archive
17 pages
Economic
No ratings yet
Economic
11 pages
W9 Multiple Linear Regression ANOVA
No ratings yet
W9 Multiple Linear Regression ANOVA
28 pages
MCQ Module 1 RGPV Mathematics III
No ratings yet
MCQ Module 1 RGPV Mathematics III
7 pages
Industrial Training (Cse 4389) : Submitted by
No ratings yet
Industrial Training (Cse 4389) : Submitted by
33 pages
P2 Salvador ST., Sta. Rita Karsada Batangas City Contact Number: 09108430187/09217794192 Email Address
No ratings yet
P2 Salvador ST., Sta. Rita Karsada Batangas City Contact Number: 09108430187/09217794192 Email Address
2 pages
+part 02 - AMEFA - 2024 - Introduction and Repetition
No ratings yet
+part 02 - AMEFA - 2024 - Introduction and Repetition
78 pages
Seat Heating E46
No ratings yet
Seat Heating E46
1 page
Artificial Intelligence Notes
No ratings yet
Artificial Intelligence Notes
7 pages
1034 Chap 2
100% (1)
1034 Chap 2
48 pages
Amazon Echo Setup.9385515.Powerpoint
No ratings yet
Amazon Echo Setup.9385515.Powerpoint
5 pages
Mone JM Pre-Test Econometrics Exams
No ratings yet
Mone JM Pre-Test Econometrics Exams
9 pages
How To Connect Two SC-2030
No ratings yet
How To Connect Two SC-2030
2 pages
Second Stats Packet 24
No ratings yet
Second Stats Packet 24
100 pages
Data Screening and Main Model Analysis in Spss
No ratings yet
Data Screening and Main Model Analysis in Spss
26 pages
Two-Way ANOVA and Heteroskedasticity
No ratings yet
Two-Way ANOVA and Heteroskedasticity
27 pages
IoE Retail Key Findings
No ratings yet
IoE Retail Key Findings
7 pages
Day 3 Statistics Interview QnA
No ratings yet
Day 3 Statistics Interview QnA
5 pages
50 Important Statistics' Q & A To Crack DS Interview
No ratings yet
50 Important Statistics' Q & A To Crack DS Interview
14 pages
Linear Models and Linear Mixed Effects Models in R
No ratings yet
Linear Models and Linear Mixed Effects Models in R
5 pages
Econometrics For Finance (2017-I)
No ratings yet
Econometrics For Finance (2017-I)
6 pages
8.1 Linear Regression and Correlation Analysis Glossary
No ratings yet
8.1 Linear Regression and Correlation Analysis Glossary
8 pages
Questions and Answers
No ratings yet
Questions and Answers
5 pages
Revision 235
No ratings yet
Revision 235
8 pages
1664189682389-2ba010110 Cced25030
No ratings yet
1664189682389-2ba010110 Cced25030
1 page
Corr - Regression Analysis
No ratings yet
Corr - Regression Analysis
19 pages
Lab1 Linux and Program
No ratings yet
Lab1 Linux and Program
49 pages
Zatca Updated Color 03
No ratings yet
Zatca Updated Color 03
1 page
Magic Quadrant For Managed IoT Connectivity Services 2024
No ratings yet
Magic Quadrant For Managed IoT Connectivity Services 2024
39 pages
Company Profile-Falcon Comp
No ratings yet
Company Profile-Falcon Comp
9 pages
Case Study On Semaphores
No ratings yet
Case Study On Semaphores
4 pages
Color Standard
No ratings yet
Color Standard
1 page
Lecture 10-Simple-Regression To Multiple Regression
No ratings yet
Lecture 10-Simple-Regression To Multiple Regression
7 pages
Data Analytics Lesson 11 Notes
No ratings yet
Data Analytics Lesson 11 Notes
8 pages
Dependency Models Only
No ratings yet
Dependency Models Only
5 pages
Unit 5 Business Analytics
No ratings yet
Unit 5 Business Analytics
24 pages
VIVA - Revision
No ratings yet
VIVA - Revision
5 pages
HPE - A00007129en - Us - R13xx-HPE FlexNetwork 5510 HI Layer 2 - LAN Switching Configuration Guide
No ratings yet
HPE - A00007129en - Us - R13xx-HPE FlexNetwork 5510 HI Layer 2 - LAN Switching Configuration Guide
329 pages
R Viva Ques
No ratings yet
R Viva Ques
24 pages
Module 5 Turing Machines
No ratings yet
Module 5 Turing Machines
6 pages
Advancedeconometricsl3!4!240128102442 58a0f1f1
No ratings yet
Advancedeconometricsl3!4!240128102442 58a0f1f1
58 pages
Unit 2 Multiple Regression: 1. Correlation
No ratings yet
Unit 2 Multiple Regression: 1. Correlation
8 pages
Introduction To Python
No ratings yet
Introduction To Python
5 pages
Econometrics Study Guide Gujarati Plain
No ratings yet
Econometrics Study Guide Gujarati Plain
3 pages
Power Factor Correction - PPT (Autosaved)
No ratings yet
Power Factor Correction - PPT (Autosaved)
13 pages
Stats Notes
No ratings yet
Stats Notes
48 pages
Logg 20250509
No ratings yet
Logg 20250509
21 pages
V20PBA203 - Business Statistics and Quantitative Methods
No ratings yet
V20PBA203 - Business Statistics and Quantitative Methods
22 pages
Final Stats Intrerview Q&A
No ratings yet
Final Stats Intrerview Q&A
20 pages
LinearRegression Chapter
No ratings yet
LinearRegression Chapter
6 pages
Lecture 6 (Data Analysis and Interpretation)
No ratings yet
Lecture 6 (Data Analysis and Interpretation)
18 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Business Analytics Viva Questions

Uploaded by

Business Analytics Viva Questions

Uploaded by

Important Questions for Viva

Que 1) What are Pareto charts?

Que 2) What is a covariance matrix?

Que 3) What is the difference between covariance and correlation?

Que 4) What is the coefficient of determination (R²)?

Que 5) Which command do we use for running a regression model in R?

Que 6) What command is used for checking multicollinearity issues in R?

Que 7) What are heteroscedasticity issues in multiple linear regression?

Que 9) What is f-square values in regression analysis?

Que 10) What is the t-test in a regression model?

Que 11) What does the p-value indicate in a regression model?

Que 13) What is adjusted R2?

Que 15) What is a confidence interval?

Que 16) What is a prediction interval?

Que 17) What is text mining?

Que 20) What is sentiment analysis?

Que 24) What are moving averages?

Que 26) What are commands in R?

Que 27) What is syntax in R?

Que 29) What are libraries in R?

Que 30) How to install a package in R?

Que 31) What are the different types of data structures in R?

4. Normality of Residuals—The residuals should be normally distributed to ensure accurate

5. No Multicollinearity—Although this is more relevant in multiple regression, if there are

Que 34) What are standard errors in a regression model?

Que 38) What is a box plot?

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.