0% found this document useful (0 votes)
424 views12 pages

Soal Latihan IT Specialist Data Analytics

Data analytics latihan

Uploaded by

Amanda Ratu L
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
424 views12 pages

Soal Latihan IT Specialist Data Analytics

Data analytics latihan

Uploaded by

Amanda Ratu L
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 12

IT-SPECIALIST

DATA ANALYTICS CERTIFICATION EXAM

1. You need to compare three (3) values of each data point in a series.
Which chart type should you use?
A. Waterfall chart
B. Area chart
C. Scatter chart
D. Bubble chart
2. You need to create a data view based on aggregations for further visual analysis. Your data includes
sales information for the past five years for food products at your company's stores. Each product
belongs to one category. For example, Milk belongs to the Dairy category. The data view must meet
the following requirements.
. Include all products and their associated categories.
. Include sales subtotals for each category and year.
. Display a grand total of sales for each category.
. Create a summary of each category for every year.
Which two aggregation methods should you use to create the data view? (Choose 2.)
Note: You will receive partial credit for each correct selection.
A Merging
B Filtering
C Pivoting
D Grouping
3. Which scenario is an example of predictive analytics?
A. A real estate agent mails flyers to homeowners in a neighborhood after reviewing a sales report.
B. A student tosses a coin to determine how to answer a True or False question.
C. A car salesperson approaches every fifth person who walks into a car dealership to attempt to
make a sale.
D. A stock trader examines financial reports of a company to determine whether the stock price
is likely to keep rising.
4. For each statement, here are the appropriate responses:
Remove the rows that contain NULL values. Yes or No?
Replace each NULL value with a random value. Yes or No?
Use a statistic such as average to account for the NULL values. Yes or No?
5. Which two chart types should you use to rank values in ascending or descending order? (Choose
2.)
Note: You will receive partial credit for each correct selection.
A. Bubble chart
B. Line chart
C. Column chart
D. Bar chart
6. How is an unstructured data set different from a structured data set?
A. The data organization of an unstructured data set is explicitly defined.
B. An unstructured data set can be quickly searched without manipulation.
C. An unstructured data set can store large amounts of unrelated data.
D. An unstructured data set has a predefined data model.
7. For which scenario should you use a line chart to represent the data?
A. The weekly average stock price during a one-year period
B. The binned distribution for the heights of different students
C. The maximum, minimum, and average values for a set of data
D. The proportion of Yes and No answers to a survey question
8. You have been given a large data set that includes location, income, and age.
Why should you disaggregate the data?
A. To combine data sets and present a summary of your findings
B. To form generalizations about the entire data set
C. To analyze income within different age groups or locations
D. To hide differences among subgroups
9. You are analyzing sales activity that occurs on national holidays
What level of data granularity will enable you to perform the most precise analysis?
Data Granularity Options
Years
Months
Weeks
Days
Hours
10. True or False statements.
Data mining is used to find anomalies. True or False?
Data mining is used to summarize raw data from large data sets. True or False?
Data mining is used to review underlying details in a given table. True or False?
11. Which concept most comprehensively describes the general meaning of data in the context of
data analytics?
A. Interpreted evidence
B. Analyzed details
C. Meaningful statistics
D. Unprocessed facts
12. What is an example of machine learning in predictive analysis?
A. Your computer automatically goes into sleep mode because the battery has less than 10 percent
power.
B. Your vehicle turns on a warning sensor because one of its components requires maintenance.
C. Your thermostat adjusts to a higher temperature because you programmed it based on the time of
day.
D. Your streaming service suggests a category of movies based on the last 10 movies you
watched.
13. You will be analyzing sales and determining trends based on a very large dataset that includes the
following columns:
· CustomerName
. CustomerEmail
· Birthdate
· FirstPurchaseDate
· MostRecentPurchaseDate
· TotalQuantityPurchased
. TotalSalesAmount
You need to validate the data before you start the analysis.
What should you do?
A. Verify date ranges and values for all date columns.
B. Analyze FirstPurchaseDates to determine purchasing trends.
C. Create aggregations of all numeric columns.
D. Calculate statistics on TotalQuantityPurchased.
14. Slicers can be used to filter data. True or False?
Sorts can be used to display a subset of data. True or False?
Filters can be used to display a subset of data. True or False?
15. You are tasked with completing a data analysis project for a large organization. During the project,
you must handle personally identifiable information (PII). Throughout the project, which three
principles should you follow? (Choose 3.)
A. Retain only the PII that you might need for future analysis.
B. Limit your handling of the PII to only what is necessary for the current analysis.
C. Keep track of the PII that you have during the analysis.
D. Request all data from the database that contains the PII.
E. Remove all PII from your computer after the analysis is complete.
16. A data scientist at your company creates a machine learning model to help the hiring manager
select candidates from thousands of job applicants.
Which statement best describes how machine learning is used in this scenario?
A. The machine learning model uses historical data and algorithms to predict future applicant
performance.
B. The hiring manager queries the machine learning database for qualified applicants.
C. The machine learning model defines the qualifications necessary for a given job or role.
D. The machine learning system converts applicant information into a common format.
17. A data set indicates that global temperature increases 0.08 degrees Celsius every 10 years.
You use this data and a linear regression model to determine the Earth's temperature in the year
2050.
What type of data analytics are you using?
A. Predictive analytics
B. Prescriptive analytics
C. Diagnostic analytics
D. Descriptive analytics
18. You ran a t-test with an alpha value of 1% (a =0.01).
Which p-value would cause you to reject the null hypothesis?
A. 0.001
B. 0.011
C. 0.09
D. 0.10
19. Which sentence most accurately describes the relationship between data and statistics?
A. All statistics are data, but not all data are statistics.
B. All data are statistics, but not all statistics are data.
C. Data and statistics are both purely quantitative in nature.
D. Data and statistics are both purely qualitative in nature.
20. In the United States and Europe, which two data points are considered non-sensitive PlI (personal
identifiable information)? (Choose 2.)
A. Date of birth
B. Gender
C. Medical records
D. Mailing address
21. Which data structure has multiple rows and multiple columns?
A. Series
B. List
C. Table
D. One-dimensional array
22. Move each function from the list on the left to the correct description on the right.
COUNT()
MAX()
MIN()
SUM()
Descriptions
Returns the largest value ….. MAX()
Returns the smallest value …. MIN()
Returns the number of values …. COUNT()
Returns the total of the values …. SUM()
23. You are responsible for e-commerce sales at your company. You need to present the quarterly
data shown in the table to upper management using the most accurate, unbiased visualization.
Which visualization should you choose?
Select the correct visualization in the answer area.
24. You are using data analytics to help answer business questions about a new product your
company released. Move each type of data analytics from the list on the left to the correct question
on the right.
Note: You will receive partial credit for each correct match.

25. You are performing descriptive analytics on quarterly sales data.


Move the appropriate statistical metrics from the list on the left to the correct locations on the right.
You may use each metric once, more than once, or not at all.
Note: You will receive partial credit for each correct response.

26. A group of students is asked about their favorite flavor of ice cream. The pie chart below illustrates
the proportions of each response.
What can you conclude from the visualization below about ice cream preferences for this group of
students?

A. The most students chose vanilla.


B. The fewest students chose chocolate.
C. The most students chose chocolate.
D. The fewest students chose strawberry.
27. You create the bar chart below, which shows sales for different years. Management asks for a way
to see demographic information associated with the individual sales records for each year.
You decide to create tables for each year that show the demographic information for the sales in that
year. When someone clicks a year in the chart, the associated table will open.
Which reporting technique does this represent?

A. Disaggregating
B. Distributing
C. Pivoting
D. Unpivoting
28. You are examining a visualization that displays the correlation between Variable X and Variable Y.
Evaluate the image and answer the questions by selecting the correct option from each drop-down
list.
29. What type of data is too complex to be stored in a traditional database management system
(DBMS)?
A. Metadata
B. Big data
C. Qualitative data
D. Imputed data
30. You have a dataset that includes product review scores and demographic information about the
reviewers. There are no subcategories associated with the demographic answers. The table shows a
selection of the data.
Which scenario is an example of disaggregating the dataset?

A. Display the overall average and mode of all scores and a count of all reviews.
B. Display average and mode of the scores for each product grouped by the ethnicity of the
reviewers.
C. Display the overall average and mode of all scores on a per-product basis.
D. Display a list of ethnicities that are included in the Other option.
31. From the data in the table below, you create a PivotTable to show the combined number of
certified virtual and in-person teachers for each class at each school
Move the appropriate labels from the list on the left to the correct locations in the PivotTable on the
right. You may use each label once or not at all.
Note: You will receive partial credit for each correct answer.
32. What are two goals of data privacy and protection laws such as GDPR, FERPA, and HIPAA?
(Choose 2.)
A. To regulate data storage.
B. To regulate data validation.
C. To regulate how data is shared.
D. To regulate how data is sorted.
33. You are given a data set displaying the time of day and number of minutes customers waited in
line for service. You need to remove bias from the results by eliminating outliers.
Which visualization illustrates outliers in your dataset?
Select the correct visualization in the answer area.
34. The visualization below displays sales data for two salespeople. A conclusion indicates that
Salesperson 1 has a higher lead-to-sales rate than Salesperson 2. (A lead-to-sales rate is the number
of actual sales divided by the number of attempted sales.)
You need to determine the accuracy of this conclusion.
What should you conclude?

A. The conclusion is inaccurate because the visualization uses size manipulation.


B. The conclusion is accurate.
C. The conclusion is inaccurate because the visualization is missing sales lead data.
D. The conclusion is inaccurate because the visualization uses scale manipulation.
35. Your marketing department attends a variety of events each year and distributes promotional
items to event participants. The table below shows the quantity distributed of each promotional item.
You are performing analysis on the data. Complete the sentences about data organization by
selecting the correct option from each drop-down list.
Note: You will receive partial credit for each correct selection.
36. Which data type results from processing a conditional statement?
A. Integer
B. Boolean
C. String
D. Character
37. Which data structure describes the following data?
['Aabid',"Jesenia","Mark"]
A. List
B. Multi-dimensional array
C. Table
D. Graph
38. You are preparing to export data from a database to a flat file to be used by another system. Each
field in the file should be separated by a comma.
You notice that the data in several columns includes commas. You decide to enclose the values in
each of these columns in double quotes (").
What feature of delimited files defines enclosing column data in double quotes?
A. Data formatter
B. Column delimiter
C. Text qualifier
D. Row delimiter
39. As part of an ETL process, which process represents transformation?
A. Retrieving data from many sources into a single destination.
B. Changing data from a summary level to a detailed level.
C. Importing a percentage of rows from the source data.
D. Converting data from one data type to another data type or structure.
40. Which data type is appropriate for a phone number using the format (###) ## -##?
A. String
B. Numeric
C. Binary
D. Boolean

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy