Data Science Questions
Data Science Questions
3. A researcher is gathering data from four geographical areas designated: South =1,
East=2, North =3, and West =4. The designated geographical area represents
a. Categorical Data
b. Qualitative data
c. Quantitative data
d. Label data
7. Do you think 50 small decision trees are better than a large one. If yes state reason
why?
------------------------------------------
Ans: yes, More robust model (ensemble of weak learners that come and make
a strong learner) Better to improve a model by taking many small steps
than fewer large steps
If one tree is erroneous, it can be auto-corrected by the following
Less prone to over fitting
9. Which of the following is used to hide limitations of Java behind an API for Cascading?
a. Scalding
b. Cascalog
c. Hcatalog
d. Hcalding
11. After you have the data which is the next step you will proceed for:
a. Data Wrangling
b. Data Modeling
c. Data Visualization
d. Data Mining
12. Which of the following statements describes how mobile devices, the use of computers
in more and more everyday interactions, and the ability to connect with other devices
almost anywhere are changing society?
I. People are able to use mobile devices for new applications such as finding
directions or finding restaurants
II. Data can be collected from thousands of sources and can be combined to
provide new services to individuals and companies
III. Buildings, cars, classrooms, and offices can now be engineered with sensors to
automate tasks like adjusting the thermostat or even driving
IV. Data that is collected can be used to identify social problems
a. III only
b. I and III
c. II and IV
d. I, II, III, and IV
13. Which data mining technique is more suitable for categorical data analysis?
a. Decision Tree
b. Neural Network
c. Association Rule
d. Linear Regression
15. When working on Neural Network models the model training time depends on the size
of the network?
a. True
b. False
20. For the table given below write the following queries:
a. Find the total sales at the end of the year 2013 for all regions.
Select total_sales from table_name where year =’2013’
b. How to show schema of the table.
Select * from table_name;