Data Science
Data Science
Third Semester
(Regulations 2021)
20 30 30 10 10 100
ii.Write a NumPy program to convert the values of Centigrade degrees into Fahrenheit
degrees and vice versa. Values have to be stored into a NumPy array.
2. i. Write a NumPy program to find the real and imaginary parts of an array of complex
numbers
ii. Write a NumPy program to convert a NumPy array into a csv file
4. i. Use NumPy , Create an array with 5 dimensions and verify that it has 5 dimensions
5. i. Create your own simple Pandas DataFrame and print its values
6. Perform appending, slicing, addition and deletion of rows with a Pandas DataFrame.
Page 1 of 3
7. i. Using Pandas, Create a DataFrame with a list of dictionaries, row indices, and column
indices.
ii. Use index label to delete or drop rows from a Pandas DataFrame.
9. Compare the results of the Univariate and Bivariate analysis for the UCI diabetes data set
11. Use the diabetes data set from UCI, Perform Univariate analysis
12. Use the diabetes data set from Pima Indians Diabetes , Perform Bivariate analysis
13. Perform Multiple Regression analysis on your own dataset ( For example, Car dataset
with information Company Name, Model, Volume, Weight, CO2) with more than one
independent value to predict a value based on two or more variables.
14. i. Using various functions in numpy library, mathematically calculate the values for a
normal distribution and create Histograms to plot the probability distribution curve.
15. i.Using the “concrete strength” dataset, explore relationships between two continuous
variables with Scatterplots
ii. Draw a Scatter Plot for the following Pandas DataFrame with Team name and Rank
Points as x and y axis
16. Make a three-dimensional plot with randomly generate 50 data points for x, y, and z. Set
the point color as red, and size of the point as 50.
Page 2 of 3
17. How will you plot and visualize geographical data with the help of Basemap. State the
Procedure for it with an example.
18. Perform Reading data from text files, Excel and the web and exploring various commands
for doing descriptive analytics on the Iris data set
20. Perform Bivariate analysis using the pandas DataFrame that contains information about
two variables: (1) Hours spent studying and (2) Exam score received by 20 different
students:
Page 3 of 3