2nd Year Cirriculum
2nd Year Cirriculum
DSE 2123 Object Oriented Programming 3 1 0 4 DSE 2223 Design & Analysis of Algorithms 2 1 0 3
II DSE 2124 Data Communication and Networks 3 0 0 3 DSE 2224 Principles of Data Privacy & Security 3 0 0 3
DSE 2142 Data Structures Lab 0 0 3 1 DSE 2241 Database Systems Lab 0 0 3 2
15 2 9 21 14 4 6 22
Statistics: Measures of central tendency – mean, median, mode, measures of dispersion – mean
deviation, standard deviation, quartile deviation, skewness and kurtosis. Correlation coefficient,
regression, least squares principles of curve fitting. Probability: Introduction, finite sample spaces,
conditional probability and independence, Baye’s theorem, one dimensional random variable,
mean, variance. Two and higher dimensional random variables: mean, variance, correlation
coefficient. Mathematical expectation and conditional expectation.
Distributions: Binomial, Poisson, geometric, uniform, normal, gamma, Chi-square and exponential
distributions, properties and simple problems. Transformation of random variable. Sampling theory:
Central limit theorem and applications. Estimation – methods and goodness of estimators. Testing of
hypothesis-one sample, two sample t-test, one way & two-way analysis of variance. Testing for
independence of attributes and goodness of fit.
References:
1. Gupta, S.C and Kapoor, V.K. Fundamentals of Mathematical Statistics, Sultan Chand &
Sons,11th Edition, 2016
2. Probability, Markov Chains, Queues, and Simulation: The Mathematical Basis of Performance
Modeling, William J Stewart(2009) Princeton University Press
3. George Casella and Roger L Berger , Cengage Learning, Statistical Inference (second edition
2021).
Steps in Data Analytics Projects, Data Analytics tasks and methods, Data Gathering and Preparation:
Data Formats, Time series data, Parsing and Transformation, Scalability and Real-time Issues; Data
Cleaning: Consistency Checking, Noisy and Missing Data, Data Integration, Data Transformation, and
Segmentation; Exploratory Analysis: Descriptive and comparative statistics, Hypothesis testing,
Statistical Inference. Data Analysis, Market Basket Analysis, Association rule mining: Apriori, FP
Growth, Partitioning, measures of pattern interestingness, Recommender Systems, Item-based
Recommendation, Collaborative Filtering. Clustering: Partitioning, Hierarchical, Density-based
approaches. Anomaly Detection. Visualization: Visual Representation of Data, Tufts & Gestalt
Principles, Visualization tools. Relevant case studies
References:
1. Anil Maheshwari, Data Analytics Made Accessible, McGraw Hill, 2021.
2. Manaranjan Pradhan, U Dinesh Kumar, “Machine Learning using Python”, Wiley
Publication, 2019.
3. Glenn J. Myatt, “Making Sense of Data: A Practical Guide to Exploratory Data
Analysis and Data Mining”, John Wiley Publication, November 2006.
4. Glenn J. Myatt, Wayne P. Johnson, “Making Sense of Data II”, John Wiley Publication,
2009.
5. Cole Nussbaumer Knaflic, “Storytelling With Data: A Data Visualization Guide for
Business Professionals”, (1e), John Wiley and Sons, 2015.
6. Jiawei Han and Micheline Kamber, “Data Mining Concepts And Techniques”, 3rd
Edition, Morgan Kauffmann
7. Pang-Ning Tan, Michael Steinbach,Vipin Kumar, ”Introduction to Data Mining”,
Pearson Education, 2nd Edition.
SDL: Graphs- Storage representations, BFS, DFS, Spanning tree, Minimum spanning tree.
References:
1. Behrouz A. Forouzan, Richard F. Gilberg, A Structured Programming Approach
Using C, (3e), Cengage Learning India Pvt. Ltd, India, 2007
2. Ellis Horowitz, Sartaj Sahni, Susan Anderson and Freed, Fundamentals of Data
Structures in C, (2e), Silicon Press, 2007
3. Richard F. Gilberg, Behrouz A. Forouzan, Data structures, A Pseudocode Approach
with C, (2e), Cengage Learning India Pvt. Ltd, India , 2009
4. Tenenbaum Aaron M., Langsam Yedidyah, Augenstein Moshe J., Data structures
using C, Pearson Prentice Hall of India Ltd., 2007
5. Debasis Samanta, Classic Data Structures, (2e), PHI Learning Pvt. Ltd., India, 2010
DSE 2123 OBJECT-ORIENTED PROGRAMMING [3 0 0 3]
References:
1. Schildt H, Java: The Complete Reference, (10e), Tata McGraw-Hill Education Group,
2017.
2. Balagurusamy E, Programming with Java, (5e), Tata McGraw Hill Education Group,
2017.
3. Daniel Liang Y, Introduction to Java Programming, (10e), Pearson Education India,
2018.
4. Horstmann CS, Big Java: Early Objects, (5e), Wiley’s Interactive Edition, 2015.
References:
1. Behrouz A. Forouzan, Data Communications and Networking, 5th Edition, Tata
McGraw Hill, 2013.
2. Behrouz A. Forouzan, TCP/IP Protocol Suite, 4th Edition, Tata McGraw Hill, 2010.
3. Tannenbaum, A.S, Computer Networks, 5th Edition, Prentice Hall of India EE
Edition, 2011.
4. Leon Garcia and Widjala, Communication Networks, 5th Edition, Tata McGraw Hill,
2017.
Tutorial on tools for Data Analytics & Visualization. Suggested tools are Python, R, MATLAB,
WEKA. Experiments with datasets to be defined in lab manual, to implement concepts of data
pre-processing, exploratory analysis, comparative statistics, statistical inference, Association,
and clustering. Creating Visual Representations- Suggested tools are MS Excel, Power BI,
Tableau. MS Excel Pivot Tables and charts, Case Studies.
References:
1. Manaranjan Pradhan, U Dinesh Kumar, “Machine Learning using Python”, Wiley
Publication, 2019.
2. Brian R. Hunt, Ronald L. Lipsman, Jonathan M. Rosenberg, Kevin R. Coombes, John E.
Osborn, Garrett J. Stuck, Guide to MATLAB: For Beginners and Experienced Users,(2e),
Cambridge University Press, 2011.
Reviewing the concepts of pointers, structures and recursion, Studying the operation of stacks
and queues and the associated application programs, Creating dynamic allocation of memory
for linked list and applying it to examples using singly, doubly and circular linked list and
their applications, Creation of binary trees and the application associated with the trees.
References:
1. Behrouz A. Forouzan, Richard F. Gilberg, A Structured Programming Approach Using
C, (3e), Cengage Learning India Pvt. Ltd, India, 2007
2. Ellis Horowitz, Sartaj Sahni, Susan Anderson and Freed, Fundamentals of Data
Structures in C, (2e), Silicon Press, 2007
3. Richard F. Gilberg, Behrouz A. Forouzan, Data structures, A Pseudocode Approach
with C, (2e), Cengage Learning India Pvt. Ltd, India, 2009
4. Tenenbaum Aaron M., Langsam Yedidyah, Augenstein Moshe J., Data structures
using C, Pearson Prentice Hall of India Ltd., 2007
5. Debasis Samanta, Classic Data Structures, (2e), PHI Learning Pvt. Ltd., India, 2010
DSE 2143 OBJECT-ORIENTED PROGRAMMING LAB [0 0 3 1]
Class and Objects: Class Definition, Creating Objects; Array and Strings: Programs Based
Upon 1-D, 2-D and Dynamic Arrays, String Comparison and Manipulation; Inheritance:
Inheritance and Its Types, Abstract Class, Inner and Outer Class, Super, Final, Static
Keywords; Collection Framework & Generics: Using Collection Classes such as Array Lists
and Linked Lists Writing Generic Classes; Exception Handling: Errors and Exceptions, Types
of Exceptions; Multithreading: Thread Class, Runnable, Synchronization, Thread Priority;
Event Handling and GUI Programming: Action Listener, Swing Package.
References:
1. Schildt H, Java: The Complete Reference, (10e), Tata McGraw-Hill Education Group,
2017.
2. Balagurusamy E, Programming with Java, (5e), Tata McGraw Hill Education Group,
2017.
3. Daniel Liang Y, Introduction to Java Programming, (10e), Pearson Education India,
2018.
4. Horstmann CS, Big Java: Early Objects, (5e), Wiley’s Interactive Edition, 2015.
FOURTH SEMESTER
Stochastic Processes – types, fundamental concepts, limiting and transient behaviour. Poisson
process – properties. Birth-death models, Queueing models-performance measures.
Multivariate analysis-normal distribution and its transformed (Wishart)-properties & their uses
in testing hypothesis, classification methods and multidimensional scaling. Graphical
representation of multivariate data (Self Directed Learning).
References:
1. William J Stewart, Probability, Markov Chains, Queues, and Simulation: The
Mathematical Basis of Performance Modeling, Princeton University Press, 2009
2. Hardle, W.K & Simar, L, Applied Multivariate Statistical Analysis, 2014.
3. Dean W. Wichern and Richard A. Johnson, Applied Multivariate Statistical Analysis,
Pearson, 2017.
Database System Applications, View of data, Database languages, Database users and
Administrator, Introduction to Relational Model: database schema, keys, schema diagrams,
Relational Query Languages, Introduction to SQL: Data Definition, Basic structure of SQL
queries, Basic operations, Set operations, Nested subqueries, Modification of the database.
Intermediate SQL: Join expressions, Views, Advanced SQL-PL/SQL, Cursors, Functions,
Procedures, Triggers, recursive queries, Entity-Relationship Model: Design Process, ER
Model, Reduction to Relational schema, Relational Database design: Functional dependencies,
Normal forms, Closure, Canonical cover, Lossless joins, dependency preserving
decomposition, File structure, Transactions, Concurrency Control, Unstructured database
SDL: Unstructured database
References:
1. Abraham Silberschatz, Henry Korth, S. Sudarshan, Database System Concepts, 7th
Edition, McGraw Hill, 2019.
2. Ramez Elmasri, Shamkant Navathe, Fundamentals of Database System, 6th Edition,
Addison Wesley Publications Co., 2010
3. Raghu Ramakrishnan, Johannes Gehrke, Database Management System, 3rd Edition,
WCB/McGraw Hill Publisher, 2007
4. Shashank Tiwari, Professional NOSQL, Wiley, 2015.
5. Ivan Bayross, SQL, PL/SQL-The Programming Language of ORACLE, 4th Edition,
BPB Publications, 2010.
6. Benjamin Rosenzweig, Elen Silvestrova, “Oracle Pl/Sql By Example”,4th Edition,
Addion-Wesley,2009.
Machine Learning Basics: Types of Machine Learning, Supervised vs. Unsupervised Learning,
Parametric vs. non-parametric models., Instance Based learning – k-nearest neighbors, Simple
Regression Models: Linear, Logistic, Cost functions, Gradient Descent, Batch Gradient
Descent, Overfitting, Model Selection, No free lunch theorem, bias/variance trade-off, union
and Chernoff bounds, VC dimensions. Bayesian Models: Bayesian concept learning, Bayesian
Decision Theory, Naïve Bayesian, Laplacian Correction, Bayesian Belief Networks. Tree
Models: information theory, decision tree induction, tuning tree size, ID3,C4.5, CHAID,
Decision Stump. Support Vector Machines: kernel functions, Regression Models: Ridge and
Lasso Regression, GLM and theexponential Family. Bagging algorithm, Random Forests, Grid
search and randomized grid search, Partial dependence plots. Ensembling and Boosting
Algorithms: Concept of weak learners, Adaptive Boosting, Extreme Gradient Boosting
(XGBoost). Artificial Neural Networks: Perceptron, Back propagation, Hopfield Network.
Curse of Dimensionality: Factor Analysis, Principal Component Analysis (PCA), Difference
between PCAs and Latent Factors
References:
1. K. Murphy, Machine Learning: A Probabilistic Perspective, MIT Press, 2012.
2. G. James, D. Witten, T Hastie, R Tibshirani, An introduction to statistical
learning withapplications in R, Springer, 2013.
3. J. Han, M. Kamber, J. Pei, Data Mining concepts and techniques, (2e), Morgan
Kaufmann-Elsevier, 2011.
4. T. Hastie, R. Tibshirani, J. Friedman, The Elements of Statistical Learning, (2e), Springer,
2009.
5. T. M. Mitchell, Machine Learning, (Indian Edition), MacGraw Hill, 2017.
6. C. Bishop, Neural Networks for Pattern Recognition, Oxford University Press, 2019
References:
1. Anany Levitin, Introduction to the Design and Analysis of Algorithms, (3e), Pearson
Education, 2011
2. Ellis Horowitz and Sartaj Sahni, Computer Algorithms/C++, (2e), University Press,
2008.
3. Thomas H. Cormen, Charles E. Leiserson, Ronal L, Rivest, Clifford Stein,
Introduction to Algorithms, (3e), PHI, 2009
SDL: System Security, Key Distribution and Authentication, Transport Layer Security,
Wireless Network Security, E-mail Security, IP Security, Security Management Systems,
Need for IT Security, Intrusion Prevention and Detection Systems, Cyber Security
References:
1. Ronald Leenes , Rosamunde van Brakel , Serge Gutwirth , De Hert, Paul, Data Protection and
Privacy: The Age of Intelligent Machines (Computers, Privacy and Data Protection), Hart
Publishing (December 28, 2017)
2. B. Raghunathan, The Complete Book of Data Anonymization: From Planning to
Implementation, Auerbach Pub, 2016.
3. L. Sweeney, Computational Disclosure Control: A Primer on Data Privacy Protection, MIT
Computer Science, 2017
4. William Stallings, Cryptography and Network Security: Principles and Practice, 7th Edition,
Pearson Education, 2017.
5. William Stallings, Network Security Essentials: Applications and Standards, 6th Edition,
Pearson Education, 2014.
6. Atul Kahate, Cryptography and Network Security, 3rd Edition, Tata McGraw-Hill Publishing
Company Limited, 2013.
SDL: The Linux System: Design Principles, Kernel Modules, Scheduling, Memory
Management.
References:
Introduction to basics of front-end developing tools needed for developing mini project.
Introduction to SQL, Integrity Constraints in SQL, Simple and complex queries, PL/SQL
Basics, Exception Handling, Cursors, exceptions, Procedures, Functions and Packages,
Triggers. Design and development of database driven mini project based on database concepts
discussed in the theory. The front-end tool is as per the prerogative of the faculty who is
handling the lab.
References:
1. Silberschatz, Korth, Sudarshan, Database System Concepts, (6e), McGraw-Hill, 2011
2. Ivan Bayross, SQL, PL/SQL: The Programming Language of Oracle ,4th Revised
Edition, BPB Publications,2010
3. Steven Feuerstein, Bill Pribyl, Oracle PL/SQL Programming, 6th Edition, O'Reilly
Media, 2014,
Tutorial on tools for Machine Learning. Python suggested. Experiments with datasets to be
defined in lab manual to perform preprocessing and deploy classifiers such as Bayesian,
Decision Trees, Support Vector Machines, k-nearest neighbor, Regression Models.
Classification accuracy measures, improving classifier performance through ensembling,
boosting etc.
References:
1. Hans Peter Langtangen, Python Scripting for Computational Science, (3e),
Springer Publishers, 2014
2. Naomi R. Ceder, The Quick Python Book, (2e), Manning Publications Co., 2010
3. Wesley J. Chun, Core Python Applications Programming, (3e), Prentice Hall
Publishers, 2012
4. G. James, D. Witten, T Hastie, R Tibshirani, An introduction to statistical
learning with applications in R, Springer, 2013.