IA Big Data Lab Works
IA Big Data Lab Works
Lab Work 1
The objective of this practical work is to design, build, and manage a large-scale relational
database using an open dataset from Kaggle. You will import data, establish relationships
between tables, and execute advanced SQL queries.
1. Dataset Selection
Choose a large and structured dataset from Kaggle.com that can be organized into
multiple related tables.
Examples of suitable datasets:
o E-commerce transactions
o Movie ratings and reviews
o Financial transactions
o Healthcare records
o Social media interactions
3. Data Analysis
Design a web-based interface using HTML, CSS, and JavaScript to interact with
the database.
Implement basic CRUD operations (Create, Read, Update, Delete) to allow users to
manage records.
Mohamed Khider University - Biskra 2024/2025
Department of Computer Science Level: Master 1
Module: Cloud Computing & Big Data Option : IA
Lab Work 2
The goal of this practical work is to implement intelligent query processing techniques to
enhance user interactions with databases. You will explore:
Instructions :
2. Create a web-based interface using HTML, CSS, and JavaScript to interact with the
SQL query .
4. Implement Autocomplete using a Trie (Prefix Tree). Example: If the user types "SEL",
This structure is useful for quickly finding the closest matches to a given input.
Example: If searching for "Biksra", the system finds similar names like "Biskra",.
Mohamed Khider University - Biskra 2024/2025
Department of Computer Science Level: Master 1
Module: Cloud Computing & Big Data Option : IA
Lab Work 3
The goal of this practical work is to explore database indexing techniques to optimize query
performance and implement TF-IDF (Term Frequency - Inverse Document Frequency)
for text search relevance. You will:
Lab Work 4
The aim of this practical work is to build a recommendation system using TF-IDF (Term
Lab Work 5
Big Data Processing with Hadoop
The objective of this practical work is to introduce students to Hadoop, a powerful framework
for distributed storage and processing of large datasets. Students will set up a Hadoop
environment, process data using HDFS (Hadoop Distributed File System), and perform
MapReduce operations to analyze a dataset.
Lab Work 6
NoSQL Database Management with MongoDB
The objective of this practical work is to introduce students to MongoDB, a NoSQL database
used for handling large amounts of unstructured and semi-structured data. Students will learn
how to:
Lab Work 7
Big Data Storage and Processing with Cassandra
Execute advanced queries using CQL (Cassandra Query Language) and Java API
Instructions
c. Update Data
d. Delete Data