Lab 2 Data Transformation in PBI
Lab 2 Data Transformation in PBI
Dr Huan Vu
Business AI Lab
1
Faculty of Data Science and Artificial Intelligence, NEU
Prerequisites
You should pick up where you left off last time on the Adventure Works project. Reminder, we have:
• Created a .pbix file named AdventureWorks.pbix.
• Loaded data from the AdventureWorks Territory Lookup.csv file, and renamed the Query
Territory Lookup.
• Loaded data from the AdventureWorks Product Categories Lookup.csv file, and renamed
the Query Product Categories Lookup.
• Loaded data from the AdventureWorks Product Subcategories Lookup.csv file, and re-
named the Query Product Subcategories Lookup.
• Loaded data from the AdventureWorks Product Lookup.csv file, and renamed the Query
Product Lookup.
• Changed the type of the columns ProductCost and ProductPrice from Decimal number to
Fixed decimal number.
• Created a new SKUType column, using the ProductSKU column, by extracting all characters
before the second dash (”-”).
• Replaced all values ”0” in the ProductStyle column with ”NA”.
Objectives
The objective of this Lab is to complete importing other data files of the AdventureWorks project and
transform the data if necessary.
1.2 Instructions
1. Import the AdventureWorks Customer Lookup.csv file to the existing PowerBI project. Make
sure that the type of columns is detected correctly.
2. Use Column quality tool to check for errors or empty values. Check the rows that contain these
errors and remove them if needed.
3. User Column distribution tool to view the distribution of values in each column.
4. User Column profile tool to view the column statistics.
@BAI Lab 2
Faculty of Data Science and Artificial Intelligence, NEU
3 Numerical Tools
1. Create a new column name ”DiscountedPrice”. The value of this column is 0.9× the Product-
Price.
2. Find the following values without storing it anywhere in our data model:
3. The Start of Week column starts on Sunday, transform the column so that the week starts on
Monday.
4. Add the following columns: MonthName (e.g. January), MonthNumber (e.g. ”1”), Start of
Year (e.g. ”1/1/2020”), and Year (e.g. ”2020”).
@BAI Lab 3