You are on page 1of 4

Assignment 3: Data Cleaning with Microsoft Excel SET A

NAME: MATRIC NUMBER:

SECTION: PROGRAM:

Instructions:
1. Download the workbook according to the provided set.
2. Rename your workbook by changing the YourMatricNo and YourSectionNo part
ONLY.
3. Write ALL answers in the space provided using a DARK BLUE/BLACK pen.
4. Submit both the HARD COPY (hand in this paper) and SOFT COPY (upload the
completed workbook with the cleaned dataset).

Scenario:
This messy dataset contains information about employees in a company. It has seven
columns. Your task is to clean the data using Microsoft Excel, making it ready for further
analysis.

Requirement:
Make a duplicate of the Messy 1 worksheet and rename the duplicated copy as Clean 1.

QUESTION 1: UNORGANIZED DATA (5 marks)


One example of the Clean 1 dataset is:

101; Muhammad Tan; 28; Manager; IT; 55000; 2022-05-10

a. Organize the dataset into several columns using Text to Columns feature.
List the content of the following cell references:

Cell Cell display


C10
E28

b. After applying Text-to-Columns with a delimiter of _________, the data is transformed


into ________ separate columns.
c. In cell B35, write the correct function to remove spaces from the content in B15.
=______________________________.

Page 1 of 4
Assignment 3: Data Cleaning with Microsoft Excel SET A

QUESTION 2: MISSING DATA (4 marks)


Check each column for missing values (blanks or empty cells).
a. A feature that will select the empty cells is called
________________________________.
b. Write any TWO (2) cell references for the missing data.

Cell

c. Name a shortcut key other than F5, to access the Go To dialog box.
___________________

QUESTION 3: DUPLICATE DATA (4 marks)


Check and remove duplicate records from the dataset.
a. Write any TWO (2) cell references for the duplicate data.

Cell

b. After applying the Remove Duplicates feature to ID and Name columns in Clean 1
sheet, ________ duplicates were removed, leaving ________ unique values in the
dataset.

QUESTION 4: INACCURATE DATA (5 marks)


Correct misspellings in the Position and Department columns.
a. Misspelled entries can be corrected by using the ___________________________
feature.
b. Write any TWO (2) cell references for the misspelled data with the cell display after
correction.

Cell Cell display after misspelled correction

Page 2 of 4
Assignment 3: Data Cleaning with Microsoft Excel SET A

QUESTION 5: INCONSISTENT DATA (2 marks)


The Clean 1 dataset lacks consistency.
a. Format the Salary column to Currency format.
b. Format the Joining Date column to follow the dd-mmm-yyyy standard.
Write the data for the following cells:
Cell Cell display
F11
G27

Page 3 of 4
Assignment 3: Data Cleaning with Microsoft Excel SET A

20

Page 4 of 4

You might also like