You are on page 1of 6

Sort Stage

Important Instructions: 
· Please read the document thoroughly before you code. 
· Please do not change the Business Requirements. 

Coverage: 
1. Sort Stage 

Source Files:

Controlled copy

Project ID: <Project ID. > <SCI.ID. > / Ver: <Ver No.>

Release ID: QTAD-SRDUC.doc / 1.0 / 21.07.2020 C2: Protected


Sort Stage

Exercise 1 : Remove Duplicates using Sorter

Use “SALES_TRANSACTIONS_Duplicates.csv” as source .

Design a job that will remove duplicate rows in the source and loads the distinct rows into target.

Controlled copy

Project ID: <Project ID. > <SCI.ID. > / Ver: <Ver No.>

Release ID: QTAD-SRDUC.doc / 1.0 / 21.07.2020 C2: Protected


Sort Stage

Controlled copy

Project ID: <Project ID. > <SCI.ID. > / Ver: <Ver No.>

Release ID: QTAD-SRDUC.doc / 1.0 / 21.07.2020 C2: Protected


Sort Stage

Exercise 2 : Create product list ordered by total sales

Use “SALES_TRANSACTIONS_UnOrdered.csv” as source .

Design a job that will create the following output where the products are listed in descending order of
total Total Sales i.e. SUM(SELLING_PRICE * QUANTITY) .

Product Total_Sales
2003 1791669.39
2001 1328768.16
2000 933800
2002 854381.07
2004 602491.23
86521 4650.5
86533 4486.68
86518 2495.48
86524 1987.56
….

Controlled copy

Project ID: <Project ID. > <SCI.ID. > / Ver: <Ver No.>

Release ID: QTAD-SRDUC.doc / 1.0 / 21.07.2020 C2: Protected


Sort Stage

Controlled copy

Project ID: <Project ID. > <SCI.ID. > / Ver: <Ver No.>

Release ID: QTAD-SRDUC.doc / 1.0 / 21.07.2020 C2: Protected


Sort Stage

Controlled copy

Project ID: <Project ID. > <SCI.ID. > / Ver: <Ver No.>

Release ID: QTAD-SRDUC.doc / 1.0 / 21.07.2020 C2: Protected

You might also like