You are on page 1of 2

Mid Exam

MK 2205 - Business Computation and Analytics 2019/2020


Time: 08.30-12.00

Instruction
1. Download the data file in Google Classroom for Mid Exam topic then read the description of
the data.
2. Provide at least 5 research questions (RQ) that can be analysed by using RapidMiner.
3. Perform the analysis using RapidMiner, then write a report in Ms Word format that includes
the screenshot of your RapidMiner process in design view, parameter view of each operator.
Scope of analysis includes but not limited to:
● Data preprocessing
● Descriptive analytics
● Predictive analytics
● Market basket analysis
● Visualization
4. Sent your RapidMiner (*.rmp format) file for your analysis and Ms. Word (*.docx format) file
(You may use Ms.Excel to prepare your analysis in RapidMiner and you need to submit the
Excel file as well) to Google Classroom with the title format:
Class_Name_NIM_RQ No. OR Report (docx file) OR Data Prep (xls file)
Example:
A_Liane Okdinawati_92017089_RQ1.rmp
A_Liane Okdinawati_92017089_RQ2.rmp
A_Liane Okdinawati_92017089_RQ3.rmp
A_Liane Okdinawati_92017089_RQ4.rmp
A_Liane Okdinawati_92017089_RQ5.rmp
A_Liane Okdinawati_92017089_Report.doc
A_Liane Okdinawati_92017089_Data Prep.xls
5. We will check your report using Turnitin software, any similarity more than 20% lead to 0
mark.
​Description of the Data
Context of the data
The growth of supermarkets in most populated cities are increasing and market competitions are
also high. The dataset is one of the historical sales of supermarket companies which has recorded in
3 different branches for 3 months data. Predictive data analytics methods are easy to apply with this
dataset.

Attribute information:
1. Invoice id: Computer generated sales slip invoice identification number.
2. Branch: Branch of supercenter (3 branches are available identified by A, B and C).
3. City: Location of supercenters.
4. Customer type: Type of customers, recorded by Members for customers using member card and
Normal for without member card.
5. Gender: Gender type of customer.
6. Product line: General item categorization groups - Electronic accessories, Fashion accessories,
Food and beverages, Health and beauty, Home and Lifestyle, Sports and travel.
7. Unit price: Price of each product in $.
8. Quantity: Number of products purchased by the customer.
9. Tax: 5% tax fee for customer buying.
10. Total: Total price including tax.
11. Date: Date of purchase (Record available from January 2019 to March 2019).
12. Time: Purchase time (10 am to 9 pm).
13. Payment: Payment used by the customer for the purchase (3 methods are available – Cash,
Credit card and Ewallet).
14. COGS: Cost of goods sold.
15. Gross margin percentage: Gross margin percentage.
16. Gross income: Gross income.
17. Rating: Customer stratification rating on their overall shopping experience (On a scale of 1 to
10).

You might also like