You are on page 1of 1

This petroleum data is maintained by Petroleum Companies in the traditional database systems.

The work that is to


be done by the trainees , convert the entire module into the hadoop, tables can be converted into hbase or hive
both. Data to be inserted into the table can not be provided by the petroleum companies as it is their secret data.
Trainees can insert any kind of data in the tables or file. Loop can be used to insert the bulk data in the tables.

Some of the free bulk data can be downloaded from the link
https://cwiki.apache.org/confluence/login.action;jsessionid=7B93F00967D176238C302B14CD845F9A?os_destination
=%2Fpages%2Fviewpage.action%3FspaceKey%3DMAHOUT%26title%3DCollections&permissionViolation=true

Once the entire dataset is ready into hadoop DWH or databases. Following analysis activities are to be done on the
datasets.

1. Which Card_Number has the maximum Transaction.


2. Calculate the sum of total_price, pump_number wise.
3. How many time a drawing_name appears on given Date_received.
4. Calculate number of ship_name between given dates (Date_from and Date_to)
5. Find out level_of_inventory as per given store_id
6. Find out the port_name on given Job_enquiry_id
7. Which (given) card_number is used on which pump_number
8. Which fuel_type (diesel or patrol) is used on given customer_purchase_id
9. Which customer (cusmoter_id) is using which card (Card_number)
10. Which customer has paid in cash (Cash_customer_YN)
11. Find out the address (address_details) as per the svc_station_id
12. Which customer (customer_id) uses which fuel pump (svc station id)

You might also like