You are on page 1of 10

Introduction to IT

Individual Report
60% of total marks

Submitted by: Submitted to:


Aditya Rathi Mr.DIBYESH
Student No. : NP02A180003 CHAND
8TH JANUARY,2019

8TH JANUARY,2019
Acknowledgement

I would like to express my special thanks of gratitude to my teacher (Name of the


teacher) as well as our principal (Name of the principal) who gave me the golden
opportunity to do this wonderful project on the topic (Write the topic name), which also
helped me in doing a lot of Research and i came to know about so many new things I
am really thankful to them.
Secondly i would also like to thank my parents and friends who helped me a lot in
finalizing this project within the limited time frame.

8TH JANUARY,2019
Abstract

The project is all about inventory management using data warehousing and big data
technology. For this coursework, data source from Alpha vertex and PreCog is taken
into consideration. The data source consists of supplier details, product details and
details about stock movements from suppliers to warehouse location. The major steps
taken in the project were identifying facts and dimensions from the case scenario,
designing dimensional modelling, developing it in sql server, populating the tables,
exporting the table data as external file, loading the file into hdfs and analyzing the data
with apache hive and pigs.

8TH JANUARY,2019
Introduction

Alpha Vertex was established in 2016 with the purpose of rendering sophisticated
analytical potentialities to investors, brokers, fund managers, and others in the
financial community by combining artificial intelligence with human insight to pinpoint
the concealed relationship in financial information. The company uses artificial-
intelligence tools to recognize, absorb, and examine traditionally studied factors that
encompasses exchange rates, interest rates, and fundamentals and non-traditional
factors that includes media content, SEC filings, weather, and litigation information.
For this coursework, data source from Alpha vertex and PreCog is taken into
consideration. The data source consists of supplier details, product details and details
about stock movements from suppliers to warehouse location. A sample of data will
be provided either as flat files or relational tables.
Inventory management is one of the most significant business processes of a
company. Its major aim is to produce a good purchase order plan to ensure that
product items are readily available when needed. Minimizing over-stocks and under-
stocks will have a crucial role in the elevation of revenue and customer satisfaction.
There are three main business activities:
Sending purchase orders to suppliers when there are product items with minimum
levels of inventory. For each sent purchase order, the ordered quantity, sent date,
product and supplier details are recorded.
Receiving purchase orders and storing stocks in appropriate locations. For each
received purchase order, the ordered quantity, received quantity, received date,
product and supplier details are recorded.
Controlling and maintaining stocks which involves adding new products and
adjusting stock levels of existing products. The details of each product are recorded
including the current stock level.

Managers responsible for managing inventory are demanding to meet the following
reporting and analysis requirements:

8TH JANUARY,2019
• A daily stock levels of all products for the last month.
• A weekly report of all products with minimum stock levels.
• Analysing stock levels by brand or product type or supplier
• Daily and weekly sent and received stock orders for the last four weeks.
• Analysing received stock orders by supplier and by month.

1.1 Grain Identification

A Fact Table is the central table in a star schema of a data warehouse. A fact table
stores quantitative information for analysis and is often denormalized. A fact table
works with dimension tables and it holds the data to be analyzed and a dimension
table stores data about the ways in which the data can be analyzed.
According to the given case scenario, there are main three main business processes
of the company which are discussed below:
1. Sending purchase orders to suppliers when there are product items with
minimum levels of inventory. For each sent purchase order, the ordered quantity,
sent date, product and supplier details are recorded.
2. Receiving purchase orders and storing stocks in appropriate locations. For each
received purchase order, the ordered quantity, received quantity, received date,
product and supplier details are recorded.
3. Controlling and maintaining stocks which involves adding new products and
adjusting stock levels of existing products. The details of each product are
recorded including the current stock level.
There are three central fact table based on the business activities as provided. The
three central fact tables are:

 fact_current_stock: This fact table is designed in order to find out the stock
levels of existing stocks and current stock levels so that depending upon the
number of current stocks, decision could be made whether to order the stock or
not, the number of stocks to be ordered etc.

 fact_po_to_send: This fact table helps us to analyze the purchase orders that
are sent.

 fact_po_to_receive: This fact table helps us to analyze the purchase orders


that are received.

8TH JANUARY,2019
Grains of fact table: The first and most important design step is declaring the fact
table grain. The grain is the business definition of what a single fact table record
represents. The grain declaration is not a list of dimensional foreign keys that
implement a primary key for the fact table. Rather, the grain is the description of the
measurement event in the physical world that gives rise to a measurement. When the
grocery store scanner measures the quantity and the charged price of a product being
purchased, the grain is literally the beep of the scanner. That is a great grain
definition.
Grain of fact_current_stock: current_stock_level, is_active

 current_stock_level : It provides the information regarding the present stock


level in the inventory.

 is_active: It gives the idea regarding whether the stock is remaining or


completed(no stocks remaining)

Grain of fact_po_to_send: order_quantity

 order_quantity: It is the number of order and the product quantity that is sent
after getting Information from current stocks
Grain of fact_po_to_receive: order_quantity, receive_quantity

 order_quantity: It is the number of orders that are sent to the suppliers.

 receive_quantity : : It is the number of orders that are received from the


suppliers.

8TH JANUARY,2019
1.2 Dimensions of Central Fact Tables

Dimensions of each central fact tables:


 fact_current_stock : dim_product, dim_supplier, dim_date
 fact_po_to_send: dim_product, dim_supplier, dim_date,dim_location
 fact_po_to_receive: dim_product, dim_supplier, dim_date, dim_location

A short descriptions of each fact table is associated with three dimension tables.

 fact_current_stock : dim_product, dim_supplier, dim_date

In order to find out the facts regarding current stock, there must be the dimension
of product of which the current stock is calculated. Similarly, one must have date
dimension in order to find out the current stock, minimum stock, etc on some
particular day or week or month. Likewise there must be data regarding the
supplier so that we can analyze the stock level as per the supplier.

 fact_po_to_send: dim_product, dim_supplier, dim_date,dim_location

In order to analyze the purchase orders sent to supplier, one must have
information regarding the product of which order is being sent, the data of sending
the purchase order and the supplier to which the purchase order is to be sent.

8TH JANUARY,2019
Figure 1 Data warehousing and big data

Figure 2 Dimensional Data modelling

 fact_po_to_receive: dim_product, dim_supplier, dim_date,dim_location

In order to analyze purchase order received, there must be the idea regarding the
product of which order is received, supplier; who sent the order and the date on
which the order was begin sent.

Identifying attributes and measures of fact tables.

8TH JANUARY,2019
Business data represented by facts can offer little insight without the presence of
business concepts and context, which take the form of attributes in Micro Strategy.
Attributes provide the business model with a context in which to report on and
analyze facts. While knowing company’s total inventory is useful, knowing from
where and when the products arrive and provides the kind of analytical depth
decision makers require on a daily basis.

fact_current_stock Purpose of attributes

Conclusion and Reflection

The coursework is all about practical implementations of what has been learnt in
lectures. Though it had been theoretically known about data warehousing and big data
technology, designing them and implementing in certain case scenarios have developed
insights about how the things work.
The major steps taken in the project were identifying facts and dimensions from the
case scenario, designing dimensional modelling, developing it in sql server, populating
the tables, exporting the table data as external file, loading the file into hdfs and
analyzing the data with apache hive and pigs. During this process problems were faced
a number of time but with serious self-study and interaction with the teacher, they were
solved.

8TH JANUARY,2019
8TH JANUARY,2019

You might also like