Professional Documents
Culture Documents
Individual Report
60% of total marks
8TH JANUARY,2019
Acknowledgement
8TH JANUARY,2019
Abstract
The project is all about inventory management using data warehousing and big data
technology. For this coursework, data source from Alpha vertex and PreCog is taken
into consideration. The data source consists of supplier details, product details and
details about stock movements from suppliers to warehouse location. The major steps
taken in the project were identifying facts and dimensions from the case scenario,
designing dimensional modelling, developing it in sql server, populating the tables,
exporting the table data as external file, loading the file into hdfs and analyzing the data
with apache hive and pigs.
8TH JANUARY,2019
Introduction
Alpha Vertex was established in 2016 with the purpose of rendering sophisticated
analytical potentialities to investors, brokers, fund managers, and others in the
financial community by combining artificial intelligence with human insight to pinpoint
the concealed relationship in financial information. The company uses artificial-
intelligence tools to recognize, absorb, and examine traditionally studied factors that
encompasses exchange rates, interest rates, and fundamentals and non-traditional
factors that includes media content, SEC filings, weather, and litigation information.
For this coursework, data source from Alpha vertex and PreCog is taken into
consideration. The data source consists of supplier details, product details and details
about stock movements from suppliers to warehouse location. A sample of data will
be provided either as flat files or relational tables.
Inventory management is one of the most significant business processes of a
company. Its major aim is to produce a good purchase order plan to ensure that
product items are readily available when needed. Minimizing over-stocks and under-
stocks will have a crucial role in the elevation of revenue and customer satisfaction.
There are three main business activities:
Sending purchase orders to suppliers when there are product items with minimum
levels of inventory. For each sent purchase order, the ordered quantity, sent date,
product and supplier details are recorded.
Receiving purchase orders and storing stocks in appropriate locations. For each
received purchase order, the ordered quantity, received quantity, received date,
product and supplier details are recorded.
Controlling and maintaining stocks which involves adding new products and
adjusting stock levels of existing products. The details of each product are recorded
including the current stock level.
Managers responsible for managing inventory are demanding to meet the following
reporting and analysis requirements:
8TH JANUARY,2019
• A daily stock levels of all products for the last month.
• A weekly report of all products with minimum stock levels.
• Analysing stock levels by brand or product type or supplier
• Daily and weekly sent and received stock orders for the last four weeks.
• Analysing received stock orders by supplier and by month.
A Fact Table is the central table in a star schema of a data warehouse. A fact table
stores quantitative information for analysis and is often denormalized. A fact table
works with dimension tables and it holds the data to be analyzed and a dimension
table stores data about the ways in which the data can be analyzed.
According to the given case scenario, there are main three main business processes
of the company which are discussed below:
1. Sending purchase orders to suppliers when there are product items with
minimum levels of inventory. For each sent purchase order, the ordered quantity,
sent date, product and supplier details are recorded.
2. Receiving purchase orders and storing stocks in appropriate locations. For each
received purchase order, the ordered quantity, received quantity, received date,
product and supplier details are recorded.
3. Controlling and maintaining stocks which involves adding new products and
adjusting stock levels of existing products. The details of each product are
recorded including the current stock level.
There are three central fact table based on the business activities as provided. The
three central fact tables are:
fact_current_stock: This fact table is designed in order to find out the stock
levels of existing stocks and current stock levels so that depending upon the
number of current stocks, decision could be made whether to order the stock or
not, the number of stocks to be ordered etc.
fact_po_to_send: This fact table helps us to analyze the purchase orders that
are sent.
8TH JANUARY,2019
Grains of fact table: The first and most important design step is declaring the fact
table grain. The grain is the business definition of what a single fact table record
represents. The grain declaration is not a list of dimensional foreign keys that
implement a primary key for the fact table. Rather, the grain is the description of the
measurement event in the physical world that gives rise to a measurement. When the
grocery store scanner measures the quantity and the charged price of a product being
purchased, the grain is literally the beep of the scanner. That is a great grain
definition.
Grain of fact_current_stock: current_stock_level, is_active
order_quantity: It is the number of order and the product quantity that is sent
after getting Information from current stocks
Grain of fact_po_to_receive: order_quantity, receive_quantity
8TH JANUARY,2019
1.2 Dimensions of Central Fact Tables
A short descriptions of each fact table is associated with three dimension tables.
In order to find out the facts regarding current stock, there must be the dimension
of product of which the current stock is calculated. Similarly, one must have date
dimension in order to find out the current stock, minimum stock, etc on some
particular day or week or month. Likewise there must be data regarding the
supplier so that we can analyze the stock level as per the supplier.
In order to analyze the purchase orders sent to supplier, one must have
information regarding the product of which order is being sent, the data of sending
the purchase order and the supplier to which the purchase order is to be sent.
8TH JANUARY,2019
Figure 1 Data warehousing and big data
In order to analyze purchase order received, there must be the idea regarding the
product of which order is received, supplier; who sent the order and the date on
which the order was begin sent.
8TH JANUARY,2019
Business data represented by facts can offer little insight without the presence of
business concepts and context, which take the form of attributes in Micro Strategy.
Attributes provide the business model with a context in which to report on and
analyze facts. While knowing company’s total inventory is useful, knowing from
where and when the products arrive and provides the kind of analytical depth
decision makers require on a daily basis.
The coursework is all about practical implementations of what has been learnt in
lectures. Though it had been theoretically known about data warehousing and big data
technology, designing them and implementing in certain case scenarios have developed
insights about how the things work.
The major steps taken in the project were identifying facts and dimensions from the
case scenario, designing dimensional modelling, developing it in sql server, populating
the tables, exporting the table data as external file, loading the file into hdfs and
analyzing the data with apache hive and pigs. During this process problems were faced
a number of time but with serious self-study and interaction with the teacher, they were
solved.
8TH JANUARY,2019
8TH JANUARY,2019