Professional Documents
Culture Documents
Centre For Management Studies: Online Submission of Assignment-02
Centre For Management Studies: Online Submission of Assignment-02
MBA-Full Time
Semester-II (2020)
Roll No: 26
Attempt any two Questions from each assignment. Each question carries equal marks.
Max. Marks: 25 for each assignment
Assignment 2
Q1. What is the scope of data mining in industry? Discuss some advantages
of data mining?
Ans1. Data mining is a process used by companies to turn raw data into useful information.
By using software to look for patterns in large batches of data, businesses can learn more about
their customers to develop more effective marketing strategies, increase sales and decrease
costs. Data mining depends on effective data collection, warehousing, and computer
processing.
Example : Grocery stores are well-known users of data mining techniques. Many supermarkets
offer free loyalty cards to customers that give them access to reduced prices not available to
non-members. The cards make it easy for stores to track who is buying what, when they are
buying it and at what price. After analyzing the data, stores can then use this data
to offer customers coupons targeted to their buying habits and decide when to put items on sale
or when to sell them at full price.
Given databases of sufficient size and quality, data mining technology can generate new
business opportunities by providing these capabilities:
Data mining techniques can yield the benefits of automation on existing software and hardware
platforms, and can be implemented on new systems as existing platforms are upgraded and
new products developed. When data mining tools are implemented on high performance
parallel processing systems, they can analyze massive databases in minutes. Faster processing
means that users can automatically experiment with more models to understand complex data.
High speed makes it practical for users to analyze huge quantities of data. Larger databases, in
turn, yield improved predictions.
1. Marketing/Retails
In order to create models, marketing companies use data mining. This was based on history to
forecast who’s going to respond to new marketing campaigns such as direct mail, online
marketing, etc. This means that marketers can sell profitable products to targeted customers.
2. Finance/Banking
Since data extraction provides information to financial institutions on loans and credit
reports, data can determine good or bad credits by creating a model for historic customers. It
also helps banks to detect fraudulent transactions by credit cards that protect the owner of a
credit card.
3. Researchers
Data mining can motivate researchers to accelerate when the method analysis the data.
Therefore, they can work more time on other projects. Shopping behaviors can be detected.
Most of the time, you may experience new problems while designing certain shopping
patterns. Therefore data mining is used to solve these problems. All the information on these
shopping patterns can be found by mining methods. This process also creates an area where
all the unexpected shopping patterns are calculated. This data extraction can be beneficial
when shopping patterns are identified.
Q2. List out the OLAP operations and explain the same with an example?
Ans2. Online analytical processing, or OLAP, is an approach to answer multi-dimensional
analytical (MDA) queries swiftly in computing. OLAP is part of the broader category
of business intelligence, which also encompasses relational databases, report writing and data
mining.
Typical applications of OLAP include business reporting for sales, marketing, management
reporting, business process management (BPM),budgeting and forecasting, financial
reporting and similar areas, with new applications emerging, such as agriculture. The
term OLAP was created as a slight modification of the traditional database term online
transaction processing (OLTP).
OLAP tools enable users to analyze multidimensional data interactively from multiple
perspectives. OLAP consists of three basic analytical operations: consolidation (roll-up), drill-
down, and slicing and dicing. Consolidation involves the aggregation of data that can be
accumulated and computed in one or more dimensions. For example, all sales offices are rolled
up to the sales department or sales division to anticipate sales trends. By contrast, the drill-
down is a technique that allows users to navigate through the details. For instance, users can
view the sales by individual products that make up a region's sales. Slicing and dicing is a
feature whereby users can take out (slicing) a specific set of data of the OLAP cube and view
(dicing) the slices from different viewpoints. These viewpoints are sometimes called
dimensions (such as looking at the same sales by salesperson, or by date, or by customer, or by
product, or by region, etc.)
Databases configured for OLAP use a multidimensional data model, allowing for complex
analytical and ad hoc queries with a rapid execution time. They borrow aspects of navigational
databases, hierarchical databases and relational databases.
OLAP is typically contrasted to OLTP (online transaction processing), which is generally
characterized by much less complex queries, in a larger volume, to process transactions rather
than for the purpose of business intelligence or reporting. Whereas OLAP systems are mostly
optimized for read, OLTP has to process all kinds of queries (read, insert, update and delete).
1) Roll-up:
1. Reducing dimensions
2. Climbing up concept hierarchy. Concept hierarchy is a system of grouping things
based on their order or level.
2) Drill-down
In drill-down data is fragmented into smaller parts. It is the opposite of the rollup process. It
can be done via
Drill-down is performed by stepping down a concept hierarchy for the dimension time.
Initially the concept hierarchy was "day < month < quarter < year."
On drilling down, the time dimension is descended from the level of quarter to the level of
month.
When drill-down is performed, one or more dimensions from the data cube are added.
It navigates the data from less detailed data to highly detailed data.
Consider the diagram above:
3) Slice:
Here, one dimension is selected, and a new sub-cube is created. Following diagram explain
how slice operation performed:
• Here Slice is performed for the dimension "time" using the criterion time = "Q1".
• It will form a new sub-cube by selecting one or more dimensions.
A Slice represents two-dimensional view of an OLAP Cube that arranges data in a grid,
similar to a spreadsheet; a Slice functions much like a report or a query in an RDBMS in that
it returns data based on a request for what to see. In an Cube, a Slice can, optimally, be created
through drag-and-drop of Dimensions and their Members; optimally, the Slice view will
refresh instantaneously.
4)Dice:
This operation is similar to a slice. The difference in dice is you select 2 or more dimensions
that result in the creation of a sub-cube.
The dice operation on the cube based on the following selection criteria involves three
dimensions.
5) Pivot
The pivot operation is also known as rotation. It rotates the data axes in view in order to provide
an alternative presentation of data. Consider the following diagram that shows the pivot
operation.
Pivot table is a data summarization tool that is used in the context of data
processing. Pivot tables are used to summarize, sort, reorganize, group, count, total or average
data stored in a database. It allows its users to transform columns into rows and rows into
columns. It allows grouping by any data field.
In Pivot, you rotate the data axes to provide a substitute presentation of data.