You are on page 1of 12

Datawarehouse

• Repository of data collected from multiple database sources (using data cleaning,
Extract, Transform and Load process).
• Use for Data Analysis and Decision making
• Characteristics of Data Warehouse -
• Subject- Oriented (Like sales, products, customers)
• Integrated (data from different sources into a consistent format)
• Time variant (Historical data, trends, daily, weekly, monthly views)
• Non-volatile (After entering data into Datawarehouse, users cannot change or
update the data. Changes are recorded as new data)
Advantages of Datawarehouse

• End Users can perform multidimensional analysis


• Consolidated view of business data
• Data access is simplified
• Provides competitive advantage
• Enables historical insights

• Disadvantages
• Maintenance cost
• Issues related with loading data
• Requires Highly Skilled manpower to manage and handle.
OLTP Vs OLAP

• OLTP • OLAP
• Online Transaction processing • Online Analytical Processing
• Operational • Informational
• Transaction • Analysis
• Detailed view • Summarized view
• OLTP is useful to administer day to • OLAP creates a single platform for
day transactions of an organization. all type of business analysis needs
which includes planning, budgeting,
forecasting, and analysis
Extraction Transformation Loading (ETL)
Mapping
• A mapping represents the flow of data between source and targets
• It tells the server how to read, transform and load data
Extract
• Data is extracted from the source system into the staging area.
Transformation
• Transformations are the objects in a mapping that modify, and pass the data
between sources and targets
Load
• Loading data into the target data warehouse database

4
Data mart and Meta Data

Data Mart
• Subset of Data Warehouse
• Smaller and focused on a particular department/subject
• Provide fast and easy access of data

Metadata
• Describe structure and meaning about data
• Consistency of definition for used data 
Data Mining
• Data mining is a process to identify useful information and patterns from a
dataset
• Patterns may be business rules, trends, associations, categorization, time
series patterns
Applications – Data mining has a lot of applications in various domains like -
• CRM – customers needs, profiling, retention etc
• Banking – Loan processing, fraud detection, forecasting cash flow etc
• Retailing – Predict sales, store design, supply chain analysis etc
• Travel – predict services, forecast demand, people movement etc.
Data warehouse

Source: Internet
Emerging Databases
Cloud Database
• A cloud database is a database service built and accessed through
a cloud platform.
• It serves many of the same functions as a traditional database with the added
flexibility of cloud computing.
• Example: Amazon Web Service (AWS), Microsoft Azure, Google Cloud
Platform, IBM DB2, MongoDB Atlas.
Challenges
• Data Protection: Though database may be in the cloud, but it still needs to be
backed up and protected.
• Companies are facing lack of resources and/or skilled people.
• When a business moves to the cloud it becomes dependent on the service
providers
Emerging Databases

Mobile database
• A mobile database is a database that can be connected to by a mobile
computing device over a wireless mobile network.
• Example: SQLite, Oracle Berkeley DB, Sparksee Mobile (graph database)

Challenges
• Security issues in Mobile databases
• Frequent disconnection
• Because of the mobility and rapid resource changes of mobile units, Query
optimization becomes the most complicated.
Emerging Databases

Multimedia database
• The multimedia databases are used to store multimedia data such as images,
animation, audio, video along with text.
• Example: Digital library software (an online database of digital objects that can
include text, still images, audio, video, digital documents, or other digital
media formats)
Challenges
• Multimedia data often are not quite structured; therefore, standard indexing
and/content-based search and retrieval may be complicated.
• Multimedia data often require a large storage device.
• Video, audio, and animation sequences have temporal requirements that have
implications on their storage, manipulation and presentation
Emerging Databases

Social media database


• SQLite, PostgreSQL, or NoSQL databases such as MongoDB
• InnoDB MySQL database engine (used by Facebook)
• PostgreSQL and Cassandra (used by twitter)
Challenges
• Data Relevance & Quality
• Difficulty In Aggregating Data 
• Extremely Rapid Content Turnover
• Frequent Feature Updates & Changes.
Emerging Databases

• Spatial database
• A spatial database includes location.
• GIS integrates and analyzes spatial location and organizes layers of information
into visualizations using maps and 3D scenes.  
• Example: MapInfo, ArcView

Challenges
• to collect new data and to convert paper maps and data into digital format
continues to be a problem
• The accuracy of a map or dataset is dependent on scale and becomes
problematic when map scales are changed or when datasets are merged

You might also like