You are on page 1of 72

Big Data

Agenda
• What is Big data?
• Real world examples
• Structuring big data
• Types of data

Presented by : A.H.Shanthakumara
Big Data

What is Bigdata?
➢Lots of data is being collected and warehoused
➢Web data, e-commerce
➢purchases at department/grocery stores
➢Bank/Credit Card transactions
➢Social Network
➢A new data challenge that required leveraging
existing systems differently
➢Classified in terms of four V’s: volume variety velocity
and veracity
➢Usually unstructured and qualitative in nature
➢The process of capturing or collecting big data is
known as datafication.
➢ big data is datafied so that it can be used productivity
Presented by : A.H.Shanthakumara
Big Data

Real world examples:


➢ Data on social media websites such as Facebook and Twitter
➢ Manufacturers are monitoring minute vibration data from their
equipment which changes slightly as it wear down
➢ Customer interaction data that enables the Financial Institutions
to create increasingly relevant and sophisticated offers
➢ Social media data enables Advertising and Marketing Institutions
to campaign, promote and other advertising mediums
➢ Medical data and patient records enables hospitals to predict
those patients that are likely to seek readmission within a few
months of discharge
Presented by : A.H.Shanthakumara
Big Data

Real world examples:


➢ Customer data and product information can be used by the web
based businesses to offer more appealing recommendations
and more successful coupon programs
➢ The government is making data public at the national, state and
city level for the users to develop new applications that can
generate public good
➢ Sports teams are using data for tracking team strategies. this is
known as sports Analytics

Presented by : A.H.Shanthakumara
Big Data

Types and sources of data:


➢ Social data: refers to the information collected from social
networking sites and online portals
➢ source: Facebook, Twitter, LinkedIn
➢ Machine data: refers to the information generated from RFID
chips, bar code scanners and sensors.
➢ source: RFID chips readings. Global positioning system
results
➢ Transactional data: refers to the information generated from
online shopping sites, retailers and business to business
transactions
➢ source: Retail websites like ebuy and Amazon
Presented by : A.H.Shanthakumara
Big Data

Structuring big data:


➢ An arranging the available data in a manner such that it
becomes easy to study, analyze and derive conclusion from it.
➢ Structuring data helps in understanding user behavior
requirements and preferences to make personalized
recommendations for every individual.
➢ Different types of data can be structured only if it is sorted and
organized in some logical pattern.
➢ The process of structuring data required one to first understand
the various types of data available today

Presented by : A.H.Shanthakumara
Big Data

Types of data:
➢ Data is obtained primarily from two types of sources
1. Internal sources
➢ Provides a structured for organized data
➢ Used to support daily business operations
➢ Data generated from CRM, ERP, OLTP, POS., etc..

2. External sources
➢ Provides unstructured or unorganized data
➢ Often analyzed to understand the entities mostly external to the

organization
➢ the data generated from social media, internet, government agencies,
Syndicate data suppliers
Presented by : A.H.Shanthakumara
Big Data

Types of data:
➢ On the basis of the data received from various sources, the Big
data comprises: Structured data, unstructured data and semi-
structured data.
➢ Typically, the unstructured data is larger in volume then the
structured and semi structured data

Presented by : A.H.Shanthakumara
Big Data

Structured data:
➢ It can be defined as the data that has defined repeating pattern
➢ Much easier and faster to process
➢ Mostly in tabular form
➢ Fixed fields within a record or a file
➢ Used to query and report against predetermined data types
➢ Some sources of structured data
➢ Relational databases
➢ Flat files in the form of records
➢ multi dimensional databases
➢ Legacy databases

Presented by : A.H.Shanthakumara
Big Data

Unstructured data:
➢ It is a set of data that might or might not have any logical or
repeating patterns
➢ Typically of meta data
➢ Inconsistent data obtained from files, social media, website,
satellite etc..
➢ Data in different formats such as emails, text, audio, video or
images
➢ Some sources of unstructured data:
➢ Text data both internal and external to the organization
➢ Social media
➢ Mobile data
Presented by : A.H.Shanthakumara
Big Data

Challenges associated with unstructured data:


➢ Identifying the unstructured data that can be processed
➢ Sorting, organizing and arranging unstructured data in different
sets and formats
➢ Combining and linking unstructured data in a more structured
format
➢ Costing in terms of storage space and human resources

Presented by : A.H.Shanthakumara
Big Data

Semi-structured data:
➢ Also known as having a schema-less or self describing
structure.
➢ Data is stored inconsistently in rows and columns of a database
➢ Some sources of semi structured data:
➢ File systems such as web data in the form of cookies

➢ Data exchange format such as JavaScript Object Notation data

Presented by : A.H.Shanthakumara
Big Data

Agenda
• Elements of big data
• Big Data Analytics
– Advantages of Big Data Analytics
– The application areas of big data

Presented by : A.H.Shanthakumara
Big Data

Elements of big data:


➢Big data growth can be depicted in terms of the
following four V's
➢Volume, Velocity, Variety, Veracity
Volume:
➢The amount of data generated
➢Organizations are doing their best to handle this ever
increasing volume of data
➢Internet alone generates a huge amount of data
➢The total data stored on the internet has crossed 1
yottabyte
➢The exact size of the Internet will never be known

Presented by : A.H.Shanthakumara
Big Data

Elements of big data:


Velocity:
➢Describes the rate at which data is generated,
captured and shared
➢Information processing systems face problems
associated with data, which keeps adding up but
cannot be processed quickly
➢The sources of velocity data:
➢IT devices:- routers, switches, firewalls constantly
generate valuable data
➢Social medium:- create huge amount of data,
which is to be analyzed instantly at a fast speed
➢Portable devices:- generates data at high speed
Presented by : A.H.Shanthakumara
Big Data

Elements of big data:


Variety:
➢Data generated from different types of sources comes
in different formats such as images, text, audio,
videos, etc...
➢Even a single source can generate data in a variety of
formats
Veracity:
➢Refers to the uncertainty of data
➢Only correct and consistent data can be used for
further analysis
➢A lot of effort goes in processing the data
➢Big data takes a good amount of time and expertise
to clean that data and make it suitable for analysis
Presented by : A.H.Shanthakumara
Big Data

Big Data Analytics:


➢Reformed the ways to conduct business in many
ways
➢Business analytics uses the data and different other
techniques to improve decision making and business
process management
➢The conventional database systems are not in a
position to process big data
➢There are three main types of business analytics:
Descriptive analytics, Predictive analytics and
Prescriptive Analytics

Presented by : A.H.Shanthakumara
Big Data

Big Data Analytics:


Descriptive Analytics:
➢It answer the question "what happened in the
business?"
➢Analysis a database to provide information on the
trends of past and current business events that can
help managers, planners, leaders, etc...
➢Performs an in-depth analysis of data to reveal details
such as frequency of events operation cost and
underlying reason for failures
➢Helps in identifying the root cause of the problem

Presented by : A.H.Shanthakumara
Big Data

Big Data Analytics:


Predictive Analytics:
➢It is about understanding and predicting the future
and answer the question "what would happen?"
➢It predicts the near future probabilities and trends and
helps in what if-analysis
➢We use statistics, data mining techniques and
machine learning to analyze the future
Prescriptive Analytics:
➢It answers what should we do on the basis of complex
data obtained from descriptive and predictive analysis
➢Using the Optimization techniques, it determine the
finest substitute to minimize or maximize some
equitable finance, marketing and many other areas
Presented by : A.H.Shanthakumara
Big Data

Big Data Analytics:


Advantages of Big Data Analytics:
➢ The real power of big data lies in its analysis,
processing, studying and implementing the
conclusions derived from the analysis of big data
➢ The right analysis of the available data can improve
major business processes in various ways
➢ Using big data cuts your costs.
➢ Using big data increases your efficiency.
➢ Using big data improves your pricing.
➢ You can compete with big businesses.
➢ Allows you to focus on local preferences.
➢ Using big data helps you increase sales and loyalty.
➢ Using big data ensures you hire the right employees
Presented by : A.H.Shanthakumara
Big Data

Big Data Analytics: The application areas of big data


➢ Transportation: identify traffic jam areas, provide real
time traffic information, help traffic control authority
➢ Education: innovative approaches in e-learning,
behavioral signals of the students, distance learning
over vast geographical areas
➢ Travel: determine certain behavioral pattern in
customers, advertising for improving customer
experience
➢ Government: taking timely and informed decisions
about various issues, identifying flaws and
loopholes, assessing the areas of improvement

Presented by : A.H.Shanthakumara
Big Data

Big Data Analytics: The application areas of big data


➢ HealthCare: improve their research and development
practices, eliminating Healthcare related challenges
➢ Telecom: utilize data for extracting meaningful
information, improving Customer services, generate
more business opportunities
➢ Consumer goods industry: POS data provides
significant real-time information about customer
preferences, current market trends, increase and
decrease in demands of different products at different
regions
➢ Aviation industry: flying preferences and other trends
and patterns, improve Customer services, enhance
operational efficiency
Presented by : A.H.Shanthakumara
Big Data

Agenda
• Use of big data in social networks
• Use of big data in preventing fraudulent
activities
• Use of big data in Retail Industry

Presented by : A.H.Shanthakumara
Big Data

Use of big data in social networks:


➢ Activities on social media generate large amount of data

➢ Analysing and mining such large data shows business trends

with respect to wants and preferences and likes and dislikes of

a wide audience.

➢ Social Network Analysis is performed on the data obtained

from social media.

➢ Example: Data derived from social media enables an

organisation to calculate total revenue a customer can

influence instead of the direct revenue the customer generates


Presented by : A.H.Shanthakumara
Big Data

Use of big data in social networks:


The following are the areas in which decision-making processes
are influenced by social network data:
➢ Business intelligence, marketing, product design and
development
Business intelligence:
➢ Data analysis process to convert a raw data set to meaningful
information by using different techniques and tools for boosting
business performance.
➢ Data is analysed to gain important business insights.
➢ Example :by using the Calls record, mobile service provider can
value the customer.
➢ Can identify highly connected customers, betterment of their
products and services, maximize the profitability of the
customer's network
Presented by : A.H.Shanthakumara
Big Data

Use of big data in social networks:


Marketing: marketers aims to deliver what consumers want by
using interactive communication across digital channels
➢ Analysis generate very useful and meaningful business insights
that may help organisations to take timely and informed
decision
Product design and development:
➢ Organisation can make a right decision in the direction of their
product development and offerings by listening to what
customer want, by understanding where the gap in the
offering is.
➢ Identify and major customers behavior and online trends
➢ Study the quality of the sentiments expressed in the
conversations
Presented by : A.H.Shanthakumara
Big Data

Use of big data in preventing fraudulent activities:


➢ The most common types of financial frauds: credit card fraud,
exchange or return policy product, personal information fraud.
Preventing fraud using Big Data Analytics:
1. Keep track of and process huge volumes of data
2. Differentiate between real and fraudulent entries
3. Identify new methods of fraud and add them to the list of fraud
prevention checks
4. Determine the location of the customer and the time when the
product was actually delivered
5. Verify whether a product has actually be delivered to the valid
recipient
Presented by : A.H.Shanthakumara
Big Data

Use of big data in preventing fraudulent activities:


Fraud detection in real time:
➢ Compare live transactions with different data sources to
validate the authenticity of online transactions
➢ Example: Compare incoming IP address with the geo-data
received, examine the entire historical data to track suspicious
patterns of the customer order,
Visually analyzing fraud:
➢ Image analytics can help to detect frauds
➢ Examples: the use of barcode and QR codes, facial recognition
and position and movement analysis
➢ Create maps and graph for comparison that can be used to
analyse situations and take decisions
Presented by : A.H.Shanthakumara
Big Data
Use of big data in detecting fraudulent activity in Insurance sector:
➢ Social media provide a real-time view of the case in hand
➢ Enable the insurance agent to diagnose the patterns of
customers claims, behaviour and other related issues.
➢ Can detect patterns of fraudulent behaviour from large amounts
of structured and unstructured data
➢ Data Analytics capable of handling various sources of
information from different channels in an integrated way
➢ Public data is an important source to conduct predictive
analysis in order to avoid frauds
➢ Business organisations integrate their internal data with third
party data, this helps in investigating and restricting
fraudulent activities
Presented by : A.H.Shanthakumara
Big Data
Use of big data in detecting fraudulent activity in Insurance sector:

Social Network Analysis:


➢ It is an innovative and effective way to identify and detect
frauds
➢ It is a mixed approach includes statistical methods, pattern
analysis and link analysis to identify any kind of relationships or
patterns
Method to detect fraud using SNA:
1. Load data gathered data from different sources into the Extract, Transform
and Load(ETL) tool. ETL process this data and loaded into warehouse.
2. Marks the risk of fraud and ranks the likelihood of fraud
3. Use big data technologies including text mining and content categorization
in fraud detection and predictive modelling mechanism.
4. On the basis of score of a specific network, an alert is made
5. Start working on the fraudulent claim
6. Finally, frauds that are detected added into the use case system
Presented by : A.H.Shanthakumara
Big Data
Use of big data in Retail Industry:

➢ With immense number of transactions and their correlations the


Retail Industry offers promising space for big data to operate
➢ Examples :find a relation between the organisations sales in
store and online sales
➢ Many times extracting data in real time is not easy
➢ The information feed to big data can be long-term strategic or
immediate feed or will not be used at all

Presented by : A.H.Shanthakumara
Big Data
Use of big data in Retail Industry:

Use of RFID data in retail:


➢ Automate the process of labelling and tracking products
➢ Various types of RFID tags are available and also comes in
various sizes and are of varied capabilities including read and
write capability, memory, and power requirements
➢ Some common benefits of using a RFID:
1. Asset Management
2. Inventory control
3. Shipping and receiving
4. Regulatory compliance
5. Service and warranty authorizations
Presented by : A.H.Shanthakumara
Big Data

Agenda
• Distributed and parallel computing for big
data
• How data models and computer models
are different?:

Presented by : A.H.Shanthakumara
Big Data

New technologies for handling big data:


➢ Voluminous, varied and scattered data cannot be handled by

traditional data storage and processing systems

➢ Modern technologies are used to handle process and analyze

big data

➢ Distributed and parallel processing, Hadoop, In-memory

computing(IMC), big data clouds, etc...

➢ Help to analyze data in different ways under varying

circumstances.

Presented by : A.H.Shanthakumara
Big Data

Distributed and parallel computing for big data:


Distributed computing:
➢ Multiple computing resources are connected in a network and
computing task are distributed across these resources.
➢ Sharing of task increases speed and efficiency
➢ Suitable to process use amounts of data in a limited time
Parallel computing:
➢ Add additional computational resources to computer system to
increase processing capability
➢ Divide complex computations to subtask, which can be
handled individually by processing units Presented by : A.H.Shanthakumara
Big Data

Distributed and parallel computing for big data:

Presented by : A.H.Shanthakumara
Big Data

Distributed and parallel computing for big data:


➢ Organizations use both parallel and distributed computing techniques to
process big data
➢ The common procedure followed by software applications:
1. Break up the given task into smaller components
2. Serving the available resources at hand
3. Assign the subtask to the nodes or components that are
interconnected in a network
➢ Major problem that often hampers data storage and processing activities
is latency.
➢ It is defined as the aggregate delay in the system because of delays in the
completion of individual task.
➢ It leads to Slow Down in system performance
➢ Effects on productivity and profitability of the organization
➢ Implementing distributed and parallel computing methodologies helps in
getting both latency and data related problems. Presented by : A.H.Shanthakumara
Big Data

Distributed and parallel computing for big data:

➢ Nodes are arranged within a system along with the elements


that form the core of computing resource
➢ Nodes are more beneficial for adding scalability to the big data
environment
➢ It makes use of virtualization and load balancing feature
Presented by : A.H.Shanthakumara
Big Data

Distributed and parallel computing for big data:


Techniques of parallel computing:
➢ Uses number of techniques to process and manage huge
amounts of data
➢ Cluster or grid computing (used in Hadoop):
➢ Server share the workload among clusters
➢ Cluster can be either homogeneous or heterogeneous
➢ Massively parallel processing(MPP):
➢ single machine working as a grid
➢ Handles storage, memory and computing
➢ High performance computing(HPP):
➢ Offer performance and scalability by using IMC
➢ Suitable for processing floating point data at high speed
Presented by : A.H.Shanthakumara
Big Data

Distributed and parallel computing for big data:

Presented by : A.H.Shanthakumara
Big Data

How data models and computer models are different?:


Data Models:

Distributed databases: Hadoop:


➢ Deals with the tables and ➢ Deals with flat files in any format
relations ➢ Operates on no schema for data
➢ must have a schema for data ➢ Divides files automatically into
➢ implements data fragmentation blocks
and partitioning
Presented by : A.H.Shanthakumara
Big Data

How data models and computer models are different?:


Computing Models in distributed databases:
➢ Generate notations of transaction
➢ Implement ACID transaction
properties
➢ allows distributed transaction

➢ Generate notations of
job divided into tasks
➢ Implements MapReduce
computing model
➢ Consider every task as
either a map or a reduce
Presented by : A.H.Shanthakumara
Big Data

Agenda
• Introducing Hadoop
• Hadoop multinode cluster architecture
• Important features of Hadoop
• How does hadoop function?

Presented by : A.H.Shanthakumara
Big Data

Introducing Hadoop:
➢ Traditional technologies are incapable to handle large data
➢ Combined number of Technologies and products into system
that can overcome the challenges faced by traditional
processing systems (Hadoop)
➢ Hadoop is an open source platform that provides Analytical
Technologies and computational power
➢ Provides an improved programming model
➢ There are two main components: Hadoop distributed file
system(HDFS) and the MapReduce
➢ Hdfs is used for storage and mapreduce used for processing

Presented by : A.H.Shanthakumara
Big Data

Hadoop multinode cluster architecture:

➢HDFS is a cluster of storage solution that is highly reliable, more


efficient and economical
➢Provides facility to manage files containing related data across
machines Presented by : A.H.Shanthakumara
Big Data

Hadoop multinode cluster architecture:


➢ Cluster consists of single master node and multiple worker nodes
➢ Master node contains a Namenode and jobtracker
➢ Namenode is a single point for storage and management of metadata
➢ Worker node acts as both Datanode and Tasktracker
➢ HDFS is managed through Namenode server which keeps snapshots
of other Namenodes
➢ The secondary Namenode take snapshots of primary Namenode
periodically.
➢ Job tracker assigns task to Tasktracker.
➢ Each Datanode sends a Heartbeat signal to Namenode after every few
minutes(Heartbeat mechanism)
Presented by : A.H.Shanthakumara
Big Data

MapReduce:
➢ A Framework that helps developers to write programs to process
large volume of unstructured data in parallel over a distributed
architecture
➢ Programmers use mapreduce libraries to build talk without
communication or coordination between nodes
➢ Performs all mathematical computations
➢ Parallel and distributed implementation provides high
performance
➢ Each node will periodically report it’s status to masternode
➢ If a node does not respond as expected the Masternode re-

assigns that job to other available nodes in the cluster


Presented by : A.H.Shanthakumara
Big Data

MapReduce:

Presented by : A.H.Shanthakumara
Big Data

MapReduce:
➢ Mapreduce consists of several components, few are
➢ Jobtracker: Masternode that manages all jobs and resources in a
cluster
➢ Tasktrackers: agents deployed at each machine in the cluster to run
the map and reduce task at the terminal
➢ JobHistoryServer: component that tracks completed jobs

Presented by : A.H.Shanthakumara
Big Data

Important features of Hadoop:


1. Performs well with several nodes without requiring shared memory
2. Follows the client-server architecture
3. Data stored across various nodes can be tracked in Hadoop
Namenode
4. Improve data processing by running computing tasks on all available
processors in parallel.
5. Keeps multiple copies of data to improve fault tolerance

Presented by : A.H.Shanthakumara
Big Data
How does hadoop function?:
➢ Hadoop cluster are created from the racks of commodity machines
➢ Tasks are created and distributed across these nodes.
➢ Nodes are allowed to work independently and provide their responses
to the starting node.
➢ Hadoop can add or remove node dynamically in a cluster.
➢ Accomplishes its operations with mapreduce model.
➢ Mapreduce model comprises two functions: mapper and reducer
➢ Mapper maps the computational subtask to different nodes, handles
load balancing and managing failure recovery.
➢ Reducer reduces the responses from compute nodes to a single result.
➢ Aggregate all the elements together after the completion of the
distributed computation Presented by : A.H.Shanthakumara
Big Data
How does hadoop function?:

Presented by : A.H.Shanthakumara
Big Data

Agenda
• Cloud computing and big data
• Cloud Computing model
• Features of Cloud computing
• Cloud deployment models
• Cloud delivery models
• In-memory computing Technology for big
data

Presented by : A.H.Shanthakumara
Big Data

Cloud computing and big data:


➢ Storage and management of big data requires hardware setup
and software packages
➢ Utilization requirement of these resources varying overtime.
➢ Share the resource to overcome utilization requirement
challenge using cloud computing.
➢ Cloud computing saves cost related to infrastructure in an
organisation by providing a framework.
➢ Cloud implementation requires common standardized
processes and their automation

Presented by : A.H.Shanthakumara
Big Data

Cloud Computing model:


➢ Applications can easily obtain the resources to perform for
computing tasks.
➢ The cost to be paid as per the acquired resources and their
use
➢ Resource acquisition is in accordance with the requirements
and payment of cost (elasticity)
➢ Dynamically regulate the use of computing resources and
access them as per the need.
➢ Organisation needs to plan, monitor and control its resource
utilization carefully.

Presented by : A.H.Shanthakumara
Big Data

Cloud Computing model:

➢ Cloud Computing uses data canters to collect data and ensure the data
backup and recovery automatically performed.
➢ Both cloud computing and Big Data Analytics use the distributed

computing model
Presented by : A.H.Shanthakumara
Big Data

Features of Cloud computing:


Scalability: means addition of new resources to existing
infrastructure
➢ Need to replace the existing hardware with a new set of
hardware components to improve data management and
processing
➢ Some issues like hardware incompatibility maybe resolved
using cloud services
Elasticity: means hiring resources as and when required and
paying for the resource that have been used.
➢ Customers need not to declare their resource requirements in
advance
Presented by : A.H.Shanthakumara
Big Data

Features of Cloud computing:


Resource pooling: means multiple organisations have no need to
individually hire all the resources
➢ Sharing of resource is allowed to cut the cost.
Self service: simple user interface that helps customer to directly
access the cloud services.
➢ The selection of needed services requires no intervention from
human beings and can be accessed automatically.

Presented by : A.H.Shanthakumara
Big Data

Features of Cloud computing:


Low cost: a careful planning, use, management and control of
resources helps organisations to reduce the cost.
➢ Customized solutions can help the organisations that cannot
offer too much initial investment.
➢ The pay- as-you- use option benefits the economies of scale
and providing benefit to customers in terms of cost reduction.
Fault tolerance: cloud offers uninterrupted services to customers
➢ In case of failure, the responsibility of handling the workload
will be shifted to other components of the cloud

Presented by : A.H.Shanthakumara
Big Data

Cloud deployment models:


➢ Services are offered in the form of various deployment models
➢ Depends on an architecture used in forming the network,
services and applications used and also a target customers
➢ Most commonly;
public cloud, private cloud, community cloud and hybrid cloud

Presented by : A.H.Shanthakumara
Big Data

Public cloud(end user level cloud):

➢Cloud that is owned and managed by a company than the


one(individual or a company) using it.
➢The resources are managed and controlled by a third party.
➢The resources are owned or hosted by the cloud service providers
and the services are sold to other companies
➢The process of computing becomes very flexible and scalable
through customized resources
➢Examples: Amazon Web Services, Verizon, Rackspace
Presented by : A.H.Shanthakumara
Big Data

Private cloud(enterprise level cloud):


➢The cloud that remains entirely in the ownership of the
organisation using it
➢The infrastructure is designed specific to a single organisation.
➢That organisation may allow this cloud to be used by its
employees, partners and customers.
➢A private cloud, you can automate several processes and
operations that require manual handling in a public cloud
➢A private cloud can be either on premises or hosted externally
➢Designed the solution concerned with latency and security issues.
➢Get the advantages of cloud architecture without providing the
privileges to manage your own data
Presented by : A.H.Shanthakumara
Big Data

Private cloud(enterprise level cloud):

Presented by : A.H.Shanthakumara
Big Data

Community cloud:

➢Cloud services are shared among various organisations with a


common tie.
➢Generally managed by a third party offering the cloud services and
can be made available on or off premises.
Presented by : A.H.Shanthakumara
Big Data

Hybrid cloud:
➢Cloud environment in which
various internal or external
service providers offer
services to many
organisations
➢An organisation can use
both private and public cloud
together
➢The organisation can
manage an internal private
cloud for general use and may
access public cloud during the
peak periods
Presented by : A.H.Shanthakumara
Big Data

Cloud delivery models:


➢Provides computational resources in the form of hardware, software
and platform
Infrastructure as a service(IaaS): combination of various elements
such as network, storage and hardware components
➢made of highly scalable and automated compute resources.
➢IaaS is fully self-service for accessing and monitoring computers,
networking, storage, and other services.
➢allows businesses to purchase resources on-demand and as-needed
instead of having to buy hardware outright.
➢User of IaaS can outsource and build a "virtual data centre" in the
cloud and have access to many of the same technologies and
resources without having to invest and manage physical infrastructure
Presented by : A.H.Shanthakumara
Big Data

Cloud delivery models:


Platform as a Service(PaaS):
➢Provide cloud components to certain software while being used mainly for
applications.
➢PaaS delivers a framework for developers that they can build upon and use
to create customized applications.
➢Platform is delivered via the web, giving developers the freedom to
concentrate on building the software without having to worry about operating
systems, software updates, storage, or infrastructure.
➢Builds on virtualization technology, so resources can easily be scaled up or
down as your business changes
➢Provides a variety of services to assist with the development, testing, and
deployment of apps
Presented by : A.H.Shanthakumara
Big Data

Cloud delivery models:


Software as a Service(SaaS):
➢Provides software applications that are accessible from wherever the user is
➢Customers do not require purchasing the software or installing it on their own
devices
➢A majority of SaaS applications run directly through the web browser.
➢SaaS vendors manage all potential technical issues, such as data,
middleware, servers, and storage, resulting in streamlined maintenance and
support for the business.
➢Greatly reducing the time and money spent on tedious tasks such as installing,
managing, and upgrading software
➢Suitable for applications that aren’t needed too often, such as tax software
➢and some applications that need both web and mobile access
Presented by : A.H.Shanthakumara
Big Data

Cloud services for big data:


➢Cloud is an ideal computing environment for handling big data
➢The Huge storage and computational power requirements for big data is
fulfilled by IaaS cloud
➢Various popular big data platforms including Mapreduce and Hadoop offered
by PaaS venders
➢The social media data and platform for analyzing the data provided by SaaS
vendors

Presented by : A.H.Shanthakumara
Big Data

Cloud providers in Big Data market:


➢Assignment: at least 10 cloud service providers rendering service that are
relevant to Big data analytics.

In-memory computing Technology for big data:


➢In-Memory Computing provides super-fast performance (thousands of times
faster) and scale of never-ending quantities of data, and simplifies access to
increasing numbers of data sources.
➢By storing data in RAM and processing it in parallel, it supplies real-time
insights that enable businesses to deliver immediate actions and responses.
➢In-Memory Computing is all about how much data can be ingested and
analyzed, and how fast the analysis can be performed
➢The volume related issues of big data addressed by using IMC and the
diversity of big data is taken care by NoSQL database
Presented by : A.H.Shanthakumara
Big Data

In-memory computing Technology for big data:

Presented by : A.H.Shanthakumara
Big Data

In-memory computing Technology for big data:

Presented by : A.H.Shanthakumara

You might also like