You are on page 1of 22

SYSTEMATICS

Prep Session 1
AGENDA

• Cloud computing
• Big Data Analytics
CLOUD COMPUTING

Cloud computing is the on-demand delivery of Why is Cloud Computing required?


compute power, database, storage, applications,
• With cloud computing, you don’t need to make large
and other IT resources via the internet. For ex.
upfront investments in hardware and spend a lot of time
• Google Cloud on the heavy lifting of managing that hardware. Instead,
• Amazon Web Services (AWS) you can provision exactly the right type and size of
computing resources you need to power your newest
• Microsoft Azure idea or operate your IT department.You can access as
• IBM Cloud many resources as you need, almost instantly, and only
pay for what you use.

• Rather than keeping files on a proprietary hard drive or


local storage device, cloud-based storage makes it
possible to save them to a remote database.
• For cost savings, increased productivity, speed and
efficiency and performance
How does Cloud computing work?
• Cloud computing gives you access to servers, storage, databases, and a broad set of application
services over the Internet. A cloud services provider such as Amazon Web Services, owns and
maintains the network-connected hardware required for these application services, while you
provision and use what you need via a web application.

• Cloud computing is named as such because the information being accessed is found remotely in the
cloud or a virtual space. Companies that provide cloud services enable users to store files and
applications on remote servers and then access all the data via the Internet.
ADVANTAGES AND DISADVANTAGES OF CLOUD COMPUTING
Advantages Disadvantages
• Scalability – Easily scalable • Vulnerability to attacks - Storing data in cloud may pose
serious challenge of information theft since in cloud every
• Cost efficient - It takes considerably lesser cost than an on-
data of your company is online. Storing confidential data in
premise technology
cloud can be a risky affair.
• High Speed – Deployment of service quickly in fewer clicks
• Network connectivity dependency - You need a reliable
• Excellent accessibility- Storing the information in cloud allows and consistent internet service as well as a good connection
you to access it anywhere and anytime regardless of the machine speed and bandwidth for your business to reap the benefits of
• Manageability- Eliminates the need for IT infrastructure updates cloud computing.
and maintenance since the service provider ensures timely, • Downtime
guaranteed and seamless delivery of services and takes care of all
• Vendor lock in - Support issues, configuration complexities
the maintenance
and additional expenses.Your data might also be left
• Pay as you go - Lets you add or subtract resources and services vulnerable to security attacks due to compromises that might
according to your needs have been made during migrations.
• Strategic edge - It helps you to access the latest and applications
any time without spending your time and money on installations
TYPES OF CLOUD

• Public Cloud – In this model, the cloud resources are owned and operated by a third-party cloud service provider.
Same hardware, storage and other services are shared with other organizations

• Private Cloud – In this model, computing resources are used exclusively by one business organization and are always
maintained on a private network. It can be physically located at your organization’s on site data-center or can be hosted
by a third-party service provider. It allows organizations to easily customize its resources to meet IT requirements. E.g
Government agencies, financial institutions and business critical operations seeking enhanced security

• Hybrid Cloud - Hybrid clouds combine on-premises infrastructure, or private clouds, with public clouds so
organizations can reap the advantages of both. In a hybrid cloud, data and applications can move between private and
public clouds for greater flexibility and more deployment options. For instance, you can use the public cloud for high-
volume, lower-security needs such as web-based email and the private cloud (or other on-premises infrastructure) for
sensitive, business-critical operations like financial reporting
CLOUD COMPUTING SERVICES
Software as a Service (SaaS) Platform as a Service(PaaS) Infrastructure as a Service (IaaS)

• Software hosted on machines • Development environment hosted by • Raw building blocks of data
provided by third party environment provided by third party
third party
• Applications accessed remotely via • Processing capacity, storage,
client and/or the Web • Targeted at developers connectivity, security, etc.

• Targeted at application end users • Microsoft Azure, Google App Engine • Microsoft Azure, AWS

• Google Apps, Dropbox, Cisco WebEx


BIG DATA AND ANALYTICS

Data, in the context of computing, refers to distinct pieces of digital information. Data is
Data usually formatted in a specific way and can exist in a variety of forms, such as numbers, text,
etc.

Big data - Big data is a term applied to data sets whose size or type is beyond the ability of
Big Data traditional relational databases to capture, manage and process the data with low latency.
Big data has one or more of the following characteristics: high volume, high velocity or high
variety
4V’S OF BIG DATA

• Volume - Refers to the amount of data


• Variety - Refers to the many sources and
types of data. Data today comes in many
different formats: structured data, semi-
structured data, unstructured data
• Velocity - Refers to the speed of generation
of data. How fast the data is generated and
processed to meet the demands.
• Veracity - Refers to the uncertainty with the
data
TYPE OF DATA

Structured - Structured data Unstructured data - Any Semi-structured - Semi-


is highly-organized and data with unknown form or the structured data is
formatted in a way so it's easily structure is classified as information that doesn’t
searchable in relational unstructured data., making it reside in a relational database
databases. Examples of much more difficult to collect, but that does have some
structured data include names, process, and analyze. Examples organizational properties that
dates, addresses, credit card of unstructured data include make it easier to analyze. E.g
numbers, stock information, video, audio, mobile activity, XML data
geolocation, and more. social media activity, satellite
imagery, surveillance imagery
• Further reading
Effectively storing huge data
which is increasing at a rapid
pace

CHALLENGES
WITH BIG DATA Cleaning of data(making it
relevant)

Keeping up with technology


and tools to extract
information and insights
WHAT IS BIG DATA ANALYTICS?
Data Analysis vs Data Analytics
Big data analytics is the process of collecting,
Analytics is defined as “a process of transforming data into
organizing and analyzing large volumes of data to actions through analysis and insight in the context of
find out the required information and to reveal organizational decision making and problem-solving.”
hidden patterns from the data which might be
Data analysis is a broader term that refers to the process of
helpful to take profitable business decisions compiling and analyzing data in order to present findings to
management to help inform business decision making.
Data analytics is a subcomponent of data analysis that involves
the use of technical tools and data analysis techniques.
TYPES OF ANALYTICS
• Descriptive - Answers the question of what happened. Descriptive analytics juggles raw data from multiple
data sources to give valuable insights into the past. However, these findings simply signal that something is
What happened
wrong or right, without explaining why. E.g. A healthcare provider will learn how many patients were
hospitalized last month; a retailer – the average weekly sales volume

• Diagnostic - Answers the question of why something happened. It gives in-depth insights into a problem.
Why something
E.g. a retailer can drill the sales and gross profit down to categories to find out why they missed their net happened
profit target

• Predictive - Predictive analytics tells what is likely to happen. It uses the findings of descriptive and
diagnostic analytics to detect tendencies, clusters and exceptions, and to predict future trends, which What is likely to
makes it a valuable tool for forecasting. E.g. Improving operations to better manage inventory and other happen
resources, or to set prices for services based on things like seasonality.

• Prescriptive – To prescribe what action to take to eliminate a future problem or take full advantage of a
promising trend. E.g. to maximize profits for an airline, prescriptive analytics can do this by automatically What action to
adjusting ticket price and availability based on numerous factors, including customer demand, weather, and take
gasoline prices.
USE CASES OF BIG DATA ANALYTICS

Utilizing the right data will allow companies to:


• Reduce operational costs.
• Track current metrics and create new ones.
• Understand its customers on a far deeper level.
• Unveil smarter and more targeted marketing campaigns.
• Find new product opportunities and offerings.
How Analytics can help in ?
Marketing Supply chain
• How are our marketing initiatives performing today? Analytics helps monetize and optimize:
• Which of them are viable in the long run? • Current inventory status
• How can we improve those which are not effective? • Forecasts
• How do our marketing activities compare with our • Demand planning
competitors’? • Sourcing
• What can we learn from our competition? • Production
• Are our marketing resources properly allocated? • Improved worker productivity measurement
• Transportation routing
• Assessing manufacturing quality
Finance • Using data from sensors on manufacturing equipment
Detecting fraud, risk management, data security,
finding unexplored strategic opportunities Human Resource
Typically financial analytics includes • Talent acquisition and retention
• Risk analysis • Attrition
• Working capital management • Headcount Management and Workforce
• Fraud detection and prevention Optimization
• Shareholder metric analysis • Optimization of Compensation and Benefits
ROLES IN ORGANIZATIONS FOR BIG DATA ANALYTICS
Tools used by Data Analysts
• Tableau Public
• OpenRefine
• KNIME
• RapidMiner
• Google Fusion Tables
• NodeXL
• Wolfram Alpha
• Google Search Operators
• SAS
• R

You might also like