Welcome to Scribd!

1 Designing Data-Intensive Apps - CH 1

Uploaded by

0% found this document useful (0 votes)

2 views2 pages

This chapter introduces the topics that will be covered in the book, which are foundations of data systems, distributed data, and derived data. It discusses that most applications are data-intensive and this book will talk about design principles for reliable, scalable, and maintainable data systems. The main concerns for data systems are reliability in the face of hardware/software faults and human errors, scalability to large loads and performance needs, and maintainability through operability, simplicity and evolvability.

Original Description:

Original Title

1 Designing Data-Intensive Apps - Ch 1

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

2 views2 pages

1 Designing Data-Intensive Apps - CH 1

Uploaded by

anjaneyaprasad nidubrolu

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Designing Data-Intensive Applications

Chapter 1: Reliable, Scalable, and Maintainable Applications

Book Organization
● This book is organized into three parts
○ Part I is Foundations of Data Systems, and introduces the topic and building
blocks
○ Part II is Distributed Data, and discusses the challenges when scaling to multiple
machines
○ Part III is Derived Data, and discusses integrating data, including batch and
stream processing

Introduction
● Most applications are data-intensive, not compute-intensive
● Data systems are such a successful abstraction that we use them all of the time without
thinking about it
● This book will talk about the principles and practicalities of data systems. What do they
have in common, what distinguishes them, and how do they achieve their characteristics
● The three main concerns for data systems are reliability, scalability, and maintainability

Reliability
● Simple definition of reliability is continuing to work correctly even when things go wrong
● Hardware Faults
○ Single machines are made more resilient through redundant hardware
○ Larger data and computing demands have driven move to multi-machine
redundancy, often using software fault-tolerance techniques
● Software Errors
○ Hard to detect, can lie dormant until unusual set of circumstances
○ Can have a systematic error or cascading failures which cause multiple system
failures, unlike hardware faults
○ No quick solution -- a set of small things each help: planning, thorough testing,
process isolation, allowing crash & restart, monitoring system behavior
● Human Errors
○ Can combine several approaches to deal with human error, including:
■ Minimize opportunities for human error
■ Decouple places where people make the most mistakes from places
where mistakes can cause failures
■ Allow quick & easy recovery from problems
■ Use detailed and clear monitoring
■ Provide good management & training

Scalability
● Even a system working well won’t necessarily be reliable with 10x users
● Planning for scalability means asking if the system grows in a particular way, what are
the options for coping with that growth
● Describing Load
○ Use metrics called load parameters, e.g. post tweet averages 4.6k requests/sec,
peak of 12k requests/sec
○ Example of Twitter’s different designs to deal with updating home timelines
● Describing Performance
○ Common concerns are response time and throughput
○ Metrics like response time are reported in percentiles, e.g. median, 95th, 99th
● Approaches for Coping with Load
○ Can scale up or scale out
○ Stateless services are easy to scale out, but scaling out stateful systems can
introduce a lot of complexity

Maintainability
● Operability: Making Life Easy for Operations
○ Good operability includes:
■ Providing visibility into the runtime behavior and internals
■ Support for automation and integration with standard tools
■ Self-healing
● Simplicity: Managing Complexity
○ Good to remove accidental complexity (not inherent in problem, just the
implementation)
○ A good abstraction hides implementation details behind easy to understand
interface
● Evolvability: Making Change Easy
○ Agile working patterns provide a framework for adapting to change
○ Evolvability is defined as agility at a large data system level
○ Simplicity and good abstractions can go a long way toward evolvability

Requirements Analysis Document Template
Document11 pages
Requirements Analysis Document Template
Kathleen Butler
100% (11)
Kali Linux Tutorial
Document121 pages
Kali Linux Tutorial
Vijay Agrawal
78% (9)
Network Troubleshooting PDF
Document28 pages
Network Troubleshooting PDF
stanley umoh
50% (2)
An Introduction To Software Architecture - Summary
Document15 pages
An Introduction To Software Architecture - Summary
John Ortiz
100% (1)
DC Module 1
Document136 pages
DC Module 1
2021.shreya.pawaskar
No ratings yet
CP 211 Lecture 1 Introduction To System Administration
Document47 pages
CP 211 Lecture 1 Introduction To System Administration
MELKIZEDEKI IGWE
No ratings yet
Uncertainty in Modeling
Document25 pages
Uncertainty in Modeling
Anil Yogi
No ratings yet
(6ASI) Week 7
Document25 pages
(6ASI) Week 7
Rashley Baloyo
No ratings yet
System Design - ML Design 1 PDF
Document24 pages
System Design - ML Design 1 PDF
Abhishek Bhowmick
100% (1)
DDIA in Concise
Document106 pages
DDIA in Concise
Watan Sahu
100% (1)
Destributed System Lecture Note Finale
Document148 pages
Destributed System Lecture Note Finale
gemchis dawo
No ratings yet
Chapters 1-6: Group 15
Document40 pages
Chapters 1-6: Group 15
kv
No ratings yet
Lecture 10
Document31 pages
Lecture 10
ahmeddhshory077
No ratings yet
Communication Middleware For Iot: T.Keerthana (2018614036) T.Pradeesshma (2018614040) M.Gaysalya (2018614032)
Document33 pages
Communication Middleware For Iot: T.Keerthana (2018614036) T.Pradeesshma (2018614040) M.Gaysalya (2018614032)
anahtreek T
No ratings yet
LS1.1 - V1 Reliable, Scalable and Maintainable Data Applications
Document10 pages
LS1.1 - V1 Reliable, Scalable and Maintainable Data Applications
R Krish
No ratings yet
Network Troubleshooting
Document28 pages
Network Troubleshooting
stanley umoh
100% (1)
Week 6 - Integration Methodologies
Document64 pages
Week 6 - Integration Methodologies
Bono, Pedro Jr. S.
No ratings yet
TSA2151 Lec01
Document32 pages
TSA2151 Lec01
Steven Nimal
No ratings yet
Layered
Document3 pages
Layered
Romisaa Khaled
No ratings yet
Unit 7-10
Document47 pages
Unit 7-10
lucky dolphin
No ratings yet
Unit 7 - Systems Development LifeCycle
Document12 pages
Unit 7 - Systems Development LifeCycle
lucky dolphin
No ratings yet
Building Blocks of Effective Data Management
Document25 pages
Building Blocks of Effective Data Management
Kyla Yumie Abang
No ratings yet
Mbse U1
Document15 pages
Mbse U1
Ashwin George
No ratings yet
CH 1 Distributed System
Document13 pages
CH 1 Distributed System
Desire Maharjan
No ratings yet
Microservices
Document14 pages
Microservices
Nilesh yadav
No ratings yet
2.operating System - Basics2
Document18 pages
2.operating System - Basics2
Shashank Rah
No ratings yet
Chapter One Introduction To System Network Administration
Document25 pages
Chapter One Introduction To System Network Administration
mesh
No ratings yet
M22 Comp Sci Notes
Document10 pages
M22 Comp Sci Notes
Adithan A.
No ratings yet
CHAPTER1 Update
Document20 pages
CHAPTER1 Update
md5fxz9ths
No ratings yet
Streamlining Network Ops With AI
Document10 pages
Streamlining Network Ops With AI
d22it198
No ratings yet
Technical Essentials of HP Servers, Rev. 11.41
Document72 pages
Technical Essentials of HP Servers, Rev. 11.41
Anonymous CZVjyUz
No ratings yet
Revision For Midterm 1
Document31 pages
Revision For Midterm 1
san marco
No ratings yet
Cis PDF
Document60 pages
Cis PDF
Jan Luna
No ratings yet
Session 1 and 2 Complete
Document72 pages
Session 1 and 2 Complete
david nduati
No ratings yet
Security, Backup, Recovery, Tuning, Testing of Data Mining and Warehousing
Document16 pages
Security, Backup, Recovery, Tuning, Testing of Data Mining and Warehousing
deepika2804
No ratings yet
Data Structure
Document54 pages
Data Structure
shyamalhp001
No ratings yet
Maintainability: Operability: Making Life Easy For Operations
Document2 pages
Maintainability: Operability: Making Life Easy For Operations
Reena Verma
No ratings yet
NetDe Sign 10 Develop Management Strategies
Document25 pages
NetDe Sign 10 Develop Management Strategies
phamthientrung
No ratings yet
Lec - 5
Document25 pages
Lec - 5
Akash Satpute
No ratings yet
Network Design and Administration-Lecture Notes PDF
Document100 pages
Network Design and Administration-Lecture Notes PDF
jaq hit cool
67% (3)
Module 2-CLOUD Technologies: by Bindu Madavi K P
Document61 pages
Module 2-CLOUD Technologies: by Bindu Madavi K P
Narmatha Thiyagarajan
No ratings yet
Chapter Six Design Rules and Implementation Support
Document28 pages
Chapter Six Design Rules and Implementation Support
Aschenaki
No ratings yet
Intelligent Decision Support Methods
Document79 pages
Intelligent Decision Support Methods
myles
No ratings yet
IT Infrastructure Management Challenges & Their Solutions
Document9 pages
IT Infrastructure Management Challenges & Their Solutions
SADY MOHAMMED
No ratings yet
Fundamentals of IS-Module I
Document96 pages
Fundamentals of IS-Module I
Eden
No ratings yet
Week 7 The Five Nines Concept - Part 1 2
Document32 pages
Week 7 The Five Nines Concept - Part 1 2
Very dangger
No ratings yet
Business Driven Technology: Viewing and Protecting Organizational Information
Document18 pages
Business Driven Technology: Viewing and Protecting Organizational Information
RiaMelinda
No ratings yet
Enterprise Information Security-14
Document10 pages
Enterprise Information Security-14
Guillermo Valdès
No ratings yet
DBT Unit4 PDF
Document152 pages
DBT Unit4 PDF
Chaitanya Madhav
No ratings yet
DA Qns
Document8 pages
DA Qns
narakatlas1987
No ratings yet
PHASE 5 - Chapter 12 Managing Systems Support & Security
Document34 pages
PHASE 5 - Chapter 12 Managing Systems Support & Security
scm39
100% (2)
Grade 11: Unit One
Document54 pages
Grade 11: Unit One
Desalegn Nigussie
No ratings yet
Information Systems: Definitions and Components
Document20 pages
Information Systems: Definitions and Components
subhash221103
No ratings yet
Distributed System Requirements: Remco Hobo December 1, 2004
Document4 pages
Distributed System Requirements: Remco Hobo December 1, 2004
Khalid Kn3
No ratings yet
Document 15
Document15 pages
Document 15
Imaad Abdullah
No ratings yet
Os Module1 My Notes-Pages-6
Document8 pages
Os Module1 My Notes-Pages-6
Athira Ka
No ratings yet
Software Design
Document40 pages
Software Design
surangauor
No ratings yet
Distributed Operating Systems
Document17 pages
Distributed Operating Systems
Arindam Sarkar
No ratings yet
Flowcharts 1
Document68 pages
Flowcharts 1
amanrajsinghars31
No ratings yet
Management Information System "Infrastructure"
Document13 pages
Management Information System "Infrastructure"
maba2610
67% (3)
Big Data Modeling and Management Systems
From Everand
Big Data Modeling and Management Systems
Alexander Afriyie
No ratings yet
Software Engineering
From Everand
Software Engineering
Marc Thorin
Rating: 3 out of 5 stars
3/5 (2)
Java Project File Sem 6
Document39 pages
Java Project File Sem 6
Am Alone
No ratings yet
K.vasu Resume
Document3 pages
K.vasu Resume
Vasu Devan
No ratings yet
Unit06 Hibernate Caching
Document28 pages
Unit06 Hibernate Caching
Long Trần
No ratings yet
CPIN239 S05Assign
Document2 pages
CPIN239 S05Assign
Jon
0% (1)
(Bookflare - Net) - Javascript Basics and Advanced
Document330 pages
(Bookflare - Net) - Javascript Basics and Advanced
Tabasamu Fiber
No ratings yet
10 G Perf Tuning 1
Document444 pages
10 G Perf Tuning 1
molcb76
No ratings yet
The Guide To Easily: Integrating Document Management Software
Document30 pages
The Guide To Easily: Integrating Document Management Software
Nguyễn Thế Đạt
No ratings yet
Ygfbuchujnf
Document20 pages
Ygfbuchujnf
an21"q
No ratings yet
X X X X X X X X: Lexical Units Identifiers
Document2 pages
X X X X X X X X: Lexical Units Identifiers
Ralu-Ardelean
No ratings yet
Concepts: Chkconfig - List Oracleasm /etc/init.d/oracleasm
Document7 pages
Concepts: Chkconfig - List Oracleasm /etc/init.d/oracleasm
Murali Palepu
No ratings yet
Technology Architecture For NginX, postgreSQL, postgREST
Document5 pages
Technology Architecture For NginX, postgreSQL, postgREST
karelvdwalt9366
No ratings yet
Forms With Formik + TypeScript
Document6 pages
Forms With Formik + TypeScript
jmanzura
No ratings yet
NSX-T Data Center Migration Coordinator Guide
Document91 pages
NSX-T Data Center Migration Coordinator Guide
Alessandro
No ratings yet
Referencias Generales de AWS
Document523 pages
Referencias Generales de AWS
Arturo M Bárcenas Yáñez
No ratings yet
Iometer User's Guide
Document82 pages
Iometer User's Guide
sree4u86
No ratings yet
Operating System Chapter 4
Document6 pages
Operating System Chapter 4
Meitei opubunq
No ratings yet
RSD ICBS v3
Document35 pages
RSD ICBS v3
Glutton Arch
No ratings yet
Automated Emerging Cyber Threat Identification and Profiling Based On Natural Language Processing
Document10 pages
Automated Emerging Cyber Threat Identification and Profiling Based On Natural Language Processing
Venkatesh Rachamalla
No ratings yet
Offshore Delivery
Document2 pages
Offshore Delivery
Bharti Kalra
No ratings yet
CrXIwin en Sp3
Document149 pages
CrXIwin en Sp3
miguelganilho
No ratings yet
Declaration and Access Modifiers PDF
Document69 pages
Declaration and Access Modifiers PDF
Murthy Mallikarjuna
No ratings yet
Symantec DLP 15.7 MP2 Release Notes
Document22 pages
Symantec DLP 15.7 MP2 Release Notes
Hassan Azeez Oladipupo
No ratings yet
Wedding Management System Update PDF
Document72 pages
Wedding Management System Update PDF
prafulla kute
No ratings yet
025.37 - Lab Guide Kata 3.7, Kedr 1.7
Document168 pages
025.37 - Lab Guide Kata 3.7, Kedr 1.7
Nizar Hajji karwi
No ratings yet
Gym Management System
Document4 pages
Gym Management System
Editor IJTSRD
No ratings yet
E 10133
Document704 pages
E 10133
pg8450
No ratings yet
Thread Dump
Document84 pages
Thread Dump
ksudeer
No ratings yet
Agile Vs Waterfall
Document2 pages
Agile Vs Waterfall
Dogther
No ratings yet