Professional Documents
Culture Documents
L6 Dcs
L6 Dcs
and Management
• Wikipedia:
“A data center is a facility used to house
computer systems and associated components,
such as telecommunications and storage
systems. It generally includes redundant or
backup power supplies, redundant data
communications connections, environmental
controls (e.g., air conditioning, fire suppression)
and various security devices. Large data centers
are industrial scale operations using as much
electricity as a small town and sometimes are
a significant source of air pollution in the form of
diesel exhaust.”
What is a Data Center?
What is a Data Center?
– Servers
– Network
– Storage
– Power
– Cooling
– Energy-efficiency
– … and a building to keep everything in…
12
Data Center Architecture
• Treat the entire data center as a computer
- Air flow analysis
- Cooling architecture (thermal management)
- Power/energy management
- Focus on ease of system and network management
- What cannot be managed/monitored does not get
deployed
• Modular and Scalable
- Card to Rack
- Rack to Container
- Container to Warehouse
• Explore low power, commodity CPU as a
building block
Data Center - Tiering
Agenda
• Standard servers
• Standard networks
• Standard storage
• But at a very large scale
Intra-Virtual-Cluster
Load Balancing
Power
Management
Virtual Machine Management
Management
• Virtualization Platform (virtualize everything)
– CPUs
– Storage (Filesystems)
– Network
• Resource Management
– Provisioning of virtual clusters
– Physical machine load balancing
– Network traffic load balancing
• Power Management
• Security
– Hypervisor protection
– Isolation between clusters
• System Management
• High Availability
– Physical component failure should not
interrupt availability of virtual resources
• Cloud Applications management
– Monitor VM Compute
Nodes
Cloud
OS
Performance agents
– Virtual storage
provisioning
– Intra-Virtual Cluster
load balancing Service Data
Nodes Nodes
– Scalable data center
network
– Isolation between Physical Physical
Physica
Physical Storage
Stora Stora
Storage
Node Node
l Node Server
ge ge
Server
virtual clusters Node
Serve Serve
r r
– Virtual machine
migration
Virtual Machine Management
• Objective
– Power Management
– Physical Machine Load Balancing
• Monitor runtime VM statistics
– Heuristic calculation to predict workloads
• Determine power down/up of machines
– Multi-dimensional bin packing (knapsack)
• CPU, network, disk
– VM migration algorithm
• Physical machine load balancing
– Migration of VM’s to other physical machine
Distributed System Group
research overview
Agenda
• Typical: 2.0
• State-of-the-art: ~1.2
- “Free” electricity
- Less carbon emissions
Agenda
• 5k Google
servers
(6 months)
IT Energy-efficiency
• Overbooking – Admission Control
• Consolidate workloads
– Power servers off
– Or slow servers down
• Dynamic Voltage Frequency Scaling(DVFS)
– Very hard to assess impact for
bursty (rapidly changing) workloads
• Oscillations and un-wanted correlations
• More next time…
• Consolidation requires software support
– Must be able to start/stop instances and
autoscale
– Stateless services preferable
IT Energy-efficiency
2. Move to cloud-hosting
– To handle large scale-up of user base
• Example: Zynga
Agenda