Professional Documents
Culture Documents
Scaling Twitter
John Adams
Twitter Operations
Web API
75%
• Site Availability
• Configuration Management
• Security
• Faster performance
Move to
Find Take
Next
Weakest Corrective
Weakest
Point Action
Point
Metrics +
Logs + Science = Process Repeatability
Analysis
HTTP
Congestion Network More LB’s
Latency
Timeline Update Better
Storage
Delay Delay algorithm
Status Flock
Database Delays
Growth Cassandra
Updates Algorithm Latency Algorithms
HTTP
Congestion Network More LB’s
Latency
Timeline Update Better
Storage
Delay Delay algorithm
Status Flock
Database Delays
Growth Cassandra
Updates Algorithm Latency Algorithms
• Logs
• SCIENCE!
status_id
r2=0.99
http://dev.twitter.com/status
Wednesday, May 5, 2010
What’s a Robot ?
• Actual error in the Rails stack (HTTP 500)
• Uncaught Exception
• Shows up in Reports
• We’re on it!
• Timeout
Apache mod_proxy
Rails (Unicorn)
MySQL Cassandra
Daemons
Wednesday, May 5, 2010
unicorn
• A single socket Rails application Server
• Queue Latency
• Replication Lag
• Written in Scala.
• Bad:
• 13 billion edges,
100K reads/second
Mysql Mysql Mysql
• Recently Open Sourced
• http://twitter.com/jobs
• http://twitter.com/about/opensource