Professional Documents
Culture Documents
• Real-‐6me
sugges6ons
based
on
your
social
graph.
Shouts Tips/To-‐dos
Log
Files
Jus6n
Moore
-‐
@injust
3/22/2011
Machine
Learning
Meetup
Ma;hew
Rathbone
-‐
@rathboma
About
Hadoop
and
Hive
Hadoop:
• Distributed
Data
processing
framework
(map-‐reduce).
• Wri;en
in
Java
Hive:
• SQL
layer
on
top
of
hadoop
• Lets
us
do
“select
count(1)
from
checkins”
instead
of
having
to
write
our
own
map-‐reduce
java
classes.
• That
means
we
store
all
of
our
data
in
flat
files
in
Amazon
S3
(which
keeps
things
simple)
“rake
cluster:start[30]”
=>
starts
a
30
node
cluster,
just
like
that
Jus6n
Moore
-‐
@injust
3/22/2011
Machine
Learning
Meetup
Ma;hew
Rathbone
-‐
@rathboma
Our
Dashboard
• Define
and
schedule
reports
through
it
venues
in
zurich
hauptbahnhof
zurich
1780
sony
ericsson
football
hotspot
basel
773
basel
bahnhof
sbb
basel
761
QUESTIONS?
Jus6n
Moore
-‐
@injust
3/22/2011
Machine
Learning
Meetup
Ma;hew
Rathbone
-‐
@rathboma
foursquare
3.0:
Explore
4
3
2
“Must
See”
1
0
Unique
Users