Professional Documents
Culture Documents
Computational Journalism
Columbia Journalism School
Week 5: Social Filtering
October 9, 2015
User
x
x
x
x
x
x
ltering
User
data from
SocialReach,
who works with
many publishers
John McDermo>, Why Facebook is for ice buckets, TwiBer is for Ferguson
Information ow on Facebook
Classify Users
Classic machine learning problem. Classify each user
as one of:
journalist/blogger
organization
ordinary individual
First, need to encode as a vector / select features...
# of followers / following
# of posts, favorites
percentage of posts that are RTs, @replies, links
presence/absence of named entities
topic distribution of tweets (IPTC top level topics)
Classier Accuracy
Eyewitness classier
Word Aspects
Other dimensions
This gives you context you have the context for whether or not
you think theyre reputable or whether or not theyre worth
reaching out to.
Its giving me a lot of context which is really useful when youre
trying to verify if someone is reputable or not.
I would tend to focus on the eyewitnesses and journalists/
bloggers. Eventually Id look at everyone else but Id want to start
my search with those two groups because they would normally
provide me with the most information.
Unpopular features:
Entity extraction not helpful, no ability to filter by location and eyewitness
status, focus on users instead of content
Social Software
Basic assumption: structure of software influences how
groups use it.
Design problem...
What do we want the users to accomplish together?
How do we encourage this?
We can write the code, but the culture is a separate
issue.