Professional Documents
Culture Documents
• What is Twitter?
• Malicious activity on Twitter
• URL shortening services
• The Twarfing architecture
• Examples
• Conclusions
• Devices
• From buoys to power
meters
• Search for Twitter on
instructables.com
user
<id> information
<user>
<text>
<created_at> Tweet text
<source>
<truncated>
<in_reply_to_status_id>
<in_reply_to_user_id>
<favorited>
<in_reply_to_screen_name>
• topics
• August 2009
• Twitter began filtering malicious URLs
• Mikko Hyppönen:
• seemed to indicate Google SB API!
• After more testing, we discovered it used
some additional filtering
<id>
<name>
<screen_name>
<location>
<favourites_count>
<description>
<utc_offset>
<profile_image_url>
<time_zone>
<url>
<profile_background_image_url>
<protected>
<profile_background_tile>
<followers_count>
<statuses_count>
<profile_background_color>
<notifications>
<profile_text_color>
<following>
<profile_link_color>
<verified>
<profile_sidebar_fill_color>
<profile_sidebar_border_color>
<friends_count>
<created_at>
• Goal
• Avoid inventing own vocabulary
• Makes it easier to mix-in data later
• eg. Facebook, Tumblr, ...
• Most Twitter data fit readily into the SIOC
ontology
• The rest used DublinCore
• Proprietary ontology for internal data
• DublinCore, http://dublincore.org/
• SIOC, http://sioc-project.org/
• describe information from online
communities
• FOAF, http://www.foaf-project.org/
• machine-readable pages describing people
• GeoOWL
<http://twitter.com/float3r/status/3492845110> a sioc:Post ;
dc:created "Sun, 23 Aug 2009 15:10:31 +0000" ;
sioc:has_creator <http://twitter.com/float3r> ;
sioc:content "RT @THErealDVORAK http://www.usdoj.gov/ndic/
pubs31/31379/31379p.pdf"@no ;
.
<http://twitter.com/float3r> a sioc:User ;
sioc:id "251294"^^xsd:integer ;
rdfs:label "float3r" ;
sioc:avatar "http://a1.twimg.com/profile_images/53008680/
Picture_022_normal.jpg"^^xsd:anyURI ;
.
White Twarf
tweet
Twitter Stream
processing
couchDB
processing Converter
URLs
4Store
RDF
Domain RDF
Graph
Converter
Reputation
Thursday, January 21, 2010
Tweet Processing
drs:hasFQDN
tw:posts
mal.com
mal drs:hasRating
malicious
tw:posts
Note that the
examples in this tw:hasURL
presentation are all tweet/5678 http://unk.com/what.exe
radically simplified
for clarity.
Thursday, January 21, 2010
Matching this in SPARQL
tw:hasURL
?t1 ?u1
drs:hasFQDN
tw:posts
?f1
drs:hasRating
?m
malicious
tw:posts
tw:hasURL
?t2 ?u2
tw:posts tw:hasURL
http://cool.com/ice.html
iceman tweet/1001
tw:hasTextSignature thislinkiscool
tw:posts
notniceman tweet/1005 http://c001.com/ice.exe
tw:hasURL
tw:posts tw:hasURL
?u1
?m1 ?t1
tw:hasTextSignature ?ts ≠
tw:posts
?m2 ?t2 ?u2
tw:hasURL