This document summarizes experimental data from Facebook profiles in the Dallas/Fort Worth area, including over 167,000 profiles with over 3 million links between them. It also provides general properties of the data, such as the diameter of the largest connected component and the number of unique traits listed by users. Finally, it describes three inference methods - Naive Bayes classification using profile details only, link structure only, or an average of the two - for predicting user attributes from the data.
This document summarizes experimental data from Facebook profiles in the Dallas/Fort Worth area, including over 167,000 profiles with over 3 million links between them. It also provides general properties of the data, such as the diameter of the largest connected component and the number of unique traits listed by users. Finally, it describes three inference methods - Naive Bayes classification using profile details only, link structure only, or an average of the two - for predicting user attributes from the data.
This document summarizes experimental data from Facebook profiles in the Dallas/Fort Worth area, including over 167,000 profiles with over 3 million links between them. It also provides general properties of the data, such as the diameter of the largest connected component and the number of unique traits listed by users. Finally, it describes three inference methods - Naive Bayes classification using profile details only, link structure only, or an average of the two - for predicting user attributes from the data.
• 167,000 profiles from the Facebook online social
network • Restricted to public profiles in the Dallas/Fort Worth network • Over 3 million links General Data Properties Lindamood et al. 09 & Heatherly et al. 09
Diameter of the largest component 16
Number of nodes 167,390 Number of friendship links 3,342,009 Total number of listed traits 4,493,436 Total number of unique traits 110,407 Number of components 18 Probability Liberal .45 Probability Conservative .55 Inference Methods Lindamood et al. 09 & Heatherly et al. 09
• Details only: Uses Naïve Bayes classifier to predict attribute
• Links Only: Uses only the link structure to predict attribute • Average: Classifies based on an average of the probabilities computed by Details and Links