Professional Documents
Culture Documents
Introduction
and then combining this information with components of network graph theory. This approach
allows us to understand the underlying data generating processes and the statistical properties of
the explanatory variables that expose a users interests. This paper describes the empirical and
theoretical foundations of our approach.
1. Social Networks
Social media networks share many common components. This section describes the participants
in these networks, their motivations, behaviors,
and how one can use these qualities to obtain relevant information from the interest graph.
1.1 Users
A commonly used approach for building interest groups is to cluster users with similar characteristics, often by their usage of keywords, but
often these approaches preclude falsifiability and
rely heavily on intense computation as a substitute for acquiring subject area expertise on social
media interests. While such techniques are capable of maximizing a well-defined objective function, such as the number of Facebook Likes over
a period of time, they can often lead to conclusions that suggest relationships between variables
that are nonsensical or prove to be unstable over
time. The Relevance Engine represents a departure from such prevailing techniques by providing
a framework for acquiring subject area expertise
from experiments on social media engagement
1.2 Interests
For the purposes of our investigation, interests
are defined as a collection of subjects, activities,
1
1.3 Engagement
Social networks have several mechanisms to observe a users response to specific content in the
social stream. Some of the most common examples include the Like button on Facebook, the
Favorite or Retweet options on Twitter, and the
option to +1 on Google+. The development of
the Relevance Engine does not rely exclusively on
a specific engagement mechanism, but requires
that interest-specific or interest-neutral content
be presented in a manner that allows one to unambiguously engaged exclusively with the stimulating content.
1.4 Content
Social networks offer multiple channels in which
a user is presented with content. Tweets, wall
posts, and links are some common examples of
these content distribution mechanism. Many of
these channels offer a way in which both interestspecific and interest-neutral content can be delivered.
A
1 X
Aj
NA
(1)
Uu,Ii B(NA , Ii )
(2)
Ii =
j=1
E[Uu,Ii ] = NA Ii
(3)
V[Uu,Ii ] = NA Ii (1 Ii )
(4)
Similarly, we define C to be a matrix with cardinality NC to represent the interest-neutral control group of the experiment that are presented
to all network users. It is again assumed that
each piece of content is independent and can be
presented in random order to the user without
introducing any pronounced confounds.
NC
1 X
Cj
NC
(5)
Uu,C B(NC , C )
(6)
C =
j=1
E[Uu,C ] = NC C
(7)
V[Uu,C ] = NC C (1 C )
(8)
(9)
Under the assumptions of independence described by Meeker (1981), the nominal excepted
3
nk 2
nk1 2
=
, k = 1, . . . , K (16)
2 nmax
nmax
(10)
Given a predetermined Type II error power estimate of and ck (), k = 1, . . . , K, under the
assumption of normality, the conditions for stopping becomes:
p
P r{ 2nk 2 1 (1 /2) <
p
W < 2nk 2 1 (1 /2)| = } = (17)
nk
X
i=1
XIi
nk
X
1
1
nk = 2
1
+ (1 )
(18)
2
One complication in the analysis of social media
content is that the proportion of successes can be
quite low even when presenting the most engaging content available. Low proportions make it
challenging to arrive at unbiased parameter estimates and thus making it difficult to make valid
inferences during the assumption of normality.
To address this issue, we explore the calculation
of bias-adjusted Maximum-Likelihood Estimates
as described in (Brown 2002). Alternative binomial proportion confidence intervals for engagement rate can also be obtained using the Wald
Interval approach, RCIs, and the Agresti-Coull
approach.
i=1
(13)
Wk > ck () 2kn 2 , k = 1, . . . , K
(14)
(15)
Wk < ck () 2kn 2 , k = 1, . . . , K
In our sequential testing framework, users consistently receive interest-neutral content and over
time the standard errors around the parameter
estimate of the expected proportion of successful
engagements with interest-neutral content should
4
decline while still providing a framework to account for the changing engagement behavior of
users on the network. With this approach, the
control content provides a quality control check
and a mechanism to monitor a users changing interests. If a given piece of interest-neutral content
is generally less compelling than interest-specific
content, then users are likely to be erroneously
classified as harboring a particular interest, leading to false discovery. To control for this effect, we
continually assess if a particular piece of content
is universally more or less compelling by looking
at the distributional properties of each piece of
interest-neutral social media content across the
network. If the engagement rate is abnormally
high or low relative to the network response rate,
then we exclude it from interest identification.
This helps to ensure that interest-neutral content
is truly interest neutral. As more sequential trial
are conducted such techniques for quality control
will see improvement.
(1)
(21)
(19)
(1) Time-Series Analysis: Future models will attempt to address the time-series component of interests. One such model would propose a weighting scheme to emphasize more recent engagement
behavior. The existing approach presumes the interests are stationary creating a need to periodically run trials on the user to measure changing
interests over time. One such example is a user
that is a new parent engaging with baby related
social media content when previously that user
did not align with such interests.
(2) Assignment Mechanisms: By acknowledging
the possibility that the the response rate at t
is potentially influenced by the response rate at
t 1, we can compare alternating assignment,
random assignment, adaptive sequencing assignments. A major assumption behind this experimental design is that the decision to engage with
the control is independent of the treatment and
the decision to engage with one piece of content
does not preclude the ability to engage with another piece of content. As long as treatment
and control do not compete with one another for
the users attention, this independence can generally be assumed, but by controlling for the order
in which the content is received we can address
concerns about the impact of fatigue from being
shown irrelevant content on engagement behavior.
(22)
trial experiments on a fresh subset of the network (2) perform an out-of-sample validation of
our second stage social graph user-interest classification on a second set of users.
y i = x i + i
0
Ui = wi + ui
4.1 Methodology
To validate our interest graph inferences, the population of network users is divided randomly into
two groups A and B in which we restrict the samples to users that have participated in the social
network for a comparable period of time to control for adoption differences. After defining a set
of N reasonable interests, the engagement-based
sequential tests are performed on group A. The
network associations of the users in group A are
defined for each interest and a Zi matrix is created for each interest from i = 1 . . . N . We apply
the logit model fit on group A onto group B to arrive at expected probabilities of engagement YP 2 .
Interest-specific content is then shown to users in
group B and the forecasted engagement rate is
then compared to the observed engagement rate.
(23)
(24)
This is in many ways similar to an omittedvariable misspecification problem in and OLS regression. Where is the standard normal distribution and is the cumulative standard normal
distribution.
"
#
0
( wi )
0
E(yi ) = xi +
(25)
( 0 wi )
4.2 Evaluation
3.2.2 Equivalence Relations
The objective of this evaluation is to see if the
Relevance Engine produces results in which the
users in the out-of-sample group that have been
classified as sharing have a higher engagement
rate for interest-specific content than interestneutral content. We evaluate our performance
by comparing the root-mean-squared error of
the out-of-sample data with the in-sample data,
where P 1 corresponds to estimated engagement
based on the in-sample logit estimate and P 2 corresponds to the out-of-sample observed value.
q
2
4. Model Validation
Falsifiability is a key feature of the Relevance Engine and this section outlines the ways in which
we can validate the success of our classification
mechanism. There are two main types of validations that need to be performed are: (1) assess
the stability of content-based engagement as determined by the ability to replicate the sequential
References
[1] C. Jennison and B.W. Turnbull Group sequential tests and repeated confidence intervals, with Applications to Normal and Binary
7