Bayesian Analysis of Caustic-Crossing Microlensing Events: A. Cassan, K. Horne, N. Kains, Y. Tsapras, P. Browne

Astronomy & Astrophysics manuscript no.
ucausfix
c ESO 2009
December 23, 2009
Bayesian analysis of caustic-crossing microlensing events

A. Cassan1 , K. Horne2 , N. Kains2 , Y. Tsapras3,4 , P. Browne2
1
Institut d’Astrophysique de Paris, UMR 7095 CNRS – Université Pierre & Marie Curie, 98 bis boulevard Arago, 75014 Paris, France
2
Scottish Universities Physics Alliance, School of Physics & Astronomy, University of St Andrews, North Haugh, KY169SS, UK
arXiv:0911.5285v1 [astro-ph.EP] 27 Nov 2009
3
Las Cumbres Observatory, 6740B Cortona Dr, suite 102, Goleta, CA 93117, USA
4
School of Mathematical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS, UK
Received ; accepted
ABSTRACT
Aims. Caustic-crossing binary-lens microlensing events are important anomalous events, because they may reveal an extrasolar planet
companion orbiting the lens star. Fast and robust modelling methods are thus of prime interest to quickly conclude on the possible planetary
nature of the event. Cassan (2008) introduced a new set of parameters to model binary-lens events, which are closely related to the features
observed in the light curve. In this work, we explain how Bayesian priors can be added in this framework, and investigate on possible
interesting choices.
Methods. We develop a mathematical formulation that allows to compute analytically priors on the new parameters, given some prior
knowledge on other physical quantities. We explicitely compute the priors for a number of interesting cases, and show how this can be
implemented in a fully Bayesian, Markov-Chain Monte-Carlo algorithm.
Results. Using Bayesian priors can speed up microlens fitting codes by reducing time spent on physically implausible models, and helps to
discriminate among alternative models based on the physical plausibility of their parameters.
Key words. gravitational lensing, extrasolar planets, methods: analytical
1. Introduction bits of several astronomical units. It thus probes in the planet

mass-separation plane a region beyond reach of any other tech-
Mao & Paczynski (1991) first suggested that observations nique, as demonstrated by the detection of the very first cool
of Galactic gravitational microlensing events could lead to super-Earth, OGLE 2005-BLG-390Lb (Beaulieu et al. 2006;
the discovery of extrasolar planets. Microlensing involves the Kubas et al. 2008).
time-dependant brightening and then dimming of a back-
ground source star as an intervening massive object (the lens) A number of microlensing events exhibit anomalous be-
crosses the observer line-of-sight. Light rays from the source haviour (i.e. they cannot be adequately modelled by the stan-
bend in the vicinity of the lens, focusing them toward the dard single-lens light curve, e.g. Paczynski 1986) and some
observer. Beginning in 1994, survey teams such as OGLE1 of these anomalies can be attributed to lensing by binary ob-
(OGLE III, Udalski 2003) and MOA2 (Bond et al. 2001) jects. The types of light curves that result from binary lensing
have reported more than four thousand microlensing events form a rich tapestry but, in general, binary systems with equal
toward the Galactic bulge to date. Several hundreds of these masses of the two components tend to produce pronounced,
events have been carefully selected and densely sampled by long anomalies in the light curves, whereas when the secondary
follow-up networks such as PLANET3 , µFUN4 , RoboNet5 and companion is only a small fraction of the total mass, the anoma-
MiNDSTEp6 . Although microlensing teams have so far pub- lies can be quite short and subtle. It is primarily these latter
lished only nine exoplanet detections, the method itself stands types of anomalies that may be caused by star-planet binaries
out because of its high sensitivity to low-mass planets with or- (Gould & Loeb 1992). Nevertheless, because the true nature of
the anomaly cannot always be established while the microlens-
1
ing event is still ongoing, every binary-lens microlensing event
http://www.astrouw.edu.pl/∼ogle
2 constitutes a prime target for planet hunting.
http://www.phys.canterbury.ac.nz/moa
3
http://planet.iap.fr In binary lensing, the lens system configuration gives rise
4
http://www.astronomy.ohio-state.edu/∼microfun to regions of space on the source plane that are bound by gravi-
5
http://robonet.lcogt.net tational caustics. Caustics are closed curves, with concave seg-
6
http://www.mindstep-science.org ments meeting in outward pointing cusps, defined by the loca-
2 A. Cassan et al.: Bayesian analysis of caustic-crossing microlensing events
tion where the Jacobian determinant of the lens mapping equa- scribing caustic-crossing events. In this formalism, the caus-
tion vanishes, i.e. are lines of infinite point-source magnifica- tic curve in the source plane is parameterised by a curvilin-
tion. There are three kinds of caustic topologies which depend ear abscissa (or arc length) running from 0 to 2. The trajec-
on the values of the binary lens mass ratio q and the two com- tory of a source crossing a caustic, which is classically pa-
ponent projected separation d in angular Einstein ring radius θE rameterised by its impact parameter u0 and position angle α,
(Einstein 1936): can alternatively be defined by giving the values sin at ingress
s ! and sout at egress7 . The two parameters timing the trajectory, tE
4GM DS − DL (time to cross one Einstein radius) and t0 (date at minimum im-
θE = , (1)
c2 DS DL pact parameter u0 ), are then replaced by the ingress and egress
where DS , DL are the observer-source and observer-lens dis- times tin and tout . The caustic curve is specified in the source
tances and M the lens total mass. In the close separation regime (i.e. caustic) plane by a complex function ζ(s) = ξ(s) + iη(s)
(cf. Fig. 1 of Cassan 2008), there are three caustics: one central (see Sec. 3.2), and once sin and sout are specified, the source
(4-cusp) and two (3-cusp) planetary caustics; in the intermedi- trajectory is fully defined. This bijective switch of parameters,
ate regime, there is only one (6-cusp) caustic, and in the wide (u0 , α, tE , t0 ) 7→ (sin , sout , tin , tout ), takes advantage of the rel-
separation regime there is one central and one planetary caustic atively high precision with which tin and tout can be guessed
(both with 4 cusps). from the observations (Kains et al. 2009; Kubas et al. 2005).
In many cases, the source trajectory happens to cross a Using these new parameters, Kains et al. (2009) anal-
caustic. As the source crosses the caustic curve and enters the ysed the caustic crossing event OGLE 2007-BLG-472. The ap-
enclosed area, a new pair of images appears, causing a sudden proach taken was a maximum likelihood procedure, quantify-
increase in the observed brightness. Similarly, when the source ing the “goodness-of-fit” by a χ2 statistic, and minimising the
exits the area defined by the caustics, the two images merge χ2 to optimise the fit. A grid search in (d, q) with even spac-
and disappear, causing a rapid drop in the observed bright- ing in log d and log q was conducted. For each (d, q) caustic
ness. These dramatic changes in magnification result in readily configuration, a genetic algorithm was used to explore widely
recognisable jumps in microlensing light curves. As empha- the remaining parameter space. While (sin , sout ) covered the full
sised by Cassan (2008), the ingress and egress times tin and range of possibilities, [0, 2] × [0, 2], tin and tout evolved in very
tout may be restricted within very tight intervals when caustic tight intervals based on the values guessed from the light curve
crossing features have been identified in the light curve, and features (caustic crossing magnification peaks). These first fits
thus advantageously be used as alternative modelling parame- were refined using a Markov-Chain Monte-Carlo (MCMC) al-
ters. gorithm, again holding (d, q) fixed while optimising over all
The new set of binary-lens modelling parameters intro- other parameters. The best-fit models in each of the identified
duced by Cassan (2008) have the advantage that two of these best-fit regions were then found by allowing all parameters to
parameters are very closely related to features that can be vary.
directly identified in the light curve. Using this new formu- As expected for binary lens events, the resulting χ2 (d, q)
lation for the analysis of OGLE 2007-BLG-472 in its most maps revealed a variety of widely-separated model parame-
straightforward implementation as a maximum likelihood anal- ter regions where a relatively low χ2 could be achieved. In
ysis (“minimising χ2 ”), Kains et al. (2009) unveiled a subtle fact the lowest χ2 models corresponded to very low q, in the
aspect of binary-lens modelling: relatively improbable physi- planet-mass regime. But with a short duration between the
cal models with very large values of tE were found with χ2 val- caustic entry and exit, and a planetary caustic size scaling
ues lower than other more plausible models. In order to avoid as q1/2 , these models implied extremely large Einstein time
finding parameter combinations which are physically unlikely, tE ∼ (tout − tin )/q1/2 ∼ 104 days, which are very unlikely ac-
a comprehensive change is to switch to a Bayesian analysis. cording to kinematics of stars motions within the Milky Way.
This is desirable as the Bayesian approach makes use of prior These best-fit maximum likelihood models were therefore re-
information on the underlying physical parameters, while χ2 jected on this physical argument. This need to reject the lowest
says nothing about parameter plausibility. χ2 models highlights a weakness in the maximum likelihood
In this article, we show how to derive Bayesian priors approach, which neglects prior distributions on the parameter
for the Cassan (2008) caustic-crossing binary-lens parameters. space. Bayesian parameter estimation, on the other hand, takes
These are based on physical priors on quantities that can be proper account of prior distributions in the parameter space.
estimated from Galactic models or calculated from already ob- In a Bayesian analysis, the posterior probability distribution
served events (Sec. 2 and 3). In Sec. 4, we describe an imple- over the model parameters θ is a function of the data D:
mentation of this Bayesian formalism within a Markov-Chain P(D|θ)P(θ)
Monte-Carlo fitting scheme, using in particular priors on the P(θ|D) = R , (2)
P(D|θ)P(θ) dθ
Einstein time tE (time for the source to travel an angular dis-
tance θE ). where P(θ) is the prior probability distribution on the parame-
ters, and the denominator partition function ensures proper nor-
2. Maximum likelihood vs. Bayesian fitting malisation of the posterior as a probability distribution over the
Cassan (2008) introduced a new parameterisation of binary 7
We use the notations “in” and “out” in place of “entry” and “exit”
lens microlens light curve model that is well suited to de- of Cassan (2008) to write more condensed formulae.
A. Cassan et al.: Bayesian analysis of caustic-crossing microlensing events 3
parameters θ. The likelihood P(D|θ) is a function of the pa-

rameters θ and a probability distribution over the data D. For
Gaussian measurement errors, with N data points having indi-
vidual standard deviations σi , the likelihood is
n o
exp − 12 χ2
L(θ) ≡ P(D|θ) = . (3)
σi
QN
(2π)N/2 i=1
Since maximising the likelihood corresponds to minimising
N
X N
−2 ln L(θ) = χ2 + 2 ln σi + ln 2π , (4)
i=1
2
a maximum likelihood is equivalent to a minimum in χ2 when
the error bars σi are known, and is essentially a Bayesian anal-
ysis that implicitly assumes a prior that is uniform on the cho-
sen parameters intervals. As we shall see in the next section,
assuming more realistic priors could substantially affect the fit-
ting process.
3. A Bayesian prior for (sin , sout )

3.1. Distribution of (sin , sout ) for isotropic trajectories
A uniform prior probability distribution in the parameter
square (sin , sout ) is implicit in the maximum likelihood analysis.
Because of the non-linear correspondence between the two sets
of parameters, it should correspond to a rather unlikely prior
for the (u0 , α) source trajectory parameters. A more plausible
prior would for example arrange for the source trajectories to
be uniformly distributed and isotropic in orientation. Fig. 1. The top pannel illustrates the three kinds of possible
In Fig. 1, the top pannel shows an intermediate caustic with source trajectories crossing a caustic: the black line has a sin-
d = 1.1 and q = 0, 1 (i.e. six cusps, in orange) with sev- gle pair of ingress and egress points, while the red and green
eral crossing trajectories. It can be seen that a straight line lines have respectively two and three ingress/egress points. The
may cross the caustic at two (black line), four (red line) or bottom pannel shows in the (sin , sout ) square the locations of
six (green line) locations, depending on the number and ori- a random distribution of ∼ 104 such trajectories crossing a
entation of the cusps. In the bottom pannel, ∼ 104 of such (d = 1.1, q = 0, 1)-caustic, with the same colour convention.
trajectories were randomly shot and their corresponding po- The solid vertical and horizontal black line mark the s-locations
sition in the (sin , sout ) square reported, using the same colour of the caustic cusps.
convention. Trajectories with a single pair of ingress/egress
map into unique black points, while for red and green trajec-
tory lines, there are respectively two and three possible pairs of equation for a binary lens with separation d and mass ratio q
ingress/egress points. defines the mapping of the position of a point-source ζ on the
We can understand some of the latter figure structures as source plane to the positions of its three or five images at z on
follows: vertical and horizontal lines marking the s values at the lens plane:
the cusps divide the (sin , sout ) square into boxes. No trajecto- !
1 1 q
ries appear in the boxes along the diagonal because the caustic ζ =z− + , (5)
curves are concave outward. It is thus impossible for a line that 1+q z z+d
enters at some position between two cusps to exit at any point where the more massive body is at the center and the compan-
between those two cusps. Similarly, other empty regions corre- ion on the left-hand side. Following Witt (1990), the caustic
spond to ingress/egress pairs that cannot be realised by straight lines ζ are parametrised by a parameter φ ∈ [0, 2π]:
lines crossing the caustic. The trajectories are seen to bunch up " #
1 1 q
around ingress/egress pairs occurring close to a cusp. This oc- + = e−iφ , (6)
curs because any trajectory that enters near a cusp is very likely 1 + q z2 (z + d)2
(for a wide range of angles) to exit also near the same cusp. with z and ζ satisfying Eq. (5). For a given angle φ, it is pos-
sible to solve a fourth order polynomial equation in z to obtain
the corresponding caustic points ζ. In practice, while we use
3.2. Analytical formulation
φ to derive the useful formulae, we use instead the equivalent
In this section, we develop a mathematical formulation which parameter s = s(φ) (bijection) introduced by Cassan (2008),
allows to compute analytically priors on (sin , sout ). The lens which has the advantage to sample the caustics evenly.
∂t0 1 ζout + ζin ζout ∧ ζin

" #
To write more condensed formulae, we will be using nota- 1
= + +i , (19)
tions that resemble two-dimensional vector operations. Given ∂tin 2 2 ζout − ζin |ζout − ζin |2
two complex numbers ζ1 = ξ1 + iη1 and ζ2 = ξ2 + iη2 , we write: ∂t0 1
"
1 ζout + ζin ζout ∧ ζin
#
ζ1 ∧ζ2 = ξ1 η2 −η1 ξ2 (“wedge product”) and ζ1 ·ζ2 = ξ1 ξ2 +η1 η2 = − +i , (20)
∂tout 2 2 ζout − ζin |ζout − ζin |2
(“scalar product”), which are both real numbers. Moreover, a
quantity related to a caustic entry (respectively exit) will be in- so that
dicated by a subscript “in” (resp. “out”). Using the usual con-
(ζout − ζin ) ∧ dζin (ζout − ζin ) ∧

dζout
vention that u0 > 0 when the origin of the coordinate system ∂ (u0 , α) =

dφin dφout
∂ (φin , φout ) , (21)
stays on the right-hand side of the source trajectory, one can |ζout − ζin |3
write: ∂t0 ∂t0
∂ (tE , t0 ) = ∂tin + ∂tout =

1
ζout ∧ ζin ∂ (tin , tout ) , (22)
u0 = , (7) |ζout − ζin | |ζout − ζin |
|ζout − ζin |
which gives
ηout − ηin
!
α = arctan + π H (ξin − ξout ) ,

(8) (ζout − ζin ) ∧ dζin (ζout − ζin ) ∧ dζout
ξout − ξin dφin dφout
J= . (23)
tout − tin |ζout − ζin |4
tE = , (9)
|ζout − ζin | The derivatives dζ/dφ evaluated at the caustic entry and exit
are given by (Cassan 2008):
tout + tin 1 ζout + ζin ζout ∧ ζin
" #
t0 = − (tout − tin ) +i , (10)
2 2 ζout − ζin |ζout − ζin |2 dζ dz dz
= + eiφ , (24)
dφ dφ dφ
where H is the Heaviside step function, and α =
π
2 sign out − ηin if ξout = ξin . The transformation between the
(η ) with
two sets of parameters is given by the Jacobian dz i (z + d)2 + q z2
= (z + d) z . (25)
∂ (u0 , α, tE , t0 ) dφ 2 (z + d)3 + q z3

J = ; (11)
∂ (sin , sout , tin , tout ) In the limit of cusp-crossing trajectories, ζout − ζin → 0, J be-
since the dependancies of the classical parameters with respect haves like 1/|ζout − ζin |2 .
to the new ones are u0 (sin , sout ), α(sin , sout ), tE (sin , sout , tin , tout ) As expected, the Jacobian J is a function of the two pa-
and t0 (sin , sout , tin , tout ), J reads (using φ instead of s): rameters (sin , sout ), while the bijection between the two set of
∂u ∂u parameters was possible by involving (tin , tout ). J, however, is
∂φin0 ∂φout0 0 0 not yet the Bayesian prior P(sin , sout ) we are seeking. In fact,
∂α ∂α
∂φin ∂φout 0 0 ∂ (u0 , α) ∂ (tE , t0 ) we have yet to consider two further aspects: firstly, the fact that

J = ∂tE ∂tE ∂tE ∂tE =
× . (12) (u0 , α, tE , t0 ) are themselves affected by prior probability distri-
∂ (φin , φout ) ∂ (tin , tout )

∂φin ∂φout ∂tin ∂tout
∂t0 ∂t0 ∂t0 ∂t0 butions ; this is discussed in the end of this section, and is the
∂φ ∂φ ∂t ∂t
in out in out
topic of Sec. 4.1. Secondly, caustic crossing points are either
After some algebra, we find for the components of the two lat- entries or exits since the trajectory is orientated from tin to tout
ter Jacobians: (tout ≥ tin ), which is not accounted for in Eq. (23). To solve
this last issue, we calculate the outward normal vector to the
∂u0 ∂u0 dξin ∂u0 dηin
= + caustics at point ζ,
∂φin ∂ξin dφin ∂ηin dφin ,
(ζout − ζin ) · ζout dζ dζ
" #
dζin Nc = i
= (ζout − ζin ) ∧ , (13)
dφ dφ
(26)
|ζout − ζin |3 dφin
∂u0 (ζout − ζin ) · ζin
" #
dζout as well as the normalised and orientated trajectory vector
= − (ζout − ζin ) ∧ , (14)
∂φout |ζout − ζin |3 dφout
ζout − ζin
∂α ∂α dξin ∂α dηin Nt = , (27)
= + |ζout − ζin |
∂φin ∂ξin dφin ∂ηin dφin
dζin and check whether Nc,in · Nt,in < 0 (inward motion at ζin ) and
(ζout − ζin ) ∧
= −
dφin
, (15) Nc,out · Nt,out > 0 (outward motion at ζout ). If these conditions
|ζout − ζin | 2
are fulfilled, we write P(sin , sout ) = J, and 0 otherwise.
(ζout − ζin ) ∧ dφdζout Defined like this, P(sin , sout ) is thus the prior on (sin , sout )
∂α
= out
, (16) we are seeking, in the special case of isotropic source trajec-
∂φout |ζout − ζin |2 tories (uniform distributions for u0 and α), uniform microlens-
∂tE 1 ing events rate (t0 is a random number) and uniform Einstein
= − , (17)
∂tin |ζout − ζin | time tE ≥ 0. In Fig. 2, we have plotted P(sin , sout ) for various
∂tE 1 (d, q) configurations as a function of sin (horizontal axis) and
= , (18) sout (vertical axis) ; higher values of P appear in white (linear
∂tout |ζout − ζin |
Fig. 2. Bayesian prior P(sin , sout ) as a function of sin (horizontal axis) and sout (vertical axis), where we have assumed isotropic
source trajectories and uniform distributions for tE and event rate. Higher values of P appear in white (linear scale). From left to
right, the caustic configurations are: (a) intermediate with d = 1.1 and q = 0.1 ; (b) wide+central and (c) wide+secondary caustic,
both for d = 2 and q = 0.1 ; (d) close+central and (e) close+secondary caustic, both for d = 0.5 and q = 0.1.
scale). From left to right, these configurations are: intermediate crossing, since its rough value can be guessed from the light
with d = 1.1 and q = 0.1, wide+central and wide+secondary, curve, contrary to ρ∗ . In practice, we choose the caustic cross-
both for d = 2 and q = 0.1, close+central and close+secondary ing which has the best data coverage and which best looks like
caustic, both for d = 0.5 and q = 0.1. One can compare the in- a straight line caustic crossing to extract the starting value for
termediate case plot with Fig. 1. In Sec. 4.1, we will investigate fitting ∆tcc .
how assuming different priors on the Einstein time tE affect the
prior on (sin , sout ).
4. Markov-Chain Monte-Carlo
3.3. Extended sources 4.1. Examples of prior probability distributions
When the source approaches the caustic curves (typically less For a given set of fitting parameters (sin , sout , tin , tout , ∆tcc ), the
than three projected source radii), one needs to take into ac- prior of the probed model reads:
count extended source effects in the modelling. As for tin and
tout , it is usually possible to extract from the light curve a new P(model) = P(sin , sout ) P(tin , tout , ∆tcc ) . (29)
parameter that can be used instead of the source radius.
The prior P(sin , sout ) is computed as explained in Sec. 3.2, and
In fact, it is well-known that when the source crosses a
may include priors that have been set on the other parame-
straight line caustic (which is in many cases a good approxi-
ters u0 , α, tE , t0 or ρ∗ by properly weighting P(sin , sout ). Given
mation of a real caustic), one can easily guess the duration of
Eqs. (9) and (10), a prior P(tin , tout ) is equivalent to a prior
the crossing from the shape of the caustic crossing feature itself
P(t0 , tE ) with a corresponding change in the prior P(sin , sout ).
(Cassan et al. 2004; Albrow et al. 1999; Schneider & Wagoner
In this section, we discuss different priors on the different pa-
1987). Here, we define this duration as the time for the source
rameters that could realistically be used in the Bayesian analy-
to cross the caustic line by its full radius (i.e. from centre to
sis. In Fig. 2 for example, we have shown the case of isotropic
limb), so that ∆tcc = ρ∗ /v⊥ ; in this definition, ρ∗ is the source
trajectories, which corresponds to uniform priors in the param-
radius in Einstein ring radius units, v⊥ is the component of the
eters u0 and α. This choice is well-justified, since the direction
source velocity perpendicular to the caustic, and the subscript
of the binary lens axis is random.
“cc” refers to either the caustic entry (“in”) or exit (“out”).
The first class of priors we can use are uninformative priors.
Obviously, for a given absolute source velocity 1/tE , the source
Since the prior expresses information about the values of pa-
will take longer if it crosses the caustic with a tangential angle.
rameters before any data has been taken, we know that param-
More precisely, the normal velocity is proportional to the co-
eters such as t0 , tE or ∆tcc must have uninformative priors, as
sine of the angle between the trajectory and the caustic normal:
we can only estimate their values by looking at the lightcurve.
v⊥ = |Nc,cc · Nt,cc |/tE . Inserting in this equation the expressions
While for t0 , α or u0 , it is natural to use uniform priors, for
of tE , Nc,cc and Nt,cc (Eqs. 9, 26 and 27 respectively), we can
strictly positive parameters like ∆tcc or tE , a better and com-
compute the source radius ρ∗ as a function of ∆tcc :
monly adopted choice is to use an uninformative prior which is
uniform in the log of the parameter.

(ζout − ζin ) ∧ dζcc
dφcc
ρ∗ = ∆tcc . (28) We illustrate the use of an uninformative prior (uniform pri-
dζcc
(tout − tin ) dφ ors in log tE , in u0 , α and t0 ) by computing P(sin , sout ) for the so-
cc
lution configuration of the binary lens event OGLE 2002-BLG-
This expression would be exact if the crossed caustic were 069 (Kubas et al. 2005; Cassan et al. 2004). The configuration
a perfect and infinite straight line. In reality, however, caus- for that event was that of a source crossing the central caustic
tic curves always have a curvature, and sometimes the source of a close binary lens, with parameter d = 0.46, q = 0.58 and
partly crosses a cusp. Nevertheless, there is no arguing that ∆tcc tout −tin ' 14.5 days. The resulting prior P(sin , sout ) is plotted on
is better-suited than ρ∗ to parameterise the observed caustic Fig. 3; the red cross shows the location of the caustic crossings
Fig. 3. Prior P(sin , sout ) for the solution configuration of caus-

tic crossing event OGLE 2002-BLG-069 (close+central caus-
tic), with an underlying (uninformative) uniform prior in log tE .
The red cross shows the location of the found caustic crossing,
(1.3, 0.3), which falls in a region of relative high probability.
at sin ' 1.3, sout ' 0.3. This falls within a region of high prob-
ability, meaning that the corresponding P(sin , sout ) prior would
have been a reasonable choice for this event.
The second class of priors are those which we can derive
using information known before the event is observed. In mi-
crolensing, most of the physical information is contained in a
Fig. 4. On the top pannel, the histogram (blue rectangles) shows
degenerate way in the Einstein timescale tE . This parameter de-
the distribution of tE found after fitting 788 single-lens mi-
pends on the relative distances between the source, the lens and
crolensing events from OGLE 2006-2007 seasons. The solid
the observer, the kinematics of the lens and the source as well
black line shows the model prediction of Wood & Mao (2005),
as on the lens’ mass function. Therefore the combination of this
which is in good agreement with the data. The bottom pannel
information can inform us on which ranges of values of tE are
displays the prior P(sin , sout ) for the same intermediate caustic
more likely to be observed. In the event OGLE-2007-BLG-472
configuration as for Fig. 2 (d = 1.1, q = 0.1), but assuming an
(Kains et al. 2009), no prior information was included on tE
underlying prior on tE given by the above distribution.
(or the prior was assumed to be uninformative), leading to the
best-fit models having unrealistically large timescales.
Empirical distributions of tE can be obtained by modelling a
large number of observed microlensing events. The top panel of
Fig. 4 shows a histogram (blue rectangles) of tE values found by now ruled out by the prior8 . This effect can be seen directly
fitting 788 single-lens microlensing events from the 2006-2007 on the plot of P(sin , sout ), where strong ”wing” features at the
OGLE seasons (including blending). As expected, the distribu- cusps disappear, and other features arise (compare with Fig. 2).
tion is far from uniform but instead appears roughly log-normal
with a peak around log tE ' 1.32 and σlog tE ' 0.4. Theoretical
4.2. Posterior probability distributions: MCMC fitting
distributions of tE can also be obtained based on Galactic mod-
els predictions. An example of such a distribution is the one In practice these and other statistics relating to the poste-
advocated by Wood & Mao (2005). It is plotted as a solid black rior parameter distribution can be evaluated efficiently us-
line on top of our histogram (Fig. 4, top panel), and is seen to ing a Markov-Chain Monte-Carlo (MCMC) to evaluate the
match the empirical distribution very well. probability-weighted integrals in Bayes’ theorem. A random
Using the Wood & Mao (2005) distribution as a prior, we walk in the parameter space is undertaken by making random
compute and plot (Fig. 4, bottom panel) the corresponding dis- steps drawn from a distribution over the parameters θ. Each
tribution P(sin , sout ), assuming (tout − tin ) = 20 days, d = 1.1 proposed step is accepted or rejected based on the probability
and q = 0.1 (the same intermediate configuration as Fig. 2). of the new point relative to the old one exceeding some thresh-
Fig. 4 (bottom panel) shows that with this prior, cusp-crossing old, which is adjusted to tune the acceptance rate to roughly
trajectories are much less likely to happen. This is because for 20 − 30%. The resulting chain locates and wanders around a
a trajectory near the cusps, the source only has a very short dis-
tance to travel between the entry and exit, while (tout − tin ) is 8
More precisely, when tE → ∞, Wood & Mao (2005) tE distribu-
constant, which means that the source’s motion has to be very tion behaves like 1/tE 3 ∼ |ζout − ζin |3 , and since J ∼ 1/|ζout − ζin |2 , the
slow, leading to large values of the timescale tE , and these are net result is that near cusps, J ∼ |ζout − ζin | → 0.
local minimum, sampling the parameters with a weight pro- References

portional to the posterior probability.
Albrow, M. D., Beaulieu, J.-P., Caldwell, J. A. R., et al. 1999,
While for a maximum likelihood analysis the n relativeo prob- ApJ, 522, 1022
ability used to accept or reject new steps is exp −∆χ2 /2 only,
Beaulieu, J.-P., Bennett, D. P., Fouqué, P., et al. 2006, Nature,
taking into account the difference in χ2 between the old and 439, 437
new points, in a full Bayesian analysis, we multiply this factor Bond, I. A., Abe, F., Dodd, R. J., et al. 2001, MNRAS, 327,
by the ratio of new and old values of the prior P(model), fol- 868
lowing Eq. (29). The posterior probability that the parameters Cassan, A. 2008, A&A, 491, 587
θ lie in a defined region Θ is then Cassan, A., Beaulieu, J. P., Brillant, S., et al. 2004, A&A, 419,
Z L1
P(θ ∈ Θ) = P(θ|D) dθ . (30) Einstein, A. 1936, Science, 84, 506
Θ Gould, A. & Loeb, A. 1992, ApJ, 396, 104
Horne, K., Snodgrass, C., & Tsapras, Y. 2009, MNRAS, 396,
The expected value of any function of parameters, g(θ), is
2087
Z Kains, N., Cassan, A., Horne, K., et al. 2009, MNRAS, 395,
hgi ≡ g(θ) P(θ|D) dθ , (31) 787
Kubas, D., Cassan, A., Beaulieu, J. P., et al. 2005, A&A, 435,
and the variance around that expected value is 941
Z Kubas, D., Cassan, A., Dominik, M., et al. 2008, A&A, 483,
317
(g(θ) − hgi)2 P(θ|D) dθ .

Var g(θ) ≡ (32)
Mao, S. & Paczynski, B. 1991, ApJ, 374, L37
Paczynski, B. 1986, ApJ, 304, 1
Similarly, confidence intervals, parameter covariances and con- Schneider, P. & Wagoner, R. V. 1987, ApJ, 314, 154
fidence intervals can all be evaluated easily in the usual man- Tsapras, Y., Street, R., Horne, K., et al. 2009, AN, 330, 4
ner given the posterior probability distribution found with the Udalski, A. 2003, Acta Astronomica, 53, 291
MCMC algorithm, providing us with a complete statistical pic- Witt, H. J. 1990, A&A, 236, 311
ture of the parameter space we explore. Wood, A. & Mao, S. 2005, MNRAS, 362, 945
5. Conclusion
We have investigated plausible priors for Bayesian analysis of
caustic-crossing microlensing light curves, based on the al-
ternative parameterisation introduced by Cassan (2008). We
have demonstrated a mathematical formulation which allows
to compute analytically Bayesian priors on these parameters,
given the knowledge we have on physical quantities on which
they depend. A number of relevant priors that may be used in
a Bayesian, Markov-Chain Monte-Carlo implementation of the
given equations have been explored.
In the context of fast development of a new generation of
networks of classical and robotic telescopes (eg. Tsapras et al.
2009), a current challenge facing the microlens planet search
community is to fully automate the fitting of binary lens light
curves in real time, after having detected an anomaly (Horne
et al. 2009). This would enable anomalies that are detected in
the observed light curves to be characterised as quickly as pos-
sible to determine whether the anomalous behaviour is caused
by a planet-mass companion to the lens star. Identifying param-
eters that could be automatically guessed by seeing the light
curve (eg. a magnification jump due to a caustic crossing) is
already a step ahead to speed up fitting codes by exploring a
much better bounded parameter space. This was the motiva-
tion of Cassan (2008) to define a new set of parameters. In this
work, we have added the possibility to include Bayesian priors
in the analysis, which avoid exploring combinations of param-
eters that are very unlikely to happen before the light curve has
been observed.

Bayesian Analysis of Caustic-Crossing Microlensing Events: A. Cassan, K. Horne, N. Kains, Y. Tsapras, P. Browne

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Bayesian Analysis of Caustic-Crossing Microlensing Events: A. Cassan, K. Horne, N. Kains, Y. Tsapras, P. Browne

Uploaded by

Copyright:

Available Formats

Astronomy & Astrophysics manuscript no.

Bayesian analysis of caustic-crossing microlensing events

Key words. gravitational lensing, extrasolar planets, methods: analytical

1. Introduction bits of several astronomical units. It thus probes in the planet

parameters θ. The likelihood P(D|θ) is a function of the pa-

3. A Bayesian prior for (sin , sout )

∂t0 1 ζout + ζin ζout ∧ ζin

Fig. 3. Prior P(sin , sout ) for the solution configuration of caus-

local minimum, sampling the parameters with a weight pro- References

You might also like