Professional Documents
Culture Documents
ucausfix
c ESO 2009
December 23, 2009
1
Institut d’Astrophysique de Paris, UMR 7095 CNRS – Université Pierre & Marie Curie, 98 bis boulevard Arago, 75014 Paris, France
2
Scottish Universities Physics Alliance, School of Physics & Astronomy, University of St Andrews, North Haugh, KY169SS, UK
arXiv:0911.5285v1 [astro-ph.EP] 27 Nov 2009
3
Las Cumbres Observatory, 6740B Cortona Dr, suite 102, Goleta, CA 93117, USA
4
School of Mathematical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS, UK
Received ; accepted
ABSTRACT
Aims. Caustic-crossing binary-lens microlensing events are important anomalous events, because they may reveal an extrasolar planet
companion orbiting the lens star. Fast and robust modelling methods are thus of prime interest to quickly conclude on the possible planetary
nature of the event. Cassan (2008) introduced a new set of parameters to model binary-lens events, which are closely related to the features
observed in the light curve. In this work, we explain how Bayesian priors can be added in this framework, and investigate on possible
interesting choices.
Methods. We develop a mathematical formulation that allows to compute analytically priors on the new parameters, given some prior
knowledge on other physical quantities. We explicitely compute the priors for a number of interesting cases, and show how this can be
implemented in a fully Bayesian, Markov-Chain Monte-Carlo algorithm.
Results. Using Bayesian priors can speed up microlens fitting codes by reducing time spent on physically implausible models, and helps to
discriminate among alternative models based on the physical plausibility of their parameters.
tion where the Jacobian determinant of the lens mapping equa- scribing caustic-crossing events. In this formalism, the caus-
tion vanishes, i.e. are lines of infinite point-source magnifica- tic curve in the source plane is parameterised by a curvilin-
tion. There are three kinds of caustic topologies which depend ear abscissa (or arc length) running from 0 to 2. The trajec-
on the values of the binary lens mass ratio q and the two com- tory of a source crossing a caustic, which is classically pa-
ponent projected separation d in angular Einstein ring radius θE rameterised by its impact parameter u0 and position angle α,
(Einstein 1936): can alternatively be defined by giving the values sin at ingress
s ! and sout at egress7 . The two parameters timing the trajectory, tE
4GM DS − DL (time to cross one Einstein radius) and t0 (date at minimum im-
θE = , (1)
c2 DS DL pact parameter u0 ), are then replaced by the ingress and egress
where DS , DL are the observer-source and observer-lens dis- times tin and tout . The caustic curve is specified in the source
tances and M the lens total mass. In the close separation regime (i.e. caustic) plane by a complex function ζ(s) = ξ(s) + iη(s)
(cf. Fig. 1 of Cassan 2008), there are three caustics: one central (see Sec. 3.2), and once sin and sout are specified, the source
(4-cusp) and two (3-cusp) planetary caustics; in the intermedi- trajectory is fully defined. This bijective switch of parameters,
ate regime, there is only one (6-cusp) caustic, and in the wide (u0 , α, tE , t0 ) 7→ (sin , sout , tin , tout ), takes advantage of the rel-
separation regime there is one central and one planetary caustic atively high precision with which tin and tout can be guessed
(both with 4 cusps). from the observations (Kains et al. 2009; Kubas et al. 2005).
In many cases, the source trajectory happens to cross a Using these new parameters, Kains et al. (2009) anal-
caustic. As the source crosses the caustic curve and enters the ysed the caustic crossing event OGLE 2007-BLG-472. The ap-
enclosed area, a new pair of images appears, causing a sudden proach taken was a maximum likelihood procedure, quantify-
increase in the observed brightness. Similarly, when the source ing the “goodness-of-fit” by a χ2 statistic, and minimising the
exits the area defined by the caustics, the two images merge χ2 to optimise the fit. A grid search in (d, q) with even spac-
and disappear, causing a rapid drop in the observed bright- ing in log d and log q was conducted. For each (d, q) caustic
ness. These dramatic changes in magnification result in readily configuration, a genetic algorithm was used to explore widely
recognisable jumps in microlensing light curves. As empha- the remaining parameter space. While (sin , sout ) covered the full
sised by Cassan (2008), the ingress and egress times tin and range of possibilities, [0, 2] × [0, 2], tin and tout evolved in very
tout may be restricted within very tight intervals when caustic tight intervals based on the values guessed from the light curve
crossing features have been identified in the light curve, and features (caustic crossing magnification peaks). These first fits
thus advantageously be used as alternative modelling parame- were refined using a Markov-Chain Monte-Carlo (MCMC) al-
ters. gorithm, again holding (d, q) fixed while optimising over all
The new set of binary-lens modelling parameters intro- other parameters. The best-fit models in each of the identified
duced by Cassan (2008) have the advantage that two of these best-fit regions were then found by allowing all parameters to
parameters are very closely related to features that can be vary.
directly identified in the light curve. Using this new formu- As expected for binary lens events, the resulting χ2 (d, q)
lation for the analysis of OGLE 2007-BLG-472 in its most maps revealed a variety of widely-separated model parame-
straightforward implementation as a maximum likelihood anal- ter regions where a relatively low χ2 could be achieved. In
ysis (“minimising χ2 ”), Kains et al. (2009) unveiled a subtle fact the lowest χ2 models corresponded to very low q, in the
aspect of binary-lens modelling: relatively improbable physi- planet-mass regime. But with a short duration between the
cal models with very large values of tE were found with χ2 val- caustic entry and exit, and a planetary caustic size scaling
ues lower than other more plausible models. In order to avoid as q1/2 , these models implied extremely large Einstein time
finding parameter combinations which are physically unlikely, tE ∼ (tout − tin )/q1/2 ∼ 104 days, which are very unlikely ac-
a comprehensive change is to switch to a Bayesian analysis. cording to kinematics of stars motions within the Milky Way.
This is desirable as the Bayesian approach makes use of prior These best-fit maximum likelihood models were therefore re-
information on the underlying physical parameters, while χ2 jected on this physical argument. This need to reject the lowest
says nothing about parameter plausibility. χ2 models highlights a weakness in the maximum likelihood
In this article, we show how to derive Bayesian priors approach, which neglects prior distributions on the parameter
for the Cassan (2008) caustic-crossing binary-lens parameters. space. Bayesian parameter estimation, on the other hand, takes
These are based on physical priors on quantities that can be proper account of prior distributions in the parameter space.
estimated from Galactic models or calculated from already ob- In a Bayesian analysis, the posterior probability distribution
served events (Sec. 2 and 3). In Sec. 4, we describe an imple- over the model parameters θ is a function of the data D:
mentation of this Bayesian formalism within a Markov-Chain P(D|θ)P(θ)
Monte-Carlo fitting scheme, using in particular priors on the P(θ|D) = R , (2)
P(D|θ)P(θ) dθ
Einstein time tE (time for the source to travel an angular dis-
tance θE ). where P(θ) is the prior probability distribution on the parame-
ters, and the denominator partition function ensures proper nor-
2. Maximum likelihood vs. Bayesian fitting malisation of the posterior as a probability distribution over the
Cassan (2008) introduced a new parameterisation of binary 7
We use the notations “in” and “out” in place of “entry” and “exit”
lens microlens light curve model that is well suited to de- of Cassan (2008) to write more condensed formulae.
A. Cassan et al.: Bayesian analysis of caustic-crossing microlensing events 3
Fig. 2. Bayesian prior P(sin , sout ) as a function of sin (horizontal axis) and sout (vertical axis), where we have assumed isotropic
source trajectories and uniform distributions for tE and event rate. Higher values of P appear in white (linear scale). From left to
right, the caustic configurations are: (a) intermediate with d = 1.1 and q = 0.1 ; (b) wide+central and (c) wide+secondary caustic,
both for d = 2 and q = 0.1 ; (d) close+central and (e) close+secondary caustic, both for d = 0.5 and q = 0.1.
scale). From left to right, these configurations are: intermediate crossing, since its rough value can be guessed from the light
with d = 1.1 and q = 0.1, wide+central and wide+secondary, curve, contrary to ρ∗ . In practice, we choose the caustic cross-
both for d = 2 and q = 0.1, close+central and close+secondary ing which has the best data coverage and which best looks like
caustic, both for d = 0.5 and q = 0.1. One can compare the in- a straight line caustic crossing to extract the starting value for
termediate case plot with Fig. 1. In Sec. 4.1, we will investigate fitting ∆tcc .
how assuming different priors on the Einstein time tE affect the
prior on (sin , sout ).
4. Markov-Chain Monte-Carlo
3.3. Extended sources 4.1. Examples of prior probability distributions
When the source approaches the caustic curves (typically less For a given set of fitting parameters (sin , sout , tin , tout , ∆tcc ), the
than three projected source radii), one needs to take into ac- prior of the probed model reads:
count extended source effects in the modelling. As for tin and
tout , it is usually possible to extract from the light curve a new P(model) = P(sin , sout ) P(tin , tout , ∆tcc ) . (29)
parameter that can be used instead of the source radius.
The prior P(sin , sout ) is computed as explained in Sec. 3.2, and
In fact, it is well-known that when the source crosses a
may include priors that have been set on the other parame-
straight line caustic (which is in many cases a good approxi-
ters u0 , α, tE , t0 or ρ∗ by properly weighting P(sin , sout ). Given
mation of a real caustic), one can easily guess the duration of
Eqs. (9) and (10), a prior P(tin , tout ) is equivalent to a prior
the crossing from the shape of the caustic crossing feature itself
P(t0 , tE ) with a corresponding change in the prior P(sin , sout ).
(Cassan et al. 2004; Albrow et al. 1999; Schneider & Wagoner
In this section, we discuss different priors on the different pa-
1987). Here, we define this duration as the time for the source
rameters that could realistically be used in the Bayesian analy-
to cross the caustic line by its full radius (i.e. from centre to
sis. In Fig. 2 for example, we have shown the case of isotropic
limb), so that ∆tcc = ρ∗ /v⊥ ; in this definition, ρ∗ is the source
trajectories, which corresponds to uniform priors in the param-
radius in Einstein ring radius units, v⊥ is the component of the
eters u0 and α. This choice is well-justified, since the direction
source velocity perpendicular to the caustic, and the subscript
of the binary lens axis is random.
“cc” refers to either the caustic entry (“in”) or exit (“out”).
The first class of priors we can use are uninformative priors.
Obviously, for a given absolute source velocity 1/tE , the source
Since the prior expresses information about the values of pa-
will take longer if it crosses the caustic with a tangential angle.
rameters before any data has been taken, we know that param-
More precisely, the normal velocity is proportional to the co-
eters such as t0 , tE or ∆tcc must have uninformative priors, as
sine of the angle between the trajectory and the caustic normal:
we can only estimate their values by looking at the lightcurve.
v⊥ = |Nc,cc · Nt,cc |/tE . Inserting in this equation the expressions
While for t0 , α or u0 , it is natural to use uniform priors, for
of tE , Nc,cc and Nt,cc (Eqs. 9, 26 and 27 respectively), we can
strictly positive parameters like ∆tcc or tE , a better and com-
compute the source radius ρ∗ as a function of ∆tcc :
monly adopted choice is to use an uninformative prior which is
uniform in the log of the parameter.
(ζout − ζin ) ∧ dζcc
dφcc
ρ∗ = ∆tcc . (28) We illustrate the use of an uninformative prior (uniform pri-
dζcc
(tout − tin ) dφ ors in log tE , in u0 , α and t0 ) by computing P(sin , sout ) for the so-
cc
lution configuration of the binary lens event OGLE 2002-BLG-
This expression would be exact if the crossed caustic were 069 (Kubas et al. 2005; Cassan et al. 2004). The configuration
a perfect and infinite straight line. In reality, however, caus- for that event was that of a source crossing the central caustic
tic curves always have a curvature, and sometimes the source of a close binary lens, with parameter d = 0.46, q = 0.58 and
partly crosses a cusp. Nevertheless, there is no arguing that ∆tcc tout −tin ' 14.5 days. The resulting prior P(sin , sout ) is plotted on
is better-suited than ρ∗ to parameterise the observed caustic Fig. 3; the red cross shows the location of the caustic crossings
6 A. Cassan et al.: Bayesian analysis of caustic-crossing microlensing events
at sin ' 1.3, sout ' 0.3. This falls within a region of high prob-
ability, meaning that the corresponding P(sin , sout ) prior would
have been a reasonable choice for this event.
The second class of priors are those which we can derive
using information known before the event is observed. In mi-
crolensing, most of the physical information is contained in a
Fig. 4. On the top pannel, the histogram (blue rectangles) shows
degenerate way in the Einstein timescale tE . This parameter de-
the distribution of tE found after fitting 788 single-lens mi-
pends on the relative distances between the source, the lens and
crolensing events from OGLE 2006-2007 seasons. The solid
the observer, the kinematics of the lens and the source as well
black line shows the model prediction of Wood & Mao (2005),
as on the lens’ mass function. Therefore the combination of this
which is in good agreement with the data. The bottom pannel
information can inform us on which ranges of values of tE are
displays the prior P(sin , sout ) for the same intermediate caustic
more likely to be observed. In the event OGLE-2007-BLG-472
configuration as for Fig. 2 (d = 1.1, q = 0.1), but assuming an
(Kains et al. 2009), no prior information was included on tE
underlying prior on tE given by the above distribution.
(or the prior was assumed to be uninformative), leading to the
best-fit models having unrealistically large timescales.
Empirical distributions of tE can be obtained by modelling a
large number of observed microlensing events. The top panel of
Fig. 4 shows a histogram (blue rectangles) of tE values found by now ruled out by the prior8 . This effect can be seen directly
fitting 788 single-lens microlensing events from the 2006-2007 on the plot of P(sin , sout ), where strong ”wing” features at the
OGLE seasons (including blending). As expected, the distribu- cusps disappear, and other features arise (compare with Fig. 2).
tion is far from uniform but instead appears roughly log-normal
with a peak around log tE ' 1.32 and σlog tE ' 0.4. Theoretical
4.2. Posterior probability distributions: MCMC fitting
distributions of tE can also be obtained based on Galactic mod-
els predictions. An example of such a distribution is the one In practice these and other statistics relating to the poste-
advocated by Wood & Mao (2005). It is plotted as a solid black rior parameter distribution can be evaluated efficiently us-
line on top of our histogram (Fig. 4, top panel), and is seen to ing a Markov-Chain Monte-Carlo (MCMC) to evaluate the
match the empirical distribution very well. probability-weighted integrals in Bayes’ theorem. A random
Using the Wood & Mao (2005) distribution as a prior, we walk in the parameter space is undertaken by making random
compute and plot (Fig. 4, bottom panel) the corresponding dis- steps drawn from a distribution over the parameters θ. Each
tribution P(sin , sout ), assuming (tout − tin ) = 20 days, d = 1.1 proposed step is accepted or rejected based on the probability
and q = 0.1 (the same intermediate configuration as Fig. 2). of the new point relative to the old one exceeding some thresh-
Fig. 4 (bottom panel) shows that with this prior, cusp-crossing old, which is adjusted to tune the acceptance rate to roughly
trajectories are much less likely to happen. This is because for 20 − 30%. The resulting chain locates and wanders around a
a trajectory near the cusps, the source only has a very short dis-
tance to travel between the entry and exit, while (tout − tin ) is 8
More precisely, when tE → ∞, Wood & Mao (2005) tE distribu-
constant, which means that the source’s motion has to be very tion behaves like 1/tE 3 ∼ |ζout − ζin |3 , and since J ∼ 1/|ζout − ζin |2 , the
slow, leading to large values of the timescale tE , and these are net result is that near cusps, J ∼ |ζout − ζin | → 0.
A. Cassan et al.: Bayesian analysis of caustic-crossing microlensing events 7
5. Conclusion
We have investigated plausible priors for Bayesian analysis of
caustic-crossing microlensing light curves, based on the al-
ternative parameterisation introduced by Cassan (2008). We
have demonstrated a mathematical formulation which allows
to compute analytically Bayesian priors on these parameters,
given the knowledge we have on physical quantities on which
they depend. A number of relevant priors that may be used in
a Bayesian, Markov-Chain Monte-Carlo implementation of the
given equations have been explored.
In the context of fast development of a new generation of
networks of classical and robotic telescopes (eg. Tsapras et al.
2009), a current challenge facing the microlens planet search
community is to fully automate the fitting of binary lens light
curves in real time, after having detected an anomaly (Horne
et al. 2009). This would enable anomalies that are detected in
the observed light curves to be characterised as quickly as pos-
sible to determine whether the anomalous behaviour is caused
by a planet-mass companion to the lens star. Identifying param-
eters that could be automatically guessed by seeing the light
curve (eg. a magnification jump due to a caustic crossing) is
already a step ahead to speed up fitting codes by exploring a
much better bounded parameter space. This was the motiva-
tion of Cassan (2008) to define a new set of parameters. In this
work, we have added the possibility to include Bayesian priors
in the analysis, which avoid exploring combinations of param-
eters that are very unlikely to happen before the light curve has
been observed.