You are on page 1of 15

Glitch veto based on unphysical gravitational wave binary inspiral templates

1, ∗ 1, 2, †
Raghav Girgaonkar, and Soumya D. Mohanty,
1
Department of Physics and Astronomy, University of Texas Rio Grande Valley,
One West University Blvd., Brownsville, Texas 78520
2
Department of Physics, IIT Hyderabad, Kandi, Telangana-502284, India
Transient signals arising from instrumental or environmental factors, commonly referred to as
glitches, constitute the predominant background of false alarms in the detection of gravitational
waves in data collected from ground-based detectors. Therefore, effective data analysis methods for
vetoing glitch-induced false alarms are crucial to enhancing the sensitivity of a search for gravita-
tional waves. We present a veto method for glitches that impact matched filtering-based searches
for binary inspiral signals. The veto uses unphysical sectors in the space of chirp time parameters as
well as an unphysical extension including negative chirp times to efficiently segregate glitches from
arXiv:2401.15237v1 [gr-qc] 26 Jan 2024

gravitational wave signals in data from a single detector. Inhabited predominantly by glitches but
nearly depopulated of genuine gravitational wave signals, these unphysical sectors can be efficiently
explored using Particle Swarm Optimization. In a test carried out on data taken from both LIGO
detectors spanning multiple observation runs, the veto was able to reject 99.9% of glitches with no
loss of injected signals detected with a signal-to-noise ratio ≥ 9.0. Our results show that extending
a matched filter search to unphysical parts of a signal parameter space promises to be an effective
strategy for mitigating glitches.

I. INTRODUCTION as glitches, of instrumental or environmental origin. In


some cases, glitches can also overlap with GW signals,
The direct detection of a binary black hole (BBH) as in the case of GW170817, and cause a CBC search
merger by the twin LIGO detectors [1] in 2015 [2] ushered pipeline to discard genuine GW signals. Glitches appear-
in the new era of gravitational wave (GW) astronomy. ing in close proximity to GW signals is not a negligible
Since then, the three detector LIGO-Virgo [3] network occurrence: In O3, a total of 18 GW events were affected
has detected over 90 GW signals from compact binary by the presence of a nearby or overlapping glitch, ac-
coalescences (CBCs) across three observing runs (O1, counting for ≈ 20% of the total number of detected GW
O2 [4] and O3 [5]). The catalog of detected CBC events events [9].
includes GW170817 [6], a binary neutron star (BNS) Although glitches come in a variety of shapes, many of
merger, amongst other BBH and neutron star-black hole them are observed to fall into distinct classes that cor-
(NSBH) mergers. The worldwide network of comparable respond to their distinct origins in the hardware or the
sensitivity GW detectors has grown with the recent ad- environment of a detector. The presence of glitch classes
dition of KAGRA [7] and it will expand further in the has been demonstrated using a variety of machine learn-
coming years with the planned LIGO-India [8]. The ex- ing methods such as Deep Convolutional Neural Net-
pansion of the network will result in a higher combined works [10], Support vector machines [11], t-Sne [12], ran-
sensitivity and an increased rate of CBC detections. dom forests [11], and S-means [13]. The Gravity Spy
Searches for CBC signals use the Generalized Likeli- project [14] has provided a high-quality data set for ma-
hood Ratio Test (GLRT) in which the log-likelihood ratio chine learning-based classifiers and has found about 22
of given data is maximized over the space of parameters different glitch classes over multiple observing runs of
characterizing the target family of signal waveforms. The the LIGO detectors [12].
GLRT is implemented using cross-correlations between Although glitch classification is invaluable for tracking
the data and candidate signal waveforms, called tem- down and eliminating their sources, this has seen limited
plates, in a strategy known as matched filtering. While success. For the majority of glitches present in real GW
the mathematical form of the GLRT, and its derivative data, one must supplement the GLRT with additional al-
the matched filter, are derived from idealized assump- gorithms, called vetoes, to mitigate the adverse effects of
tions of stationarity and Gaussianity of the noise in the glitches. One such method that has been used widely in
output of a GW detector, real data have many features different forms in CBC searches is the χ2 -veto [15]. Un-
that deviate from these assumptions and cause a degra- der a unified formalism [16], a χ2 -veto consists of choos-
dation in its performance. CBC searches, in particular, ing a particular subspace in the null space of a given
incur a higher false alarm rate and hence lower sensitiv- template, projecting the given data onto this subspace,
ity, due to the presence of transient signals, also known and computing the norm of the projection. By construc-
tion, the projection norm should be consistent with that
of pure noise if the template happens to match the un-
known signal in the data. However, it may be sizeable
∗ raghav.girgaonkar01@utrgv.edu if the data contains a glitch, thereby serving as a veto
† soumya.mohanty@utrgv.edu statistic. A limitation of this approach is that CBC sig-
2

nals do not constitute a vector space in themselves, in the not. However, unlike the χ2 -veto, we do not pick the un-
sense that a linear combination of two CBC signals is not physical waveforms from the null subspace of a candidate
another CBC signal, which makes the null space (and its signal or tune them for specific glitch classes. In addi-
subspaces) dependent on the unknown parameters of a tion, besides using the norm of the data projection on
signal in given data. In addition, computational consid- unphysical templates as done in χ2 -vetoes, the method
erations severely limit the dimensionality of the chosen also uses the times of arrival of a signal estimated with
subspace, which creates a vast number of possible choices physical and unphysical templates.
for the subspaces. It has been shown [17, 18] that the To create unphysical waveforms, we use the space of
freedom of choosing the subspace can be exploited to chirp time parameters that are obtained by an injec-
maximize projections for specific classes of glitches, re- tive but non-surjective mapping of the binary component
sulting in improvements in the detection sensitivity for masses. This makes the chirp time space larger than the
high-mass CBC signals [4]. However, this could also re- mass parameter space and, hence, splits it into physi-
quire re-tuning the method for different glitch classes or cal and unphysical regions. Furthermore, the unphysical
when the classes themselves change due to changes in the sector can be significantly enlarged using negative chirp
detectors and their environment across observation runs. time values, allowing a larger population of glitches to be
A different veto strategy is implemented in the GstLAL- trapped and vetoed. The segregation of the estimated pa-
based inspiral pipeline [19] that uses a gating method to rameters of glitches and injected GW signals using physi-
remove glitches that are strong enough to significantly cal and unphysical templates is seen to be strong enough
increase the estimated variance of whitened data from to veto glitch events in real data across a wide range
the expected value of unity. of Signal-to-Noise Ratios (SNRs). The lack of any as-
Additionally, GstLAL also employs a signal- sumptions in the method about glitch classes ensures its
consistency veto in which the time series of the usefulness irrespective of changes in the makeup of the
matched filter output around its peak value is compared glitch population from one observation run to another.
with that expected from a real signal. In [20], it was Searching over unphysical regions of the parameter
shown that several classes of glitches occupy extreme space incurs a significant additional computational bur-
parts of the parameter space of BBH mergers under den in a conventional matched filtering strategy based
matched filtering where no GW signals have been on a bank of pre-computed waveforms. However, this
observed, thereby suggesting a possible veto based on problem is mitigated in a stochastic search strategy for
mapping the distribution of glitches in this space. binary inspiral signals implemented with Particle Swarm
In addition to the above vetoes that are applied to Optimization (PSO) [27–29]. In fact, the inclusion of
the output of a search pipeline, several approaches have unphysical regions in positive chirp time space is an in-
been explored in which the waveform of a glitch is esti- herent part of such a search that incurs no additional
mated and subtracted from the data before conducting computational cost. This is because PSO works best for
a GW search. Glitch subtraction using BayesWave [21], hypercubical search spaces while the physical region in
a wavelet-based method that uses data from a detector positive chirp time space is non-hypercubical. Hence,
network to distinguish between a GW signal and a glitch, the search space for PSO must necessarily extend to the
was used in parameter estimation for GW170817 [22] as bounding hypercube of the physical region and include
well as several subsequent signals [9]. Adaptive spline non-physical sectors.
fitting implemented in SHAPES [23] can fit out different The search over the negative chirp time space, on
types of glitches [24] in the presence of an overlapping the other hand, increases the computational cost of
CBC signal such as GW170817. GRITCLEAN but not strongly, since it is activated only
Glitschen [25] uses parameterized waveform models to when a candidate event bypasses the veto in the positive
describe identified glitch classes using principal compo- chirp time search.
nent analysis. The Antiglitch method[26] uses empirical We tested the performance of GRITCLEAN on ≈ 131
waveform models for the four most commonly occurring hours of data, free of any observed GW signals, taken
classes of short-duration glitches to implement a matched from the Hanford (H1) and Livingston (L1) detectors
filter for detecting and subtracting them out. across different observing runs. We used a PSO-based
In this paper we present a novel glitch veto called search with templates and injected signals belonging to
GRITCLEAN (Glitch Rejection using Illegal Templates and the restricted 2PN waveform family [30]. Using a detec-
Cross-Linking of Event Attribute Numbers) that is based tion threshold of SNR = 9.0, we found that GRITCLEAN
on extending a matched filtering search to unphysical was able to reject 99.9% of candidate events, most of
parts of the CBC signal parameter space where the sig- them glitches since none were GW signals, that crossed
nal waveforms can never arise from a gravitational radi- this threshold. The safety of GRITCLEAN was tested on
ation driven inspiral of two point particles. Similar to CBC signals covering a wide mass and SNR range in-
the χ2 -veto, it exploits the key idea that a CBC signal is jected in the same data. It was found that GRITCLEAN did
expected to present very different responses when corre- not reject any of the injections that passed the detection
lated with physical and unphysical CBC templates while threshold. It is important to note that this performance
a glitch, which does not resemble a CBC signal, may is achieved with a single-detector analysis alone. The
3

effectiveness of the veto would increase manyfold when tion [31] as


supplemented by coincidence requirements for events de- Z ∞
tected across multiple detectors. h̃+,× (f ) = dt h+,× (t)e−2πif t , (2)
The rest of the paper is organized as follows. Section −∞
II contains a brief review of the GLRT formalism used 1 + cos ι − 7
h̃+ (f ) = A f 6 exp[−iΨ(f )] , (3)
in our analysis, the waveform model used, and the PSO- 2
based search. An outline of our glitch veto scheme is 7 π
h̃× (f ) = A cos ιf − 6 exp[−i(Ψ(f ) + )] , (4)
given in Section III, followed by the results in Section 2
IV. Our conclusions and discussions for future work are where A is the overall distance-dependent amplitude, ι is
presented in Section V. the inclination angle of the orbital plane of the binary to
the line of sight from the detector, and e h+,× (f ) = 0 for
f∈ / [f∗ , fISCO ]. Here, fISCO is the highest instantaneous
frequency of the GW signal associated with the inspiral,
II. SIGNAL DETECTION AND ESTIMATION corresponding to the innermost stable circular orbit be-
fore plunge, and f∗ is the low-frequency cut-off, assumed
In this section, we present a brief overview of the math- to be 30 Hz in this study, caused by the sharp rise in seis-
ematical formalism underlying the GLRT for restricted mic noise below which a GW signal cannot be observed.
2PN signals followed by the implementation details of a Although fISCO depends on the component masses of a
PSO-based search. binary system, we simplify our code implementation by
setting fISCO = 700 Hz, which corresponds to the lowest
mass systems considered in our study. The phase Ψ(f )
is given by,
A. Data and Signal Models 4  (j−5)/3
π X f
Ψ(f ) = 2πf (ta + τ ) − φ0 − + αj , (5)
4 j=0 f∗
The data from a GW detector is of the form,
where the coefficients αj are defined as,
y(t) = s(t) + n(t) , (1)
3τ0
α0 = 2πf∗ ,
5
where s(t) could be a GW signal or a glitch, and n(t) is α1 = 0,
noise. In this study, we assume that the GW signal be-
longs to the restricted 2PN waveform family in which the α2 = 2πf∗ τ1 ,
phase evolution of the signal includes corrections due to 3τ1.5
α3 = −2πf∗ ,
GW radiation up to orders of (v/c)4 , while the modula- 5
tion of the amplitude is calculated up to only the Newto- α4 = 2πf∗ 3τ2 ,
nian order [30]. The spins of the binary components are
neglected at this order and the orbit is assumed to have and {τ0 , τ1 , τ1.5 , τ2 }, called the chirp time parameters, are
circularized by the time the instantaneous frequency of obtained from the binary component masses m1 and m2
the signal enters the sensitive band of ground-based de- as,
tectors. At this order, the time evolution of the signal  − 53
frequency is determined solely by the masses of the binary 5 GM
τ0 = f −1 πf∗ η −1 ,
components. We do not make any assumptions about the 256π ∗ c3
signal waveforms of glitches.  −1  
5 GM 743 11
The single detector strain response, h(t), is expressed τ1 = f∗−1 πf ∗ η −1
+ η ,
192π c3 336 4
in terms of h+,× (t), the polarization waveforms in the  − 23
Transverse Traceless gauge, as follows, 1 GM
τ1.5 = f∗−1 πf ∗ η −1 ,
8 c3
 − 13
h(t) = F+ (α, δ, ψ)h+ (t) + F× (α, δ, ψ)h× (t) . 5 −1 GM
τ2 = f πf∗
128π ∗ c3
Here, F+ (α, δ, ψ) and F× (α, δ, ψ) are the detector an-  
3058673 5429 617 2
tenna patterns that depend on the sky location of the × η −1 + η+ η ,
1016064 1008 144
source, given by the azimuthal angle α and the polar an- m1 m2 µ
gle δ, and the orientation, given by the polarization angle M = (m1 + m2 ), µ = ,η= .
M M
ψ, of the binary orbit projected on the sky. The GW po-
larization waveforms, h+,× (t), can be expressed in the The time at which the instantaneous frequency of the
Fourier domain using the stationary phase approxima- signal crosses f∗ is designated as its time of arrival (TOA)
4

ta and the phase of the signal at t = ta is the initial phase In GLRT, the detection statistic is defined as,
φ0 . The combination
LG (y) = max LLR(y, Θ) . (14)
Θ
τ = τ0 − τ1.5 + τ1 + τ2 , (6)
Formally, a signal detection is declared if LG (y) > η for
called the chirp length, approximates the observed dura- a preset threshold η. In such a case,
tion of the signal and is a key quantity in GRITCLEAN:
For a physically valid waveform, one must have τ > 0. b = argmax LLR(y, Θ) ,
Θ (15)
The restricted 2PN signals can be parameterized either Θ
in terms of the binary component masses or any two of
the chirp times. It is convenient to choose τ0 and τ1.5 as provides the Maximum Likelihood Estimate (MLE) of
the independent parameters since they can be inverted the unknown parameters of the signal present in the data.
analytically to obtain M and µ, hence m1 and m2 , b for a given
Henceforth, we will call the pair (LG (y), Θ)
data segment y an event. In real data, which may have
  13  −1 non-GW signals or non-Gaussian noise, the result of the
1 5 G
µ= 2 , (7) GLRT is not immediately deemed to be a detection even
16f∗2
4π 4 τ0 τ1.5 c3
   −1 if LG (y) > η. Instead, the occurence of LG (y) > η pro-
5 τ1.5 G vides only a candidate event for the given y that must
M= 2
, (8)
32f∗ π τ0 c3 pass additional checks, namely, the vetoes before it can
p be declared to be a GW detection.
M − M 2 − 4µM
m1 = , (9) For the single detector case considered in this paper,
p 2 F+,× , ι, and A fold into an overall amplitude and a re-
M + M 2 − 4µM definition of the initial phase φ0 . Splitting the set of
m2 = , (10)
2 parameters as Θ = Θ′ ∪ {ρ, φ0 }, where Θ′ = {τ0 , τ1.5 , ta }
and ρ = ks(Θ)k, the signal can be expressed as,
where we choose the convention m1 < m2 . The values
of M and µ above can in turn be used to derive the s(Θ) = ρq(Θ′ , φ0 ) , (16)
remaining chirp times τ1 and τ2 .
s(Θ)
q(Θ′ , φ0 ) = , ⇒ kq(Θ′ , φ0 )k = 1 . (17)
ks(Θ)k
B. The GLRT and MLE
Here, ρ defines the SNR s(Θ) and q(Θ′ , φ0 ) is called the
template waveform.
While not true in general for real data, the assumption In terms of these parameters, the GLRT can be com-
of n(t) being a Gaussian noise process allows us to derive puted as,
the log-likelihood ratio of a finite duration segment of the
 
data as, ′ 1 2
LG (y) = max max ρhy, q(Θ , φ0 )i − ρ . (18)
1 Θ′ ρ,φ0 2
LLR(y, Θ) = hy, s(Θ)i − ||s(Θ)||2 , (11)
2 Substituting the solution of the inner maximization,
where for any continuous time function x(t), x = namely,
(x0 , x1 , . . . , xN −1 ) is the row vector of sampled values
xk = x(tk ), with tk = k/fs and fs being the sampling ρb = hy, q(Θ′ , φb0 )i , (19)
frequency, and hy, q 0 (Θ′ )i
φb0 = arctan , (20)
hy, q 1 (Θ′ )i
−1 T
hx, yi = xC y , (12)
q 0 (Θ′ ) = q(Θ′ , 0) , (21)
is the inner product based on the covariance matrix C of q 1 (Θ′ ) = q(Θ′ , π/2) , (22)
the noise n in y, with Cij = E[ni , nj ], and kxk2 = hx, xi
is the norm induced by the inner product. Here, Θ is the in Eq. 18 and using another split, Θ′ = θ ∪ {ta } and θ =
set of parameters that describe the 2PN signal. Under {τ0 , τ1.5 }, the expression for the GLRT statistic reduces
the assumption of wide-sense stationarity of the noise, to
the inner product above is conveniently expressed as
LG (y) = max λ(θ) , (23)
† θ

hx, yi = x
e ye./S n , (13) λ(θ) = max hy, q 0 (θ, ta )i2 +
ta

where x e is the Discrete Fourier Transform (DFT) of x, hy, q 1 (θ, ta )i2 ] . (24)
‘./’ denotes element-wise division, and S n is the two-sided
power spectral density (PSD) Sn (f ) sampled at the DFT For fixed θ, the maximization over ta can be performed
frequencies. by simply evaluating hy, q 0,1 (θ, ta )i at values of ta equal
5

to the sampling times tk = k/fs , k = 0, 1, . . . , N − 1.


This is a cross-correlation operation between the data 8
and each of the quadrature templates, q 0 (Θ′ ) and q 1 (Θ′ ),
which can be performed efficiently [32, 33] using the Fast 6
Fourier Transform (FFT). In this paper, the remaining
maximization of λ(θ), called the fitness function, over θ 4
is carried out using PSO.
Note that, from Eq. 19 and Eq. 20, the square root of 2
LG (y) is equivalent to the SNR ofpthe estimated signal.
As such, from here on, we refer to λ(θ) as the estimated
SNR at the location θ. 1 2
10 10

C. PSO-based search FIG. 1: Plot illustrating performance of PSO. Each


point corresponds to one data segment containing an
In the specific context of the optimization problem in injected GW signal. Here, ρbtrue and ρbPSO denote the
Eq. 23, PSO searches for the global maximum of the fit- estimated SNRs at the true parameters of the injected
ness function λ(θ) iteratively over a rectangle τa,min ≤ signal and the best location found by PSO, respectively.
τa ≤ τa,max , a ∈ 0, 1.5 called the search space. In each The Y-axis shows the relative difference in these two
iteration, a fixed number of sampled values are obtained quantities, with the vertical blue line showing
for the fitness function. The locations of these samples in SNR = 9.0 for reference. In all cases, ρbPSO > ρbtrue ,
the search space are called particles and the set of parti- indicating that PSO performed well.
cles is called a swarm. Based on their fitness values, the
particles are moved to new locations in the next itera-
tion. Each particle keeps track of the best location in its out as a particularly robust algorithm, at least in its ap-
own history, called its personal best, as well as the best plications to GW data analysis problems, and the only
location found by its neighbors, called its local best. The parameters that typically need to be tuned are Niter and
specification of the neighborhood of each particle sets the the number, Nruns , of independent runs.
topology of the swarm. Each particle carries with it a A simple strategy to assess the tuning of PSO is to
vector, called its velocity, that specifies its displacement take a data realization with an injected GW signal and
from one iteration to the next. compare the estimated SNR found by PSO, ρbPSO , with
The heart of the PSO algorithm is the dynamical equa- the value, ρbtrue , at the true parameters of the injection.
tion for updating the velocity. In standard variants Since, in the presence of noise, the global maximum of
of PSO, it has contributions from (i) an inertia term the fitness function must shift away from the injection
that simply scales the current velocity, (ii) a randomly parameters, one expects to have ρbPSO ≥ ρbtrue if PSO is
weighted cognitive term that attracts the particle towards working well. A given choice of Nruns and Niter is deemed
its personal best location, and (iii) a randomly weighted acceptable if this condition is satisfied in a significant
social term that attracts the particle towards the local fraction of independent data realizations. In this paper,
best location. The final result of the search is the best we have set Niter = 500 and Nrun = 8, and as shown in
of all local bests at termination. A simple termination Fig. 1, the condition ρbPSO ≥ ρbtrue is satisfied in all the
condition, followed in our work, is to fix the number of data segments with injected signals (see Sec. IV B) used
iterations, Niter , in advance. One also needs to specify in our study. The other parameters of PSO have been
the dynamical rule for a particle that breaches the search set at the same values as in [29]. In particular, we use 40
space boundary. We use the let-them-fly boundary con- particles and the ring topology where the neighborhood
dition [34] in which no change is made to the dynamical of each particle consists of the two particles adjacent to it
equation but the fitness is set to −∞, thereby ensuring when the indices of the particles are placed sequentially
that the attractions of the local and personal bests even- around a circle.
tually pull the particle back in. Further details about PSO is known to perform best when the search space
PSO are discussed in pedagogical introductions such as is a hypercube. Non-hypercubical search spaces lead to
[35, 36]. an excessive loss of particles under the boundary con-
A stochastic search algorithm, such as PSO, is not dition used here. Thus, when run over the rectangular
guaranteed to converge to the global maximum in a finite chirp time search space described above, the search nat-
number of iterations. However, the probability of conver- urally extends to its unphysical sectors. The PSO-based
gence can be improved by tuning the algorithm for a spe- search was carried out using a pipeline that was devel-
cific problem. Further improvement can be obtained by oped in MATLAB R2022 [37]. For each data segment in our
using multiple runs of the algorithm, with independent study, the search ranges for PSO were set as τ0 ∈ [0, 90],
pseudorandom streams, on the same fitness function and τ1.5 ∈ [0, 2] for the positive chirp time quadrant, and
selecting the best solution among them [36]. PSO stands τ0 ∈ [−90, 0], τ1.5 ∈ [−2, 0] for the negative chirp time
6

quadrant. This is not possible in a search over the mass B. Negative chirp time Quadrant
parameter space, since every point corresponds to a phys-
ical signal waveform. We use this property of the chirp While the chirp time parameters are never negative
time space in our veto scheme as described in the follow- by construction, nothing prevents us from using negative
ing section. values for them in the phase Ψ(f ) in Eq. 5. The resulting
waveform in the time domain does not correspond to any
physical binary inspiral but remains a valid real-valued
III. GLITCH VETO function of time in itself. Therefore, extending a PSO-
based search to SN provides a significant enlargement
of the space of unphysical template waveforms for use
As outlined earlier, GRITCLEAN uses physical and un- in GRITCLEAN. At the same time, no additional ad hoc
physical template waveforms to probe the nature of a parameters are introduced in the overall veto strategy,
candidate event. In this section, we first define physi- such as assumptions about specific glitch classes, besides
cal and unphysical templates, and then present the veto the search range in SN .
strategy. While the negative quadrant of the chirp time We define the template waveforms in SN using a few
space, τ0 < 0, τ1,5 < 0, denoted as SN , consists entirely modifications to the expressions in Sec.( II A). To calcu-
of unphysical templates, the positive quadrant, τ0 > 0, late the values of τ1 and τ2 from negative values of τ0 and
τ1.5 > 0, denoted as SP , has both types of templates. At τ1.5 , M and µ are obtained as,
the same time, a part of SN must be treated specially
  13  −1
because the templates in that part can resemble binary 1 5 G
inspiral signals despite being unphysical. These details µ= , (26)
16f∗2 4π 4 |τ0 ||τ1.5 |2 c3
are presented separately for the two quadrants below.    −1
5 |τ1.5 | G
M= 2
. (27)
32f∗ π |τ0 | c3
A. Positive chirp time Quadrant The chirp times τ1 and τ2 are obtained from M and µ as
before (c.f., Eq. 9 and Eq. 10) but both are multiplied by
The quadrant SP , has two different types of unphys- −1, thus making all the chirp time parameters negative.
ical sectors. The first is one in which the inversion of In addition, we modify Ψ(f ) to,
(τ0 , τ1.5 ) to the masses of binary components (m1 , m2 ) 4  (j−5)/3
π X f
yields complex values. From Eq. 9 and Eq.10, this hap- Ψ(f ) = 2πf ta − φ0 − + αj (28)
pens when 4 j=0 f∗

to ensure that the TOA, ta , refers to the start of the


M < 4µ . (25)
waveform.
Almost all of the template waveforms thus defined have
Expressing this condition in terms of the chirp times us- negative chirp lengths (cf., Eq. 6) and instantaneous fre-
ing Eqs. 7 and 8, we see that points in the (τ0 , τ1.5 ) space quencies that decrease with time. However, we find that
that lie below the curve there exists a small region in SN , with an origin similar
  1/5 to that of the negative chirp time region in SP , in which
1 128π 2 the waveforms have positive chirp lengths and increasing
τ1.5 = 2 3 ,
τ0 f∗ 25 instantaneous frequencies. Such waveforms can have a
strong correlation with GW signals, especially with high-
which corresponds to the M = 4µ or m1 = m2 curve, mass and short-duration ones. We address this issue sim-
satisfy this condition. The second unphysical region in ply by replacing the fitness value for a point (τ0 , τ1.5 ) in
SP is the one where the chirp length of a signal, given by this region with the value of the point with the two (still
Eq. 6, becomes negative. This happens due to the fact negative) chirp times swapped, that is, (τ1.5 , τ0 ). The de-
that τ1.5 appears with an opposite sign to all the other generacy introduced in the fitness function by this swap
chirp times in the expression for the chirp length and, in does not have a significant impact on the search in SN
this region, τ1.5 acquires a comparatively large value. due to the smallness of the region over which this occurs.
The physical and unphysical regions of SP are shown Figure 3 illustrates the effect of swapping the chirp times
in Fig. 2. While the unphysical region corresponding to of an increasing frequency chirp in SN .
negative chirp length is a very small in area, we find it
to be an important one since many glitches are observed
to fall in this region while injected signals never do. The C. Veto Strategy
template waveforms for this sector are chirps with in-
stantaneous frequencies that decrease with time, as il- For any candidate event found in the positive quad-
lustrated in Figure 2, instead of increasing like a normal rant (SP ) search, the application of GRITCLEAN involves
GW radiation-driven inspiral. a sequence of three veto steps as described below.
7

0%

25%

50%

90%

FIG. 2: The physical (blue) and unphysical (yellow) regions in the positive chirp time quadrant SP . The
wedge-shaped unphysical region on the left corresponds to negative chirp lengths. The decreasing instantaneous
frequency of signals in this sector is illustrated in the spectrogram shown as an inset. The remaining unphysical
region corresponds to masses with imaginary components. The black curves denote the boundaries for different
values of ζ (Eq. 29). The kite-shaped boundary corresponds to component masses in the range [1.4, 30]M⊙.

a. Chirp length veto – In the first step, we use the crosses a preset threshold. In terms of chirp times, this
estimated chirp times (τ0 , τ1.5 ) to calculate the chirp is expressed as,
length τ using Eq. 6. If the chirp length is negative, the
v
candidate is vetoed outright as a glitch. This hard cri- u  2 2 1/3
u 8 5π τ0
terion is motivated by our observation that an injected ζ =t 1− , (29)
5
5f∗ 4τ1.5
GW signal, regardless of its SNR or other parameters,
never appears in this small sector of SP , while it is well
populated with glitches. with the expression for the boundary for a particular
value of ζ written as,
b. Complex mass veto – If τ > 0, the next veto step
consists of examining the estimated binary component "  −3 #1/5
masses. Recall that a significant part of SP corresponds 5π 2 τ02 5f∗ 2
τ1.5 = (1 + ζ ) . (30)
to complex masses. Although a CBC signal will never 4 8
have chirp time parameters in this unphysical sector, the
global maximum of the fitness function and the corre- The corresponding curves for different values of ζ are
sponding estimated signal parameters may fall in this re- shown in Fig. 2.
gion because of the presence of noise in the data. There-
fore, in order to reduce the chances of falsely dismissing c. SN veto – For any event which bypasses the two
a GW signal, one should not reject outright a candidate previous vetoes, a second search is launched for the global
that falls in this unphysical region but adopt a softer cri- maximum of LG (y) in the negative chirp time quadrant
terion. Accordingly, in the second veto step, we reject SN . This furnishes a new set of estimated parameters,
b N , from the SN search, with Θ
Θ b P being estimated from
a candidate only if the ratio, ζ, of the imaginary to real
part in the inferred masses, the SP run, for a given data segment. Of these parame-
ters, we use the estimated TOA, b ta,P and b
ta,N , and the
SNR, ρbP and ρbN , to construct the TOA difference mag-
nitude
r

ζ= 1− , |∆ta | = |b
ta,P − b
ta,N | , (31)
M
8

10-23 10-23
1 1

h(t)
0
-1

-1 -2

Frequency (Hz) 1000 1000

500 500

0 0
88.2 88.6 89 89.4 88.5 89 89.5 90 90.5
Time (s) Time (s)

FIG. 3: (Left column) An increasing frequency chirp signal in the negative chirp time quadrant SN with chirp
length τ > 0 with the top and bottom panels showing the time series and spectrogram, respectively. The chirp times
for this signal are [τ0 , τ1.5 ] = [−0.6, −1.5]. (Right column) The time series (top) and spectrogram (bottom) of the
signal obtained by swapping the values of τ0 and τ1.5 . This yields a decreasing frequency chirp.

and the magnitude of the relative SNR difference Observing Start End Detector
Run GPS GPS
|b
ρP − ρbN | (sec) (sec)
|∆ρ| = . (32)
ρbP O1 1127002112 1127047168 H1

A candidate signal is vetoed in GRITCLEAN if it falls in O2 1171222528 1171263488 L1


a predefined area of the (|∆ρ|, |∆ta |) plane. The area
is obtained, as described in Sec. IV, using the observed 1243394048 1243508736 H1
O3a
distribution of injected signals in the (|∆ρ|, |∆ta |) plane. 1246453760 1246543872 L1
The intuitive motivation for using |∆ta | and |∆ρ| is
1258631168 1258749952 H1
that a GW signal is not expected to correlate well with O3b
1260441600 1260539904 L1
any template in SN since the latter are all decreasing fre-
quency chirps. This should lead to a large difference in
the estimated SNRs as well as other parameters. How- TABLE I: The detectors, the GPS start and end times,
ever, the only other parameter that is well-estimated and and the observing runs of the data used in this study.
that can be compared across the searches in SP and SN
is the TOA since the estimated chirp times have oppo-
site signs, leading to real and complex masses in the two dataset used in our study, the preconditioning applied to
quadrants, while the initial phase is a poorly estimated the data, and the GW signals that were injected to study
parameter subject to large errors. For a glitch, on the the safety of GRITCLEAN.
other hand, that does not closely resemble any of the bi-
nary inspiral templates in the physical sector of SP , one
expects no particular preference for a high correlation A. Data description
in either SP or SN . Therefore, the gap between the esti-
mated SNR and TOA for a glitch is likely to be small. As For this study, we used single detector LIGO data
we shall see in Sect. IV our results bear out this intuitive from the Gravitational Wave Open Science Center
argument quite well. (GWOSC) [38]. A total of ≈ 141 hours of data was
used, distributed across all available observing runs (O1,
O2, O3a and O3b), from the Hanford (H1) and Liv-
IV. RESULTS ingston (L1) detectors. All of the selected data passes
the CBC CAT3 flag, which means that the detectors were
We present here our main results on the performance operating in science-mode. More details about the data,
of GRITCLEAN. This is preceded by a description of the namely, the GPS times, the observing runs, and detec-
9

tors are summarized in Table I. By using data from all distributed in SNR and other parameters as described
of the observing runs and the two LIGO detectors, we below.
were able to test the performance of GRITCLEAN across a The low mass injections had each mass chosen indepen-
broad cross-section of the observed glitch population. dently from a uniform distribution over [1.4, 3]M⊙. For
The PSO-based GLRT, LG (y), was applied to the data intermediate mass injections, the lower and higher masses
above in were drawn from uniform distributions over [1.4, 3] M⊙
512 sec long segments overlapped by 64 sec. The last and [5, 10] M⊙ , respectively. For the high mass injec-
64 sec of the cross-correlation of a segment with a tem- tions, each mass was selected from a uniform mass dis-
plate are discarded to account for the corruption caused tribution over [10, 25]M⊙. For each signal injection, the
by the circular wrapping of the template in an FFT-based initial phase was uniformly sampled from a range of
cross-correlation. Of the resulting 1135 segments, three [0, 2π] and the TOA was randomly selected to be between
were dropped because they contained confirmed GW [50, 350] sec. The TOA range ensured that the injected
signals, namely, GW190706 222641, GW190707 093326, signal did not have a time of coalescence that leaked into
and GW191215 223052. Assuming that our search the overlap region between consecutive segments. The
method is not more sensitive than the flagship CBC SNRs of each signal was drawn independently of the other
search pipelines, this ensured that all candidate events parameters from uniform distributions over three differ-
detected by the PSO-based search in the remaining seg- ent ranges: 198 injections were drawn from [10, 40], which
ments, in the absence of injected signals, were either is representative of the SNR range of the GW signals that
glitches or false alarms arising from the Gaussian noise have been observed so far; 196 injections had a lower
component. SNR range of [10, 13]; 60 were injected with unrealisti-
For some of the segments, the candidate signal had cally high SNRs in [100, 500]. The last set of injections
a time of arrival ta in the part of the cross-correlation was intended to stress test the assumption in GRITCLEAN
output that was not discarded but a time of coalescence, that a GW signal will always show a large SNR contrast
ta + τ (c.f., Eq. 6), that fell in the discarded part. In in searches with physical and unphysical waveforms. It is
most of these cases, this happened due to a strong glitch conceivable that this assumption is violated for very loud
in the last 64 sec of a segment triggering a high cross- GW signals since they may induce a strong response from
correlation with a long-duration (> 64 sec) template. unphysical waveforms just as strong glitches can trigger
Clearly, the estimated ta , which is integral to GRITCLEAN, a high-SNR response from physical waveforms.
is invalid for such a candidate and the corresponding seg-
ment was dropped from further consideration. However,
this did not result in the loss of a glitch because it was C. Veto performance
caught, along with valid estimated parameters, in the
overlapped section at the start of the subsequent seg- Candidate events were obtained from the data seg-
ment. As a result of this cut, an additional 68 segments ments described in Sec. IV A using a detection threshold
were removed, reducing the duration of the analyzed data of ρbP ≥ 9.0 on the estimated SNR in the positive chirp
to ≈ 131 hours. Prior to carrying out the PSO-based time quadrant (SP ) search. (This threshold is indepen-
search, each data segment was high-pass filtered with a dent of the one used to select the segments for signal
cutoff frequency of 30 Hz to remove the seismic noise con- injection as described in Sec. IV B.) The probability of a
tribution. For every 512 sec data segment, each run (SP false alarm from Gaussian stationary noise alone would
and SN ) took ≈ 46 min on 8 cores of an AMD EPYC be very low (< 1 event/year) at such a high threshold [39]
7763 64-Core Processor running at 2.45 GHz. although real broadband noise (coupled with imperfect
whitening) may have a non-Gaussian tail [40] that ele-
vates this probability somewhat. Hence, from the seg-
B. Signal injections ments free of signal injections, most candidate events
are expected to be glitches and it is convenient to re-
Each segment in the final set above was first analyzed fer to them as such in the following. The objective of
with a positive quadrant, SP , search without any signal any veto method should be the rejection of as many of
injection and the candidate event, with one event per these glitches as possible. The set of candidate events
segment, was recorded. Next, to study the veto safety of found had 708 glitches and 450 injected signals. The
GRITCLEAN, simulated GW signals were injected in a ran- 4 injections that were lost, corresponding to a false dis-
domly chosen subset of segments in which the estimated missal probability of 0.8%, had injected SNRs in the low-
SNRs of the candidate event fell below a threshold of est range ([10, 13]) and estimated SNRs of 8.02, 8.31,
8.0, thereby ensuring that these segments either did not 8.90 and 7.86. While GRITCLEAN is applied in stages as
have glitches or had weak and inconsequential ones at described in Sec. III C, it is informative to apply the neg-
most. From the 321 such segments that were found, we ative chirp time quadrant (SN ) search to all the glitches
randomly picked, with replacement, 454 segments for sig- and obtain their |∆ta | (Eq. 31) and |∆ρ| (Eq. 32) val-
nal injection. The set of injected signals was comprised ues. Fig. 4 shows the distribution of all the glitches
of low, intermediate, and high mass systems that were in the (|∆ta |, |∆ρ|) plane along with the distribution of
10
14454
20

102
2291 10

Whitened strain
0
10
363 0

-2
10 57 -10

-4 9
-20
10 -8 -6 -4 -2 0
10 10 10 10 10 0 100 200 300 400 500
Time
FIG. 4: Scatterplot of glitches in the (|∆ρ|, |∆ta |) plane.
FIG. 5: Whitened time-series of a data segment
The color of a point represents, following the map
containing multiple glitches (high amplitude spikes).
shown in the colorbar, the estimated SNR (b ρP ) in SP of
The SP search detects the glitch located near 388 sec,
a glitch. The colormap is based on the range of
while the SN search detects the glitch near 268 sec. The
observed log(b
ρP ) divided into 5 bins.
estimated SNRs of both glitches were ≈ 66 in the
respective searches.

their SP estimated SNRs. In agreement with the intu-


itive motivation for choosing |∆ta | and |∆ρ| presented in accounting of broadband non-Gaussianity could mitigate
Sec. III C, most of the glitches cluster in the region with these types of glitches better.
small values of these quantities. However, a few distinct Fig. 6 presents the main results on the performance of
outliers are observed in the region defined by |∆ta | & 1. GRITCLEAN. They are conveniently summarized again in
We also see from Fig. 4 that there is no notable depen- terms of a scatterplot in the |∆ρ|, |∆ta | plane, with the
dence on SNR of the joint distribution in |∆ta | and |∆ρ| detected injections now included in addition to glitches.
in the main cluster of glitches. On the other hand, most The points have also been differentiated on the basis of
of the outliers have low SNRs. the different veto steps in GRITCLEAN, with glitches ve-
The origins of these outliers, deciphered by closer in- toed by the three steps described in Sec. III C shown with
spection of the data segments that contain them, were different colors. We have also shown the different types
found to be of 4 different types. Type-1– Multiple widely of outliers. With the two SP vetoes alone, namely chirp
separated strong glitches in the same data segment cause length and complex mass, 505 out of the 708 glitches, or
the SP and SN searches to detect different ones. An 71.4% of the total number of glitches, were vetoed with
example is shown in Fig. 5. Type-2– The SP and SN 0% rejection of injected signals. Out of these 505 glitches,
searches find events associated with a single but wide 351, or 49.6% of the total number of glitches, were vetoed
duration (O(10) sec) disturbance that causes them to by the chirp length veto alone. These two vetoes also re-
estimate similar SNRs but widely different TOAs. Type- jected 5 out of the 9 outliers discussed above. Fig. 7
3– The data segment has no obvious glitch, but creates shows the fraction of glitches that survive after the ap-
a false alarm in the SP search, possibly due to broad- plication of the chirp length and complex mass vetoes as
band non-Gaussianity mentioned earlier. Since there is a function of their estimated SNR ρbP . We see that each
no glitch, only broadband noise, the SN search finds an of the two SP vetoes had nearly a constant effectiveness,
event that is widely separated in TOA. Type-4– The SP in terms of the fraction of vetoed glitches, across nearly
search detects a weak glitch but the SN search does not the entire range of glitch SNRs.
and finds an event associated with broadband noise. This For the third veto step in GRITCLEAN based on the SN
causes a large discrepancy in both SNR and TOA. In the search, the observed distribution of detected injections
set of outlier glitches, there were 4, 2, 2, and 1 Type-1 to in the (|∆ρ|, |∆ta |) plane can be used to define an exclu-
Type-4 events, respectively. Type-1 and Type-2 events sion area such that any candidate event falling outside it
arise from a technical limitation of the current PSO-based is vetoed. In the present case, letting this area enclose
search since it only finds the global maximum of the fit- all detected injections defines its boundary as |∆ρ| ≥ 0.1
ness function and, hence, identifies only one candidate and |∆ta | ≥ 0.3. As seen in Fig. 6, imposing this cri-
per data segment in a SP or SN search. Strictly speaking, terion rejects all remaining glitches except for the sin-
Type-3 glitches are simply false alarms whose prevalence gle Type-4 outlier. This event was a marginal detection
depends on the detection threshold used for the SP search in the SP search with an estimated SNR = 9.12 and,
and the non-Gaussianity of the broadband noise distribu- as shown in Fig. 8, arises from a visible low-frequency
tion. Future improvements to the PSO-based search that glitch at the edge of the high-pass filter cutoff. It by-
allow multiple candidate events per segment and better passed both the chirp length and complex mass vetoes
11

FIG. 6: Results obtained by implementing GRITCLEAN on candidate events obtained with a detection threshold of
SNR = 9.0. Glitches vetoed by the chirp length and complex mass veto steps of GRITCLEAN in the positive chirp
time quadrant (SP ) are shown with pink and magenta colors, respectively. The remaining glitch events that bypass
the SP vetoes are shown in red. The exclusion area for the SN veto step, defined by |∆ρ| ≥ 0.1 and |∆ta | ≥ 0.3 is
shown as a rectangle shaded in green. Different marker shapes for the glitches correspond to different types of
outliers, as noted in the legend of the plot. Candidate events from injected signals are shown in dark blue, yellow,
and cyan colors, corresponding to the high, realistic, and low SNR ranges, respectively. Different marker shapes for
these events correspond to different mass ranges, as noted in the legend. The single glitch that falls into the
exclusion zone and is not vetoed by the SP vetoes is a Type-4 outlier shown by the red asterisk.

in SP , with estimated masses of 4.33M⊙ and 32.75M⊙.


Such glitches, although rare, may be an important con-
taminant in CBC searches, and it would be worth charac-
10
2
terizing their population with much larger datasets. In
addition to the 2-dimensional distribution of candidate
Counts

events in the (|∆ρ|, |∆ta |) plane, it is illuminating to ex-


amine their 3-dimensional distribution, shown in Fig. 9,
10
1 with the third dimension being log (b ρP ). The segrega-
tion of the injections in SNR is more apparent whereas
they overlap in the 2-dimensional plot. The glitches are
well separated from the injections and form a diffuse and
10
0 unstructured cloud of points. On the other hand, the
1 1.5 2 2.5 3 3.5 4
cloud of injection points is highly structured. This is re-
vealed more clearly in Fig. 10 where the 3-dimensional
FIG. 7: The survival function for estimated SNRs in the distribution is projected onto the (|∆ρ|, log(b ρP )) plane:
SP search, ρbP , of all glitches (blue), glitches that bypass we see that the injections lie in the neighborhood of
the chirp length veto step (orange), and glitches that a 2-dimensional surface embedded in the 3-dimensional
bypass all positive chirp time quadrant (SP ) vetoes space. The elucidation of the origin of this structured
(yellow). For any given value of ρbP , the survival distribution is left to future work.
function provides the number of glitches with estimated
SNRs above this value. With 99.9% of glitches vetoed successfully, our results
demonstrate that GRITCLEAN is highly effective. This
12

140
ζ Glitches missed Injections rejected
120 (%) (%) (%)
10 18.7 15.1
100
25 19.7 4.4
Frequency

80 50 22.1 0.4
60
90 28.6 0.0
100 31.9 0.0
40

20 TABLE II: Effect of the threshold on ζ (Eq. 29) on the


0 fraction of glitches (second column) not rejected and
135 140 145 150 155 the fraction of detected injections (third column)
Time
rejected by the complex mass veto.
FIG. 8: Logarithm of the spectrogram of a short
segment of whitened data containing the glitch, a
Type-4 outlier, that survived GRITCLEAN. The positive
was achieved with with 0% loss of detected injections,
chirp time quadrant (SP ) search detects the
which shows that GRITCLEAN also has a high level of veto
low-frequency low-SNR glitch near 145 sec while the
safety. In terms of the inverse false alarm rate (IFAR)
negative chirp time quadrant (SN ) search produces only
metric used for detection confidence in GW discoveries,
a broadband noise-induced event that is widely
the IFAR was reduced by the application of GRITCLEAN
separated in time.
from 1 false alarm every 11.10 min to 1 false alarm
every 5.45 days. The breakdown of the veto fraction
across the three veto steps shows that the SN veto step
in GRITCLEAN is essential as it is required to eliminate
≈ 30% of glitches. At the same time, this shows that
it is not needed for the majority of candidate events,
thereby considerably reducing its impact on the overall
computational cost of the search. The tunable param-
eters in GRITCLEAN are (i) the threshold on ζ (Eq. 29)
in the complex mass veto step, (ii) ranges of τ0 and τ1.5
defining the SN space, and (iii) the thresholds on |∆ρ|
and |∆ta | in the SN veto step. (The chirp length veto
step has no free parameters.) For a PSO-based search,
it is straightforward to set the ranges of τ0 and τ1.5 in
SN to be the same as those for SP since there is no im-
FIG. 9: Distribution of candidate events in the pact of smaller ranges on the computational cost of the
3-dimensional (|∆ta |, |∆ρ|, log (b
ρP )) space. Red dots SN search. The remaining parameters can be fixed using
show glitches while injected signals are shown in the the distribution of injections alone. For example, we set
same color scheme as used in Fig. 6. the thresholds on |∆ρ| and |∆ta | above such that there
was 0% loss of detected injections. Similarly, as shown
in Table II, we determined the fraction of detected injec-
tions vetoed by different thresholds on ζ and determined
that it should be set at 90% in order to retain all injec-
tions. Table II also shows the fraction of glitches that are
missed by the complex mass veto for different thresholds
on ζ. One can imagine a scenario in which the threshold
is lowered to tolerate a higher fraction of rejected injec-
tions while reducing computational costs due to a less
frequent application of the SN search.
With an effective glitch veto strategy in hand, the de-
tection threshold itself can be lowered to catch weaker
signals. Fig. 11 shows the results when the detection
FIG. 10: Distribution of candidate events in threshold is lowered to ρbP ≥ 8.0. As intended, this
(log (b
ρP ), |∆ρ|) space. Red dots show glitches while allows the detection of 3 out of the 4 injected signals
injected signals are shown in the same color scheme as that had been missed with the higher threshold. The
used in Fig. 6. The shaded region is bounded from number of crossings of the detection threshold now in-
below by the detection threshold and from left by the crease to 762 candidate events from the segments with-
threshold on |∆ρ|. out signal injections. (One expects that there will be
more contamination of this set by false alarms arising
13

from the broadband noise, but we will continue to call negative chirp time quadrant, SN , to separate glitches
them all glitches.) GRITCLEAN was applied as before, from CBC signals in a matched filter search. We have
with all settings kept the same except for the exclusion tested the effectiveness of our strategy using ≈ 131 hours
area in the (|∆ρ|, |∆ta |) plane, which must be enlarged of single-detector LIGO data. Our results show that
to |∆ρ| ≥ 0.07 and |∆ta | ≥ 0.1 in order to retain all the GRITCLEAN is a highly effective veto method that can suc-
detected injections. Now, the number of glitches that cessfully reject glitches that impact CBC searches while
are missed rises to 6 from 1, reducing the fraction of ve- maintaining a high level of veto safety. The search over
toed glitches to 99.2%. The loss of detected injections the unphysical sectors of SP is already integrated in a
remains at 0% after the application of GRITCLEAN, main- PSO-based approach and does not add any computa-
taining its high safety. While the IFAR of the weakest tional burden. The search over SN incurs additional com-
detected injection increases in correspondence with the putational cost but need only be used for ≈ 30% of can-
larger number of glitches, the reduction in IFAR due to didate events. For the detection threshold of SNR = 9.0,
GRITCLEAN is still quite large: from 1 false alarm every GRITCLEAN eliminated all glitches except one and brought
10.31 min before vetoing to 1 false alarm every 21.8 hours down the IFAR due to glitches by a factor of ≈ 707. It
after. Fig. 11 also shows the events that did not cross should be emphasized here that this large reduction was
the detection threshold. Most of such events arise from obtained from just a single-detector search. Supplement-
the broadband noise rather than glitches, like the Type-3 ing with a multi-detector coincidence veto will reduce the
outlier discussed earlier. We see that such noise-induced IFAR even further. The gain from using GRITCLEAN in
events occupy the same region of the (|∆ρ|, |∆ta |) plane such a scenario will be quantified in future work.
as the outlier glitches and that this is well separated In this study, a common set of injected signals was used
from the main cluster of glitches. Lowering the detec- for tuning the parameters of GRITCLEAN. However, the
tion threshold further and expanding the exclusion zone injections needed to tune the complex mass veto thresh-
to retain weaker injected signals will uncover more of this old ζ should be more specialized. This is because the
population of noise-induced candidate events, which are problem of complex estimated masses mainly affects the
more likely to bypass GRITCLEAN and, for that matter, signals located on the boundary of the unphysical region
any other glitch veto method. of complex masses, which is given by the curve of equal
One way in which GRITCLEAN differs from χ2 -vetoes is mass binaries (c.f Fig. 2). Therefore, a more precise de-
that the latter depend primarily on the norm of the data termination of ζ for a given fractional loss of injected
projection on the null space of a given template, while signals requires a much larger number of equal-mass in-
GRITCLEAN uses both the projection norms, encapsulated jections than were present in the common injection set.
in |∆ρ|, as well as the estimated TOAs, through |∆ta |, As a result, we used a conservative value for ζ that in-
of the best fit physical and unphysical templates. Fig. 6 curred a larger fraction of missed glitches in the complex
and Fig. 11 illustrate the importance of using |∆ta | in ad- mass veto step. While future applications of GRITCLEAN
dition to |∆ρ| in GRITCLEAN. If the threshold on |∆ta | is will improve in this aspect, our current results are not
removed, one can see that the number of glitches missed significantly affected given that the SN veto step would
(red colored points) increase by 2 and 7 in Fig. 6 and have trapped any glitch that escaped the complex mass
Fig. 11, respectively, thereby increasing the false alarm veto due to a lower ζ.
rate due to glitches without a concomitant increase in the From the outliers in the distribution of glitches across
number of detected injections. Another clear impact of the (|∆ρ|, |∆ta |) plane, we found that the majority are
using |∆ta |, seen in Fig. 11, is the separation of glitches associated with a technical limitation of the current PSO-
from below-threshold events (gray dots), the majority of based search that allows only one event to be identified
which would be false alarms from broadband noise and per data segment. This limitation will be lifted in ongo-
not real glitches. The separation includes both strong ing improvements to the PSO-based CBC search that will
and weak (but above threshold) glitches since, as dis- enable the identification of all local maxima of the fitness
cussed in the context of Fig. 4, their distribution in the function above a given detection threshold rather than
(|∆ρ|, |∆ta |) plane does not depend strongly on their es- just the global maximum. To aid reproducibility of our
timated SNR. As the SNR of GW signals is reduced, they work, we have provided the dataset of candidate events
will migrate in this plane to the left, as can be seen by the found in our study, the codes for PSO-based matched-
progression of the colored bands of the injected signals, filtering, and the scripts for producing some of the main
but not towards the bottom where most of the glitches plots in a public data release on Zenodo [41].
reside.

ACKNOWLEDGMENTS
V. CONCLUSIONS
This work is supported by NSF Grant No. PHY-
We have presented a veto method called GRITCLEAN 2207935. We acknowledge the Texas Advanced Comput-
that exploits the unphysical regions of the positive chirp ing Center (TACC) at the University of Texas at Austin
time quadrant, SP , and unphysical waveforms in the (www.tacc.utexas.edu) for providing high performance
14

FIG. 11: Results obtained by implementing GRITCLEAN on candidate events obtained with a detection threshold of
SNR = 8.0. Glitches vetoed by the chirp length and complex mass veto steps of GRITCLEAN in the positive chirp
time quadrant (SP ) are shown with pink and magenta colors, respectively. The remaining glitch events that bypass
the SP vetoes are shown in red. The exclusion area for the SN veto step, defined by |∆ρ| ≥ 0.07 and |∆ta | ≥ 0.1 is
shown as the green shaded rectangle. Candidate events from injected signals are shown in dark blue, yellow, and
cyan colors, corresponding to the high, realistic, and low SNR ranges, respectively. Events that fell below the
detection threshold are shown in gray color.

computing resources. This research has made use of data Science Center (gwosc.org), a service of the LIGO Sci-
or software obtained from the Gravitational Wave Open entific Collaboration, the Virgo Collaboration, and KA-
GRA.

[1] B. Abbott et al., “Ligo: the laser interferometer 2021.


gravitational-wave observatory,” Reports on Progress in [6] B. P. Abbott et al., “GW170817: Observation of gravita-
Physics, vol. 72, p. 076901, 2009. tional waves from a binary neutron star inspiral,” Phys-
[2] B. P. Abbott et al., “Observation of gravitational waves ical Review Letters, vol. 119, p. 161101, Oct 2017.
from a binary black hole merger,” Physical Review Let- [7] “Kagra: 2.5 generation interferometric gravitational
ters, vol. 116, p. 061102, Feb 2016. wave detector,” Nature Astronomy, vol. 3, no. 1, pp. 35–
[3] F. Acernese et al., “Status of Virgo,” Classical and 40, 2019.
Quantum Gravity, vol. 25, p. 114045, 2008. [8] C. S. Unnikrishnan, “IndIGO and Ligo-India Scope and
[4] B. Abbott, R. Abbott, T. Abbott, S. Abraham, F. Acer- Plans for Gravitational Wave Research and Precision
nese, K. Ackley, C. Adams, R. Adhikari, V. Adya, C. Af- Metrology in India,” International Journal of Modern
feldt, et al., “Gwtc-1: A gravitational-wave transient Physics D, vol. 22, p. 1341010, Jan. 2013.
catalog of compact binary mergers observed by ligo and [9] D. Davis, T. Littenberg, I. Romero-Shaw, M. Millhouse,
virgo during the first and second observing runs,” Phys- J. McIver, F. Di Renzo, and G. Ashton, “Subtracting
ical Review X, vol. 9, no. 3, p. 031040, 2019. glitches from gravitational-wave detector data during the
[5] The LIGO Scientific Collaboration, the Virgo Collabo- third observing run,” arXiv preprint arXiv:2207.03429,
ration, the KAGRA Collaboration, R. Abbott, et al., 2022.
“GWTC-3: Compact Binary Coalescences Observed by [10] D. George, H. Shen, and E. A. Huerta, “Classification
LIGO and Virgo During the Second Part of the Third and unsupervised clustering of ligo data with deep trans-
Observing Run,” arXiv e-prints, p. arXiv:2111.03606, fer learning,” Phys. Rev. D, vol. 97, p. 101501, 2018.
15

[11] R. Biswas, L. Blackburn, J. Cao, R. Essick, K. A. spline fitting,” Classical and Quantum Gravity, vol. 40,
Hodge, E. Katsavounidis, K. Kim, Y.-M. Kim, E.-O. p. 125001, may 2023.
Le Bigot, C.-H. Lee, J. J. Oh, S. H. Oh, E. J. Son, [25] J. Merritt, B. Farr, R. Hur, B. Edelman, and Z. Doc-
Y. Tao, R. Vaulin, and X. Wang, “Application of ma- tor, “Transient glitch mitigation in advanced ligo data,”
chine learning algorithms to the study of noise arti- Physical Review D, vol. 104, no. 10, p. 102004, 2021.
facts in gravitational-wave data,” Phys. Rev. D, vol. 88, [26] R. Bondarescu, A. Lundgren, and R. Macas, “Antiglitch:
p. 062003, Sep 2013. a quasi-physical model for removing short glitches from
[12] S. Bahaadini, V. Noroozi, N. Rohani, S. Coughlin, ligo and virgo data,” 2023.
M. Zevin, J. Smith, V. Kalogera, and A. Katsaggelos, [27] J. Kennedy and R. C. Eberhart, “Particle swarm op-
“Machine learning for gravity spy: Glitch classification timization,” in Proceedings of the IEEE International
and dataset,” Information Sciences, vol. 444, pp. 172– Conference on Neural Networks: Perth, WA, Australia,
186, 2018. vol. 4, p. 1942, IEEE, 1995.
[13] S. Mukherjee, R. Obaid, and B. Matkarimov, “Classifi- [28] T. S. Weerathunga and S. D. Mohanty, “Performance of
cation of glitch waveforms in gravitational wave detector particle swarm optimization on the fully-coherent all-sky
characterization,” Journal of Physics: Conference Series, search for gravitational waves from compact binary coa-
vol. 243, p. 012006, aug 2010. lescences,” Physical Review D, vol. 95, no. 12, p. 124030,
[14] M. Zevin, S. Coughlin, S. Bahaadini, E. Besler, N. Ro- 2017.
hani, S. Allen, M. Cabero, K. Crowston, A. K. Kat- [29] M. E. Normandin and S. D. Mohanty, “Towards a
saggelos, S. L. Larson, et al., “Gravity spy: integrating real-time fully-coherent all-sky search for gravitational
advanced ligo detector characterization, machine learn- waves from compact binary coalescences using particle
ing, and citizen science,” Classical and Quantum Gravity, swarm optimization,” Physical Review D, vol. 101, no. 8,
vol. 34, no. 6, p. 064003, 2017. p. 082001, 2020.
[15] B. Allen, “χ2 time-frequency discriminator for gravi- [30] L. Blanchet, T. Damour, B. R. Iyer, C. M. Will, and
tational wave detection,” Physical Review D, vol. 71, A. G. Wiseman, “Gravitational-radiation damping of
p. 062001, Mar 2005. compact binary systems to second post-newtonian or-
[16] I. W. Harry and S. Fairhurst, “Targeted coherent search der,” Physical Review Letters, vol. 74, pp. 3515–3518,
for gravitational waves from compact binary coales- May 1995.
cences,” Phys. Rev. D, vol. 83, p. 084002, Apr 2011. [31] B. S. Sathyaprakash and S. V. Dhurandhar, “Choice of
[17] A. H. Nitz, “Distinguishing short duration noise tran- filters for the detection of gravitational waves from coa-
sients in ligo data to improve the pycbc search for grav- lescing binaries,” Physical Review D, vol. 44, pp. 3819–
itational waves from high mass binary black hole merg- 3834, Dec 1991.
ers,” arXiv preprint arXiv:1709.08974, 2017. [32] E. O. Brigham and R. E. Morrow, “The fast fourier trans-
[18] S. Choudhary, S. Bose, P. Joshi, and S. Dhurandhar, “Im- form,” IEEE Spectrum, vol. 4, no. 12, pp. 63–70, 1967.
proved binary black hole searches through better discrim- [33] B. F. Schutz, Data processing, analysis, and storage for
ination against noise transients,” 12 2022. interferometric antennas, p. 406–452. Cambridge Univer-
[19] S. Sachdev, S. Caudill, H. Fong, R. K. Lo, C. Messick, sity Press, 1991.
D. Mukherjee, R. Magee, L. Tsukada, K. Blackburn, [34] D. Bratton and J. Kennedy, “Defining a standard for par-
P. Brady, et al., “The gstlal search analysis methods for ticle swarm optimization,” in Swarm Intelligence Sympo-
compact binary mergers in advanced ligo’s second and sium, 2007. SIS 2007. IEEE, pp. 120–127, IEEE, 2007.
advanced virgo’s first observing runs,” arXiv preprint [35] J. Robinson and Y. Rahmat-Samii, “Particle swarm op-
arXiv:1901.08580, 2019. timization in electromagnetics,” IEEE Transactions on
[20] G. Ashton, S. Thiele, Y. Lecoeuche, J. McIver, and L. K. Antennas and Propagation, vol. 52, no. 2, pp. 397–407,
Nuttall, “Parameterised population models of transient 2004.
non-gaussian noise in the ligo gravitational-wave detec- [36] S. D. Mohanty, Swarm Intelligence Methods for Statisti-
tors,” Classical and Quantum Gravity, vol. 39, p. 175004, cal Regression. Chapman and Hall/CRC, 2018.
Aug. 2022. [37] The Mathworks, Inc., Natick, Massachusetts, MATLAB
[21] N. J. Cornish and T. B. Littenberg, “Bayeswave: version 9.12.0.2327980 (R2022a) Update 7, 2022.
Bayesian inference for gravitational wave bursts and [38] R. A. et. al, “Open data from the third observing run of
instrument glitches,” Classical and Quantum Gravity, ligo, virgo, kagra, and geo,” The Astrophysical Journal
vol. 32, no. 13, p. 135012, 2015. Supplement Series, vol. 267, p. 29, jul 2023.
[22] C. Pankow, K. Chatziioannou, E. A. Chase, T. B. Litten- [39] S. D. Mohanty and S. V. Dhurandhar, “Hierarchi-
berg, M. Evans, J. McIver, N. J. Cornish, C.-J. Haster, cal search strategy for the detection of gravitational
J. Kanner, V. Raymond, S. Vitale, and A. Zimmer- waves from coalescing binaries,” Phys. Rev. D, vol. 54,
man, “Mitigation of the instrumental noise transient in pp. 7108–7128, Dec 1996.
gravitational-wave data surrounding gw170817,” Phys. [40] T. Yamamoto, K. Hayama, S. Mano, Y. Itoh, and
Rev. D, vol. 98, p. 084016, Oct 2018. N. Kanda, “Characterization of non-gaussianity in grav-
[23] S. D. Mohanty and E. Fahnestock, “Adaptive spline fit- itational wave detector noise,” Phys. Rev. D, vol. 93,
ting with particle swarm optimization,” Computational p. 082005, Apr 2016.
Statistics, pp. 1–37, 2020. [41] R. Girgaonkar, “Gritclean code and files – part 1,” 2024.
[24] S. D. Mohanty and M. A. T. Chowdhury, “Glitch sub- Zenodo. DOI: 10.5281/zenodo.10547476
traction from gravitational wave data using adaptive

You might also like