Professional Documents
Culture Documents
1, ∗ 1, 2, †
Raghav Girgaonkar, and Soumya D. Mohanty,
1
Department of Physics and Astronomy, University of Texas Rio Grande Valley,
One West University Blvd., Brownsville, Texas 78520
2
Department of Physics, IIT Hyderabad, Kandi, Telangana-502284, India
Transient signals arising from instrumental or environmental factors, commonly referred to as
glitches, constitute the predominant background of false alarms in the detection of gravitational
waves in data collected from ground-based detectors. Therefore, effective data analysis methods for
vetoing glitch-induced false alarms are crucial to enhancing the sensitivity of a search for gravita-
tional waves. We present a veto method for glitches that impact matched filtering-based searches
for binary inspiral signals. The veto uses unphysical sectors in the space of chirp time parameters as
well as an unphysical extension including negative chirp times to efficiently segregate glitches from
arXiv:2401.15237v1 [gr-qc] 26 Jan 2024
gravitational wave signals in data from a single detector. Inhabited predominantly by glitches but
nearly depopulated of genuine gravitational wave signals, these unphysical sectors can be efficiently
explored using Particle Swarm Optimization. In a test carried out on data taken from both LIGO
detectors spanning multiple observation runs, the veto was able to reject 99.9% of glitches with no
loss of injected signals detected with a signal-to-noise ratio ≥ 9.0. Our results show that extending
a matched filter search to unphysical parts of a signal parameter space promises to be an effective
strategy for mitigating glitches.
nals do not constitute a vector space in themselves, in the not. However, unlike the χ2 -veto, we do not pick the un-
sense that a linear combination of two CBC signals is not physical waveforms from the null subspace of a candidate
another CBC signal, which makes the null space (and its signal or tune them for specific glitch classes. In addi-
subspaces) dependent on the unknown parameters of a tion, besides using the norm of the data projection on
signal in given data. In addition, computational consid- unphysical templates as done in χ2 -vetoes, the method
erations severely limit the dimensionality of the chosen also uses the times of arrival of a signal estimated with
subspace, which creates a vast number of possible choices physical and unphysical templates.
for the subspaces. It has been shown [17, 18] that the To create unphysical waveforms, we use the space of
freedom of choosing the subspace can be exploited to chirp time parameters that are obtained by an injec-
maximize projections for specific classes of glitches, re- tive but non-surjective mapping of the binary component
sulting in improvements in the detection sensitivity for masses. This makes the chirp time space larger than the
high-mass CBC signals [4]. However, this could also re- mass parameter space and, hence, splits it into physi-
quire re-tuning the method for different glitch classes or cal and unphysical regions. Furthermore, the unphysical
when the classes themselves change due to changes in the sector can be significantly enlarged using negative chirp
detectors and their environment across observation runs. time values, allowing a larger population of glitches to be
A different veto strategy is implemented in the GstLAL- trapped and vetoed. The segregation of the estimated pa-
based inspiral pipeline [19] that uses a gating method to rameters of glitches and injected GW signals using physi-
remove glitches that are strong enough to significantly cal and unphysical templates is seen to be strong enough
increase the estimated variance of whitened data from to veto glitch events in real data across a wide range
the expected value of unity. of Signal-to-Noise Ratios (SNRs). The lack of any as-
Additionally, GstLAL also employs a signal- sumptions in the method about glitch classes ensures its
consistency veto in which the time series of the usefulness irrespective of changes in the makeup of the
matched filter output around its peak value is compared glitch population from one observation run to another.
with that expected from a real signal. In [20], it was Searching over unphysical regions of the parameter
shown that several classes of glitches occupy extreme space incurs a significant additional computational bur-
parts of the parameter space of BBH mergers under den in a conventional matched filtering strategy based
matched filtering where no GW signals have been on a bank of pre-computed waveforms. However, this
observed, thereby suggesting a possible veto based on problem is mitigated in a stochastic search strategy for
mapping the distribution of glitches in this space. binary inspiral signals implemented with Particle Swarm
In addition to the above vetoes that are applied to Optimization (PSO) [27–29]. In fact, the inclusion of
the output of a search pipeline, several approaches have unphysical regions in positive chirp time space is an in-
been explored in which the waveform of a glitch is esti- herent part of such a search that incurs no additional
mated and subtracted from the data before conducting computational cost. This is because PSO works best for
a GW search. Glitch subtraction using BayesWave [21], hypercubical search spaces while the physical region in
a wavelet-based method that uses data from a detector positive chirp time space is non-hypercubical. Hence,
network to distinguish between a GW signal and a glitch, the search space for PSO must necessarily extend to the
was used in parameter estimation for GW170817 [22] as bounding hypercube of the physical region and include
well as several subsequent signals [9]. Adaptive spline non-physical sectors.
fitting implemented in SHAPES [23] can fit out different The search over the negative chirp time space, on
types of glitches [24] in the presence of an overlapping the other hand, increases the computational cost of
CBC signal such as GW170817. GRITCLEAN but not strongly, since it is activated only
Glitschen [25] uses parameterized waveform models to when a candidate event bypasses the veto in the positive
describe identified glitch classes using principal compo- chirp time search.
nent analysis. The Antiglitch method[26] uses empirical We tested the performance of GRITCLEAN on ≈ 131
waveform models for the four most commonly occurring hours of data, free of any observed GW signals, taken
classes of short-duration glitches to implement a matched from the Hanford (H1) and Livingston (L1) detectors
filter for detecting and subtracting them out. across different observing runs. We used a PSO-based
In this paper we present a novel glitch veto called search with templates and injected signals belonging to
GRITCLEAN (Glitch Rejection using Illegal Templates and the restricted 2PN waveform family [30]. Using a detec-
Cross-Linking of Event Attribute Numbers) that is based tion threshold of SNR = 9.0, we found that GRITCLEAN
on extending a matched filtering search to unphysical was able to reject 99.9% of candidate events, most of
parts of the CBC signal parameter space where the sig- them glitches since none were GW signals, that crossed
nal waveforms can never arise from a gravitational radi- this threshold. The safety of GRITCLEAN was tested on
ation driven inspiral of two point particles. Similar to CBC signals covering a wide mass and SNR range in-
the χ2 -veto, it exploits the key idea that a CBC signal is jected in the same data. It was found that GRITCLEAN did
expected to present very different responses when corre- not reject any of the injections that passed the detection
lated with physical and unphysical CBC templates while threshold. It is important to note that this performance
a glitch, which does not resemble a CBC signal, may is achieved with a single-detector analysis alone. The
3
ta and the phase of the signal at t = ta is the initial phase In GLRT, the detection statistic is defined as,
φ0 . The combination
LG (y) = max LLR(y, Θ) . (14)
Θ
τ = τ0 − τ1.5 + τ1 + τ2 , (6)
Formally, a signal detection is declared if LG (y) > η for
called the chirp length, approximates the observed dura- a preset threshold η. In such a case,
tion of the signal and is a key quantity in GRITCLEAN:
For a physically valid waveform, one must have τ > 0. b = argmax LLR(y, Θ) ,
Θ (15)
The restricted 2PN signals can be parameterized either Θ
in terms of the binary component masses or any two of
the chirp times. It is convenient to choose τ0 and τ1.5 as provides the Maximum Likelihood Estimate (MLE) of
the independent parameters since they can be inverted the unknown parameters of the signal present in the data.
analytically to obtain M and µ, hence m1 and m2 , b for a given
Henceforth, we will call the pair (LG (y), Θ)
data segment y an event. In real data, which may have
13 −1 non-GW signals or non-Gaussian noise, the result of the
1 5 G
µ= 2 , (7) GLRT is not immediately deemed to be a detection even
16f∗2
4π 4 τ0 τ1.5 c3
−1 if LG (y) > η. Instead, the occurence of LG (y) > η pro-
5 τ1.5 G vides only a candidate event for the given y that must
M= 2
, (8)
32f∗ π τ0 c3 pass additional checks, namely, the vetoes before it can
p be declared to be a GW detection.
M − M 2 − 4µM
m1 = , (9) For the single detector case considered in this paper,
p 2 F+,× , ι, and A fold into an overall amplitude and a re-
M + M 2 − 4µM definition of the initial phase φ0 . Splitting the set of
m2 = , (10)
2 parameters as Θ = Θ′ ∪ {ρ, φ0 }, where Θ′ = {τ0 , τ1.5 , ta }
and ρ = ks(Θ)k, the signal can be expressed as,
where we choose the convention m1 < m2 . The values
of M and µ above can in turn be used to derive the s(Θ) = ρq(Θ′ , φ0 ) , (16)
remaining chirp times τ1 and τ2 .
s(Θ)
q(Θ′ , φ0 ) = , ⇒ kq(Θ′ , φ0 )k = 1 . (17)
ks(Θ)k
B. The GLRT and MLE
Here, ρ defines the SNR s(Θ) and q(Θ′ , φ0 ) is called the
template waveform.
While not true in general for real data, the assumption In terms of these parameters, the GLRT can be com-
of n(t) being a Gaussian noise process allows us to derive puted as,
the log-likelihood ratio of a finite duration segment of the
data as, ′ 1 2
LG (y) = max max ρhy, q(Θ , φ0 )i − ρ . (18)
1 Θ′ ρ,φ0 2
LLR(y, Θ) = hy, s(Θ)i − ||s(Θ)||2 , (11)
2 Substituting the solution of the inner maximization,
where for any continuous time function x(t), x = namely,
(x0 , x1 , . . . , xN −1 ) is the row vector of sampled values
xk = x(tk ), with tk = k/fs and fs being the sampling ρb = hy, q(Θ′ , φb0 )i , (19)
frequency, and hy, q 0 (Θ′ )i
φb0 = arctan , (20)
hy, q 1 (Θ′ )i
−1 T
hx, yi = xC y , (12)
q 0 (Θ′ ) = q(Θ′ , 0) , (21)
is the inner product based on the covariance matrix C of q 1 (Θ′ ) = q(Θ′ , π/2) , (22)
the noise n in y, with Cij = E[ni , nj ], and kxk2 = hx, xi
is the norm induced by the inner product. Here, Θ is the in Eq. 18 and using another split, Θ′ = θ ∪ {ta } and θ =
set of parameters that describe the 2PN signal. Under {τ0 , τ1.5 }, the expression for the GLRT statistic reduces
the assumption of wide-sense stationarity of the noise, to
the inner product above is conveniently expressed as
LG (y) = max λ(θ) , (23)
† θ
hx, yi = x
e ye./S n , (13) λ(θ) = max hy, q 0 (θ, ta )i2 +
ta
where x e is the Discrete Fourier Transform (DFT) of x, hy, q 1 (θ, ta )i2 ] . (24)
‘./’ denotes element-wise division, and S n is the two-sided
power spectral density (PSD) Sn (f ) sampled at the DFT For fixed θ, the maximization over ta can be performed
frequencies. by simply evaluating hy, q 0,1 (θ, ta )i at values of ta equal
5
quadrant. This is not possible in a search over the mass B. Negative chirp time Quadrant
parameter space, since every point corresponds to a phys-
ical signal waveform. We use this property of the chirp While the chirp time parameters are never negative
time space in our veto scheme as described in the follow- by construction, nothing prevents us from using negative
ing section. values for them in the phase Ψ(f ) in Eq. 5. The resulting
waveform in the time domain does not correspond to any
physical binary inspiral but remains a valid real-valued
III. GLITCH VETO function of time in itself. Therefore, extending a PSO-
based search to SN provides a significant enlargement
of the space of unphysical template waveforms for use
As outlined earlier, GRITCLEAN uses physical and un- in GRITCLEAN. At the same time, no additional ad hoc
physical template waveforms to probe the nature of a parameters are introduced in the overall veto strategy,
candidate event. In this section, we first define physi- such as assumptions about specific glitch classes, besides
cal and unphysical templates, and then present the veto the search range in SN .
strategy. While the negative quadrant of the chirp time We define the template waveforms in SN using a few
space, τ0 < 0, τ1,5 < 0, denoted as SN , consists entirely modifications to the expressions in Sec.( II A). To calcu-
of unphysical templates, the positive quadrant, τ0 > 0, late the values of τ1 and τ2 from negative values of τ0 and
τ1.5 > 0, denoted as SP , has both types of templates. At τ1.5 , M and µ are obtained as,
the same time, a part of SN must be treated specially
13 −1
because the templates in that part can resemble binary 1 5 G
inspiral signals despite being unphysical. These details µ= , (26)
16f∗2 4π 4 |τ0 ||τ1.5 |2 c3
are presented separately for the two quadrants below. −1
5 |τ1.5 | G
M= 2
. (27)
32f∗ π |τ0 | c3
A. Positive chirp time Quadrant The chirp times τ1 and τ2 are obtained from M and µ as
before (c.f., Eq. 9 and Eq. 10) but both are multiplied by
The quadrant SP , has two different types of unphys- −1, thus making all the chirp time parameters negative.
ical sectors. The first is one in which the inversion of In addition, we modify Ψ(f ) to,
(τ0 , τ1.5 ) to the masses of binary components (m1 , m2 ) 4 (j−5)/3
π X f
yields complex values. From Eq. 9 and Eq.10, this hap- Ψ(f ) = 2πf ta − φ0 − + αj (28)
pens when 4 j=0 f∗
0%
25%
50%
90%
FIG. 2: The physical (blue) and unphysical (yellow) regions in the positive chirp time quadrant SP . The
wedge-shaped unphysical region on the left corresponds to negative chirp lengths. The decreasing instantaneous
frequency of signals in this sector is illustrated in the spectrogram shown as an inset. The remaining unphysical
region corresponds to masses with imaginary components. The black curves denote the boundaries for different
values of ζ (Eq. 29). The kite-shaped boundary corresponds to component masses in the range [1.4, 30]M⊙.
a. Chirp length veto – In the first step, we use the crosses a preset threshold. In terms of chirp times, this
estimated chirp times (τ0 , τ1.5 ) to calculate the chirp is expressed as,
length τ using Eq. 6. If the chirp length is negative, the
v
candidate is vetoed outright as a glitch. This hard cri- u 2 2 1/3
u 8 5π τ0
terion is motivated by our observation that an injected ζ =t 1− , (29)
5
5f∗ 4τ1.5
GW signal, regardless of its SNR or other parameters,
never appears in this small sector of SP , while it is well
populated with glitches. with the expression for the boundary for a particular
value of ζ written as,
b. Complex mass veto – If τ > 0, the next veto step
consists of examining the estimated binary component " −3 #1/5
masses. Recall that a significant part of SP corresponds 5π 2 τ02 5f∗ 2
τ1.5 = (1 + ζ ) . (30)
to complex masses. Although a CBC signal will never 4 8
have chirp time parameters in this unphysical sector, the
global maximum of the fitness function and the corre- The corresponding curves for different values of ζ are
sponding estimated signal parameters may fall in this re- shown in Fig. 2.
gion because of the presence of noise in the data. There-
fore, in order to reduce the chances of falsely dismissing c. SN veto – For any event which bypasses the two
a GW signal, one should not reject outright a candidate previous vetoes, a second search is launched for the global
that falls in this unphysical region but adopt a softer cri- maximum of LG (y) in the negative chirp time quadrant
terion. Accordingly, in the second veto step, we reject SN . This furnishes a new set of estimated parameters,
b N , from the SN search, with Θ
Θ b P being estimated from
a candidate only if the ratio, ζ, of the imaginary to real
part in the inferred masses, the SP run, for a given data segment. Of these parame-
ters, we use the estimated TOA, b ta,P and b
ta,N , and the
SNR, ρbP and ρbN , to construct the TOA difference mag-
nitude
r
4µ
ζ= 1− , |∆ta | = |b
ta,P − b
ta,N | , (31)
M
8
10-23 10-23
1 1
h(t)
0
-1
-1 -2
500 500
0 0
88.2 88.6 89 89.4 88.5 89 89.5 90 90.5
Time (s) Time (s)
FIG. 3: (Left column) An increasing frequency chirp signal in the negative chirp time quadrant SN with chirp
length τ > 0 with the top and bottom panels showing the time series and spectrogram, respectively. The chirp times
for this signal are [τ0 , τ1.5 ] = [−0.6, −1.5]. (Right column) The time series (top) and spectrogram (bottom) of the
signal obtained by swapping the values of τ0 and τ1.5 . This yields a decreasing frequency chirp.
and the magnitude of the relative SNR difference Observing Start End Detector
Run GPS GPS
|b
ρP − ρbN | (sec) (sec)
|∆ρ| = . (32)
ρbP O1 1127002112 1127047168 H1
tors are summarized in Table I. By using data from all distributed in SNR and other parameters as described
of the observing runs and the two LIGO detectors, we below.
were able to test the performance of GRITCLEAN across a The low mass injections had each mass chosen indepen-
broad cross-section of the observed glitch population. dently from a uniform distribution over [1.4, 3]M⊙. For
The PSO-based GLRT, LG (y), was applied to the data intermediate mass injections, the lower and higher masses
above in were drawn from uniform distributions over [1.4, 3] M⊙
512 sec long segments overlapped by 64 sec. The last and [5, 10] M⊙ , respectively. For the high mass injec-
64 sec of the cross-correlation of a segment with a tem- tions, each mass was selected from a uniform mass dis-
plate are discarded to account for the corruption caused tribution over [10, 25]M⊙. For each signal injection, the
by the circular wrapping of the template in an FFT-based initial phase was uniformly sampled from a range of
cross-correlation. Of the resulting 1135 segments, three [0, 2π] and the TOA was randomly selected to be between
were dropped because they contained confirmed GW [50, 350] sec. The TOA range ensured that the injected
signals, namely, GW190706 222641, GW190707 093326, signal did not have a time of coalescence that leaked into
and GW191215 223052. Assuming that our search the overlap region between consecutive segments. The
method is not more sensitive than the flagship CBC SNRs of each signal was drawn independently of the other
search pipelines, this ensured that all candidate events parameters from uniform distributions over three differ-
detected by the PSO-based search in the remaining seg- ent ranges: 198 injections were drawn from [10, 40], which
ments, in the absence of injected signals, were either is representative of the SNR range of the GW signals that
glitches or false alarms arising from the Gaussian noise have been observed so far; 196 injections had a lower
component. SNR range of [10, 13]; 60 were injected with unrealisti-
For some of the segments, the candidate signal had cally high SNRs in [100, 500]. The last set of injections
a time of arrival ta in the part of the cross-correlation was intended to stress test the assumption in GRITCLEAN
output that was not discarded but a time of coalescence, that a GW signal will always show a large SNR contrast
ta + τ (c.f., Eq. 6), that fell in the discarded part. In in searches with physical and unphysical waveforms. It is
most of these cases, this happened due to a strong glitch conceivable that this assumption is violated for very loud
in the last 64 sec of a segment triggering a high cross- GW signals since they may induce a strong response from
correlation with a long-duration (> 64 sec) template. unphysical waveforms just as strong glitches can trigger
Clearly, the estimated ta , which is integral to GRITCLEAN, a high-SNR response from physical waveforms.
is invalid for such a candidate and the corresponding seg-
ment was dropped from further consideration. However,
this did not result in the loss of a glitch because it was C. Veto performance
caught, along with valid estimated parameters, in the
overlapped section at the start of the subsequent seg- Candidate events were obtained from the data seg-
ment. As a result of this cut, an additional 68 segments ments described in Sec. IV A using a detection threshold
were removed, reducing the duration of the analyzed data of ρbP ≥ 9.0 on the estimated SNR in the positive chirp
to ≈ 131 hours. Prior to carrying out the PSO-based time quadrant (SP ) search. (This threshold is indepen-
search, each data segment was high-pass filtered with a dent of the one used to select the segments for signal
cutoff frequency of 30 Hz to remove the seismic noise con- injection as described in Sec. IV B.) The probability of a
tribution. For every 512 sec data segment, each run (SP false alarm from Gaussian stationary noise alone would
and SN ) took ≈ 46 min on 8 cores of an AMD EPYC be very low (< 1 event/year) at such a high threshold [39]
7763 64-Core Processor running at 2.45 GHz. although real broadband noise (coupled with imperfect
whitening) may have a non-Gaussian tail [40] that ele-
vates this probability somewhat. Hence, from the seg-
B. Signal injections ments free of signal injections, most candidate events
are expected to be glitches and it is convenient to re-
Each segment in the final set above was first analyzed fer to them as such in the following. The objective of
with a positive quadrant, SP , search without any signal any veto method should be the rejection of as many of
injection and the candidate event, with one event per these glitches as possible. The set of candidate events
segment, was recorded. Next, to study the veto safety of found had 708 glitches and 450 injected signals. The
GRITCLEAN, simulated GW signals were injected in a ran- 4 injections that were lost, corresponding to a false dis-
domly chosen subset of segments in which the estimated missal probability of 0.8%, had injected SNRs in the low-
SNRs of the candidate event fell below a threshold of est range ([10, 13]) and estimated SNRs of 8.02, 8.31,
8.0, thereby ensuring that these segments either did not 8.90 and 7.86. While GRITCLEAN is applied in stages as
have glitches or had weak and inconsequential ones at described in Sec. III C, it is informative to apply the neg-
most. From the 321 such segments that were found, we ative chirp time quadrant (SN ) search to all the glitches
randomly picked, with replacement, 454 segments for sig- and obtain their |∆ta | (Eq. 31) and |∆ρ| (Eq. 32) val-
nal injection. The set of injected signals was comprised ues. Fig. 4 shows the distribution of all the glitches
of low, intermediate, and high mass systems that were in the (|∆ta |, |∆ρ|) plane along with the distribution of
10
14454
20
102
2291 10
Whitened strain
0
10
363 0
-2
10 57 -10
-4 9
-20
10 -8 -6 -4 -2 0
10 10 10 10 10 0 100 200 300 400 500
Time
FIG. 4: Scatterplot of glitches in the (|∆ρ|, |∆ta |) plane.
FIG. 5: Whitened time-series of a data segment
The color of a point represents, following the map
containing multiple glitches (high amplitude spikes).
shown in the colorbar, the estimated SNR (b ρP ) in SP of
The SP search detects the glitch located near 388 sec,
a glitch. The colormap is based on the range of
while the SN search detects the glitch near 268 sec. The
observed log(b
ρP ) divided into 5 bins.
estimated SNRs of both glitches were ≈ 66 in the
respective searches.
FIG. 6: Results obtained by implementing GRITCLEAN on candidate events obtained with a detection threshold of
SNR = 9.0. Glitches vetoed by the chirp length and complex mass veto steps of GRITCLEAN in the positive chirp
time quadrant (SP ) are shown with pink and magenta colors, respectively. The remaining glitch events that bypass
the SP vetoes are shown in red. The exclusion area for the SN veto step, defined by |∆ρ| ≥ 0.1 and |∆ta | ≥ 0.3 is
shown as a rectangle shaded in green. Different marker shapes for the glitches correspond to different types of
outliers, as noted in the legend of the plot. Candidate events from injected signals are shown in dark blue, yellow,
and cyan colors, corresponding to the high, realistic, and low SNR ranges, respectively. Different marker shapes for
these events correspond to different mass ranges, as noted in the legend. The single glitch that falls into the
exclusion zone and is not vetoed by the SP vetoes is a Type-4 outlier shown by the red asterisk.
140
ζ Glitches missed Injections rejected
120 (%) (%) (%)
10 18.7 15.1
100
25 19.7 4.4
Frequency
80 50 22.1 0.4
60
90 28.6 0.0
100 31.9 0.0
40
from the broadband noise, but we will continue to call negative chirp time quadrant, SN , to separate glitches
them all glitches.) GRITCLEAN was applied as before, from CBC signals in a matched filter search. We have
with all settings kept the same except for the exclusion tested the effectiveness of our strategy using ≈ 131 hours
area in the (|∆ρ|, |∆ta |) plane, which must be enlarged of single-detector LIGO data. Our results show that
to |∆ρ| ≥ 0.07 and |∆ta | ≥ 0.1 in order to retain all the GRITCLEAN is a highly effective veto method that can suc-
detected injections. Now, the number of glitches that cessfully reject glitches that impact CBC searches while
are missed rises to 6 from 1, reducing the fraction of ve- maintaining a high level of veto safety. The search over
toed glitches to 99.2%. The loss of detected injections the unphysical sectors of SP is already integrated in a
remains at 0% after the application of GRITCLEAN, main- PSO-based approach and does not add any computa-
taining its high safety. While the IFAR of the weakest tional burden. The search over SN incurs additional com-
detected injection increases in correspondence with the putational cost but need only be used for ≈ 30% of can-
larger number of glitches, the reduction in IFAR due to didate events. For the detection threshold of SNR = 9.0,
GRITCLEAN is still quite large: from 1 false alarm every GRITCLEAN eliminated all glitches except one and brought
10.31 min before vetoing to 1 false alarm every 21.8 hours down the IFAR due to glitches by a factor of ≈ 707. It
after. Fig. 11 also shows the events that did not cross should be emphasized here that this large reduction was
the detection threshold. Most of such events arise from obtained from just a single-detector search. Supplement-
the broadband noise rather than glitches, like the Type-3 ing with a multi-detector coincidence veto will reduce the
outlier discussed earlier. We see that such noise-induced IFAR even further. The gain from using GRITCLEAN in
events occupy the same region of the (|∆ρ|, |∆ta |) plane such a scenario will be quantified in future work.
as the outlier glitches and that this is well separated In this study, a common set of injected signals was used
from the main cluster of glitches. Lowering the detec- for tuning the parameters of GRITCLEAN. However, the
tion threshold further and expanding the exclusion zone injections needed to tune the complex mass veto thresh-
to retain weaker injected signals will uncover more of this old ζ should be more specialized. This is because the
population of noise-induced candidate events, which are problem of complex estimated masses mainly affects the
more likely to bypass GRITCLEAN and, for that matter, signals located on the boundary of the unphysical region
any other glitch veto method. of complex masses, which is given by the curve of equal
One way in which GRITCLEAN differs from χ2 -vetoes is mass binaries (c.f Fig. 2). Therefore, a more precise de-
that the latter depend primarily on the norm of the data termination of ζ for a given fractional loss of injected
projection on the null space of a given template, while signals requires a much larger number of equal-mass in-
GRITCLEAN uses both the projection norms, encapsulated jections than were present in the common injection set.
in |∆ρ|, as well as the estimated TOAs, through |∆ta |, As a result, we used a conservative value for ζ that in-
of the best fit physical and unphysical templates. Fig. 6 curred a larger fraction of missed glitches in the complex
and Fig. 11 illustrate the importance of using |∆ta | in ad- mass veto step. While future applications of GRITCLEAN
dition to |∆ρ| in GRITCLEAN. If the threshold on |∆ta | is will improve in this aspect, our current results are not
removed, one can see that the number of glitches missed significantly affected given that the SN veto step would
(red colored points) increase by 2 and 7 in Fig. 6 and have trapped any glitch that escaped the complex mass
Fig. 11, respectively, thereby increasing the false alarm veto due to a lower ζ.
rate due to glitches without a concomitant increase in the From the outliers in the distribution of glitches across
number of detected injections. Another clear impact of the (|∆ρ|, |∆ta |) plane, we found that the majority are
using |∆ta |, seen in Fig. 11, is the separation of glitches associated with a technical limitation of the current PSO-
from below-threshold events (gray dots), the majority of based search that allows only one event to be identified
which would be false alarms from broadband noise and per data segment. This limitation will be lifted in ongo-
not real glitches. The separation includes both strong ing improvements to the PSO-based CBC search that will
and weak (but above threshold) glitches since, as dis- enable the identification of all local maxima of the fitness
cussed in the context of Fig. 4, their distribution in the function above a given detection threshold rather than
(|∆ρ|, |∆ta |) plane does not depend strongly on their es- just the global maximum. To aid reproducibility of our
timated SNR. As the SNR of GW signals is reduced, they work, we have provided the dataset of candidate events
will migrate in this plane to the left, as can be seen by the found in our study, the codes for PSO-based matched-
progression of the colored bands of the injected signals, filtering, and the scripts for producing some of the main
but not towards the bottom where most of the glitches plots in a public data release on Zenodo [41].
reside.
ACKNOWLEDGMENTS
V. CONCLUSIONS
This work is supported by NSF Grant No. PHY-
We have presented a veto method called GRITCLEAN 2207935. We acknowledge the Texas Advanced Comput-
that exploits the unphysical regions of the positive chirp ing Center (TACC) at the University of Texas at Austin
time quadrant, SP , and unphysical waveforms in the (www.tacc.utexas.edu) for providing high performance
14
FIG. 11: Results obtained by implementing GRITCLEAN on candidate events obtained with a detection threshold of
SNR = 8.0. Glitches vetoed by the chirp length and complex mass veto steps of GRITCLEAN in the positive chirp
time quadrant (SP ) are shown with pink and magenta colors, respectively. The remaining glitch events that bypass
the SP vetoes are shown in red. The exclusion area for the SN veto step, defined by |∆ρ| ≥ 0.07 and |∆ta | ≥ 0.1 is
shown as the green shaded rectangle. Candidate events from injected signals are shown in dark blue, yellow, and
cyan colors, corresponding to the high, realistic, and low SNR ranges, respectively. Events that fell below the
detection threshold are shown in gray color.
computing resources. This research has made use of data Science Center (gwosc.org), a service of the LIGO Sci-
or software obtained from the Gravitational Wave Open entific Collaboration, the Virgo Collaboration, and KA-
GRA.
[11] R. Biswas, L. Blackburn, J. Cao, R. Essick, K. A. spline fitting,” Classical and Quantum Gravity, vol. 40,
Hodge, E. Katsavounidis, K. Kim, Y.-M. Kim, E.-O. p. 125001, may 2023.
Le Bigot, C.-H. Lee, J. J. Oh, S. H. Oh, E. J. Son, [25] J. Merritt, B. Farr, R. Hur, B. Edelman, and Z. Doc-
Y. Tao, R. Vaulin, and X. Wang, “Application of ma- tor, “Transient glitch mitigation in advanced ligo data,”
chine learning algorithms to the study of noise arti- Physical Review D, vol. 104, no. 10, p. 102004, 2021.
facts in gravitational-wave data,” Phys. Rev. D, vol. 88, [26] R. Bondarescu, A. Lundgren, and R. Macas, “Antiglitch:
p. 062003, Sep 2013. a quasi-physical model for removing short glitches from
[12] S. Bahaadini, V. Noroozi, N. Rohani, S. Coughlin, ligo and virgo data,” 2023.
M. Zevin, J. Smith, V. Kalogera, and A. Katsaggelos, [27] J. Kennedy and R. C. Eberhart, “Particle swarm op-
“Machine learning for gravity spy: Glitch classification timization,” in Proceedings of the IEEE International
and dataset,” Information Sciences, vol. 444, pp. 172– Conference on Neural Networks: Perth, WA, Australia,
186, 2018. vol. 4, p. 1942, IEEE, 1995.
[13] S. Mukherjee, R. Obaid, and B. Matkarimov, “Classifi- [28] T. S. Weerathunga and S. D. Mohanty, “Performance of
cation of glitch waveforms in gravitational wave detector particle swarm optimization on the fully-coherent all-sky
characterization,” Journal of Physics: Conference Series, search for gravitational waves from compact binary coa-
vol. 243, p. 012006, aug 2010. lescences,” Physical Review D, vol. 95, no. 12, p. 124030,
[14] M. Zevin, S. Coughlin, S. Bahaadini, E. Besler, N. Ro- 2017.
hani, S. Allen, M. Cabero, K. Crowston, A. K. Kat- [29] M. E. Normandin and S. D. Mohanty, “Towards a
saggelos, S. L. Larson, et al., “Gravity spy: integrating real-time fully-coherent all-sky search for gravitational
advanced ligo detector characterization, machine learn- waves from compact binary coalescences using particle
ing, and citizen science,” Classical and Quantum Gravity, swarm optimization,” Physical Review D, vol. 101, no. 8,
vol. 34, no. 6, p. 064003, 2017. p. 082001, 2020.
[15] B. Allen, “χ2 time-frequency discriminator for gravi- [30] L. Blanchet, T. Damour, B. R. Iyer, C. M. Will, and
tational wave detection,” Physical Review D, vol. 71, A. G. Wiseman, “Gravitational-radiation damping of
p. 062001, Mar 2005. compact binary systems to second post-newtonian or-
[16] I. W. Harry and S. Fairhurst, “Targeted coherent search der,” Physical Review Letters, vol. 74, pp. 3515–3518,
for gravitational waves from compact binary coales- May 1995.
cences,” Phys. Rev. D, vol. 83, p. 084002, Apr 2011. [31] B. S. Sathyaprakash and S. V. Dhurandhar, “Choice of
[17] A. H. Nitz, “Distinguishing short duration noise tran- filters for the detection of gravitational waves from coa-
sients in ligo data to improve the pycbc search for grav- lescing binaries,” Physical Review D, vol. 44, pp. 3819–
itational waves from high mass binary black hole merg- 3834, Dec 1991.
ers,” arXiv preprint arXiv:1709.08974, 2017. [32] E. O. Brigham and R. E. Morrow, “The fast fourier trans-
[18] S. Choudhary, S. Bose, P. Joshi, and S. Dhurandhar, “Im- form,” IEEE Spectrum, vol. 4, no. 12, pp. 63–70, 1967.
proved binary black hole searches through better discrim- [33] B. F. Schutz, Data processing, analysis, and storage for
ination against noise transients,” 12 2022. interferometric antennas, p. 406–452. Cambridge Univer-
[19] S. Sachdev, S. Caudill, H. Fong, R. K. Lo, C. Messick, sity Press, 1991.
D. Mukherjee, R. Magee, L. Tsukada, K. Blackburn, [34] D. Bratton and J. Kennedy, “Defining a standard for par-
P. Brady, et al., “The gstlal search analysis methods for ticle swarm optimization,” in Swarm Intelligence Sympo-
compact binary mergers in advanced ligo’s second and sium, 2007. SIS 2007. IEEE, pp. 120–127, IEEE, 2007.
advanced virgo’s first observing runs,” arXiv preprint [35] J. Robinson and Y. Rahmat-Samii, “Particle swarm op-
arXiv:1901.08580, 2019. timization in electromagnetics,” IEEE Transactions on
[20] G. Ashton, S. Thiele, Y. Lecoeuche, J. McIver, and L. K. Antennas and Propagation, vol. 52, no. 2, pp. 397–407,
Nuttall, “Parameterised population models of transient 2004.
non-gaussian noise in the ligo gravitational-wave detec- [36] S. D. Mohanty, Swarm Intelligence Methods for Statisti-
tors,” Classical and Quantum Gravity, vol. 39, p. 175004, cal Regression. Chapman and Hall/CRC, 2018.
Aug. 2022. [37] The Mathworks, Inc., Natick, Massachusetts, MATLAB
[21] N. J. Cornish and T. B. Littenberg, “Bayeswave: version 9.12.0.2327980 (R2022a) Update 7, 2022.
Bayesian inference for gravitational wave bursts and [38] R. A. et. al, “Open data from the third observing run of
instrument glitches,” Classical and Quantum Gravity, ligo, virgo, kagra, and geo,” The Astrophysical Journal
vol. 32, no. 13, p. 135012, 2015. Supplement Series, vol. 267, p. 29, jul 2023.
[22] C. Pankow, K. Chatziioannou, E. A. Chase, T. B. Litten- [39] S. D. Mohanty and S. V. Dhurandhar, “Hierarchi-
berg, M. Evans, J. McIver, N. J. Cornish, C.-J. Haster, cal search strategy for the detection of gravitational
J. Kanner, V. Raymond, S. Vitale, and A. Zimmer- waves from coalescing binaries,” Phys. Rev. D, vol. 54,
man, “Mitigation of the instrumental noise transient in pp. 7108–7128, Dec 1996.
gravitational-wave data surrounding gw170817,” Phys. [40] T. Yamamoto, K. Hayama, S. Mano, Y. Itoh, and
Rev. D, vol. 98, p. 084016, Oct 2018. N. Kanda, “Characterization of non-gaussianity in grav-
[23] S. D. Mohanty and E. Fahnestock, “Adaptive spline fit- itational wave detector noise,” Phys. Rev. D, vol. 93,
ting with particle swarm optimization,” Computational p. 082005, Apr 2016.
Statistics, pp. 1–37, 2020. [41] R. Girgaonkar, “Gritclean code and files – part 1,” 2024.
[24] S. D. Mohanty and M. A. T. Chowdhury, “Glitch sub- Zenodo. DOI: 10.5281/zenodo.10547476
traction from gravitational wave data using adaptive