One flexible model for multiclass gravitational wave signal and glitch generation
Abstract
Simulating realistic time-domain observations of gravitational waves (GWs) and other events of interest in GW detectors, such as transient noise bursts called glitches, can help in advancing GW data analysis. Simulated data can be used in downstream data analysis tasks by augmenting datasets for signal searches, balancing datasets for machine learning applications, validating detection schemes, and constructing mock data challenges. In this work, we present a conditional derivative GAN (cDVGAN), a novel conditional model in the generative adversarial network framework for simulating multiple classes of time-domain observations that represent gravitational waves (GWs) and detector glitches. cDVGAN can also generate generalized hybrid samples that span the variation between classes through class interpolation in the conditioned class vector. cDVGAN introduces an additional player into the typical 2-player adversarial game of GANs, where an auxiliary discriminator analyzes the first-order derivative time series. Our results show that this provides synthetic data that better capture the features of the original data. cDVGAN conditions on three classes in the time domain, two denoized from LIGO blip and tomte glitch events from its third observing run (O3), and the third representing binary black hole (BBH) mergers. Our proposed cDVGAN outperforms four different baseline GAN models in replicating the features of the three classes. Specifically, our experiments show that training convolutional neural networks (CNNs) with our cDVGAN-generated data improves the detection of samples embedded in detector noise beyond the synthetic data from other state-of-the-art GAN models. Our best synthetic dataset yields as much as a 4.2% increase in area-under-the-curve (AUC) performance, maintaining the same CNN architecture, compared to synthetic datasets from baseline GANs. Moreover, training the CNN with class-interpolated hybrid samples from our cDVGAN outperforms CNNs trained only on the standard classes, when identifying real samples embedded in LIGO detector background between signal-to-noise ratios ranging from 1 to 16 (4% AUC improvement for cDVGAN). We also illustrate an application of cDVGAN in a data augmentation example, showing that it is competitive with a traditional augmentation approach. Lastly, we test cDVGAN's BBH signals in a fitting-factor study, showing that the synthetic signals are generally consistent with the semianalytical model used to generate the training signals and the corresponding parameter space.
- Publication:
-
Physical Review D
- Pub Date:
- July 2024
- DOI:
- 10.1103/PhysRevD.110.022004
- arXiv:
- arXiv:2401.16356
- Bibcode:
- 2024PhRvD.110b2004D
- Keywords:
-
- Physics - Instrumentation and Detectors;
- Computer Science - Machine Learning;
- General Relativity and Quantum Cosmology
- E-Print:
- 20 pages, 17 figures, 5 tables