PHD Proposal

Instrumentation & signal processing

Uploaded by

Chandrakanta Nayak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Topics covered

academic collaboration,
student supervision,
white Gaussian noise,
subsampling techniques,
determinantal point processes,
mathematical probability,
Gaussian analytic functions,
interdisciplinary research,
research funding,
data-driven research

0% found this document useful (0 votes)

102 views3 pages

PHD Proposal

Instrumentation & signal processing

Uploaded by

Chandrakanta Nayak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Topics covered

academic collaboration,
student supervision,
white Gaussian noise,
subsampling techniques,
determinantal point processes,
mathematical probability,
Gaussian analytic functions,
interdisciplinary research,
research funding,
data-driven research

Rémi Bardenet, CR CNRS,

PI of ERC grant Blackjack

PI of Artificial Intelligence chair Baccarat
Centre de recherche en informatique, signal et automatique de Lille (CRIStAL)
Université de Lille
rbardenet.github.io
remi.bardenet@gmail.com

Subhroshekhar Ghosh, Assistant professor,

Department of mathematics,
National University of Singapore
https://subhro-ghosh.github.io/

Proposal for PhD thesis

Keywords. Negative dependence in large-scale machine learning, time-frequency signal pro-
cessing, Monte Carlo integration.

Funding. This thesis will be funded by AI chair “Baccarat” managed by Rémi Bardenet.

Starting date. To be discussed with the advisors. A PhD in France takes 3 years, typically
extended by 6 months if needed. It is possible, and actually recommended, to arrange a master’s
level internship before the thesis starts.

Supervision. To tackle this interdisciplinary project, we offer the co-supervision of a com-

putational statistician developing applications of repulsive point processes (Rémi Bardenet,
main supervisor; Univ. Lille, CNRS, École Centrale de Lille), and an expert in probability and
statistical physics (Subhro Ghosh: National Univ. Singapore).

Environment. Time will be shared between Lille (France) and Singapore. There is ample
funding for travel between the two sites.
In more detail, the first site is CRIStAL, the department of computer science of the Univer-
sity of Lille, where co-supervisor Rémi Bardenet is based. Besides roughly 100 people working
on data science and signal processing, CRIStAL hosts a group of 7-10 people dedicated precisely
to the applications of repulsive point processes in data science, funded either by the Baccarat
AI chair or ERC grant Blackjack. Moreover, we have strong links to the department of mathe-
matics in Laboratoire Painlevé, a few minutes away by foot. A weekly workgroup between the
two labs is further devoted to point processes and their applications, with a focus on interacting
point processes. This makes Lille a rich scientific environment for the thesis.
The second site is the National University of Singapore (NUS), with co-supervisor Subhro
Ghosh at the department of mathematics and the department of statistics. The two depart-
ments are very strong across the wide spectrum of mathematics for data science.

Context. A point process is a random discrete set of points in a generic space. A broad
interest has recently emerged around point processes that exhibit a regular arrangement of
their points in a wide sense. Figure 1(b) shows an example of such a regular arrangement
in 2D, while Figure 1(a) shows the same number of i.i.d. points drawn uniformly on the
same rectangle, for reference. In condensed matter physics, it has been observed that particle
systems like Figure 1(b) are actually so regularly spread that the variance of the number of
1 1
8.0

4.0

0 0

Frequency
0.0

-4.0

-1 -1 -8.0

-1 0 1 -1 0 1
0.0 4.0 8.0 12.0 16.0
Time

(a) (b) (c)

Figure 1: (a) A i.i.d. uniform draw, along with two draws of repulsive point processes: (b) a
determinantal point process, and (c) the zeros of the planar Gaussian analytic function.

points in a large window is lower than expected, a phenomenon called suppressed fluctuations,
or hyperuniformity. This has triggered a line of research in mathematical probability around so-
called repulsive point processes, like zeros of Gaussian analytic functions (GAFs; Hough et
al. (2009)) and determinantal point processes (DPPs; Hough et al. (2006)). In less than 10 years,
machine learners and statisticians have then turned some of these repulsive point processes into
powerful subsampling tools (Kulesza and Taskar, 2012; Derezinski and Mahoney, 2021;
Belhadji, Bardenet, and Chainais, 2020b; Bardenet and Hardy, 2020) and statistical models
(Kulesza and Taskar, 2012; Lavancier, Møller, and Rubak, 2014).
For instance, Belhadji, Bardenet, and Chainais (2020a) propose to perform feature selection
in linear regression using DPPs. A DPP is a point process in which points all interact with
each other, and where the interaction is encoded by a kernel, in the same sense as the kernel of
support vector machines. It can reasonably be argued that DPPs are the kernel machine of point
processes. In the case of (Belhadji, Bardenet, and Chainais, 2020a), the point process draws a
set of indices of jointly dissimilar columns of a the (wide) feature matrix. The well-spreadedness
of the point process is measured by how well the span of the selected columns approximates the
span of the first principal directions. We showed in particular that the resulting set of columns
led to performance similar to PCA, while leading to more interpretable features.
As another example, repulsive point processes have helped characterize the behaviour of
the zeros of time-frequency transforms of white noises (Bardenet, Flamant, and Chainais, 2018;
Bardenet and Hardy, 2019) in signal processing. Time-frequency transforms are the math-
ematical equivalent of musical scores, i.e. they map a signal (say, the sound of an instrument
recorded in a noisy instrument) to a function of time and frequency that encodes what fre-
quency is present at what time, like notes on a score. One such transform is the spectrogram,
and Figure 1(c) shows the spectrogram of a white Gaussian noise sample. The zeros of the
spectrogram, shown as white dots, are a repulsive point process. It turns out that it has the
same law as the zeros of the planar GAF, a Gaussian process of particular importance in prob-
ability (Hough et al., 2009). The mathematical knowledge we have of this point process in turn
leads to improved signal reconstruction algorithms (Bardenet, Flamant, and Chainais, 2018).

Objectives. To get acquainted with the interdisciplinary topic of repulsive point processes,
we shall start with a project that fits in ongoing collaboration between the two supervisors.
Ideally, this project shall be tackled during a master’s level internship prior to starting the PhD.
Depending on the student’s background and taste, this can be, e.g., (i) topological data analysis
applied to the zeros of random spectrograms like Figure 1(c) in statistical signal processing.
Alternately, the internship could revolve around (ii) negatively dependent subsampling for
large-scale machine learning. For instance, how can we taylor a repulsive point process to
sample a coreset (Tremblay, Barthelmé, and Amblard, 2019) for some specific ML task? In
other words, is there a natural repulsive point process that subsamples a large dataset, while
guaranteeing that learning on the subsample leads to the same generalization properties as
learning on the initial dataset? This is related to recent work by the two supervisors (Bardenet,
Ghosh, and Lin, 2021) on determinantal subsampling for stochastic gradient descent.
After this first project, the three of us will pick an ambitious open problem in line with the
objectives of the Baccarat AI chair, according to the student’s interest. Candidate problems
include identifying and studying repulsive point processes for high-dimensional Monte Carlo
integration, fast sampling algorithms for determinantal point processes in machine learning,
dictionary learning for signal processing, or studying zeros of wavelet transforms of random
signals to use them in filtering tasks.

References
[1] R. Bardenet, J. Flamant, and P. Chainais. “On the zeros of the spectrogram of white
noise”. In: Applied and Computational Harmonic Analysis (2018).
[2] R. Bardenet, S. Ghosh, and M. Lin. “Determinantal point processes based on orthogo-
nal polynomials for sampling minibatches in SGD”. In: Advances in Neural Information
Processing Systems (NeurIPS). 2021.
[3] R. Bardenet and A. Hardy. “Monte Carlo with Determinantal Point Processes”. In: An-
nals of Applied Probability (2020).
[4] R. Bardenet and A. Hardy. “Time-frequency transforms of white noises and Gaussian
analytic functions”. In: Applied and Computational Harmonic Analysis (2019).
[5] A. Belhadji, R. Bardenet, and P. Chainais. “A determinantal point process for column
subset selection”. In: Journal of Machine Learning Research (JMLR) (2020).
[6] A. Belhadji, R. Bardenet, and P. Chainais. “Kernel interpolation with continuous volume
sampling”. In: International Conference on Machine Learning (ICML). 2020.
[7] M. Derezinski and M. W. Mahoney. “Determinantal point processes in randomized nu-
merical linear algebra”. In: Notices of the American Mathematical Society 68.1 (2021).
[8] J. B. Hough et al. “Determinantal processes and independence”. In: Probability surveys
(2006).
[9] J. B. Hough et al. Zeros of Gaussian analytic functions and determinantal point processes.
Vol. 51. American Mathematical Society, 2009.
[10] A. Kulesza and B. Taskar. “Determinantal point processes for machine learning”. In:
Foundations and Trends in Machine Learning (2012).
[11] F. Lavancier, J. Møller, and E. Rubak. “Determinantal point process models and statis-
tical inference”. In: Journal of the Royal Statistical Society, Series B. B (2014).
[12] N. Tremblay, S. Barthelmé, and P.-O. Amblard. “Determinantal Point Processes for Core-
sets.” In: Journal of Machine Learning Research 20.168 (2019), pp. 1–70.

All Format
91% (32)
All Format
1 page
Live Tracker - Free PakData Sim Database On-Line 2025
No ratings yet
Live Tracker - Free PakData Sim Database On-Line 2025
9 pages
1500 Vocabulary Words
79% (107)
1500 Vocabulary Words
27 pages
XXXX XXXXXXXX: X X X X X XX
60% (5)
XXXX XXXXXXXX: X X X X X XX
2 pages
It - Stephen King's PDF
80% (10)
It - Stephen King's PDF
588 pages
Sim Owner Details - Pakistan No #1 Number Information System 2025
56% (16)
Sim Owner Details - Pakistan No #1 Number Information System 2025
3 pages
Secret Code Samsung
89% (38)
Secret Code Samsung
3 pages
میری گرم فیملی
81% (47)
میری گرم فیملی
133 pages
XXX Archita Phukan Viral Video Original XXX VIDEOS
9% (11)
XXX Archita Phukan Viral Video Original XXX VIDEOS
4 pages
50 Numerical Questions On Electricity Class 10
88% (74)
50 Numerical Questions On Electricity Class 10
49 pages
Big Book of Sex
41% (123)
Big Book of Sex
386 pages
Earseus Key
53% (15)
Earseus Key
4 pages
NADANPENKODI - Malayalam Kambi Kathakal
60% (10)
NADANPENKODI - Malayalam Kambi Kathakal
8 pages
Microsoft Office 2007 Activation Keys
85% (34)
Microsoft Office 2007 Activation Keys
2 pages
Telugu Family Sex Stories Collection
67% (102)
Telugu Family Sex Stories Collection
157 pages
Corel Draw X7 Serial Number & Activation Code
60% (42)
Corel Draw X7 Serial Number & Activation Code
1 page
All Numbers
67% (18)
All Numbers
59 pages
Telugu Boothu Kathala 5
67% (18)
Telugu Boothu Kathala 5
33 pages
Carbon and Its Compound (Prashant Kirad)
91% (267)
Carbon and Its Compound (Prashant Kirad)
21 pages
Uveit Foster
50% (6)
Uveit Foster
954 pages
Telugu Boothu Kathala 24 PDF
77% (13)
Telugu Boothu Kathala 24 PDF
20 pages
Chemistry (Annual Reports - Vol.59-1962)
100% (8)
Chemistry (Annual Reports - Vol.59-1962)
576 pages
Sample Research Paper PDF
95% (20)
Sample Research Paper PDF
36 pages
Class 12 Mathematics All Formulas
90% (176)
Class 12 Mathematics All Formulas
9 pages
Open Deed of Sale of A Motor Vehicle
81% (594)
Open Deed of Sale of A Motor Vehicle
1 page
Mineral and Energy Resources (Prashant Kirad)
92% (247)
Mineral and Energy Resources (Prashant Kirad)
20 pages
R. D. Sharma Class 9th Book PDF - Unlocked
83% (69)
R. D. Sharma Class 9th Book PDF - Unlocked
464 pages
Agriculture (Prashant Kirad)
90% (216)
Agriculture (Prashant Kirad)
22 pages
Tamil Romantic Stories Collection
40% (20)
Tamil Romantic Stories Collection
2 pages
Telugu Boothu Kathala 24
60% (10)
Telugu Boothu Kathala 24
27 pages

PHD Proposal

Uploaded by

PHD Proposal

Uploaded by

Rémi Bardenet, CR CNRS,

PI of ERC grant Blackjack

Subhroshekhar Ghosh, Assistant professor,

Proposal for PhD thesis

Supervision. To tackle this interdisciplinary project, we offer the co-supervision of a com-

(a) (b) (c)

You might also like