Forschungsseminar Mathematische Statistik

Institut für Mathematik | Forschungsseminar Mathematische Statistik

Forschungsseminar Mathematische Statistik

Für den Bereich Statistik

A. Carpentier, S. Greven, W. Härdle, M. Reiß, V. Spokoiny

Ort

Weierstrass-Institut für Angewandte Analysis und Stochastik
ESH, Mohrenstraße 39 bitte beachten Sie die Raumänderung!!!
Mohrenstrasse 39
10117 Berlin

Zeit

mittwochs, 10.00 - 12.00 Uhr

Programm

16. April 2025

N.N.

23. April 2025

N.N.

30. April 2025

Ratmir Miftachov (HU Berlin)

Early Stopping for Regression Trees

Abstract: We develop early stopping rules for growing regression tree estimators. The fully data-driven stopping rule is based on monitoring the global residual norm. The best-first search and the breadth-first search algorithms together with linear interpolation give rise to generalized projection or regularization flows. A general theory of early stopping is established. Oracle inequalities for the early-stopped regression tree are derived without any smoothness assumption on the regression function, assuming the original CART splitting rule, yet with a much broader scope. The remainder terms are of smaller order than the best achievable rates for Lipschitz functions in dimension . In real and synthetic data the early stopping regression tree estimators attain the statistical performance of cost-complexity pruning while significantly reducing computational costs.

07. Mai 2025

N.N.

14. Mai 2025

Vladimir Spokoiny (WIAS/ HU)

Estimation and inference for Deep Neuronal Networks and inverse problems

Abstract: The talk discusses two important issues in modern high-dimensional statistics. Success of DNN in practical applications is at the same time a great challenge for statistical theory due to the "curse of dimensionality" problem. Manifold type assumptions are not really helpful and do not explain the double descent phenomenon when the DNN accuracy improves with of overparametrization. We offer a different view on the problem based on the notion of effective dimension and a calming device. The idea is to decouple the structural DNN relation by extending the parameter space and use a proper regularization without any substantial increase of the effective dimension.The other related issue is the choice of regulation in inverse problems.

We show that a simple ridge penalty (Tikhonov regularization) does a good job in any inverse problem for which the operator is more regular than the unknown signal. In the opposite case, one should use a model reduction technique like spectral cut-off. Estimation and inference for Deep Neuronal Networks and inverse problems

21. Mai 2025

Yi Yu (University of Warwick)

Contextual Dynamic Pricing: Algorithms, Optimality and Local Differential Privacy Constraints

Abstract: We study contextual dynamic pricing problems where a firm sells products to $T$ sequentially-arriving consumers, behaving according to an unknown demand model. The firm aims to minimize its regret over a clairvoyant that knows the model in advance. The demand follows a generalized linear model (GLM), allowing for stochastic feature vectors in $\mathbb R^d$ encoding product and consumer information. We first show the optimal regret is of order~$\sqrt{dT}$, up to logarithmic factors, improving existing upper bounds by a~$\sqrt{d}$ factor. This optimal rate is materialized by two algorithms: an explore-then-commit~(ETC) algorithm and a confidence bound-type algorithm. A key insight is an intrinsic connection between dynamic pricing and contextual multi-armed bandit problems with many arms with a careful discretization. We further extend our study to adversarial contexts and propose algorithms that are statistically and computationally more efficient than existing methods in the literature. We further study contextual dynamic pricing under local differential privacy~(LDP) constraints. We propose a stochastic gradient descent-based ETC algorithm achieving regret upper bounds of order $d\sqrt{T}/\epsilon$, up to logarithmic factors, where $\epsilon>0$ is the privacy parameter. The upper bounds with and without LDP constraints are matched by newly constructed minimax lower bounds, characterizing costs of privacy. Moreover, we extend our study to dynamic pricing under mixed privacy constraints, improving the privacy-utility tradeoff by leveraging public data. This is the first time such setting is studied in the dynamic pricing literature and our theoretical results seamlessly bridge dynamic pricing with and without LDP. Extensive numerical experiments and real data applications are conducted to illustrate the efficiency and practical value of our algorithms.

28. Mai 2025

Jason Klusowski (Princeton University)

Statistical-computational Trade-offs for Recursive Adaptive Partitioning Estimators

Abstract: Recursive adaptive partitioning estimators, like decision trees and their ensembles, are effective for high-dimensional regression but usually rely on greedy training, which can become stuck at suboptimal solutions. We study this phenomenon in estimating sparse regression functions over binary features, showing that when the true function satisfies a certain structural property, greedy training achieves low estimation error with only a logarithmic number of samples in the feature count. However, when this property is absent, estimation becomes exponentially more difficult. Interestingly, this dichotomy between efficient and inefficient estimation resembles the behavior of two-layer neural networks trained with SGD in the mean-field regime. Meanwhile, ERM-trained recursive adaptive partitioning estimators always achieve low estimation error with logarithmically many samples, revealing a fundamental statistical-computational trade-off for greedy training.

04. Juni 2025

Sebastian Kassing (TU Berlin)

Stochastic Optimization: From Quadratic Programs to the Training of Neural Networks

Abstract: Many foundational results in (stochastic) optimization—ranging from convergence guarantees and rates to asymptotic normality—have been derived under strong convexity assumptions or even for quadratic objective functions. However, such assumptions often fail to hold in modern machine learning applications, where objectives are typically non-convex. This talk explores a recent line of research that extends classical results in stochastic gradient-based optimization to broader classes of functions satisfying the Polyak–Łojasiewicz (PL) inequality, a condition that is significantly more relevant for practical deep learning models. We consider typical acceleration techniques such as Polyak’s Heavy Ball and Ruppert-Polyak averaging and use a geometric interpretation of the PL-inequality to show that many algorithmic properties extend to a more general and realistic class of objectives.

11. Juni 2025

Dimitri Konen (University of Cambridge)

Data assimilation with the 2D Navier-Stokes equations: Optimal Gaussian asymptotics for the posterior measure

Abstract: A functional Bernstein von Mises theorem is proved for posterior measures arising in a data assimilation problem with the two-dimensional Navier-Stokes equation where a Gaussian process prior is assigned to the initial condition of the system. The posterior measure, which provides the update in the space of all trajectories arising from a discrete sample of the dynamics, is shown to be approximated by a Gaussian random function arising from the solution to a linear parabolic PDE with Gaussian initial condition. The approximation holds in the strong sense of the supremum norm on the regression functions, showing that predicting future states of Navier-Stokes systems admits root(N)-consistent estimators even for commonly used nonparametric models. Consequences to credible bands and uncertainty quantification are discussed, and a functional minimax theorem is derived that describes the Cramer-Rao lower bound for estimating the state of the non-linear system, which is attained by the data assimilation algorithm.

18. Juni 2025

Lasse Vuursteen (University of Pennsylvania)

Adaptive Estimation under Differential Privacy Constraints

Abstract: Estimation guarantees in nonparametric models typically depend on underlying function classes (or hyperparameters) that are seldom known in practice. Adaptive estimators provide simultaneous near-optimal performance across multiple such function classes.

In this talk, I will discuss recent work with co-authors Tony Cai and Abhinav Chakraborty, in which we study adaptation under differential privacy constraints. Differential privacy fundamentally limits the information that can be revealed about each individual datum by each data holder.

We develop a general theory for adaptation under differential privacy in the context of estimating linear functionals of a density. Our framework characterizes the difficulty of private adaptation problems through a specific "between-class modulus of continuity" that exactly describes the optimal achievable performance for private estimators that must adapt across two or more function classes.

Our theory reveals and quantifies the extent to which adaptation between specific function classes suffers as a consequence of imposing differential privacy constraints.

25. Juni 2025

CANCELLED

02. Juli 2025: Merle Munko (Otto-von-Guericke University Magdeburg); Multiple Tests for Mean Functions of Functional Data; Abstract: Functional data analysis is becoming increasingly popular to study data from real-valued random functions. Nevertheless, there is a lack of multiple testing procedures for such data. These are particularly important in factorial designs for comparing different groups or inferring factor effects. We propose a new class of testing procedures for arbitrary linear hypotheses in general factorial designs with functional data. Our methods allow global as well as multiple inference of both univariate and multivariate mean functions without assuming particular error distributions or homoscedasticity. That is, we allow for different structures of the covariance functions between groups. We analyse the (joint) asymptotic behaviour of suitable test statistics and propose a resampling approach to approximate the limit distributions. The resulting global and multiple testing procedures are asymptotically valid under weak conditions and applicable in general functional MANOVA settings. We evaluate their small-sample performance in extensive simulations and finally illustrate their applicability by analysing a data set.



09. Juli 2025
CANCELLED

16. Juli 2025

Claudia Strauch (Universität Heidelberg)


 Interessenten sind herzlich eingeladen.

Für Rückfragen wenden Sie sich bitte an:

Frau Marina Filatova

Mail: marina.filatova@hu-berlin.de
Telefon: +49-30-2093-45460
Humboldt-Universität zu Berlin
Institut für Mathematik
Unter den Linden 6
10099 Berlin, Germany

Humboldt-Universität zu Berlin - Mathematisch-Naturwissenschaftliche Fakultät - Institut für Mathematik

Für den Bereich Statistik

A. Carpentier, S. Greven, W. Härdle, M. Reiß, V. Spokoiny

Ort

Zeit

Programm

Frau Marina Filatova