false

Dynamic Inverse Optimization under Drift and Shocks: Theory, Regret Bounds, and Applications

Dynamic Inverse Optimization under Drift and Shocks: Theory, Regret Bounds, and Applications ArXiv ID: 2509.14080 “View on arXiv” Authors: JINHO CHA Abstract The growing prevalence of drift and shocks in modern decision environments exposes a gap between classical optimization theory and real-world practice. Standard models assume fixed objectives, yet organizations from hospitals to power grids routinely adapt to shifting priorities, noisy data, and abrupt disruptions. To address this gap, this study develops a dynamic inverse optimization framework that recovers hidden, time-varying preferences from observed allocation trajectories. The framework unifies identifiability analysis with regret guarantees conditions are established for existence and uniqueness of recovered parameters, and sharp static and dynamic regret bounds are derived to characterize responsiveness to gradual drift and sudden shocks. Methodologically, a drift-aware estimator grounded in convex analysis and online learning theory is introduced, with finite-sample guarantees on recovery accuracy. Computational experiments in healthcare, energy, logistics, and finance reveal heterogeneous recovery patterns, ranging from rapid resilience to persistent vulnerability. Overall, dynamic inverse optimization emerges as both a theoretical contribution and a broadly applicable diagnostic tool for benchmarking resilience, uncovering hidden behavioral shifts, and guiding policy interventions in complex stochastic systems. ...

September 17, 2025 · 2 min · Research Team

Market Making without Regret

Market Making without Regret ArXiv ID: 2411.13993 “View on arXiv” Authors: Unknown Abstract We consider a sequential decision-making setting where, at every round $t$, a market maker posts a bid price $B_t$ and an ask price $A_t$ to an incoming trader (the taker) with a private valuation for one unit of some asset. If the trader’s valuation is lower than the bid price, or higher than the ask price, then a trade (sell or buy) occurs. If a trade happens at round $t$, then letting $M_t$ be the market price (observed only at the end of round $t$), the maker’s utility is $M_t - B_t$ if the maker bought the asset, and $A_t - M_t$ if they sold it. We characterize the maker’s regret with respect to the best fixed choice of bid and ask pairs under a variety of assumptions (adversarial, i.i.d., and their variants) on the sequence of market prices and valuations. Our upper bound analysis unveils an intriguing connection relating market making to first-price auctions and dynamic pricing. Our main technical contribution is a lower bound for the i.i.d. case with Lipschitz distributions and independence between prices and valuations. The difficulty in the analysis stems from the unique structure of the reward and feedback functions, allowing an algorithm to acquire information by graduating the “cost of exploration” in an arbitrary way. ...

November 21, 2024 · 2 min · Research Team

Logarithmic regret in the ergodic Avellaneda-Stoikov market making model

Logarithmic regret in the ergodic Avellaneda-Stoikov market making model ArXiv ID: 2409.02025 “View on arXiv” Authors: Unknown Abstract We analyse the regret arising from learning the price sensitivity parameter $κ$ of liquidity takers in the ergodic version of the Avellaneda-Stoikov market making model. We show that a learning algorithm based on a maximum-likelihood estimator for the parameter achieves the regret upper bound of order $\ln^2 T$ in expectation. To obtain the result we need two key ingredients. The first is the twice differentiability of the ergodic constant under the misspecified parameter in the Hamilton-Jacobi-Bellman (HJB) equation with respect to $κ$, which leads to a second–order performance gap. The second is the learning rate of the regularised maximum-likelihood estimator which is obtained from concentration inequalities for Bernoulli signals. Numerical experiments confirm the convergence and the robustness of the proposed algorithm. ...

September 3, 2024 · 2 min · Research Team

Regret-Optimal Federated Transfer Learning for Kernel Regression with Applications in American Option Pricing

Regret-Optimal Federated Transfer Learning for Kernel Regression with Applications in American Option Pricing ArXiv ID: 2309.04557 “View on arXiv” Authors: Unknown Abstract We propose an optimal iterative scheme for federated transfer learning, where a central planner has access to datasets ${"\cal D"}1,\dots,{"\cal D"}N$ for the same learning model $f_θ$. Our objective is to minimize the cumulative deviation of the generated parameters ${“θ_i(t)"}{“t=0”}^T$ across all $T$ iterations from the specialized parameters $θ^\star{“1”},\ldots,θ^\star_N$ obtained for each dataset, while respecting the loss function for the model $f_{“θ(T)”}$ produced by the algorithm upon halting. We only allow for continual communication between each of the specialized models (nodes/agents) and the central planner (server), at each iteration (round). For the case where the model $f_θ$ is a finite-rank kernel regression, we derive explicit updates for the regret-optimal algorithm. By leveraging symmetries within the regret-optimal algorithm, we further develop a nearly regret-optimal heuristic that runs with $\mathcal{“O”}(Np^2)$ fewer elementary operations, where $p$ is the dimension of the parameter space. Additionally, we investigate the adversarial robustness of the regret-optimal algorithm showing that an adversary which perturbs $q$ training pairs by at-most $\varepsilon>0$, across all training sets, cannot reduce the regret-optimal algorithm’s regret by more than $\mathcal{“O”}(\varepsilon q \bar{“N”}^{“1/2”})$, where $\bar{“N”}$ is the aggregate number of training pairs. To validate our theoretical findings, we conduct numerical experiments in the context of American option pricing, utilizing a randomly generated finite-rank kernel. ...

September 8, 2023 · 2 min · Research Team