Publications | Vincent Pacelli

Feedback Schrödinger Bridge Matching

Panagiotis Theodoropoulos, Nikolaos Komianos, Vincent Pacelli, and 2 more authors

In Proc. Intl. Conf. on Learning Representations, 2025

Abs DOI arXiv Bib

Recent advancements in diffusion bridges for distribution transport problems have heavily relied on matching frameworks, yet existing methods often face a trade-off between scalability and access to optimal pairings during training. Fully unsupervised methods make minimal assumptions but incur high computational costs, limiting their practicality. On the other hand, imposing full supervision of the matching process with optimal pairings improves scalability, however, it can be infeasible in most applications. To strike a balance between scalability and minimal supervision, we introduce Feedback Schrödinger Bridge Matching (FSBM), a novel semi-supervised matching framework that incorporates a small portion (% of the entire dataset) of pre-aligned pairs as state feedback to guide the transport map of non-coupled samples, thereby significantly improving efficiency. This is achieved by formulating a static Entropic Optimal Transport (EOT) problem with an additional term capturing the semi-supervised guidance. The generalized EOT objective is then recast into a dynamic formulation to leverage the scalability of matching frameworks. Extensive experiments demonstrate that FSBM accelerates training and enhances generalization by leveraging coupled pairs’ guidance, opening new avenues for training matching frameworks with partially aligned datasets.
@inproceedings{Theodoropoulos25, title = {Feedback Schr{\"o}dinger Bridge Matching}, author = {Theodoropoulos, Panagiotis and Komianos, Nikolaos and Pacelli, Vincent and Liu, Guan-Horng and Theodorou, Evangelos A.}, booktitle = {Proc. Intl. Conf. on Learning Representations}, year = {2025}, url = {https://openreview.net/forum?id=k3tbMMW8rH}, doi = {10.48550/arXiv.2410.14055} }
Deep Distributed Optimization for Large-Scale Quadratic Programming

Augustinos D. Saravanos, Hunter Kuperman, Alex Oshin, and 3 more authors

In Proc. Intl. Conf. on Learning Representations, 2025

Abs DOI arXiv Bib

Quadratic programming (QP) forms a crucial foundation in optimization, encompassing a broad spectrum of domains and serving as the basis for more advanced algorithms. Consequently, as the scale and complexity of modern applications continue to grow, the development of efficient and reliable QP algorithms is becoming increasingly vital. In this context, this paper introduces a novel deep learning-aided distributed optimization architecture designed for tackling large-scale QP problems. First, we combine the state-of-the-art Operator Splitting QP (OSQP) method with a consensus approach to derive DistributedQP, a new method tailored for network-structured problems, with convergence guarantees to optimality. Subsequently, we unfold this optimizer into a deep learning framework, leading to DeepDistributedQP, which leverages learned policies to accelerate reaching to desired accuracy within a restricted amount of iterations. Our approach is also theoretically grounded through Probably Approximately Correct (PAC)-Bayes theory, providing generalization bounds on the expected optimality gap for unseen problems. The proposed framework, as well as its centralized version DeepQP, significantly outperform their standard optimization counterparts on a variety of tasks such as randomly generated problems, optimal control, linear regression, transportation networks and others. Notably, DeepDistributedQP demonstrates strong generalization by training on small problems and scaling to solve much larger ones (up to 50K variables and 150K constraints) using the same policy. Moreover, it achieves orders-of-magnitude improvements in wall-clock time compared to OSQP. The certifiable performance guarantees of our approach are also demonstrated, ensuring higher-quality solutions over traditional optimizers.
@inproceedings{Saravanos25, title = {Deep Distributed Optimization for Large-Scale Quadratic Programming}, author = {Saravanos, Augustinos D. and Kuperman, Hunter and Oshin, Alex and Abdul, Arshiya Taj and Pacelli, Vincent and Theodorou, Evangelos}, booktitle = {Proc. Intl. Conf. on Learning Representations}, year = {2025}, url = {https://openreview.net/forum?id=hzuumhfYSO}, doi = {10.48550/arXiv.2412.12156} }
Operator Splitting Covariance Steering for Safe Stochastic Nonlinear Control

Akash Ratheesh^†, Vincent Pacelli^†, Augustinos D. Saravanos, and 1 more author

In Proc. Conf. on Decision and Control (To Appear), 2025

Abs DOI Bib

Most robotics applications are typically accompanied with safety restrictions that need to be satisfied with a high degree of confidence even in environments under uncertainty. Controlling the state distribution of a system and enforcing such specifications as distribution constraints is a promising approach for meeting such requirements. In this direction, covariance steering (CS) is an increasingly popular stochastic optimal control (SOC) framework for designing safe controllers via explicit constraints on the system covariance. Nevertheless, a major challenge in applying CS methods to systems with the nonlinear dynamics and chance constraints common in robotics is that the approximations needed are conservative and highly sensitive to the point of approximation. This can cause sequential convex programming methods to converge to poor local minima or incorrectly report problems as infeasible due to shifting constraints. This paper presents a novel algorithm for solving chance-constrained nonlinear CS problems that directly addresses this challenge. Specifically, we propose an operator-splitting approach that temporarily separates the main problem into subproblems that can be solved in parallel. The benefit of this relaxation lies in the fact that it does not require all iterates to satisfy all constraints simultaneously prior to convergence, thus enhancing the exploration capabilities of the algorithm for finding better solutions. Simulation results verify the ability of the proposed method to find higher quality solutions under stricter safety constraints than standard methods on a variety of robotic systems. Finally, the applicability of the algorithm on real systems is confirmed through hardware demonstrations.
@inproceedings{Ratheesh25, title = {Operator Splitting Covariance Steering for Safe Stochastic Nonlinear Control}, author = {Ratheesh, Akash and Pacelli, Vincent and Saravanos, Augustinos D. and Theodorou, Evangelos A.}, year = {2025}, booktitle = {Proc. Conf. on Decision and Control (To Appear)}, organization = {IEEE}, doi = {10.48550/arXiv.2411.11211}, }
Fundamental Limits for Sensor-Based Robot Control

Anirudha Majumdar, Zhiting Mei, and Vincent Pacelli

Intl. J. of Robotics Research, 2023

Abs DOI arXiv Bib

Our goal is to develop theory and algorithms for establishing fundamental limits on performance imposed by a robot’s sensors for a given task. In order to achieve this, we define a quantity that captures the amount of task-relevant information provided by a sensor. Using a novel version of the generalized Fano’s inequality from information theory, we demonstrate that this quantity provides an upper bound on the highest achievable expected reward for one-step decision-making tasks. We then extend this bound to multi-step problems via a dynamic programming approach. We present algorithms for numerically computing the resulting bounds, and demonstrate our approach on three examples: (i) the lava problem from the literature on partially observable Markov decision processes, (ii) an example with continuous state and observation spaces corresponding to a robot catching a freely-falling object, and (iii) obstacle avoidance using a depth sensor with non-Gaussian noise. We demonstrate the ability of our approach to establish strong limits on achievable performance for these problems by comparing our upper bounds with achievable lower bounds (computed by synthesizing or learning concrete control policies).
@article{Majumdar23, title = {Fundamental Limits for Sensor-Based Robot Control}, author = {Majumdar, Anirudha and Mei, Zhiting and Pacelli, Vincent}, journal = {Intl. J. of Robotics Research}, volume = {42}, number = {12}, pages = {1051--1069}, year = {2023}, publisher = {SAGE}, url = {https://journals.sagepub.com/doi/full/10.1177/02783649231190947}, doi = {https://doi.org/10.1177/02783649231190} }
Fundamental Performance Limits for Sensor-Based Robot Control and Policy Learning

Anirudha Majumdar and Vincent Pacelli

In Proc. of Robotics: Science and Systems, 2022

Abs DOI arXiv Bib

Our goal is to develop theory and algorithms for establishing fundamental limits on performance for a given task imposed by a robot’s sensors. In order to achieve this, we define a quantity that captures the amount of task-relevant information provided by a sensor. Using a novel version of the generalized Fano inequality from information theory, we demonstrate that this quantity provides an upper bound on the highest achievable expected reward for one-step decision making tasks. We then extend this bound to multi-step problems via a dynamic programming approach. We present algorithms for numerically computing the resulting bounds, and demonstrate our approach on three examples: (i) the lava problem from the literature on partially observable Markov decision processes, (ii) an example with continuous state and observation spaces corresponding to a robot catching a freely-falling object, and (iii) obstacle avoidance using a depth sensor with non-Gaussian noise. We demonstrate the ability of our approach to establish strong limits on achievable performance for these problems by comparing our upper bounds with achievable lower bounds (computed by synthesizing or learning concrete control policies).
@inproceedings{Majumdar22, title = {Fundamental Performance Limits for Sensor-Based Robot Control and Policy Learning}, author = {Majumdar, Anirudha and Pacelli, Vincent}, booktitle = {Proc. of Robotics: Science and Systems}, year = {2022}, url = {https://roboticsconference.org/2022/program/papers/036/}, doi = {https://doi.org/10.15607/rss.2022.xviii.036} }
Robust Control Under Uncertainty via Bounded Rationality and Differential Privacy

Vincent Pacelli and Anirudha Majumdar

In Proc. Intl. Conf on Robotics and Automation, 2022

Abs DOI arXiv Bib

The rapid development of affordable and compact high-fidelity sensors (e.g., cameras and LIDAR) allows robots to construct detailed estimates of their states and environments. However, the availability of such rich sensor information introduces two challenges: (i) the lack of analytic sensing models, which makes it difficult to design controllers that are robust to sensor failures, and (ii) the computational expense of processing the high-dimensional sensor information in real time. This paper addresses these challenges using the theory of differential privacy, which allows us to (i) design controllers with bounded sensitivity to errors in state estimates, and (ii) bound the amount of state information used for control (i.e., to impose decision-making under bounded rationality). The resulting framework approximates the separation principle and allows us to derive an upper-bound on the cost incurred with a faulty state estimator in terms of three quantities: the cost incurred using a perfect state estimator, the magnitude of state estimation errors, and the level of differential privacy. We demonstrate the efficacy of our framework numerically on different robotics problems, including nonlinear system stabilization and motion planning.
@inproceedings{Pacelli22, title = {Robust Control Under Uncertainty via Bounded Rationality and Differential Privacy}, author = {Pacelli, Vincent and Majumdar, Anirudha}, booktitle = {Proc. Intl. Conf on Robotics and Automation}, pages = {3467--3474}, year = {2022}, organization = {IEEE}, url = {https://ieeexplore.ieee.org/abstract/document/9811557}, doi = {10.1109/icra46639.2022.9811557} }
Invariant Policy Optimization: Towards Stronger Generalization in Reinforcement Learning

Anoopkumar Sonar, Vincent Pacelli, and Anirudha Majumdar

In Proc. Conf. on Learning for Dynamics and Control, 2021

Abs DOI arXiv Bib

A fundamental challenge in reinforcement learning is to learn policies that generalize beyond the operating domains experienced during training. In this paper, we approach this challenge through the following invariance principle: an agent must find a representation such that there exists an action-predictor built on top of this representation that is simultaneously optimal across all training domains. Intuitively, the resulting invariant policy enhances generalization by finding causes of successful actions. We propose a novel learning algorithm, Invariant Policy Optimization (IPO), that implements this principle and learns an invariant policy during training. We compare our approach with standard policy gradient methods and demonstrate significant improvements in generalization performance on unseen domains for linear quadratic regulator and grid-world problems, and an example where a robot must learn to open doors with varying physical properties.
@inproceedings{Sonar2021, title = {Invariant Policy Optimization: Towards Stronger Generalization in Reinforcement Learning}, author = {Sonar, Anoopkumar and Pacelli, Vincent and Majumdar, Anirudha}, booktitle = {Proc. Conf. on Learning for Dynamics and Control}, pages = {21--33}, year = {2021}, organization = {PMLR}, url = {https://proceedings.mlr.press/v144/sonar21a.html}, doi = {10.48550/arXiv.2006.01096} }
Systems of Stacking Interlocking Blocks

Rahul Mangharam, Matthew Edward O’Kelly, Vincent Scott Pacelli, and 1 more author

Jan 2022
Learning Task-Driven Control Policies via Information Bottlenecks

Vincent Pacelli and Anirudha Majumdar

In Proc. of Robotics: Science and Systems, Jul 2020

Abs DOI arXiv Bib

This paper presents a reinforcement learning approach to synthesizing task-driven control policies for robotic systems equipped with rich sensory modalities (e.g., vision or depth). Standard reinforcement learning algorithms typically produce policies that tightly couple control actions to the entirety of the system’s state and rich sensor observations. As a consequence, the resulting policies can often be sensitive to changes in task-irrelevant portions of the state or observations (e.g., changing background colors). In contrast, the approach we present here learns to create a task-driven representation that is used to compute control actions. Formally, this is achieved by deriving a policy gradient-style algorithm that creates an information bottleneck between the states and the task-driven representation; this constrains actions to only depend on task-relevant information. We demonstrate our approach in a thorough set of simulation results on multiple examples including a grasping task that utilizes depth images and a ball-catching task that utilizes RGB images. Comparisons with a standard policy gradient approach demonstrate that the task-driven policies produced by our algorithm are often significantly more robust to sensor noise and task-irrelevant changes in the environment.
@inproceedings{Pacelli20, author = {Pacelli, Vincent and Majumdar, Anirudha}, title = {{Learning Task-Driven Control Policies via Information Bottlenecks}}, booktitle = {Proc. of Robotics: Science and Systems}, year = {2020}, address = {Corvalis, Oregon, USA}, month = jul, url = {https://www.roboticsproceedings.org/rss16/p101.html}, doi = {10.15607/rss.2020.xvi.101} }
Task-driven Estimation and Control via Information Bottlenecks

Vincent Pacelli and Anirudha Majumdar

In Proc. Intl. Conf on Robotics and Automation, Jul 2019

Abs DOI arXiv Bib

Our goal is to develop a principled and general algorithmic framework for task-driven estimation and control for robotic systems. State-of-the-art approaches for controlling robotic systems typically rely heavily on accurately estimating the full state of the robot (e.g., a running robot might estimate joint angles and velocities, torso state, and position relative to a goal). However, full state representations are often excessively rich for the specific task at hand and can lead to significant computational inefficiency and brittleness to errors in state estimation. In contrast, we present an approach that eschews such rich representations and seeks to create task-driven representations. The key technical insight is to leverage the theory of information bottlenecks to formalize the notion of a “task-driven representation” in terms of information theoretic quantities that measure the minimality of a representation. We propose novel iterative algorithms for automatically synthesizing (offline) a task-driven representation (given in terms of a set of task-relevant variables (TRVs)) and a performant control policy that is a function of the TRVs. We present online algorithms for estimating the TRVs in order to apply the control policy. We demonstrate that our approach results in significant robustness to unmodeled measurement uncertainty both theoretically and via thorough simulation experiments including a spring-loaded inverted pendulum running to a goal location.
@inproceedings{Pacelli19, title = {Task-driven Estimation and Control via Information Bottlenecks}, author = {Pacelli, Vincent and Majumdar, Anirudha}, booktitle = {Proc. Intl. Conf on Robotics and Automation}, pages = {2061--2067}, year = {2019}, organization = {IEEE}, url = {https://www.roboticsproceedings.org/rss16/p101.html}, doi = {10.1109/icra.2019.8794213} }
Integration of Local Geometry and Metric Information in Sampling-Based Motion Planning

Vincent Pacelli, Omur Arslan, and Daniel E. Koditschek

In Proc. Intl. Conf on Robotics and Automation, Jul 2018

Abs DOI Bib

The efficiency of sampling-based motion planning algorithms is dependent on how well a steering procedure is capable of capturing both system dynamics and configuration space geometry to connect sample configurations. This paper considers how metrics describing local system dynamics may be combined with convex subsets of the free space to describe the local behavior of a steering function for sampling-based planners. Subsequently, a framework for using these subsets to extend the steering procedure to incorporate this information is introduced. To demonstrate our framework, three specific metrics are considered: the LQR cost-to-go function, a Gram matrix derived from system linearization, and the Mahalanobis distance of a linear-Gaussian system. Finally, numerical tests are conducted for a second-order linear system, a kinematic unicycle, and a linear-Gaussian system to demonstrate that our framework increases the connectivity of sampling-based planners and allows them to better explore the free space.
@inproceedings{Pacelli18, title = {Integration of Local Geometry and Metric Information in Sampling-Based Motion Planning}, author = {Pacelli, Vincent and Arslan, Omur and Koditschek, Daniel E.}, booktitle = {Proc. Intl. Conf on Robotics and Automation}, pages = {3061--3068}, year = {2018}, organization = {IEEE}, url = {https://ieeexplore.ieee.org/abstract/document/8460739}, doi = {10.1109/icra.2018.8460739} }
Sensory Steering for Sampling-Based Motion Planning

Omur Arslan, Vincent Pacelli, and Daniel E. Koditschek

In Proc. Intl. Conf. on Intelligent Robots and Systems, Jul 2017

Abs DOI Bib

Sampling-based algorithms offer computationally efficient, practical solutions to the path finding problem in high-dimensional complex configuration spaces by approximately capturing the connectivity of the underlying space through a (dense) collection of sample configurations joined by simple local planners. In this paper, we address a long-standing bottleneck associated with the difficulty of finding paths through narrow passages. Whereas most prior work considers the narrow passage problem as a sampling issue (and the literature abounds with heuristic sampling strategies) very little attention has been paid to the design of new effective local planners. Here, we propose a novel sensory steering algorithm for sampling-based motion planning that can “feel” a configuration space locally and significantly improve the path planning performance near difficult regions such as narrow passages. We provide computational evidence for the effectiveness of the proposed local planner through a variety of simulations which suggest that our proposed sensory steering algorithm outperforms the standard straight-line planner by significantly increasing the connectivity of random motion planning graphs.
@inproceedings{Arslan17, title = {Sensory Steering for Sampling-Based Motion Planning}, author = {Arslan, Omur and Pacelli, Vincent and Koditschek, Daniel E.}, booktitle = {Proc. Intl. Conf. on Intelligent Robots and Systems}, pages = {3708--3715}, year = {2017}, organization = {IEEE}, url = {https://ieeexplore.ieee.org/abstract/document/8206218}, doi = {10.1109/iros.2017.8206218} }