Publications
- Feedback Schrödinger Bridge MatchingPanagiotis Theodoropoulos, Nikolaos Komianos, Vincent Pacelli, and 2 more authorsIn Proc. Intl. Conf. on Learning Representations, 2025
Recent advancements in diffusion bridges for distribution transport problems have heavily relied on matching frameworks, yet existing methods often face a trade-off between scalability and access to optimal pairings during training. Fully unsupervised methods make minimal assumptions but incur high computational costs, limiting their practicality. On the other hand, imposing full supervision of the matching process with optimal pairings improves scalability, however, it can be infeasible in most applications. To strike a balance between scalability and minimal supervision, we introduce Feedback Schrödinger Bridge Matching (FSBM), a novel semi-supervised matching framework that incorporates a small portion (% of the entire dataset) of pre-aligned pairs as state feedback to guide the transport map of non-coupled samples, thereby significantly improving efficiency. This is achieved by formulating a static Entropic Optimal Transport (EOT) problem with an additional term capturing the semi-supervised guidance. The generalized EOT objective is then recast into a dynamic formulation to leverage the scalability of matching frameworks. Extensive experiments demonstrate that FSBM accelerates training and enhances generalization by leveraging coupled pairs’ guidance, opening new avenues for training matching frameworks with partially aligned datasets.
@inproceedings{Theodoropoulos25, title = {Feedback Schr{\"o}dinger Bridge Matching}, author = {Theodoropoulos, Panagiotis and Komianos, Nikolaos and Pacelli, Vincent and Liu, Guan-Horng and Theodorou, Evangelos A.}, booktitle = {Proc. Intl. Conf. on Learning Representations}, year = {2025}, url = {https://openreview.net/forum?id=k3tbMMW8rH}, doi = {10.48550/arXiv.2410.14055} }
- Deep Distributed Optimization for Large-Scale Quadratic ProgrammingAugustinos D. Saravanos, Hunter Kuperman, Alex Oshin, and 3 more authorsIn Proc. Intl. Conf. on Learning Representations, 2025
Quadratic programming (QP) forms a crucial foundation in optimization, encompassing a broad spectrum of domains and serving as the basis for more advanced algorithms. Consequently, as the scale and complexity of modern applications continue to grow, the development of efficient and reliable QP algorithms is becoming increasingly vital. In this context, this paper introduces a novel deep learning-aided distributed optimization architecture designed for tackling large-scale QP problems. First, we combine the state-of-the-art Operator Splitting QP (OSQP) method with a consensus approach to derive DistributedQP, a new method tailored for network-structured problems, with convergence guarantees to optimality. Subsequently, we unfold this optimizer into a deep learning framework, leading to DeepDistributedQP, which leverages learned policies to accelerate reaching to desired accuracy within a restricted amount of iterations. Our approach is also theoretically grounded through Probably Approximately Correct (PAC)-Bayes theory, providing generalization bounds on the expected optimality gap for unseen problems. The proposed framework, as well as its centralized version DeepQP, significantly outperform their standard optimization counterparts on a variety of tasks such as randomly generated problems, optimal control, linear regression, transportation networks and others. Notably, DeepDistributedQP demonstrates strong generalization by training on small problems and scaling to solve much larger ones (up to 50K variables and 150K constraints) using the same policy. Moreover, it achieves orders-of-magnitude improvements in wall-clock time compared to OSQP. The certifiable performance guarantees of our approach are also demonstrated, ensuring higher-quality solutions over traditional optimizers.
@inproceedings{Saravanos25, title = {Deep Distributed Optimization for Large-Scale Quadratic Programming}, author = {Saravanos, Augustinos D. and Kuperman, Hunter and Oshin, Alex and Abdul, Arshiya Taj and Pacelli, Vincent and Theodorou, Evangelos}, booktitle = {Proc. Intl. Conf. on Learning Representations}, year = {2025}, url = {https://openreview.net/forum?id=hzuumhfYSO}, doi = {10.48550/arXiv.2412.12156} }
- Operator Splitting Covariance Steering for Safe Stochastic Nonlinear ControlAkash Ratheesh, Vincent Pacelli, Augustinos D. Saravanos, and 1 more authorarXiv preprint arXiv:2411.11211, 2025
Most robotics applications are typically accompanied with safety restrictions that need to be satisfied with a high degree of confidence even in environments under uncertainty. Controlling the state distribution of a system and enforcing such specifications as distribution constraints is a promising approach for meeting such requirements. In this direction, covariance steering (CS) is an increasingly popular stochastic optimal control (SOC) framework for designing safe controllers via explicit constraints on the system covariance. Nevertheless, a major challenge in applying CS methods to systems with the nonlinear dynamics and chance constraints common in robotics is that the approximations needed are conservative and highly sensitive to the point of approximation. This can cause sequential convex programming methods to converge to poor local minima or incorrectly report problems as infeasible due to shifting constraints. This paper presents a novel algorithm for solving chance-constrained nonlinear CS problems that directly addresses this challenge. Specifically, we propose an operator-splitting approach that temporarily separates the main problem into subproblems that can be solved in parallel. The benefit of this relaxation lies in the fact that it does not require all iterates to satisfy all constraints simultaneously prior to convergence, thus enhancing the exploration capabilities of the algorithm for finding better solutions. Simulation results verify the ability of the proposed method to find higher quality solutions under stricter safety constraints than standard methods on a variety of robotic systems. Finally, the applicability of the algorithm on real systems is confirmed through hardware demonstrations.
@article{Ratheesh25, title = {Operator Splitting Covariance Steering for Safe Stochastic Nonlinear Control}, author = {Ratheesh, Akash and Pacelli, Vincent and Saravanos, Augustinos D. and Theodorou, Evangelos A.}, journal = {arXiv preprint arXiv:2411.11211}, year = {2025}, doi = {10.48550/arXiv.2411.11211} }
- Fundamental Limits for Sensor-Based Robot ControlAnirudha Majumdar, Zhiting Mei, and Vincent PacelliIntl. J. of Robotics Research, 2023
Our goal is to develop theory and algorithms for establishing fundamental limits on performance imposed by a robot’s sensors for a given task. In order to achieve this, we define a quantity that captures the amount of task-relevant information provided by a sensor. Using a novel version of the generalized Fano’s inequality from information theory, we demonstrate that this quantity provides an upper bound on the highest achievable expected reward for one-step decision-making tasks. We then extend this bound to multi-step problems via a dynamic programming approach. We present algorithms for numerically computing the resulting bounds, and demonstrate our approach on three examples: (i) the lava problem from the literature on partially observable Markov decision processes, (ii) an example with continuous state and observation spaces corresponding to a robot catching a freely-falling object, and (iii) obstacle avoidance using a depth sensor with non-Gaussian noise. We demonstrate the ability of our approach to establish strong limits on achievable performance for these problems by comparing our upper bounds with achievable lower bounds (computed by synthesizing or learning concrete control policies).
@article{Majumdar23, title = {Fundamental Limits for Sensor-Based Robot Control}, author = {Majumdar, Anirudha and Mei, Zhiting and Pacelli, Vincent}, journal = {Intl. J. of Robotics Research}, volume = {42}, number = {12}, pages = {1051--1069}, year = {2023}, publisher = {SAGE}, url = {https://journals.sagepub.com/doi/full/10.1177/02783649231190947}, doi = {https://doi.org/10.1177/02783649231190} }
- Fundamental Performance Limits for Sensor-Based Robot Control and Policy LearningAnirudha Majumdar and Vincent PacelliIn Proc. of Robotics: Science and Systems, 2022
Our goal is to develop theory and algorithms for establishing fundamental limits on performance for a given task imposed by a robot’s sensors. In order to achieve this, we define a quantity that captures the amount of task-relevant information provided by a sensor. Using a novel version of the generalized Fano inequality from information theory, we demonstrate that this quantity provides an upper bound on the highest achievable expected reward for one-step decision making tasks. We then extend this bound to multi-step problems via a dynamic programming approach. We present algorithms for numerically computing the resulting bounds, and demonstrate our approach on three examples: (i) the lava problem from the literature on partially observable Markov decision processes, (ii) an example with continuous state and observation spaces corresponding to a robot catching a freely-falling object, and (iii) obstacle avoidance using a depth sensor with non-Gaussian noise. We demonstrate the ability of our approach to establish strong limits on achievable performance for these problems by comparing our upper bounds with achievable lower bounds (computed by synthesizing or learning concrete control policies).
@inproceedings{Majumdar22, title = {Fundamental Performance Limits for Sensor-Based Robot Control and Policy Learning}, author = {Majumdar, Anirudha and Pacelli, Vincent}, booktitle = {Proc. of Robotics: Science and Systems}, year = {2022}, url = {https://roboticsconference.org/2022/program/papers/036/}, doi = {https://doi.org/10.15607/rss.2022.xviii.036} }
- Robust Control Under Uncertainty via Bounded Rationality and Differential PrivacyVincent Pacelli and Anirudha MajumdarIn Proc. Intl. Conf on Robotics and Automation, 2022
The rapid development of affordable and compact high-fidelity sensors (e.g., cameras and LIDAR) allows robots to construct detailed estimates of their states and environments. However, the availability of such rich sensor information introduces two challenges: (i) the lack of analytic sensing models, which makes it difficult to design controllers that are robust to sensor failures, and (ii) the computational expense of processing the high-dimensional sensor information in real time. This paper addresses these challenges using the theory of differential privacy, which allows us to (i) design controllers with bounded sensitivity to errors in state estimates, and (ii) bound the amount of state information used for control (i.e., to impose decision-making under bounded rationality). The resulting framework approximates the separation principle and allows us to derive an upper-bound on the cost incurred with a faulty state estimator in terms of three quantities: the cost incurred using a perfect state estimator, the magnitude of state estimation errors, and the level of differential privacy. We demonstrate the efficacy of our framework numerically on different robotics problems, including nonlinear system stabilization and motion planning.
@inproceedings{Pacelli22, title = {Robust Control Under Uncertainty via Bounded Rationality and Differential Privacy}, author = {Pacelli, Vincent and Majumdar, Anirudha}, booktitle = {Proc. Intl. Conf on Robotics and Automation}, pages = {3467--3474}, year = {2022}, organization = {IEEE}, url = {https://ieeexplore.ieee.org/abstract/document/9811557}, doi = {10.1109/icra46639.2022.9811557} }
- Invariant Policy Optimization: Towards Stronger Generalization in Reinforcement LearningAnoopkumar Sonar, Vincent Pacelli, and Anirudha MajumdarIn Proc. Conf. on Learning for Dynamics and Control, 2021
A fundamental challenge in reinforcement learning is to learn policies that generalize beyond the operating domains experienced during training. In this paper, we approach this challenge through the following invariance principle: an agent must find a representation such that there exists an action-predictor built on top of this representation that is simultaneously optimal across all training domains. Intuitively, the resulting invariant policy enhances generalization by finding causes of successful actions. We propose a novel learning algorithm, Invariant Policy Optimization (IPO), that implements this principle and learns an invariant policy during training. We compare our approach with standard policy gradient methods and demonstrate significant improvements in generalization performance on unseen domains for linear quadratic regulator and grid-world problems, and an example where a robot must learn to open doors with varying physical properties.
@inproceedings{Sonar2021, title = {Invariant Policy Optimization: Towards Stronger Generalization in Reinforcement Learning}, author = {Sonar, Anoopkumar and Pacelli, Vincent and Majumdar, Anirudha}, booktitle = {Proc. Conf. on Learning for Dynamics and Control}, pages = {21--33}, year = {2021}, organization = {PMLR}, url = {https://proceedings.mlr.press/v144/sonar21a.html}, doi = {10.48550/arXiv.2006.01096} }
- Systems of Stacking Interlocking BlocksRahul Mangharam, Matthew Edward O’Kelly, Vincent Scott Pacelli, and 1 more authorJan 2022
- Learning Task-Driven Control Policies via Information BottlenecksVincent Pacelli and Anirudha MajumdarIn Proc. of Robotics: Science and Systems, Jul 2020
This paper presents a reinforcement learning approach to synthesizing task-driven control policies for robotic systems equipped with rich sensory modalities (e.g., vision or depth). Standard reinforcement learning algorithms typically produce policies that tightly couple control actions to the entirety of the system’s state and rich sensor observations. As a consequence, the resulting policies can often be sensitive to changes in task-irrelevant portions of the state or observations (e.g., changing background colors). In contrast, the approach we present here learns to create a task-driven representation that is used to compute control actions. Formally, this is achieved by deriving a policy gradient-style algorithm that creates an information bottleneck between the states and the task-driven representation; this constrains actions to only depend on task-relevant information. We demonstrate our approach in a thorough set of simulation results on multiple examples including a grasping task that utilizes depth images and a ball-catching task that utilizes RGB images. Comparisons with a standard policy gradient approach demonstrate that the task-driven policies produced by our algorithm are often significantly more robust to sensor noise and task-irrelevant changes in the environment.
@inproceedings{Pacelli20, author = {Pacelli, Vincent and Majumdar, Anirudha}, title = {{Learning Task-Driven Control Policies via Information Bottlenecks}}, booktitle = {Proc. of Robotics: Science and Systems}, year = {2020}, address = {Corvalis, Oregon, USA}, month = jul, url = {https://www.roboticsproceedings.org/rss16/p101.html}, doi = {10.15607/rss.2020.xvi.101} }
- Integration of Local Geometry and Metric Information in Sampling-Based Motion PlanningVincent Pacelli, Omur Arslan, and Daniel E. KoditschekIn Proc. Intl. Conf on Robotics and Automation, Jul 2018
The efficiency of sampling-based motion planning algorithms is dependent on how well a steering procedure is capable of capturing both system dynamics and configuration space geometry to connect sample configurations. This paper considers how metrics describing local system dynamics may be combined with convex subsets of the free space to describe the local behavior of a steering function for sampling-based planners. Subsequently, a framework for using these subsets to extend the steering procedure to incorporate this information is introduced. To demonstrate our framework, three specific metrics are considered: the LQR cost-to-go function, a Gram matrix derived from system linearization, and the Mahalanobis distance of a linear-Gaussian system. Finally, numerical tests are conducted for a second-order linear system, a kinematic unicycle, and a linear-Gaussian system to demonstrate that our framework increases the connectivity of sampling-based planners and allows them to better explore the free space.
@inproceedings{Pacelli18, title = {Integration of Local Geometry and Metric Information in Sampling-Based Motion Planning}, author = {Pacelli, Vincent and Arslan, Omur and Koditschek, Daniel E.}, booktitle = {Proc. Intl. Conf on Robotics and Automation}, pages = {3061--3068}, year = {2018}, organization = {IEEE}, url = {https://ieeexplore.ieee.org/abstract/document/8460739}, doi = {10.1109/icra.2018.8460739} }
- Sensory Steering for Sampling-Based Motion PlanningOmur Arslan, Vincent Pacelli, and Daniel E. KoditschekIn Proc. Intl. Conf. on Intelligent Robots and Systems, Jul 2017
Sampling-based algorithms offer computationally efficient, practical solutions to the path finding problem in high-dimensional complex configuration spaces by approximately capturing the connectivity of the underlying space through a (dense) collection of sample configurations joined by simple local planners. In this paper, we address a long-standing bottleneck associated with the difficulty of finding paths through narrow passages. Whereas most prior work considers the narrow passage problem as a sampling issue (and the literature abounds with heuristic sampling strategies) very little attention has been paid to the design of new effective local planners. Here, we propose a novel sensory steering algorithm for sampling-based motion planning that can “feel” a configuration space locally and significantly improve the path planning performance near difficult regions such as narrow passages. We provide computational evidence for the effectiveness of the proposed local planner through a variety of simulations which suggest that our proposed sensory steering algorithm outperforms the standard straight-line planner by significantly increasing the connectivity of random motion planning graphs.
@inproceedings{Arslan17, title = {Sensory Steering for Sampling-Based Motion Planning}, author = {Arslan, Omur and Pacelli, Vincent and Koditschek, Daniel E.}, booktitle = {Proc. Intl. Conf. on Intelligent Robots and Systems}, pages = {3708--3715}, year = {2017}, organization = {IEEE}, url = {https://ieeexplore.ieee.org/abstract/document/8206218}, doi = {10.1109/iros.2017.8206218} }