Lifelong Learning (to Plan) by Abstraction
Overview
This project opens the door to an exciting, novel direction in robotics research. Its goal is to unlock a human-like capability in long-lived robots, allowing them to autonomously and improve their problem-solving capabilities throughout their lifetime. To achieve this, we developed a new paradigm for robot learning, called Lifelong Learning by Abstraction (LLbA). This paradigm does not involve statistical learning nor training of policies by trial and error before deployment. Instead, here, learning is done after deployment, by gradually, from one problem to the next, enriching the planner with useful, generalized conclusions, automatically extracted from individual successful experiences; these can later be dynamically matched to, adapted for, and reused in new planning problems, to guide and accelerate their solution. Thus far, we applied this paradigm in the context of geometric path-finding, symbolic task (“AI”) planning (commonly defined in PDDL), and multi-manipulator object rearrangement. Work-in-progress seeks to apply this to various additional contexts, including motion planning for high degree-of-freedom robot arms, and Multi-Agent Path Finding (MAPF).
Unpacking
With a series of publications, we built the theoretical and computational framework underpinning this paradigm. The first paper (Elimelech et al., 2022) introduced the theoretical basis for transfer by abstraction and showed that solutions of successful planning experiences can be abstracted into reusable “planning strategies,” encoded as “Abstract Road Maps (ARMs)”. The second paper (Elimelech et al., 2022) took this into a practical direction and formulated basic algorithms for strategy-accelerated planning, given a library of such planning strategies (encoded as ARMs). The third paper (Elimelech et al., 2023), complementary to the second one, presented practical algorithms for automatically abstracting solution paths into ARMs, i.e., building the planning-strategy library; this importantly introduced the idea of path segmentation through Abstraction Critical State Detection (ACSD). The second and third papers were presented in the context of geometric path-finding problems. The fourth paper (Elimelech et al., 2024), acted as direct continuation of the second one; where the earlier work only leveraged a single strategy from the library in each new solution (meaning, learning to solve similar problems), the strategy-accelerated planning algorithm presented here allowed for chaining multiple strategies, in order to find the complete solution (meaning, learning to solve harder problems, by combining experiences). This paper was presented in the context of task planning, in a “Blocksworld” PDDL domain.
New work (Elimelech et al., 2024) suggested this paradigm applies naturally to multi-manipulator object rearrangement problems, by integrating with the multi-robot planning framework DaSH (for hypergraph-based multi-robot planning).
Note. This paper series is currently being compiled into a single comprehensive publication. A preprint is expected to be released soon.
Note. Thanks to feedback from the community, some of the terminology has been revised throughout the lifetime of this project, in order to avoid confusion. Earlier publications used the term “abstract skills” to refer to the learned objects; this was later revised to “abstract strategies.” Further, the usage of the terms “public/private abstraction key” was dropped; current work simply use the term “abstraction key.”