Robust Reinforcement Learning for Linear Temporal Logic Specifications with Finite Trajectory Duration

MortazaviMoghaddam, SeyyedSoroush

Robust Reinforcement Learning for Linear Temporal Logic Specifications with Finite Trajectory Duration

dc.contributor.author	MortazaviMoghaddam, SeyyedSoroush
dc.date.accessioned	2024-09-23T18:59:13Z
dc.date.available	2024-09-23T18:59:13Z
dc.date.issued	2024-09-23
dc.date.submitted	2024-09-18
dc.description.abstract	Linear Temporal Logic (LTL) is a formal behavioral specification language that offers a mathematically unambiguous and succinct way to represent operating requirements for a wide variety of systems, including autonomous and robotic systems. Traditional methods in this domain rely on model-checking approaches to ensure that a devised policy adheres to the provided specification. However, these methods are limited in the scope of problems they can solve and often lack generalizability to novel specifications and environments. Despite progress in synthesizing satisfying policies for LTL specifications under different operating conditions, learning policies that reliably satisfy complex LTL specifications in challenging environments remains an open problem. With the emergence of Machine Learning (ML) approaches, researchers have explored the use of ML-based techniques with LTL policy synthesis. Among the various approaches investigated, Reinforcement Learning (RL) has garnered particular attention for this task. While LTL specifications are evaluated over infinite-length trajectories, this work focuses on satisfying a class of specifications within a finite number of steps, as is to be expected in most real-world applications involving robotic or autonomous systems where the run-time of the robot is limited before it needs to recharge itself, e.g., a robot vacuum which has to perform certain cleaning tasks before recharge. Therefore, in this work, an RL-based technique is developed for the problem of generating trajectories of a system that satisfy a given LTLf specification in a system with finite (discrete) states and actions and a priori unknown transition probabilities modeled as a Markov Decision Process (MDP). The proposed approach builds upon the popular AlphaGo Zero Reinforcement Learning (RL) framework, which has found great success in the two-player game of Go, to learn policies that can satisfy an LTLf specification given a limit on the trajectory duration. In this thesis, first the motivation and the necessary background on the problem are provided, followed by a brief overview of existing methods. Then the problem statement is introduced, the proposed methodology and its variants are presented, and extensive simulations of complex robot motion planning problems are conducted and their results are explained. These simulations demonstrate how the approach achieves higher success rates under time constraints compared to state-of-the-art methods. The thesis concludes with a section discussing potential directions for future work and examining the results and their implications for the work completed.
dc.identifier.uri	https://hdl.handle.net/10012/21061
dc.language.iso	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.relation.uri	https://github.com/CL2-UWaterloo/RL-LTL
dc.title	Robust Reinforcement Learning for Linear Temporal Logic Specifications with Finite Trajectory Duration
dc.type	Master Thesis
uws-etd.degree	Master of Applied Science
uws-etd.degree.department	Electrical and Computer Engineering
uws-etd.degree.discipline	Electrical and Computer Engineering
uws-etd.degree.grantor	University of Waterloo	en
uws-etd.embargo.terms	0
uws.contributor.advisor	Vardhan Pant, Yash
uws.contributor.advisor	Fischmeister, Sebastian
uws.contributor.affiliation1	Faculty of Engineering
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Mortazavimoghaddam_Seyyedsoroush.pdf
Size:: 3.74 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.4 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Electrical and Computer Engineering