A Reinforcement Learning Framework for Simultaneous Chemical Process Flowsheet Generation, Design and Control

Reynoso Donzelli, Simone

A Reinforcement Learning Framework for Simultaneous Chemical Process Flowsheet Generation, Design and Control

dc.contributor.advisor	Ricardez Sandoval, Luis Alberto
dc.contributor.author	Reynoso Donzelli, Simone
dc.date.accessioned	2024-12-12T21:08:44Z
dc.date.available	2024-12-12T21:08:44Z
dc.date.issued	2024-12-12
dc.date.submitted	2024-12-06
dc.description.abstract	Integration of process design and control of chemical process flowsheets (CPFs) is a key focus in chemical engineering, receiving extensive research attention. The main objective in this area is to identify optimal process design and control variables for a CPF, ensuring both economic viability and dynamic feasibility of plant operations. This integration presents a complex optimization problem, which is challenging to solve using traditional optimization methods. Additionally, the problem becomes even more intricate when discrete decisions or logical constraints, which give rise to Boolean variables, are considered—common in integrated design and control of CPFs problems. Therefore, the development of new methodologies is needed to effectively address these challenges. The emerging trend in Machine Learning (ML), particularly in Reinforcement Learning (RL), for solving such problems serves as the foundation for this thesis. The limited studies regarding the solution of the integrated problem using RL techniques motivates the exploration and development of novel methodologies. Before addressing the integrated problem, it is important to understand the potential of RL as a tool for solving design and optimization problems of CPFs under steady-state conditions. A RL methodology that introduces two novel RL agents: a discrete masked Proximal Policy Optimization (mPPO) and a hybrid masked Proximal Policy Optimization (mHPPO) has been proposed. In this framework, the agents are capable of autonomously generate, design and optimize CPFs utilizing an inlet flowrate and a set of unit operations (UOs) as initial information. A key feature of this approach is the use of masking – an underexplored yet promising area for solving the present problem – which involves the incorporation of expert knowledge or design rules to exclude certain actions from the agent's decision-making process, enhancing the agent’s performance. Adding to that, this method stands out by seamlessly integrating masked agents with rigorous models of UOs, including advanced thermodynamic and conservation equations, within its simulation environment. The effectiveness of these agents was evaluated through several case studies, including two that utilized commercial simulation suites as part of the RL environment. The resulting CPFs generated by the RL agents present viable flowsheet designs that meet the pre-specified design requirements. Recognizing the potential of RL for designing CPF, this thesis also introduces a novel RL approach for generating, designing, and controlling CPFs. Similar to the previous methodology, the proposed framework generates CPFs directly from an inlet stream, eliminating the need for predefined arrangements of UOs. Furthermore, the framework leverages surrogate models, specifically Neural Networks (NNs), to accelerate the learning process of the RL agent and avoid dependence on mechanistic dynamic models. These surrogate models approximate key process variables and closed-loop performance metrics for complex dynamic UO models. The results obtained using this methodology were compared with model-based optimization results to assess the accuracy and validity of the proposed approach in approximating well-established methodologies. Consistency with the model-based approach was assessed. Additional case studies involved formulations with multiple UOs to further demonstrate the approach’s flexibility to deal with various scenarios. Results from those case studies demonstrate that the RL agent can effectively learn to maintain the dynamic operability of the UOs under disturbances, adhere to equipment design and operational constraints, and generate viable and economically attractive CPFs. The high adaptability offered by the surrogate models enables this methodology to approximate the dynamic behavior of the most common UO. As a result, the proposed framework is sufficiently explicit and flexible to be used in more intricate design and control problems involving multiple UOs.
dc.identifier.uri	https://hdl.handle.net/10012/21246
dc.language.iso	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.subject	Reinforcement Learning
dc.subject	Process system engineering
dc.subject	Machine Learning
dc.subject	Flowsheet generation
dc.subject	Flowsheet design
dc.subject	Flowsheet control
dc.title	A Reinforcement Learning Framework for Simultaneous Chemical Process Flowsheet Generation, Design and Control
dc.type	Master Thesis
uws-etd.degree	Master of Applied Science
uws-etd.degree.department	Chemical Engineering
uws-etd.degree.discipline	Chemical Engineering
uws-etd.degree.grantor	University of Waterloo	en
uws-etd.embargo.terms	0
uws.contributor.advisor	Ricardez Sandoval, Luis Alberto
uws.contributor.affiliation1	Faculty of Engineering
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: ReynosoDonzelli_Simone.pdf
Size:: 5.07 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.4 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Chemical Engineering