In this paper, we develop a Markov decision process (MDP) formulation for the low--temperature metastable Ising model evolving according to Kawasaki dynamics in a finite box of the two--dimensional square lattice. We analyze how an external controller can guide the system to the all--occupied state by appropriately adding and moving particles at specified moments in time. To this end, we construct a reduced MDP on a constrained family of configurations having a single cluster, a regime where particle attachment is more likely than detachment. We investigate two reward structures: one that depends solely on the time to reach the target configuration, and another that incorporates action--dependent energy costs. Within this MDP framework, we characterize the exact optimal policies under both reward structures, which turn out to have a different behavior: while a purely efficiency--based criterion promotes the growth from the boundary centers of the cluster, an energy--based reward function favours the growth at the corners of the cluster.

Optimal strategies for controlled growth in metastable Kawasaki dynamics

Simone Baldassarri
;
2026-01-01

Abstract

In this paper, we develop a Markov decision process (MDP) formulation for the low--temperature metastable Ising model evolving according to Kawasaki dynamics in a finite box of the two--dimensional square lattice. We analyze how an external controller can guide the system to the all--occupied state by appropriately adding and moving particles at specified moments in time. To this end, we construct a reduced MDP on a constrained family of configurations having a single cluster, a regime where particle attachment is more likely than detachment. We investigate two reward structures: one that depends solely on the time to reach the target configuration, and another that incorporates action--dependent energy costs. Within this MDP framework, we characterize the exact optimal policies under both reward structures, which turn out to have a different behavior: while a purely efficiency--based criterion promotes the growth from the boundary centers of the cluster, an energy--based reward function favours the growth at the corners of the cluster.
2026
Mathematics - Optimization and Control
Mathematics - Optimization and Control
Mathematical Physics
Mathematics - Probability
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12571/38484
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact