PONTRYAGIN-GUIDED DIRECT POLICY OPTIMIZATION FOR CONTINUOUS-TIME PORTFOLIO PROBLEM
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

. We present Pontryagin-Guided Direct Policy Optimization (PGDPO), a framework for solving continuous-time portfolio optimization problems involving both consumption and investment decisions. Integrating Pontryagin's Maximum Principle (PMP) within a neural network pipeline, PGDPO bypasses traditional value function approximation and directly optimizes policy parameters using adjoint processes associated with the current policy, computed via automatic differentiation. An optional alignment penalty, explicitly derived from PMP conditions, significantly accelerates convergence and improves policy stability during training. Numerical experiments validate the framework's efficacy: PG-DPO accurately recovers the closed-form solution for the classical Merton problem and, crucially, demonstrates its capability to handle more realistic, state-dependent dynamics involving stochastic factors, effectively capturing intertemporal hedging demands. These results highlight that the PMP-guided deep learning approach offers an effective and potentially efficient pathway for direct policy optimization in complex continuous-time stochastic control settings within finance.

키워드

Merton portfolio problemConsumption-Investment ProblemPon-tryagin's Maximum PrincipleDirect Policy OptimizationStochastic ControlNeural NetworkDIFFERENTIAL-EQUATIONSCONSUMPTION
제목
PONTRYAGIN-GUIDED DIRECT POLICY OPTIMIZATION FOR CONTINUOUS-TIME PORTFOLIO PROBLEM
저자
Huh, JeonggyuJeong, SeungwonJeon, Jaegi
DOI
10.3934/jimo.2025110
발행일
2025-09
유형
Article; Early Access
저널명
Journal of Industrial and Management Optimization
21
9
페이지
5687 ~ 5715