Advances in Consumer Research
Issue 3 : 1150-1156
Original Article
Reinforcement Learning for Dynamic Portfolio Optimization in Financial Markets
 ,
 ,
 ,
 ,
 ,
1
Associate Professor, School of Business and Management, Christ University, Bangalore Bannerghatta Road Campus, Bannerghatta Main Road, Hulimavu, Bangalore, Karnataka 560076, India.
2
Associate Professor, Department of Zoology, K. K. Shah Jarodwala Maninagar Science College, BJLT Campus, Rambaug, Maninagar, Ahmedabad, Gujarat-380008, India.
3
Research Scholar, Faculty of Management and Commerce, K.M.(KRISHNA MOHAN) UNIVERSITY, Mathura, Pali Dungra Sonkh Road Govardhan, Uttar Pradesh-281123, India.
4
Assistant Professor, Department of Management, Global Business School and Research Centre, Dr. D. Y. Patil Vidyapeeth (Deemed to be University) Pimpri, Pune-411033, India.
5
Assistant Professor of Finance, Dept of Finance & Economics, Dhofar University, Oman.
6
Assistant Professor of Finance, Indira School of Business Studies, India.
Abstract

This research introduces a dynamic portfolio optimization framework based on the Proximal Policy Optimization (PPO) reinforcement learning algorithm that is known to be stable and perform optimally in continuous decision making. The proposed approach seeks to maximize long term returns in the portfolio while taking care of risks and transaction expenses in a volatile financial market. Utilizing the open source framework FinRL, the framework incorporates historical market data, technical indicators, and transaction cost constraints into a Markov Decision Process (MDP). There are rolling window features of asset returns and portfolio allocations in the state space, whereas the action space in determining optimal weight distributions in several assets. The aim is to represent the risk adjusted return of the portfolio by the reward function. PPO’s concisely defined objective and entropy regularization induces optimal efficient policy updates and exploration exploitation behavior. The experimental results demonstrate that the model has superior cumulative return and Sharpe ratio vs. traditional benchmarks and, therefore, have white paper potential in actual, AI-driven investment strategy in a trading environment.

Keywords
Recommended Articles
Research Article
Employees’ Perceptions of Job Evaluation Practices: Evidence from the Textile Industry in Uttar Pradesh
Published: 30/09/2025
Research Article
Publishing Of Reports Via Camunda Workflow Orchestration for A Financial Institute
Published: 30/09/2025
Research Article
E-Commerce vs. Traditional Retail: A Data-Driven Comparison of Profitability and Sustainability
Published: 30/09/2025
Research Article
Strategic Patient-Centric Brand Management in Pharma: Transforming Value Creation through VRIO Analysis
Published: 30/08/2025
Loading Image...
Volume 2, Issue 3
Citations
242 Views
62 Downloads
Share this article
© Copyright Advances in Consumer Research