A Reinforcement Learning Approach for Adaptive Budget Allocation in Pharmaceutical Digital Marketing: Maximizing ROI Across Patient Journey Touchpoints

Main Article Content

Zhenghao Pan

Abstract

This paper introduces a novel reinforcement learning framework for dynamic budget allocation in pharmaceutical digital advertising, addressing critical challenges in optimizing marketing resources across patient journey touchpoints. Traditional budget allocation methods in pharmaceutical marketing fail to adapt to complex, multi-channel environments and regulatory constraints, resulting in suboptimal ROI. We propose a comprehensive reinforcement learning approach that models budget allocation as a sequential decision-making problem, with a state space encompassing channel performance metrics, audience characteristics, and regulatory parameters. The framework incorporates a multi-objective reward function balancing immediate conversion metrics with long-term value generation while maintaining compliance requirements. Experimental validation using 24 months of real-world pharmaceutical marketing data across five therapeutic areas demonstrates significant performance improvements over conventional methodologies. The reinforcement learning framework achieved an average ROI increase of 42.3% compared to baseline methods, with particularly strong performance in rare disease categories (69.9% improvement). The system demonstrates effective learning convergence across diverse therapeutic contexts while maintaining regulatory compliance. This research provides both theoretical contributions to AI applications in healthcare marketing and practical implementation strategies for pharmaceutical companies seeking to optimize digital advertising investments across increasingly complex patient journey touchpoints.

Article Details

Section

Articles

How to Cite

A Reinforcement Learning Approach for Adaptive Budget Allocation in Pharmaceutical Digital Marketing: Maximizing ROI Across Patient Journey Touchpoints. (2025). Journal of Sustainability, Policy, and Practice, 1(4), 1-15. https://schoalrx.com/index.php/jspp/article/view/51

References

1. S. Zhang, Z. Feng, and B. Dong, "LAMDA: Low-latency anomaly detection architecture for real-time cross-market financial decision support," Academia Nexus Journal, vol. 3, no. 2, 2024.

2. Z. Wang, X. Wang, and H. Wang, "Temporal graph neural networks for money laundering detection in cross-border transactions," Academia Nexus Journal, vol. 3, no. 2, 2024.

3. A. Kang, J. Xin, and X. Ma, "Anomalous cross-border capital flow patterns and their implications for national economic security: An empirical analysis," Journal of Advanced Computing Systems, vol. 4, no. 5, pp. 42-54, 2024. doi: 10.69987/jacs.2024.40504

4. J. Liang, C. Zhu, and Q. Zheng, "Developing evaluation metrics for cross-lingual LLM-based detection of subtle sentiment manipulation in online financial content," Journal of Advanced Computing Systems, vol. 3, no. 9, pp. 24-38, 2023. doi: 10.69987/jacs.2023.30903

5. Z. Wang, and J. Liang, "Comparative analysis of interpretability techniques for feature importance in credit risk assessment," Spectrum of Research, vol. 4, no. 2, 2024.

6. B. Dong, and Z. Zhang, "AI-driven framework for compliance risk assessment in cross-border payments: Multi-jurisdictional challenges and response strategies," Spectrum of Research, vol. 4, no. 2, 2024.

7. J. Wang, L. Guo, and K. Qian, "LSTM-based heart rate dynamics prediction during aerobic exercise for elderly adults," 2025. doi: 10.20944/preprints 202504.1692.v1

8. D. Ma, M. Shu, and H. Zhang, "Feature selection optimization for employee retention prediction: A machine learning approach for human resource management," 2025. doi: 10.20944/preprints 202504.1549.v1

9. M. Li, D. Ma, and Y. Zhang, "Improving database anomaly detection efficiency through sample difficulty estimation," 2025. doi: 10.20944/preprints 202504.1527.v1

10. K. Yu, Y. Chen, T. K. Trinh, and W. Bi, "Real-time detection of anomalous trading patterns in financial markets using generative adversarial networks," 2025. doi: 10.54254/2755-2721/2025.22016

11. X. Xiao, H. Chen, Y. Zhang, W. Ren, J. Xu, and J. Zhang, "Anomalous payment behavior detection and risk prediction for SMEs based on LSTM-attention mechanism," Academic Journal of Sociology and Management, vol. 3, no. 2, pp. 43-51, 2025. doi: 10.70393/616a736d.323733

12. X. Xiao, Y. Zhang, H. Chen, W. Ren, J. Zhang, and J. Xu, "A differential privacy-based mechanism for preventing data leakage in large language model training," Academic Journal of Sociology and Management, vol. 3, no. 2, pp. 33-42, 2025. doi: 10.70393/616a736d.323732

13. J. Zhang, X. Xiao, W. Ren, and Y. Zhang, "Privacy-preserving feature extraction for medical images based on fully homomorphic encryption," Journal of Advanced Computing Systems, vol. 4, no. 2, pp. 15-28, 2024.

14. W. Ren, X. Xiao, J. Xu, H. Chen, Y. Zhang, and J. Zhang, "Trojan virus detection and classification based on graph convolutional neural network algorithm," Journal of Industrial Engineering and Applied Science, vol. 3, no. 2, pp. 1-5, 2025. doi: 10.70393/6a69656173.323735

15. S. Ji, Y. Liang, X. Xiao, J. Li, and Q. Tian, "An attitude-adaptation negotiation strategy in electronic market environments," In Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007), July, 2007, pp. 125-130. doi: 10.1109/snpd.2007.26

16. X. Xiao, Y. Zhang, J. Xu, W. Ren, and J. Zhang, "Assessment methods and protection strategies for data leakage risks in large language models," Journal of Industrial Engineering and Applied Science, vol. 3, no. 2, pp. 6-15, 2025. doi: 10.70393/6a69656173.323736

17. X. Liu, Z. Chen, K. Hua, M. Liu, and J. Zhang, "An adaptive multimedia signal transmission strategy in cloud-assisted vehicular networks," In 2017 IEEE 5th International Conference on Future Internet of Things and Cloud (FiCloud), August, 2017, pp. 220-226. doi: 10.1109/ficloud.2017.42

18. S. Michael, E. Sohrabi, M. Zhang, S. Baral, K. Smalenberger, A. Lan, and N. Heffernan, "Automatic short answer grading in college mathematics using in-context meta-learning: An evaluation of the transferability of findings," In International Conference on Artificial Intelligence in Education, July, 2024, pp. 409-417. doi: 10.1007/978-3-031-64315-6_38

19. H. McNichols, M. Zhang, and A. Lan, "Algebra error classification with large language models," In International Conference on Artificial Intelligence in Education, June, 2023, pp. 365-376. doi: 10.1007/978-3-031-36272-9_30

20. M. Zhang, Z. Wang, Z. Yang, W. Feng, and A. Lan, "Interpretable math word problem solution generation via step-by-step planning," arXiv preprint arXiv:2306.00784, 2023. doi: 10.18653/v1/2023.acl-long.379

21. M. Zhang, Z. Wang, R. Baraniuk, and A. Lan, "Math operation embeddings for open-ended solution analysis and feedback," arXiv preprint arXiv:2104.12047, 2021.

22. S. Jordan, Y. Chandak, D. Cohen, M. Zhang, and P. Thomas, "Evaluating the performance of reinforcement learning algorithms," In International Conference on Machine Learning, November, 2020, pp. 4962-4973.

23. D. Qi, J. Arfin, M. Zhang, T. Mathew, R. Pless, and B. Juba, "Anomaly explanation using metadata," In 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), March, 2018, pp. 1916-1924. doi: 10.1109/wacv.2018.00212

24. M. Zhang, T. Mathew, and B. Juba, "An improved algorithm for learning to perform exception-tolerant abduction," In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 31, No. 1)., February, 2017. doi: 10.1609/aaai. V 31i1.10700

25. J. Jin, T. Zhu, and C. Li, "Graph Neural Network-Based Prediction Framework for Protein-Ligand Binding Affinity: A Case Study on Pediatric Gastrointestinal Disease Targets," Journal of Medicine and Life Sciences, vol. 1, no. 3, pp. 136–142, 2025.