A Reinforcement Learning Approach for Adaptive Budget Allocation in Pharmaceutical Digital Marketing: Maximizing ROI Across Patient Journey Touchpoints
Main Article Content
Abstract
This paper introduces a novel reinforcement learning framework for dynamic budget allocation in pharmaceutical digital advertising, addressing critical challenges in optimizing marketing resources across patient journey touchpoints. Traditional budget allocation methods in pharmaceutical marketing fail to adapt to complex, multi-channel environments and regulatory constraints, resulting in suboptimal ROI. We propose a comprehensive reinforcement learning approach that models budget allocation as a sequential decision-making problem, with a state space encompassing channel performance metrics, audience characteristics, and regulatory parameters. The framework incorporates a multi-objective reward function balancing immediate conversion metrics with long-term value generation while maintaining compliance requirements. Experimental validation using 24 months of real-world pharmaceutical marketing data across five therapeutic areas demonstrates significant performance improvements over conventional methodologies. The reinforcement learning framework achieved an average ROI increase of 42.3% compared to baseline methods, with particularly strong performance in rare disease categories (69.9% improvement). The system demonstrates effective learning convergence across diverse therapeutic contexts while maintaining regulatory compliance. This research provides both theoretical contributions to AI applications in healthcare marketing and practical implementation strategies for pharmaceutical companies seeking to optimize digital advertising investments across increasingly complex patient journey touchpoints.
Article Details
Section
How to Cite
References
1. S. Zhang, Z. Feng, and B. Dong, "LAMDA: Low-latency anomaly detection architecture for real-time cross-market financial decision support," Academia Nexus Journal, vol. 3, no. 2, 2024.
2. Z. Wang, X. Wang, and H. Wang, "Temporal graph neural networks for money laundering detection in cross-border transactions," Academia Nexus Journal, vol. 3, no. 2, 2024.
3. A. Kang, J. Xin, and X. Ma, "Anomalous cross-border capital flow patterns and their implications for national economic security: An empirical analysis," Journal of Advanced Computing Systems, vol. 4, no. 5, pp. 42-54, 2024. doi: 10.69987/jacs.2024.40504
4. J. Liang, C. Zhu, and Q. Zheng, "Developing evaluation metrics for cross-lingual LLM-based detection of subtle sentiment manipulation in online financial content," Journal of Advanced Computing Systems, vol. 3, no. 9, pp. 24-38, 2023. doi: 10.69987/jacs.2023.30903
5. Z. Wang, and J. Liang, "Comparative analysis of interpretability techniques for feature importance in credit risk assessment," Spectrum of Research, vol. 4, no. 2, 2024.
6. B. Dong, and Z. Zhang, "AI-driven framework for compliance risk assessment in cross-border payments: Multi-jurisdictional challenges and response strategies," Spectrum of Research, vol. 4, no. 2, 2024.
7. J. Wang, L. Guo, and K. Qian, "LSTM-based heart rate dynamics prediction during aerobic exercise for elderly adults," 2025. doi: 10.20944/preprints 202504.1692.v1
8. D. Ma, M. Shu, and H. Zhang, "Feature selection optimization for employee retention prediction: A machine learning approach for human resource management," 2025. doi: 10.20944/preprints 202504.1549.v1
9. M. Li, D. Ma, and Y. Zhang, "Improving database anomaly detection efficiency through sample difficulty estimation," 2025. doi: 10.20944/preprints 202504.1527.v1
10. K. Yu, Y. Chen, T. K. Trinh, and W. Bi, "Real-time detection of anomalous trading patterns in financial markets using generative adversarial networks," 2025. doi: 10.54254/2755-2721/2025.22016
11. X. Xiao, H. Chen, Y. Zhang, W. Ren, J. Xu, and J. Zhang, "Anomalous payment behavior detection and risk prediction for SMEs based on LSTM-attention mechanism," Academic Journal of Sociology and Management, vol. 3, no. 2, pp. 43-51, 2025. doi: 10.70393/616a736d.323733
12. X. Xiao, Y. Zhang, H. Chen, W. Ren, J. Zhang, and J. Xu, "A differential privacy-based mechanism for preventing data leakage in large language model training," Academic Journal of Sociology and Management, vol. 3, no. 2, pp. 33-42, 2025. doi: 10.70393/616a736d.323732
13. J. Zhang, X. Xiao, W. Ren, and Y. Zhang, "Privacy-preserving feature extraction for medical images based on fully homomorphic encryption," Journal of Advanced Computing Systems, vol. 4, no. 2, pp. 15-28, 2024.
14. W. Ren, X. Xiao, J. Xu, H. Chen, Y. Zhang, and J. Zhang, "Trojan virus detection and classification based on graph convolutional neural network algorithm," Journal of Industrial Engineering and Applied Science, vol. 3, no. 2, pp. 1-5, 2025. doi: 10.70393/6a69656173.323735
15. S. Ji, Y. Liang, X. Xiao, J. Li, and Q. Tian, "An attitude-adaptation negotiation strategy in electronic market environments," In Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007), July, 2007, pp. 125-130. doi: 10.1109/snpd.2007.26
16. X. Xiao, Y. Zhang, J. Xu, W. Ren, and J. Zhang, "Assessment methods and protection strategies for data leakage risks in large language models," Journal of Industrial Engineering and Applied Science, vol. 3, no. 2, pp. 6-15, 2025. doi: 10.70393/6a69656173.323736
17. X. Liu, Z. Chen, K. Hua, M. Liu, and J. Zhang, "An adaptive multimedia signal transmission strategy in cloud-assisted vehicular networks," In 2017 IEEE 5th International Conference on Future Internet of Things and Cloud (FiCloud), August, 2017, pp. 220-226. doi: 10.1109/ficloud.2017.42
18. S. Michael, E. Sohrabi, M. Zhang, S. Baral, K. Smalenberger, A. Lan, and N. Heffernan, "Automatic short answer grading in college mathematics using in-context meta-learning: An evaluation of the transferability of findings," In International Conference on Artificial Intelligence in Education, July, 2024, pp. 409-417. doi: 10.1007/978-3-031-64315-6_38
19. H. McNichols, M. Zhang, and A. Lan, "Algebra error classification with large language models," In International Conference on Artificial Intelligence in Education, June, 2023, pp. 365-376. doi: 10.1007/978-3-031-36272-9_30
20. M. Zhang, Z. Wang, Z. Yang, W. Feng, and A. Lan, "Interpretable math word problem solution generation via step-by-step planning," arXiv preprint arXiv:2306.00784, 2023. doi: 10.18653/v1/2023.acl-long.379
21. M. Zhang, Z. Wang, R. Baraniuk, and A. Lan, "Math operation embeddings for open-ended solution analysis and feedback," arXiv preprint arXiv:2104.12047, 2021.
22. S. Jordan, Y. Chandak, D. Cohen, M. Zhang, and P. Thomas, "Evaluating the performance of reinforcement learning algorithms," In International Conference on Machine Learning, November, 2020, pp. 4962-4973.
23. D. Qi, J. Arfin, M. Zhang, T. Mathew, R. Pless, and B. Juba, "Anomaly explanation using metadata," In 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), March, 2018, pp. 1916-1924. doi: 10.1109/wacv.2018.00212
24. M. Zhang, T. Mathew, and B. Juba, "An improved algorithm for learning to perform exception-tolerant abduction," In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 31, No. 1)., February, 2017. doi: 10.1609/aaai. V 31i1.10700
25. J. Jin, T. Zhu, and C. Li, "Graph Neural Network-Based Prediction Framework for Protein-Ligand Binding Affinity: A Case Study on Pediatric Gastrointestinal Disease Targets," Journal of Medicine and Life Sciences, vol. 1, no. 3, pp. 136–142, 2025.