An Improved Time-Delay Grey Verhulst Model Optimized by Multi-Agent Reinforcement Learning for Electricity Market Forecasting

Document Type : Research Article

Authors

1 Department of Applied Mathematics, Faculty of Mathematical Sciences, Ferdowsi University of Mashhad, Mashhad, Iran

2 Alma Mater Studiorum – Università di Bologna, Bologna, Italy

Abstract

Accurate forecasting of electricity prices is still a difficult task because of the volatile and nonlinear nature of energy markets, as well as the limited availability of reliable data. Grey forecasting models are often used for this purpose, but they usually lack enough flexibility to capture delayed effects and complex interactions among several variables. To address these issues, this study aims to present an Improved Time-Delay Grey Multivariable Verhulst Model (ITGMVM), a new grey model developed for short-term electricity price prediction. The model introduces time-delay parameters that represent lagged relationships between variables, helping it respond better to dynamic market behavior. Two optimization methods are designed for parameter calibration: Partial Parameter Estimation (PPE) and Full Parameter Estimation (FPE), where the latter adjusts all parameters at the same time. These methods are supported by a new hybrid optimization framework called MARL-WOA, which combines Multi-Agent Reinforcement Learning (MARL) with Whale Optimization Algorithm (WOA). This combination improves the search process, leading to faster convergence and higher accuracy.  The model is evaluated using real-world data from Australia's National Electricity Market (NEM), specifically focusing on Sundays and Wednesdays between December 2023 and March 2024. Results show that ITGMVM, when optimized with FPE-MARL-WOA, outperforms six existing grey and hybrid models across multiple statistical metrics, achieving exceptional forecasting accuracy and robustness. The obtained results demonstrate the strength of integrating adaptive AI techniques with grey modeling to support decision-making in data-constrained energy environments.

Keywords

Main Subjects


 

Article PDF

[1] Jiang, H., Kong, P., Hu, Y. C., and Jiang, P. Forecasting China’s CO2 emissions by considering interaction of bilateral FDI using the improved grey multivariable Verhulst model. International Journal of Computer Mathematics, 23, 225–240, (2021).
[2] Ding, S. and Li, R. A new multivariable grey convolution model based on Simpson’s rule and its applications. Complexity, 2020(1), 4564653, (2020).
[3] Mirjalili, S. and Lewis, A. The Whale Optimization Algorithm. Advances in Engineering Software, 95, 51–67, (2016).
[4] Watkins, C. J. and Dayan, P. Q-learning. Machine Learning, 8(3-4), 279–292, (1992).
[5] Buoniu, L., Bbuka, R., and De Schutter, B. A comprehensive survey of multiagent reinforcement learning. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 38(2), 1–18, (2008).
[6] Hu, Z. and Yu, X. Reinforcement learning-based comprehensive learning grey wolf optimizer for feature selection. Applied Soft Computing, 149, 110959, (2023).
[7] Seyyedabbasi, A., Aliyev, R., Kiani, F., Gulle, M. U., Basyildiz, H., and Shah, M. A. Hybrid algorithms based on combining reinforcement learning and metaheuristic methods to solve global optimization problems. Knowledge-Based Systems, 223, 107044, (2021).
[8] Digalakis, J. and Margaritis, K. On benchmarking functions for genetic algorithms. International Journal of Computer Mathematics, 77(4), 481–506, (2001).
[9] Molga, M. and Smutnicki, C. Test functions for optimization needs. Technical Report, (2005).
[10] Yang, X. S. Firefly algorithm, stochastic test functions and design optimisation. International Journal of Bio-Inspired Computation, 2(2), 78–84, (2010).
[11] Yao, X., Liu, Y., and Lin, G. Evolutionary programming made faster. IEEE Transactions on Evolutionary Computation, 3(2), 82–102, (1999).
[12] Weron, R. Electricity price forecasting: A review of the state-of-the-art with a look into the future. International Journal of Forecasting, 30(4), 1030–1081, (2014).
[13] He, Y., Wang, C., and Chen, Z. State-of-the-art electricity load and price forecasting for wholesale markets: A comprehensive review. Energies, 17(22), 5797, (2024).
[14] Li, J., Zhao, S., and Wang, L. Impact of renewable energy on extreme volatility in wholesale electricity prices. Journal of Cleaner Production, 442, 140256, (2024).
[15] Tang, R., Zhou, K., and Yang, Q. Challenges and trends in high-volatility electricity price forecasting. Energies, 17(22), 5802, (2024).
[16] Li, X., Zhou, H., and Liu, J. A hybrid GM(1,N)-ARIMA model for electricity price forecasting. Energy Reports, 7, 2874–2882, (2021).
[17] Zhao, Y., He, Z., and Xu, M. Improved time-delay grey model optimized by GA for power price peak forecasting. Energy, 254, 124363, (2022).
[18] Chen, L., Wang, X., and Zhang, H. A PSO-optimized multivariable grey Verhulst model for energy market forecasting. Expert Systems with Applications, 211, 118635, (2023).
[19] Wang, B., Lin, Z., and Wu, T. Electricity price forecasting using hybrid GM(1,1)-SVR with residual correction. Applied Energy, 309, 118460, (2022).
[20] Liu, M., Zhang, Y., and Xu, D. Grey-based hybrid prediction model for spot electricity price using Firefly Algorithm. Journal of Cleaner Production, 258, 120743, (2020).
[21] Yu, X., Lu, L., Qi, J., Qian, Y., Zhao, L., Tan, C., Chen, Y., and Han, Z. A clustering fractional-order grey model in short-term electrical load forecasting. Scientific Reports, 15(1), 6207, (2025).
[22] Gou, X., Mi, C., Yang, Y., and Zeng, B. A nonlinear mixed-frequency grey prediction model with two-stage lag parameter optimization and its application. Applied Mathematical Modelling, 116360, (2025).
[23] Talbi, E. G. Metaheuristics: From Design to Implementation. Wiley Series on Parallel and Distributed Computing, (2009).
[24] Zheng, J., Zhong, J., Chen, M., and He, K. Reinforced Hybrid Genetic Algorithm for the Traveling Salesman Problem. arXiv preprint arXiv:2107.06870, (2021).
[25] Kiani, F., Seyyedabbasi, A., Aliyev, R., et al. Hybrid algorithms based on combining reinforcement learning and metaheuristic methods to solve global optimization problems. Knowledge-Based Systems, 223, 107044, (2021).
[26] Nguyen, H. T., Nguyen, H. V., and Nguyen, T. T. Proposing a hybrid metaheuristic optimization algorithm for energy consumption prediction in buildings. Scientific Reports, 12, 4923, (2022).
[27] Fu, Y., Wu, D., Boulet, B., and Zinflou, A. Time series forecasting via reinforcement learning-based model combination. IEEE Internet of Things Journal, (2025).
[28] Wu, Z., Fang, G., Ye, J., Zhu, D. Z., and Huang, X. A reinforcement learning-based ensemble forecasting framework for renewable energy forecasting. Renewable Energy, 244, 122692, (2025).
[29] Ghimire, S., Deo, R. C., Casillas-Perez, D., Sharma, E., Salcedo-Sanz, S., Barua, P. D., and Acharya, U. R. Half-hourly electricity price prediction with a hybrid convolution neural network–random vector functional link deep learning approach. Applied Energy, 374, 123920, (2024).
[30] Xu, Y., Huang, X., Zheng, X., Zeng, Z., and Jin, T. VMD-ATT-LSTM electricity price prediction based on grey wolf optimization algorithm in electricity markets considering renewable energy. Renewable Energy, 236, 121408, (2024).
[31] Meng, A., Wang, P., Zhai, G., Zeng, C., Chen, S., Yang, X., and Yin, H. Electricity price forecasting with high penetration of renewable energy using attention-based LSTM network trained by criss-cross optimization. Energy, 254, 124212, (2022).
[32] Forrest, S. Genetic algorithms: Principles of natural selection applied to computation. Science, 261(5123), 872–878, (1993).
[33] Storn, R. and Price, K. V. Differential Evolution—A simple and efficient heuristic for global optimization over continuous spaces. Journal of Global Optimization, 11(4), 341–359, (1997).
[34] Beyer, H.-G. and Schwefel, H.-P. Evolution strategies: A comprehensive introduction. Natural Computing, 1(1), 3–52, (2002).
[35] Poli, R., Kennedy, J., and Blackwell, T. Particle swarm optimization—An overview. Swarm Intelligence, 1(1), 33–57, (2007).
[36] Dorigo, M., Maniezzo, V., and Colorni, A. The Ant System: Optimization by a colony of cooperating agents. IEEE Transactions on Systems, Man, and Cybernetics—Part B, 26(1), 29–41, (1996).
[37] Yang, X.-S. Firefly algorithm, stochastic test functions and design optimisation. International Journal of Bio-Inspired Computation, 2(2), 78–84, (2010).
[38] Kirkpatrick, S., Gelatt, C. D., Jr., and Vecchi, M. P. Optimization by simulated annealing. Science, 220(4598), 671–680, (1983).
[39] Rashedi, E., Nezamabadi-pour, H., and Saryazdi, S. GSA: A gravitational search algorithm. Information Sciences, 179(13), 2232–2248, (2009).
[40] Ma, X. and Liu, Z. The kernel-based nonlinear multivariate grey model. Applied Mathematical Modelling, 56, 217–238, (2018).
[41] Ding, S., Hu, J., and Lin, Q. Accurate forecasts and comparative analysis of Chinese CO2 emissions using a superior time-delay grey model. Energy Economics, 126, 107013, (2023).
[42] Pour, S. H., Fard, O. S., and Zeng, B. A grey prediction model based on Von Bertalanffy equation and its application in energy prediction. Engineering Applications of Artificial Intelligence, 143, 110012, (2025). 
Volume 10, Issue 2
December 2025
Pages 162-190
  • Receive Date: 08 October 2025
  • Revise Date: 29 November 2025
  • Accept Date: 11 December 2025
  • Publish Date: 30 December 2025