Enhancing Progressive Ensemble Learning via Normalized Extra-Gradient Initialization
Published in Neural Networks, 2026
This work develops Normalized Extra-Gradient Initialization to accelerate progressive training of sparse MoE models.
Recommended citation: Zheshun Wu, Yu Pan, Dun Zeng, Zenglin Xu, Qifan Wang, and Jie Liu. (2026). Enhancing Progressive Ensemble Learning via Normalized Extra-Gradient Initialization. Neural Networks.
