Generalization-Aware Structured Regression towards Balancing Bias and Variance

Pavlovski, M., Zhou, F., Arsov, N., Kocarev, L., Obradovic, Z.

Proc. 27th International Joint Conference on Artificial Intelligence (IJCAI)

Abstract

Attaining the proper balance between underfitting and overfitting is one of the central challenges in machine learning. It has been approached mostly by deriving bounds on generalization risks of learning algorithms. Such bounds are, however, rarely controllable. In this study, a novel bias-variance balancing objective function is introduced in order to improve generalization performance. By utilizing distance correlation, this objective function is able to indirectly control a stability-based upper bound on a model’s expected true risk. In addition, the Generalization-Aware Collaborative Ensemble Regressor (GLACER) is developed, a model that bags a crowd of structured regression models, while allowing them to collaborate in a fashion that minimizes the proposed objective function. The experimental results on both synthetic and real-world data indicate that such an objective enhances the overall model’s predictive performance. When compared against a broad range of both traditional and structured regression models GLACER was ~10-56% and ~49-99% more accurate for the task of predicting housing prices and hospital readmissions, respectively.

BibTeX

				
					@inproceedings{pavlovski2018generalization,
  title={Generalization-Aware Structured Regression towards Balancing Bias and Variance.},
  author={Pavlovski, Martin and Zhou, Fang and Arsov, Nino and Kocarev, Ljupco and Obradovic, Zoran},
  booktitle={IJCAI},
  pages={2616--2622},
  year={2018}
}