Foula Vagena – Ensemble Learning: Theory and Techniques | Data Intelligence Institute of Paris

Foula Vagena
December 15, 4pm
online (zoom)

Abstract

Ensemble learning is the process by which multiple models, such as classifiers or experts, are combined to solve a particular computational intelligence problem. Ensemble learning is primarily used to improve the (classification, prediction, function approximation, etc.) performance of a model, or reduce the likelihood of an unfortunate selection of a poor one. By strategically combining multiple models one can produce a new predictive model with reduced variance, bias and improved predictions. In this tutorial we will explain the bias-variance tradeoff and describe how popular ensemble techniques (such as bagging, boosting, stacking etc) handle it. We will conclude the tutorial with an illustrative prediction task using various ensemble models.

The Hands-On Workshop will focus on examples of ensemble models.

References on the Booster subject as recommended during the seminar:

Boosting book: https://mitpress.mit.edu/books/boosting
XGBoost paper: https://arxiv.org/abs/1603.02754

Dr Foula Vagena
(Université Paris Cité, diiP)
Zografoula Vagena is a research associate at the Data Intelligence Institute of Paris (diiP) and affiliated with the Université Paris Cité. She has been a data science researcher and practitioner for over ten years. She has worked on different analytics problems including forecasting, image processing, graph analytics, multidimensional data analysis, text processing, recommendation systems, sequential data analysis and optimization within various fields such as transportation, healthcare, retail, finance/insurance and accounting. She has also performed research in the intersection of data management and analytics, and was a primary contributor of the MCDB/SimSQL systems that blended data management with Bayesian statistics. She holds a PhD in data management from the University of California, Riverside.

Example code

Other seminars

Shen Liang – Knowledge-guided Data Science

2022, diiP, Seminars + Hands-On Workshops

Shen LiangMay 18, 4 PM online (zoom) linkedinAbstract This tutorial presents an overview of knowledge-guided data science, a rising methodology in machine learning which fuses data with domain knowledge. We will present numerous case studies on this methodology...

Shen Liang – Deep Transfer Learning and Multi-task Learning

2022, diiP, Seminars + Hands-On Workshops

Shen LiangJune 15, 4 PMonline (zoom) linkedinAbstract This tutorial provides an overview of two important and correlated (in many cases intersectional) topics in deep learning: transfer learning, and multi-task learning. Transfer learning focuses on...

Foula Vagena – Deep Learning for Sequential Data: Models and Applications

2021, diiP, Seminars + Hands-On Workshops

Foula VagenaApril 13, 4 PMonline (zoom) Abstract Recurrent neural networks (RNNs) are a family of specialized neural networks for processing sequential data. They can scale to much longer sequences than would be practical for networks without sequence-based...

Foula Vagena – Graph Based Data Science: Opportunities Challenges and Techniques

2022, diiP, Seminars + Hands-On Workshops

Foula VagenaJanuary 19, 4pmonline (zoom) Abstract Graph based data science lets us leverage the power of relationships and structure in data to improve model prediction and answer previously intractable questions. In this tutorial we will first introduce the...