Dating

SequeL team

Sequential Learning

Leader: Philippe Preux

PRESENTATION MEMBERS THESES PUBLICATIONS

Presentation

SequeL is a research group working in the field of machine learning; more specifically, SequeL is dedicated to the study of the problem of sequential decision making under uncertainty, that is, the study of how an "agent" having a goal to fullfill can learn an optimal behavior to achieve this goal in an unknown environment. SequeL is composed of two dozens members. Activities range from foundations of learning to algorithm design, and transfer towards companies. Questions are studied such as "What can a Turing machine learn efficiently? and in which conditions?". Or, in a budget context, "Given an amount of computational resource, how close to the optimal behavior can an algorithm reach?", finally application oriented questions such as those related to computational advertizing and recommendation systems for e-commerce websites, are also studied.

SequeL has led to the multi-awarded Crazy Stone go playing program.  Some SequeL PhD students have been awarded  the Gilles Kahn award, the Jacques Neveu award and the ECCAI award. We won the ICML 2011 Exploration vs. Exploitation challenge, and the ACM RecSYS 2014 challenge (both challenges on recommendation systems). SequeL expertize has led to collaborations with international companies like Orange Labs, Intel, Technicolor, Deezer and also with national and local SMEs.

 

Members

Permanent

  • Professor
    • Philippe Preux (Responsable)
  • Research director
    • Rémi Munos
  • Associate professor
    • Christos Dimitrakakis
  • Research scientists
    • Emilie Kaufmann
    • Alessandro Lazaric
    • Odalric-Ambrym Maillard
    • Daniil Ryabko
    • Michal Valko

Temporary

  • Postdoc
    • Matteo Pirotta
  • Phd students
    • Merwan Barlier
    • Alexandre Berard
    • Daniele Calandriello
    • Nicolas Carrara
    • Ronan Fruit
    • Pratik Gajane
    • Guillaume Gautier
    • Julien Perolat
    • Julien Seznec
    • Florian Strub
    • Romain Warlop
  • Engineer
    • Ralph Bourdoukan
  • Others
    • Lilian Besson
    • Georgios Papoudakis

Associated

  • Professor
    • Olivier Pietquin

Marc Abeille

Controle robuste et apprentissage par renforcement, application au problème de construction de portefeuille

Merwan Barlier

Dialogues intelligents basés sur l'écoute de conversation homme/homme

Alexandre Berard

Learning from Post-edition in Machine Translation

Daniele Calandriello

Efficient Sequential Learning in Structured and Constrained Environment

Nicolas Carrara

Apprentissage par renforcement pour optimisation de systèmes de dialogue via l'adaptation à chaque utilisateur

Ronan Fruit

Transfer of Knowledge in reinforcement learning for the improvement of exploration and generalization

Pratik Gajane

Sequential Learning and Decision Making under Partial Monitoring

Guillaume Gautier

Fast sampling of determinantal point processes

Jean-Bastien Grill

Création et analyse d'algorithmes efficaces pour la prise de décision dans un environnement inconnu et incertain

Julien Perolat

Apprentissage par renforcement : cas du jeu à 2 joueurs

Julien Seznec

Sequential Learning for Educationnal System

Florian Strub

Contributions à l'apprentissage séquentiel profond et à son application à l'interaction homme-robot

Romain Warlop

Novel Learning and Exploration-Exploitation Methodes for Effective Recommender Systems

Frédéric Guillou

On Recommendation Systems in Sequential Context 2016-12-02

Tomas Kocak

Sequential learning with similarities 2016-11-28

Vincenzo Musco

Usages of Graphs and Synthetic Data for Software Propagation Analysis 2016-11-03

Hadrien Glaude

Learning rational linear sequential systems using the method of moments 2016-07-08

Marta Soare

Computational and sample complexity of planning and reinforcement learning algorithms 2015-12-14

Amir Sani

Machine Learning for Decision-Making under Uncertainty 2015-05-12

Olivier Nicol

Data-driven evaluation of Contextual Bandit algorithms and applications to Dynamic Recommendation. 2014-12-18

Boris Baldassari

Maisqual : Amélioration de la qualité logicielle par fouille de données. 2014-07-01

Victor Gabillon

Budgeted Classification-based Policy Iteration 2014-06-12

Azadeh Khaleghi

Online Sequence Prediction 2013-11-18

Christophe Salperwyck

Apprentissage incrémental en ligne sur flux de données 2012-11-30

Alexandra Carpentier

De l'échantillonnage optimal en grande et petite dimension 2012-10-05

Jean Francois Hren

Compromis exploration - Exploitation en optimisation et contrôle 2012-06-01

Odalric-Ambrym Maillard

Apprentissage Séquentiel : Bandits, Statistique et Renforcement 2011-10-03

Manuel Loth

Algorithmes d'Ensembles Actifs pour le LASSO 2011-07-08

Sébastien Bubeck

Bandits Games and Clustering Foundations 2010-06-10

Michal Valko

Bandits and graphs and structures 2016-06-15

Jérémie Mary

Data-Driven Recommender Systems - Sequences of Recommendations 2015-11-24

Mohammad Ghavamzadeh

Complexité d’Échantillonnage pour la Prise de Décision Séquentielle 2014-06-11

Daniil Ryabko

Apprenabilité dans les problèmes de l'inférence séquentielle 2011-12-19

Other ' DatInG : Data Intelligence Group ' teams

LINKS MAGNET SIGMA