- Prospective master students:
here are for some previous projects I advised.

- Nov. 2017: our work is presented at PGMO days
- Sep. 2017: I will serve as a reviewer for ICLR 2018
- Sep. 2017: I will serve as a reviewer for AISTATS 2018
- Sep. 2017: I will serve as a PC member for MPREF 2018
- Sep. 2017: one paper accepted at COCOA'17
- Aug. 2017: one paper accepted at AETA'17
- Co-organizing the second Asian Workshop on Reinforcement Learning
- Jun. 2017: I will serve as a PC member for AAAI 2018
- May 2017: one paper accepted at ICML 2017
- Apr. 2017: Balázs Szörényi visited us at JIE and JRI.
- Apr. 2017: I will serve as a PC member for MIWAI 2017
- Mar. 2017: I will serve as a PC member for ACML 2017
- Mar. 2017: I will serve as a reviewer for NIPS 2017
- Mar. 2017: new paper accepted at IJAIT
- Feb. 2017: I will serve as a PC member for UAI 2017
- Feb. 2017: I will serve as a PC member for JFPDA 2017
- Dec. 2016: Best paper award at MIWAI
- Dec. 2016: I will serve as a PC member for MODEM at AAMAS 2017
- Nov. 2016: one paper accepted at AAAI 2017
- Nov. 2016: I will serve as a Senior PC member for IJCAI 2017
- Nov. 2016: our work is presented at PGMO days
- Oct. 2016: one paper accepted at DA2PL
- Oct. 2016: I will serve as a PC member for AISTATS 2017
- Sept. 2016: two papers accepted at MIWAI
- July 2016: our project was funded by PGMO
- Co-organizing the first Asian Workshop on Reinforcement Learning
- June 2016: Xiaoliang Fan gave a talk at JRI.
- May 2016: Hugo Gilbert visited us at JIE and JRI.
- May 2016: one paper accepted at UAI
- April 2016: Robert Busa-Fekete visited us at JIE and JRI.
- April 2016: new paper accepted at COMSOC
- February 2016: Hugo Gilbert, a PhD student in our group, got a second prize as young researcher at ROADEF for his work with Olivier Spanjaard.

The high-level goals of our group are to help build decision-support systems (recommender systems, decision aid, search...) and automatic decision systems (autonomous agents, robots, traders...). Our research work lies at the intersection of artificial intelligence, decision theory, machine learning and operations research. In particular, it deals with (sequential) decision making under uncertainty, multiobjective optimization, preference elicitation/learning.

Currently we focus on two application domains:

**Robotics**: we aim at investigating and designing algorithms that will allow robots to learn from human interactions (demonstrations, queries, feedback…)**Energy**: we want to develop adaptive and robust control systems for power systems and smart environments.

Here's a classification of some recent work (downloadable here or send me an email):

- General framework and decision models for sequential decision-making
*Solving MDPs with Skew Symmetric Bilinear Utility Functions*- Gilbert, Hugo; Spanjaard, Olivier; Viappiani, Paolo; Weng, Paul; 24th International Joint Conference on Artificial Intelligence (IJCAI-15) 2015*Markov Decision Processes with Functional Rewards*- Spanjaard, Olivier; Weng, Paul; 7th Multi-Disciplinary International Workshop on Artificial Intelligence (MIWAI) 2013 8271 pp. 269-280 Lecture Notes in Computer Science**Best paper award***Algebraic Markov Decision Processes*- Perny, Patrice; Spanjaard, Olivier; Weng, Paul; 19th International Joint Conference on Artificial Intelligence 2005 pp. 1372-1377

- Preference elicitation/learning in sequential decision-making
*Reducing the Number of Queries in Interactive Value Iteration*- Gilbert, Hugo; Spanjaard, Olivier; Viappiani, Paolo; Weng, Paul; ADT 2015 LNCS*Teacher-Student Framework: a Reinforcement Learning Approach*- Zimmer, Matthieu; Viappiani, Paolo; Weng, Paul; AAMAS Workshop Autonomous Robots and Multirobot Systems 2014*Interactive Q-Learning with Ordinal Rewards and Unreliable Tutor*- Weng, Paul; Busa-Fekete, Robert; Hüllermeier, Eyke; ECML/PKDD Workshop Reinforcement Learning with Generalized Feedback 2013*Interactive Value Iteration for Markov Decision Processes with Unknown Rewards*- Weng, Paul; Zanuttini, Bruno; International Joint Conference on Artificial Intelligence 2013 pp. 2415-2421

- Qualitative/ordinal decision-making
*Qualitative Multi-Armed Bandits: A Quantile-Based Approach*- Szorenyi, Balazs; Busa-Fekete, Robert; Weng, Paul; Hüllermeier, Eyke; International Conference on Machine Learning 2015*Preference-based Reinforcement Learning: Evolutionary Direct Policy Search using a Preference-based Racing Algorithm*- Busa-Fekete, Robert; Szorenyi, Balazs; Weng, Paul; Cheng, Weiwei; Hüllermeier, Eyke; Machine Learning 2014 97 pp. 327-351*Preference-based Evolutionary Direct Policy Search*- Busa-Fekete, Robert; Szorenyi, Balazs; Weng, Paul; Cheng, Weiwei; Hüllermeier, Eyke; ICRA Autonomous Learning Workshop 2013*Top-k Selection based on Adaptive Sampling of Noisy Preferences*- Busa-Fekete, Robert; Szorenyi, Balazs; Weng, Paul; Cheng, Weiwei; Hüllermeier, Eyke; International Conference on Machine Learning 2013*Ordinal Decision Models for Markov Decision Processes*- Weng, Paul; European Conference on Artificial Intelligence 2012*Committee Selection with a Weight Constraint Based on a Pairwise Dominance Relation*- Delort, Charles; Spanjaard, Olivier; Weng, Paul; International Conference on Algorithmic Decision Theory (ADT) 2011 6992 pp. 28-41 LNAI*Markov Decision Processes with Ordinal Rewards: Reference Point-Based Preferences*- Weng, Paul; International Conference on Automated Planning and Scheduling 2011 pp. 282-289

- Argumentation as sequential decision-making
*Optimization of probabilistic argumentation with Markov Decision Models*- Hadoux, Emmanuel; Beynier, Aurélie; Maudet, Nicolas; Weng, Paul; Hunter, Anthony; International Joint Conference on Artificial Intelligence 2015

- Sequential decision-making in non-stationary environments
*Sequential Decision-Making under Non-stationary Environments via Sequential Change-point Detection*- Hadoux, Emmanuel; Beynier, Aurélie; Weng, Paul; First International Workshop on Learning over Multiple Contexts @ ECML 2014*Solving Hidden-Semi-Markov-Mode Markov Decision Problems*- Hadoux, Emmanuel; Beynier, Aurélie; Weng, Paul; International Conference on Scalable Uncertainty Management 2014

- Multiobjective/multicriteria decision-making
*A Compromise Programming Approach to Multiobjective Markov Decision Processes*- Ogryczak, Wlodzimierz; Perny, Patrice; Weng, Paul; International Journal of Information Technology and Decision Making 2013 12 pp. 1021-1053*Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes*- Perny, Patrice; Weng, Paul; Goldsmith, Judy; Hanna, Josiah; Conference on Uncertainty in Artificial Intelligence 2013*On WOWA Rank Reversal*- Ogryczak, Wlodzimierz; Perny, Patrice; Weng, Paul; International Conference on Modelling Decisions for Artificial Intelligence 2012 7647 pp. 66-77 LNAI*A Compromise Programming Approach to Multiobjective Markov Decision Processes*- Ogryczak, Wlodzimierz; Perny, Patrice; Weng, Paul; International Conference on Multicriteria Decision Making 2011 pp. 167-167*On Minimizing Ordered Weighted Regrets in Multiobjective Markov Decision Processes*- Ogryczak, Wlodzimierz; Perny, Patrice; Weng, Paul; International Conference on Algorithmic Decision Theory (ADT) 2011 6992 pp. 190-204 LNAI*On finding compromise solutions in multiobjective Markov decision processes*- Perny, Patrice; Weng, Paul; European Conference on Artificial Intelligence Multidisciplinary Workshop on Advances in Preference Handling 2010 pp. 55-60

- Study of qualitative/ordinal decision models
*Axiomatic Foundations of Generalized Qualitative Utility*- Weng, Paul; Multi-Disciplinary International Workshop on Artificial Intelligence (MIWAI) 2013 LNAI*An Axiomatic Approach to Qualitative Decision Theory with Binary Possibilistic Utility*- Weng, Paul; European Conference on Artificial Intelligence 2006 pp. 467-471*Axiomatic Foundations for a Class of Generalized Expected Utility: Algebraic Expected Utility*- Weng, Paul; International Conference on Uncertainty in Artificial Intelligence 2006 pp. 520-527*Qualitative Decision-Making Under Possibilistic Uncertainty: Toward More Discriminating Criteria*- Weng, Paul; International Conference on Uncertainty in Artificial Intelligence 2005 pp. 615-622