SYSU-CMU JIE
SYSU-CMU JRI
News:

The high-level goals of our group are to help build decision-support systems (recommender systems, decision aid, search...) and automatic decision systems (autonomous agents, robots, traders...). Our research work lies at the intersection of artificial intelligence, decision theory, machine learning and operations research. In particular, it deals with (sequential) decision making under uncertainty, multiobjective optimization, preference elicitation/learning.

Currently we focus on two application domains:

Here's a classification of some recent work (downloadable here or send me an email):

General framework and decision models for sequential decision-making
  • Solving MDPs with Skew Symmetric Bilinear Utility Functions - Gilbert, Hugo; Spanjaard, Olivier; Viappiani, Paolo; Weng, Paul; 24th International Joint Conference on Artificial Intelligence (IJCAI-15) 2015
  • Markov Decision Processes with Functional Rewards - Spanjaard, Olivier; Weng, Paul; 7th Multi-Disciplinary International Workshop on Artificial Intelligence (MIWAI) 2013 8271 pp. 269-280 Lecture Notes in Computer Science Best paper award
  • Algebraic Markov Decision Processes - Perny, Patrice; Spanjaard, Olivier; Weng, Paul; 19th International Joint Conference on Artificial Intelligence 2005 pp. 1372-1377
Preference elicitation/learning in sequential decision-making
  • Reducing the Number of Queries in Interactive Value Iteration - Gilbert, Hugo; Spanjaard, Olivier; Viappiani, Paolo; Weng, Paul; ADT 2015 LNCS
  • Teacher-Student Framework: a Reinforcement Learning Approach - Zimmer, Matthieu; Viappiani, Paolo; Weng, Paul; AAMAS Workshop Autonomous Robots and Multirobot Systems 2014
  • Interactive Q-Learning with Ordinal Rewards and Unreliable Tutor - Weng, Paul; Busa-Fekete, Robert; Hüllermeier, Eyke; ECML/PKDD Workshop Reinforcement Learning with Generalized Feedback 2013
  • Interactive Value Iteration for Markov Decision Processes with Unknown Rewards - Weng, Paul; Zanuttini, Bruno; International Joint Conference on Artificial Intelligence 2013 pp. 2415-2421
Qualitative/ordinal decision-making
  • Qualitative Multi-Armed Bandits: A Quantile-Based Approach - Szorenyi, Balazs; Busa-Fekete, Robert; Weng, Paul; Hüllermeier, Eyke; International Conference on Machine Learning 2015
  • Preference-based Reinforcement Learning: Evolutionary Direct Policy Search using a Preference-based Racing Algorithm - Busa-Fekete, Robert; Szorenyi, Balazs; Weng, Paul; Cheng, Weiwei; Hüllermeier, Eyke; Machine Learning 2014 97 pp. 327-351
  • Preference-based Evolutionary Direct Policy Search - Busa-Fekete, Robert; Szorenyi, Balazs; Weng, Paul; Cheng, Weiwei; Hüllermeier, Eyke; ICRA Autonomous Learning Workshop 2013
  • Top-k Selection based on Adaptive Sampling of Noisy Preferences - Busa-Fekete, Robert; Szorenyi, Balazs; Weng, Paul; Cheng, Weiwei; Hüllermeier, Eyke; International Conference on Machine Learning 2013
  • Ordinal Decision Models for Markov Decision Processes - Weng, Paul; European Conference on Artificial Intelligence 2012
  • Committee Selection with a Weight Constraint Based on a Pairwise Dominance Relation - Delort, Charles; Spanjaard, Olivier; Weng, Paul; International Conference on Algorithmic Decision Theory (ADT) 2011 6992 pp. 28-41 LNAI
  • Markov Decision Processes with Ordinal Rewards: Reference Point-Based Preferences - Weng, Paul; International Conference on Automated Planning and Scheduling 2011 pp. 282-289
Argumentation as sequential decision-making
  • Optimization of probabilistic argumentation with Markov Decision Models - Hadoux, Emmanuel; Beynier, Aurélie; Maudet, Nicolas; Weng, Paul; Hunter, Anthony; International Joint Conference on Artificial Intelligence 2015
Sequential decision-making in non-stationary environments
  • Sequential Decision-Making under Non-stationary Environments via Sequential Change-point Detection - Hadoux, Emmanuel; Beynier, Aurélie; Weng, Paul; First International Workshop on Learning over Multiple Contexts @ ECML 2014
  • Solving Hidden-Semi-Markov-Mode Markov Decision Problems - Hadoux, Emmanuel; Beynier, Aurélie; Weng, Paul; International Conference on Scalable Uncertainty Management 2014
Multiobjective/multicriteria decision-making
  • A Compromise Programming Approach to Multiobjective Markov Decision Processes - Ogryczak, Wlodzimierz; Perny, Patrice; Weng, Paul; International Journal of Information Technology and Decision Making 2013 12 pp. 1021-1053
  • Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes - Perny, Patrice; Weng, Paul; Goldsmith, Judy; Hanna, Josiah; Conference on Uncertainty in Artificial Intelligence 2013
  • On WOWA Rank Reversal - Ogryczak, Wlodzimierz; Perny, Patrice; Weng, Paul; International Conference on Modelling Decisions for Artificial Intelligence 2012 7647 pp. 66-77 LNAI
  • A Compromise Programming Approach to Multiobjective Markov Decision Processes - Ogryczak, Wlodzimierz; Perny, Patrice; Weng, Paul; International Conference on Multicriteria Decision Making 2011 pp. 167-167
  • On Minimizing Ordered Weighted Regrets in Multiobjective Markov Decision Processes - Ogryczak, Wlodzimierz; Perny, Patrice; Weng, Paul; International Conference on Algorithmic Decision Theory (ADT) 2011 6992 pp. 190-204 LNAI
  • On finding compromise solutions in multiobjective Markov decision processes - Perny, Patrice; Weng, Paul; European Conference on Artificial Intelligence Multidisciplinary Workshop on Advances in Preference Handling 2010 pp. 55-60
Study of qualitative/ordinal decision models
  • Axiomatic Foundations of Generalized Qualitative Utility - Weng, Paul; Multi-Disciplinary International Workshop on Artificial Intelligence (MIWAI) 2013 LNAI
  • An Axiomatic Approach to Qualitative Decision Theory with Binary Possibilistic Utility - Weng, Paul; European Conference on Artificial Intelligence 2006 pp. 467-471
  • Axiomatic Foundations for a Class of Generalized Expected Utility: Algebraic Expected Utility - Weng, Paul; International Conference on Uncertainty in Artificial Intelligence 2006 pp. 520-527
  • Qualitative Decision-Making Under Possibilistic Uncertainty: Toward More Discriminating Criteria - Weng, Paul; International Conference on Uncertainty in Artificial Intelligence 2005 pp. 615-622