Paul Weng | Student Projects

News:

Openings for postdocs, PhD, master, or research associates: If you're interested in my research, please send me an email in English with your CV.

June 2025: I will serve as an area chair for AAAI 2026
Jun-Jul 2025: I'll be an invited professor at Université Paris Dauphine-PSL
May 2025: our survey paper on RLHF has been accepted in TMLR
May 2025: one paper accepted at ICML 2025
Apr. 2025: I'll be an invited speaker at ICASIS 2025
Mar. 2025: one paper accepted in ACM TELO
Jan. 2025: one paper accepted at ICLR 2025
Jan. 2025: We got awarded the Shuangchuang talent program of Kunshan
Dec. 2024: two papers accepted at AAAI 2025
Nov. 2024: I will serve as an area chair for ECAI 2025
Oct. 2024: one paper accepted in NCAA
Sep. 2024: one paper accepted in the journal track of ACML 2024
June 2024: one paper accepted at ICML 2024 Workshop MHFAIA
May 2024: I will serve as an area chair for AAAI 2025
May 2024: one paper accepted at ICML 2024
Apr. 2024: I will serve as a co-chair for MIWAI 2024
Apr. 2024: our survey paper on interpretable RL has been accepted in Machine Learning
Jan. 2024: one paper accepted at ICLR 2024
Jan. 2024: I have moved to DKU
Nov. 2023: I will serve as an area chair for ECAI 2024
Jul. 2023: one paper accepted at ECAI 2023
Jun. 2023: one paper accepted at ECML/PKDD 2023
Jun. 2023: one paper accepted at TMLR
Jun. 2023: We received the best paper award at ALA 2023
Jun. 2023: We got funded by NetEase to work on RL from human feedback
May 2023: I will serve as an area chair for AAAI 2024
Mar. 2023: one paper accepted at ALA 2023
Mar. 2023: one paper accepted at LION 2023
Jan. 2023: one paper accepted at AAMAS 2023
Sep. 2022: one paper accepted at CORL 2022
Sep. 2022: one paper accepted at ACML 2022
Jul. 2022: I will serve as a senior PC member for AAAI 2023
May 2022: one paper accepted at ICML 2022
Nov. 2021: one paper accepted at International Journal of Production Research
Nov. 2021: one paper accepted at DAI 2021
Oct. 2021: one paper accepted at ADPRL 2021
Aug. 2021: We got funded by NSFC to work on exploiting equivariance in deep RL
Jul. 2021: I will serve as a senior PC member for AAAI 2022
May 2021: one paper accepted at ICML 2021
Feb. 2021: one paper accepted at ICRA 2021
Aug. 2020: I will serve as a senior PC member for AAAI 2021
Aug. 2020: I will serve as a senior PC member for IJCAI 2021
Jul. 2020: one paper accepted at IROS 2020
Jun. 2020: We got funded by Huawei to work on interpretable RL
Jun. 2020: one paper accepted at ICML 2020
May. 2020: our survey chapter on reinforcement learning published in A Guided Tour of Artificial Intelligence Research
Dec. 2019: I will serve as a PC member for JFPDA 2020
Oct. 2019: I will serve as a PC member for ECAI 2020
Oct. 2019: I was invited to give a talk at AWRL 2019
Oct. 2019: I will serve as a senior PC member for IJCAI 2020
Aug. 2019: I will serve as a reviewer for AISTATS 2020
Aug. 2019: one paper accepted at DAI 2019
Aug. 2019: We got funded by Yahoo Research.

Here are some research projects I advised recently at the graduate level at JIE:

2016-2017: Deep reinforcement learning
2016-2017: Adaptive video streaming via reinforcement learning
2016-2017: Reinforcement learning for UAV control
2016-2017: Automatic trading with reinforcement learning
2016-2017: Adaptive navigation in transportation networks
2015-2016: Finding risk-averse shortest path using time-dependent stochastic costs
2015-2016: A multi-objective reinforcement learning approach to traffic light control in nonstationary environments
2015-2016: Discrete and continuous decoding of reaching movements from neural signals
2015-2016: Optimized spaced repetition based on a learner's model
2015-2016: Batch reinforcement learning for optimized spaced repetition
2015-2016: Empirical evaluation of Markov decision process and reinforcement learning with ordinal rewards

Here are some research projects I advised at the graduate level at JRI:

2016-2017: Combining shortest path and recommender system
2016-2017: Algorithmic decision theory in computer networks

Here are some research projects I advised at the graduate level (master MVA) at ENS Cachan:

2016-2017: Inverse reinforcement learning with additional preferential information
2015-2016: Community detection based on mixing times

Here are some research internships I advised at the graduate level (M2) at UPMC:

2013: Solving optimization problems in poor information situations (with Thibaut Lust)
2013: Sequential decision-making with ordinal rewards (with Paolo Viappiani)
2013: Parameterizing preference models for sequential decision-making under uncertainty (with Paolo Viappiani)
2012: Policy learning in unknown environment by regret minimization (with Aurélie Beynier)
2011: Reinforcement learning for dynamic asset allocation (at Primexis)
2009: Analysis of Schelling's model (mainly advised by Cyril Banderier)

Here are some research projects I proposed at the graduate level (M2) at UPMC:

2014: Combinatorial adversarial multi-armed bandits
2014: Preference-based policy learning
2013: Multiobjective reinforcement learning
2013: CMA-ES (Covariance Matrix Adaptation - Evolution Strategy)
2007: Decision-making with partially consonant belief functions

Here are some programming projects I proposed at the graduate level (M1):

2013: Trading on the bitcoin market
2013: Reinforcement learning agent with a tutor (with Paolo Viappiani)
2012: Application in finance of robust or risk-sensitive reinforcement learning
2011: Backtests of financial investment strategies
2011: Artificial Intelligence and Poker Texas Hold'em
2010: Pair-trading strategies based on copulas
2009: Study of solution methods for large-scale MDPs (with Aurélie Beynier)