English | 南开大学

管理科学与工程系列学术讲座

发布时间: 2024-12-10
浏览次数: 10

讲座题目:Sequential decision making: trade-off between optimality and safety

讲座人:郑泽宇教授

主持人:李勇建教授

讲座时间:2024年12月16日16:00

讲座地点:南开大学商学院A501-2

 

讲座摘要:

This talk explores the integration of safety into efficiency in sequential decision-making for online experimentation, specifically within a stochastic multi-armed bandit setting. Previous work mostly focuses on “optimality” of sequential decisions -- achieving efficiency by minimizing regret expectation. However, maintaining “safety” -- controlling the regret tail risk, is essential in critical applications. This work provides a detailed characterization of the trade-off between optimality and safety.

 

讲座人简介:

郑泽宇,加州大学伯克利工业工程与运筹系教授、获终身教职。主持伯克利人工智能与仿真实验室。博士、硕士毕业于斯坦福大学,本科毕业于北京大学。研究领域集中在仿真、应用概率、机器学习、人工智能。在运筹学、管理科学期刊Operations Research,Management Science发表论文十余篇;在机器学习、人工智能会议NeurIPS, ICML, KDD, AISTATS发表论文十余篇。现担任Operations Research、Naval Research Logistics等期刊Associate Editor。