管理科学与工程系列学术讲座

发布时间: 2024-12-10

浏览次数: 10

讲座题目：Sequential decision making: trade-off between optimality and safety

讲座人：郑泽宇教授

主持人：李勇建教授

讲座时间：2024年12月16日16：00

讲座地点：南开大学商学院A501-2

讲座摘要：

This talk explores the integration of safety into efficiency in sequential decision-making for online experimentation, specifically within a stochastic multi-armed bandit setting. Previous work mostly focuses on “optimality” of sequential decisions -- achieving efficiency by minimizing regret expectation. However, maintaining “safety” -- controlling the regret tail risk, is essential in critical applications. This work provides a detailed characterization of the trade-off between optimality and safety.

讲座人简介：

郑泽宇，加州大学伯克利工业工程与运筹系教授、获终身教职。主持伯克利人工智能与仿真实验室。博士、硕士毕业于斯坦福大学，本科毕业于北京大学。研究领域集中在仿真、应用概率、机器学习、人工智能。在运筹学、管理科学期刊Operations Research，Management Science发表论文十余篇；在机器学习、人工智能会议NeurIPS, ICML, KDD, AISTATS发表论文十余篇。现担任Operations Research、Naval Research Logistics等期刊Associate Editor。

导航

学术与科研

管理科学与工程系列学术讲座

认证机构

联系方式

学院周边

关注我们