Variational reward estimator bottleneck: Towards robust reward estimator for multidomain task-oriented dialogue
Jeiyoon Park, Chanhee Lee, Chanjun Park, Kuekyeng Kim, Heuiseok Lim
Research output: Contribution to journal › Article › peer-review
Fingerprint
Dive into the research topics of 'Variational reward estimator bottleneck: Towards robust reward estimator for multidomain task-oriented dialogue'. Together they form a unique fingerprint.