[5] Ahmed Touati, Jérémy Rapin, and Yann Ollivier. Does zero-shot reinforcement learning exist? ICLR 2023
[6] Seohong Park, Dibya Ghosh, Benjamin Eysenbach, and Sergey Levine. Hiql: Offline goalconditioned rl with latent states as actions. NeurIPS 2023.
[7] Aviral Kumar, Justin Fu, Matthew Soh, George Tucker, and Sergey Levine. Conservative q-learning for offline reinforcement learning. NeurIPS 2020