Max Wolf's Second Brain

Home

❯

general

❯

preference based RL

preference-based RL

Jul 30, 20251 min read

(wtih new obsidian prompt)

https://aistudio.google.com/prompts/19KdvqJcSKmShtXXALJTBeCMWxUxlqgg2


Graph View

Backlinks

  • Motif - Intrinsic Motivation from Artificial Intelligence Feedback

Created with Quartz v4.5.1 © 2025

  • GitHub
  • Discord Community