1 comments

  • kaushikbokka 2 days ago

    The future of RL observability could look like this:

    you’re working alongside your model, spawning multiple versions of your environment by tweaking components at different points, much like using git worktrees for RL experiments.