2 comments

  • dippogriff 5 hours ago

    Great work showing on how brittle these GUI benchmarks can be! Love the visuals.

    I wonder if SFT is the problem here as opposed to the coordinate discretization; what happens with continuous action space?

    • 5 hours ago
      [deleted]