Traffic Rule Compliant AVs using end-to-end RL
Ongoing project on using end-to-end RL with Reward Machines to encode traffic rule compliance into the RL learned policy.
Ongoing project on using end-to-end RL with Reward Machines to encode traffic rule compliance into the RL learned policy.
Solving the queens game from linkedin using the Theorem Prover z3.