Portfolio

Traffic Rule Compliant AVs using end-to-end RL

Ongoing project on using end-to-end RL with Reward Machines to encode traffic rule compliance into the RL learned policy.

Solving the queens game from linkedin using the Theorem Prover z3.