Gaurav Chaudhary

about
publications
projects
research
services

Announcement_9

Created on March 02, 2026

2026

The preprint of the final work from my PhD, Match or Replay: Self-Imitating Proximal Policy Optimization, is on arXiv.

© Copyright 2026 Gaurav Chaudhary. Powered by Jekyll with al-folio theme. Hosted by GitHub Pages. Photos from Unsplash.