[new paper!] QGFN: Controllable Greediness with Action Values