Three f(ρ) Functions in the Off-Policy Family
Each f(ρ) must lie in the shaded valid region between f=1 and f=ρ. Adjust ε and α to explore.
ε (clip range):
0.20