Directly Estimating the Variance of the lambda-Return Using Temporal-Difference Methods
I’ve got a new paper on arXiv on using TD methods to directly estimate the variance of the lambda-return.
I’ve got a new paper on arXiv on using TD methods to directly estimate the variance of the lambda-return.