Thank you for this great code!
Just one question regarding the line 16 of prioritized_memory.py:
Should not the absolute value of the TD-error be taken in return (error + self.e) ** self.a ?
This is what is done in the original paper.
In addition, if not using abs(), I get the error can't convert complex to float due to (error + self.e) being negative.
Thank you for this great code!
Just one question regarding the line 16 of
prioritized_memory.py:Should not the absolute value of the TD-error be taken in
return (error + self.e) ** self.a?This is what is done in the original paper.
In addition, if not using
abs(), I get the errorcan't convert complex to floatdue to(error + self.e)being negative.