You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you again for your wonderful implementation!
I have a question about the implementation: What's the role of the test data during the evaluation? As for as I know, a typical evaluation is the agent interacting with the environment, which doesn't need test data. So is there any difference with your implementation? And how to tell if an episode succeeds or not with test data?