mu_zero/example2 library
Functions
-
argMax(
List< double> list) → int -
generateMuZeroPure(
MuZeroGreedyAgent agent, List< int> prompt, int maxLength, Map<int, String> itos) → void - Pure MuZero Generation: Only uses Representation ONCE, then relies on Dynamics
-
main(
) → void -
trainMuZero(
MuZeroGreedyAgent agent, List< int> targetSequence, int epochs) → void - Training with Unrolled Dynamics to prevent "Latent Collapse"