When turning TM verbosity all the way up, one of the reported indicators is average sequence length. The relation between accuracy and average sequence length for my data is what I am after.
Is it possible to extract it from the network somehow? I have tried extracting other variables from the network, but have proven inept.
Moreover, does a sequence length of 0 mean that only the current input is used to make a prediction, i.e. a first-order prediction, or would that be the case for a sequence length of 1? The average sequence length seems to have a minimum at 0, so the former must be true, right?