There was a similar question asked a while back on this thread. Quick summary of what I wrote in that thread, capacity is a bit more complicated than just looking at the two parameters you mentioned. You must consider other parameters such as the activation threshold (which balances capacity vs noise tolerance), how diverse the sequences being learned are (repeating some of the inputs more than others lowers capacity), etc.
2 Likes