Open
Description
- which label to use for "parameter optimization"? "optimization" is taken by "CPU performance optimizations", NuPIC used "swarming" which is more specific than necessary, but already used in the community to mean "parameter optimization" (also shorter)
- improve docs how to actually run the optimization
- make opt. run in CI as a default example (some short swarm)
- there's a "bug" in the way we describe and run the swarming now! As we're faking. We are testing the results on the test set, but that is a mete-parameter and our behavior leads to overfitting. We should use
- cross-validation
- train/eval/test split (where test is out-of-sample and never touched)
- kill optimization when plateauing? (changes start to be too insignificant)
- implement other methods: EA/GA (genetic algorithms), simulated-annealing, but rather I'd see a 3rd party framework, a proper interface, and this project moved to a separate htm-community repo.