It might be better to use a Bayesian approach in combination with auto diff. # Tasks - [ ] Evaluate whether this improves performance - [ ] Implement Bayesian parameter initialization - [ ] Test this