Port [this tutorial](https://docs.jaxstack.ai/en/latest/JAX_machine_translation.html) The tutorial provides guidance on defining and training an encoder-decoder model. Additional features - [ ] Update the parameter definitions to be in a dataclass for consistency with other Bonsai models - [ ] Use KV-cache later in the tutorial for faster decoding