maybe you want more neurons in the backbone and fewer in the task, maybe you want to set specific activation types for the attention