Weight updates for branch heads

Hi, I noticed that for every training sample the network outputs predictions for all 5 output branches, but the loss is then (correctly) calculated using the output from the branch that corresponds to that sample's high-level command and summing those losses for all samples in the batch to get the total_loss tensor. Is this total loss value then used to update all 5 branches? Or is an individual loss for each branch calculated somewhere only using the samples that they are supposed to predict on given the high-level command? 

Hopefully the question is clear, I can try to rephrase if it isn't!

Thanks a lot for this repo it has been very useful! 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weight updates for branch heads #17

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Weight updates for branch heads #17

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions