BCELoss vs BCEWithLogitsLoss - PyTorch Forums
discuss.pytorch.org › t › bceloss-vsJan 02, 2019 · As you described the only difference is the included sigmoid activation in nn.BCELoss. Just to clarify, if using nn.BCEWithLogitsLoss(target, output), output should be passed through a sigmoid and only then to BCEWithLogitsLoss? I don’t understand why one would pass it through a sigmoid twice because x is already a probability after passing through one sigmoid.