GaNDLF icon indicating copy to clipboard operation
GaNDLF copied to clipboard

[BUG] Loss signatures: CE Loss failure because of additional `params` argument

Open VukW opened this issue 1 year ago • 3 comments

Describe the bug

During loss computations it is assumed that loss function takes three params: prediction, target, params. However, it's not true for CE loss that takes only prediction and target, so using loss_function: ce fails.

To Reproduce

Steps to reproduce the behavior: try to train any model with loss_function: ce

Expected behavior

A clear and concise description of what you expected to happen.

Media

If applicable, add images, screenshots or other relevant media to help explain your problem.

Environment information

GaNDLF version, OS, and any other relevant information.

Additional context

The straightforward solution is just to add an unused params arg to CE function. However, I believe, doing this would cause linter / codacy failures as parameter is defined but not used. In this case the best option IMO is to create a standard class interface for losses:

## different file, so that this can be used by segmentation, classification, regression, and compound/hybrid losses

class LossFunctionInterface(ABC):
    @staticmethod
    @abstractmethod
    def compute_loss(
        self, targets: torch.Tensor, predictions: torch.Tensor, params: dict
    ):
        """
        Abstract method to compute the loss function.

        Args:
            targets (torch.Tensor): The target values.
            predictions (torch.Tensor): The predicted values.
            params (dict): The parameters.

        Raises:
            NotImplementedError: This method must be implemented in the derived class.
        """
        raise NotImplementedError

class DCCE(LossInterface):
    @staticmethod
    def compute_loss(predictions: torch.Tensor, targets: torch.Tensor, params: dict) -> torch.Tensor:
        ### ... move DCCE calculation logic there...

# in loss_and_metric.py it can be used as:
# loss_function.calc(predictions, targets, params)

and the same with all other losses. In this case all the losses would have the same signature and can be used interchangeably. If signature of any loss function differs, both Codacy and IDE would warn developer that something goes wrong.

VukW avatar Apr 23 '24 18:04 VukW

The solution makes complete sense to me. We should do this for all the losses, not just ce. And on that note, perhaps ce is a bit ambiguous, and we should make it explicit: either CEL (i.e., cross entropy loss), BCEL (i.e., binary cross entropy loss), or BCEL_logits (i.e., binary cross entropy with logits).

Thoughts?

sarthakpati avatar Apr 24 '24 13:04 sarthakpati

Stale issue message

github-actions[bot] avatar Jun 23 '24 19:06 github-actions[bot]

Stale issue message

github-actions[bot] avatar Aug 23 '24 19:08 github-actions[bot]