models
models copied to clipboard
MM Pytorch API: TabularTransform for input tabular sequence
Fixes # (issue)
Goals :soccer:
- Add support for padding, transforming, and masking sequential inputs data in MM Pytorch backend
- The implemented transform classes should: - Support multiple targets - Be used for training, evaluation, and inference
Implementation Details :construction:
- [x] Implement
TabularBatchPaddingto pad a group of sequential inputs - [x] Implement
TabularPredictNextfor generating targets of causal next item prediction - [ ] Implement
TabularPredictLastfor generating targets of last item prediction - [ ] Implement
TabularPredictRandomfor generating targets of predicting one random item and truncate the sequence so that the random item is at the last position. - [x] Implement
TabularMaskRandomfor masked language modeling training (MLM) strategy - [x] Implement
TabularMaskLastfor masking last item in the sequence, generally used to evaluate models trained with MLM.
Testing Details :mag:
- Defined tests for padding and the different sequence transformations