seamless_communication
seamless_communication copied to clipboard
Question about Maltese Dataset Consistency - Extension of the previous S2S release (November 30, 2023)
Hello everyone,
Issue Description
Observation
The Maltese dataset dated November 30, 2023, is strictly identical to the previous version, without any observable extension.
The datasets metadata is provided here:
https://github.com/facebookresearch/seamless_communication/blob/main/docs/m4t/seamless_align_README.md
Inquiry
I would like to open this issue to seek clarification on whether this consistency is intentional or if it could potentially be an oversight. Is it normal for the extended Maltese dataset to exhibit no changes compared to the prior version?
Action
- Could someone from the team provide insights into the dataset update process for the Maltese language please?
Thank you!