Piotr Żelasko issues

Results 60 issues of


                                            Piotr Żelasko

Dynamic bucket selection rng sync

Follow up to #863 and #1309 This version seems to work as intended, it consistently picks the same buckets on each DDP rank. It depends on good `duration_bins` initialization (i.e....

[do-not-merge] SpeechLLM dev branch

# What does this PR do ? This PR is for tracking the changes in speech-llm main development branch w.r.t. main. **Collection**: multimodal # Changelog - Add specific line by...

stale

ASR

common

Multi Modal

Support configurable extra fields for LazyNeMoTarredIterator

# What does this PR do ? Add a one line overview of what this PR aims to accomplish. **Collection**: [Note which collection this PR will affect] # Changelog -...

common

Run CICD

Installation issue with numpy 2.0

Lhotse CI is breaking on lilcom installation, not 100% sure why, but I think it is related to numpy 2.0 release. First, Lhotse tests were failing on `numpy not available`,...

Make torchaudio an optional dependency

EMMeTT support in SpeechLLM + tutorial for Lhotse Multimodal Dataloading

# What does this PR do ? This PR extends NeMo and SpeechLLM with the following: * EMMeTT (optimized training) support for SpeechLLM * Support for joint audio and text...

ASR

common

Multi Modal

SpeechLM2 collection

> [!IMPORTANT] > The `Update branch` button must only be pressed in very rare occassions. > An outdated branch is never blocking the merge of a PR. > Please reach...

ASR

common

Run CICD

audio

Piotr Żelasko

Dynamic bucket selection rng sync

[do-not-merge] SpeechLLM dev branch

Support configurable extra fields for LazyNeMoTarredIterator

Installation issue with numpy 2.0

Make torchaudio an optional dependency

EMMeTT support in SpeechLLM + tutorial for Lhotse Multimodal Dataloading

SpeechLM2 collection

Compiling issues in GitHub CI

Fix feature extractor to be invariant to padding

Speechlm2 SALM improvements