Alexandros Koumparoulis

Results 81 issues of Alexandros Koumparoulis

# What does this PR do ? Add a one line overview of what this PR aims to accomplish. **Collection**: [Note which collection this PR will affect] # Changelog -...

NLP
CI

# What does this PR do ? With EP enabled mcore_gpt is required (previously was checking megatron_amp_O2 whether it was enabled). **Collection**: [Note which collection this PR will affect] #...

stale
NLP

# What does this PR do ? Add expert_model_parallel_size to yamls Add a one line overview of what this PR aims to accomplish. **Collection**: [Note which collection this PR will...

stale
NLP

# What does this PR do ? Allow passing fp16_lm_cross_entropy to mcore to reduce memory requirements. *needs* MR in Mcore first. **Collection**: [Note which collection this PR will affect] #...

stale
NLP

# What does this PR do ? Add a one line overview of what this PR aims to accomplish. **Collection**: [Note which collection this PR will affect] # Changelog -...

PL installs pretty_errors which alters `sys.excepthook` from `sys.__excepthook__` to a text formatting function the pretty_errors library defines. However, the format used is incomplete as it misses filepaths, which hinders debugging....

Run CICD

# What does this PR do ? Enable saving/restoring checkpoint when using mcore dist opt in NeMo. **Collection**: [Note which collection this PR will affect] # Changelog - Add specific...

core
NLP
Run CICD

# Description As the title says, instead of querying pip to determine .so locations, this will try first try to look-up to directories from the __init__.py file and if the...

# What does this PR do ? Fixes a bug in mistral converter, affecting mistral-7b-instruct models (only the instruct variants). Re-issue: special tokens (e.g. [INST]) were tokenized instead of being...

NLP
Run CICD

# What does this PR do ? Add a one line overview of what this PR aims to accomplish. **Collection**: [Note which collection this PR will affect] # Changelog -...

Run CICD