GenerativeAIExamples icon indicating copy to clipboard operation
GenerativeAIExamples copied to clipboard

ImportError: Apex was not found. Please see the NeMo README for installation instructions: https://github.com/NVIDIA/NeMo#megatron-gpt.

Open pradeepdev-1995 opened this issue 1 year ago • 3 comments

I am trying the below Lora training notebook https://github.com/NVIDIA/GenerativeAIExamples/blob/main/models/Gemma/lora.ipynb When running the below code segment

from nemo.collections.nlp.parts.megatron_trainer_builder import MegatronLMPPTrainerBuilder
from nemo.utils.exp_manager import exp_manager

trainer = MegatronLMPPTrainerBuilder(cfg).create_trainer()
exp_manager(trainer, cfg.exp_manager)

I am getting the error

ImportError: Apex was not found. Please see the NeMo README for installation instructions: https://github.com/NVIDIA/NeMo#megatron-gpt.

So as per the official documentation when I try to install Apex,

git clone https://github.com/NVIDIA/apex.git
cd apex
git checkout b496d85fb88a801d8e680872a12822de310951fd
pip install -v --no-build-isolation --disable-pip-version-check --no-cache-dir --config-settings "--build-option=--cpp_ext --cuda_ext --fast_layer_norm --distributed_adam --deprecated_fused_adam" ./

this installation raises the given error


Usage:
  pip install [options] <requirement specifier> [package-index-options] ...
  pip install [options] -r <requirements file> [package-index-options] ...
  pip install [options] [-e] <vcs project url> ...
  pip install [options] [-e] <local project path> ...
  pip install [options] <archive url/path> ...
no such option: --config-settings

Screenshot from 2024-04-04 10-51-03

pradeepdev-1995 avatar Apr 08 '24 09:04 pradeepdev-1995

Thanks for reporting this. We are checking this and will get back to you shortly. @nealvaidya please help check this issue.

shubhadeepd avatar Apr 22 '24 12:04 shubhadeepd

@pradeepdev-1995 did you try using the NeMo container which should have the dependencies installed? image

shashank3959 avatar May 01 '24 23:05 shashank3959

@shashank3959 Yes i am using this container

pradeepdev-1995 avatar May 02 '24 04:05 pradeepdev-1995