Kevin Richard
Results
1
comments of
Kevin Richard
@VIGHNESH1521 try adding this: llm = CTransformers(model=cfg.MODEL_BIN_PATH, model_type=cfg.MODEL_TYPE, config={'max_new_tokens': cfg.MAX_NEW_TOKENS, 'temperature': cfg.TEMPERATURE, 'gpu_layers': 50} ) **llm = accelerator.prepare(llm)** return llm I just use: config = {'max_new_tokens': cfg.MAX_NEW_TOKENS, 'temperature': cfg.TEMPERATURE, 'gpu_layers':...