Kevin-Yang
Kevin-Yang
When I was clone & run your GPT2 example code, there are some runtime error when try to load model. How can I run this code? - tried tflite version...
- [ ] Resource Reduced Model - [ ] Distribute Strategy - [ ] README for Korean - [ ] preprocessing with method1 - [x] add sample audio url
Tutorial (https://giftless.datopian.com/en/latest/quickstart.html) does not work as raise following error occured. - Expected result (in documents) ``` INFO 127.0.0.1 - - "POST /my-organization/test-repo/objects/batch HTTP/1.1" 200 - INFO 127.0.0.1 - - "PUT...
GPU행업 이슈
## How to reproduce ```python tokenizer = AutoTokenizer.from_pretrained(model_name, bos_token='[BOS]', eos_token='[EOS]', unk_token='[UNK]', pad_token='[PAD]', mask_token='[MASK]') model = AutoModelForCausalLM.from_pretrained(model_name)#.to(device='cuda', non_blocking=True) _ = model.eval() parallelformers.parallelize(model, num_gpus=4, fp16=True, verbose='detail') tok = tokenizer("My name is Kevin."*10,...
## Describe a TODO feature - Force tp_wrapper do not parallelize emb-layer if model has not embedding layer. (for vision model competible) https://discord.com/channels/729741769192767510/1012603449910759504/1083785802930192434 ## Assignees - @jason9693
Also, found `_remove_repeated_phrase` function is wrong. should be fixed later.