Cal Mitchell

Results 8 issues of Cal Mitchell

I'd like to be able to SSH into the machine manually to verify things. Is there a way to specify which SSH pair the EC2 instance uses? Sorry if this...

Hi, I noticed that the PyTorch `nn.LayerNorm` class now takes a `bias` arg. This PR removes the custom `LayerNorm` class and replaces it with the built-in. I tested the qualitative...

Hi @karpathy, I recently modified this repo to shard everything with PyTorch FSDP and would like to contribute this functionality to the project. If I cleaned up the code and...

Hello, thanks very much for the excellent work on this repo. There are several examples showing how to create a question-response style dataset, but I can't immediately tell how to...

#### Context This PR seeks to resolve issue #809 by adding the ability to pack and tokenize input/label pairs from a Huggingface dataset and then continue a model's pretraining phase...

CLA Signed

I'm finding this repo to be a user friendly, extensible, memory efficient solution for training/fine-tuning models. However, when it comes to inference, there is a usability gap that could be...

enhancement

In the docs, it says that when quantizing to anything other than int8, many operations will fall back to fp32. However, looking through the code (and inserting some print lines)...

### What happened? I am loading a large amount of tables in a for loop with bcp. It is working well most of the time but sometimes errors out (on...

bug
Pending Response