Sarthak Langde

Results 4 issues of Sarthak Langde

I know from previous issues it is mentioned that that Q8BERT was just an experiment to measure the accuracy of quantized BERT model. But, given that the accuracy is good,...

question

Hey, Thank you for the scripts for loading checkpoints and running benchmarks. I have a strange issue that ds_inference fp16 throughput is quite slower than the results mentioned. But, the...

### Description Currently, the examples directory has one directory per model and then we have many notebooks with only one line changing at max. It is quite hard to keep...

documentation
good first issue

### Description BaseModel load does not allow for loading of weights from non-xTuring hub based models. We should either make it optional or let users provide more details of these...

enhancement