Sebastian Nicolas comments

Repositories
Issues
Comments

Results 7 comments of


                                            Sebastian Nicolas

Dynamic TFLOPS Calculation

> > Getting this output now after > > ``` > exo --inference-engine pytorch --run-model llama-3.1-8b > ``` I'm assuming this comment was meant for #139 ?

Dynamic TFLOPS Calculation

@AlexCheema I added support for benchmarking on the MLX inference engine. Right now it only benchmarks f32 and f16 calculations because mlx doesn't support matrix multiplication for int8. Not sure...

Dynamic TFLOPS Calculation

@AlexCheema I created a benchmark for tinygrad, cleaned up the mlx benchmark and attempted to implement your requests from your last review. I also removed the dict from device_capabilites and...

Dynamic TFLOPS Calculation

@AlexCheema DeviceCapabilites are now lazily computed. PTAL

Dynamic TFLOPS Calculation

@AlexCheema resolved merge conflicts.

Dynamic TFLOPS Calculation

@AlexCheema PTAL

Dynamic TFLOPS Calculation

@AlexCheema PTAL