coremltools
coremltools copied to clipboard
GPU acceleration for palletizattion
would help a lot to speed up compression. my current plan is to use nvidia gpu since they have more compute, but support for apple silicon gpus would probably be useful for more people. also i know it is currently possible to compress weights externally, but i think it is more user friendly to integrate in the cto api
Please give us some more details here. How big is your model? What are you trying to do? What code are you running to do that? How long is it taking? What are the rough specs of the machine you are using?