ONE icon indicating copy to clipboard operation
ONE copied to clipboard

[onert] Introduce full quantization

Open hseok-oh opened this issue 10 months ago • 0 comments

Let's support full quantization on runtime

  • [x] Introduce full quantization type: https://github.com/Samsung/ONE/pull/11497
  • [ ] Full quantization from circle model including minmax data & weight quantization
  • [ ] Generate circle model (buffer) including layer minmax & weight quantization for full quantization from f32 circle and minmax data
    • ~Use minmax-embedder library~
  • [ ] Introduce API to collect minmax data
  • [ ] Remove HDF5 dependency: https://github.com/Samsung/ONE/issues/12574
    • [ ] Runtime
    • [ ] minmax-embedder

Draft: #12903

hseok-oh avatar Apr 24 '24 08:04 hseok-oh