aiwiki icon indicating copy to clipboard operation
aiwiki copied to clipboard

MO KD

Open junxnone opened this issue 3 years ago • 0 comments

知识蒸馏

  • Knowledge Distillation 知识蒸馏
  • Teacher/Student Model
    • Teacher Model 的输出 作为 Soft Target Training Student Model
    • Student 学习 Teacher 泛化能力
  • 用途
    • 模型压缩(小模型学习大模型的泛化能力)
  • History
  • Distilling the Knowledge in a Neural Network

image

Trend

Reference

junxnone avatar Apr 12 '21 02:04 junxnone