SHARK icon indicating copy to clipboard operation
SHARK copied to clipboard

llama2 - FP16 - IR generation

Open PhaneeshB opened this issue 1 year ago • 1 comments

  • fix combine mlir for llama2 for fp16 IR generation by keeping deduplicating constant variable names using appropriate suffix for first llama and second llama
  • needs to be tested for IR generation with int4 to see it doesn't break the generation flow for quantised IRs

PhaneeshB avatar Nov 08 '23 12:11 PhaneeshB

Converted to draft till sanity check is done for int4 IR generation.

PhaneeshB avatar Nov 08 '23 12:11 PhaneeshB