ipex-llm
ipex-llm copied to clipboard
Update NPU Llama
Description
1. Why the change?
- Remove some unused code
- Simplify
padded_causal_mask
logic, but don't see obvious performance change
2. User API changes
No change.
3. Summary of the change
4. How to test?
- [ ] Unit test: Please manually trigger the PR Validation here by inputting the PR number (e.g.,
1234
). And paste your action link here once it has been successfully finished.