Add ops needed for new hybrid models: SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM
The ops needed for the new hybrid models including Qwen3 Next and Kimi Linear.
Prerequisite to merging https://github.com/ggml-org/llama.cpp/pull/16095
@gabe-l-hart guess you'll be interested in this one as well :)
@slaren @ggerganov Should be ready for final review.
@ggerganov Aight, paralellized CUMSUM, added docs, removed TRI_KEEP, renamed TRI_KEEP to TRI, added CONST with const1234d helpers.
Aight, @ggerganov @slaren @CISC it's ready to merge I think.
As a constructive feedback for the future, try to split the changes in ggml in even smaller parts. It would improve the review process because there are many little details (naming, API design, code formatting) that are not obvious at first and it takes some time to get accustomed to them.
Will do, I'm still getting used to the granularity of the review process here :)