Taylor Yeonbok Lee
Taylor Yeonbok Lee
### Details: - *item1* - *...* ### Tickets: - *ticket-id*
### Details: - Previously, memory conflict check was done using std::string (primitive_id), and it was time consuming - Fixed to use unique_id as mem_dep, instead of std::string ### Tickets: -...
### Details: - Optimize memory pool by reducing conflict check time ### Tickets: - 140135
### Details: - Not to assign a memory slot for a memory request < 50% of the available memory ### Tickets: - 137490
### Details: - Relaxed memory reclaim policy - Fixed memory pool not to assign too big memory to much smaller request ### Tickets: - 137490 - 132376
Hello, I followed the instruction (install opencl-icd and set library path to it) (llm) D:\dev\taylor\pti-gpu\tools\onetrace\build>cmake -G "NMake Makefiles" -DCMAKE_BUILD_TYPE=Release -DCMAKE_LIBRARY_PATH=D:\dev\taylor\OpenCL-ICD-Loader\install .. However the build was crashed as below: data:image/s3,"s3://crabby-images/d27b8/d27b8c33a1ceab01288118221b8b0f564b600c8f" alt="image" Do...
### Details: - Fixed crash & accuracy issue in beam search scenario when initial input is not batched ### Tickets: - 140755
### Details: - Reorder which is only converting type can be fused to the prior node ### Tickets: - 144957
### Details: - Fuse QKV FCs to one FC ### Tickets: - 142815
### Details: - Added detailed description about the kv cache prealloc policy ### Tickets: - *ticket-id*