drQedwards
drQedwards
Q-Promise Logic Loop Summary 1. Data is appended as JSON objects: Each agent output gets parsed (comments stripped, regex for {...}), loaded via json.loads, and appended to self.data. 2. Standard...
### Overview of the PMLL Compression Algorithm The Persistent Memory Logic Loop (PMLL) architecture introduces a novel approach to memory-efficient inference in large language models (LLMs) by augmenting standard Transformers...
And another one MR. karpathy I’m ready to turn in my homework please also see https://www.authorea.com/users/856117/articles/1297819-the-persistent-memory-logic-loop-from-proposal-design-to-formal-proof-of-p-np-using-the-pmll-algorithm
/* pmll_cuda.h — Persistent-Memory Logic Loop (CUDA reference) Mirrors pmll.h but stores KV on the GPU. */ #pragma once #include #ifndef MAX_MEM_T #define MAX_MEM_T 128 #endif typedef struct { int...
The PMLL
https://www.authorea.com/users/856117/articles/1297819-the-persistent-memory-logic-loop-from-proposal-design-to-formal-proof-of-p-np-using-the-pmll-algorithm
======== NVIDIA CUDA Build Simulation ======== >>> 1. Front-end (host compilation) • Using nvcc 12.5 (CUDA toolkit 12.5.52) • Detected GCC 12.3.0 host compiler - parsing gpt2_pmll.cu ………………………………… done -...