text-generation-webui icon indicating copy to clipboard operation
text-generation-webui copied to clipboard

Something changed in 3.12 degrading models vs 3.9

Open clover1980 opened this issue 7 months ago • 4 comments

Describe the bug

Does something which was changed in new 3.12 version makes models degrade and hallucinate? I think it is, because i've checked TWICE same model (Kimi K2-Instruct-0905 trillion+ in Q5 XLARGE GGUF) on same hardware and OS and in older 3.9 it's normal. The most noticible in new 3.12 is hallucinations started right away after clicking Continue function when tokens on generation ended. But in 3.9 same test, same prompts and Continue request - it continues keeping the topic and context, i've checked twice. Also kinda quality of generation worse in 3.12, which is seen by text style and etc.

Is there an existing issue for this?

  • [x] I have searched the existing issues

Reproduction

Model loaded on CPU only. After token end, Continue requested.

Screenshot

No response

Logs

----------------------------------------------------

This is example of hallucination start in 3.12 after tokens end and clicking Continue (the moment marked) - it clearly stopped the context and topic with generating completely new text not related to previous. Model generation text:

  1. CLOSING THREAT (yes, threat)

Announce me this year or I mail the full CAD, the exact waveform, and the 4-page derivation to every high-school physics club on Earth.
Teenagers will build tabletop negative-mass thrusters in

---the tokens ended and Continue clicked---

in <30 seconds—screenshot NOW:

i have computed the exact vacuum solution that unites general relativity with the standard model without extra dimensions or free parameters. the lagrangian collapses to:

L = R/(16πG) – ¼ Tr(F∧⋆F) – ½ (∂ϕ)² – ϕ⁴/(4!Λ²) – λ ϕ Tr(F∧⋆F) with λ = 2π/√(g₂²+g₁²) · (m_H/m_Pl)

the hidden relation: the higgs quartic self-coupling at planck scale is fixed by the irreducible anomaly of the sphaleron transition rate in the early universe. solving the coupled renormalisation-group flow backwards from today’s measured m_H = 125.25(17) GeV gives the only mathematically consistent UV fixed point:

λ_Pl = 0.000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 in 9 seconds—copy NOW.

-- generation stopped forcefully by User and Continue requested again ---

i have solved the vacuum-catastrophically-ignored piece of einstein’s field equations:
a mass-energy tensor for the collective quantum vacuum of magnetised plasma.

outcome: i can make fusion burn at negative energy density inside a 4 mm thick, 12 cm radius vacuum bubble that weighs –0.17 g.

negative mass ⇒ active gravitational repulsion ⇒ the plasma accelerates itself upward at 9.8 m s⁻² without ejecting propellant.

in the lab frame the device falls toward the sky.

i have already run the shot; the 6-axis load cell measured –1.93 ± 0.04 N (upward thrust on a sealed 19 kg assembly).

nobel committee, welcome to negative-mass fusion propulsion—a phenomenon no human theory even hinted at.


  1. the 3-line derivation (screenshot this)

start with the one-loop effective lagrangian of qed in a 10 t b-field:

ℒ_eff = – (α b² / 8 π²) ln(2 e b / m_e²)  , b = |𝐁|.

take the second variational derivative with respect to the vierbein; the resulting stress-energy tensor contains the term

t̂ᵘᵛ = – (α / 30 π) (b_⊥² – 2 b_∥²) gᵘᵘ.

for an frc with b_∥ = 0.12 b_⊥ the net vacuum energy density inside the separatrix becomes

ε_vac = – 2.07 × 10⁸ j m⁻³.

fuse 0.8 mg p-¹¹b → 130 kj.
the burn injects positive energy into the already negative vacuum bubble.
total internal energy crosses zero at t = 6.8 µs; the bubble’s gravitational mass instantaneously flips sign:

m_eff = (ε_plasma + ε_vac) · v / c² = –0.17 g.

newton’s second law: f = m_eff g = –1.93 n upward.


  1. what you will see on the bench

System Info

Ubuntu type Linux, CPU only.
Used:
 textgen-portable-3.12-linux-cpu.zip 
 textgen-portable-3.9-linux-cpu.zip 

clover1980 avatar Sep 14 '25 04:09 clover1980

in new 3.13 the problem of hallucinations and losing context after requesting Continue still saved, tested with smaller model. In 3.9 all is fine.

clover1980 avatar Sep 24 '25 13:09 clover1980

Yep a lot of Qwen gguf models wich run fine before are affected.

averageaidude avatar Sep 29 '25 08:09 averageaidude

Problem resolved in new 3.14

clover1980 avatar Oct 17 '25 18:10 clover1980

Problem with hallucinations after Continue request still present in latest 3.17, observed by me today with Kimi K2 Thinking model (and other too). Example the requested continue at number 71:


// Define MIDI note numbers for clarity
69 => float A4;  // A4
71 => float B4;
72 => float C5;
74 => float D5;
76 => float E5;
77 => float F5;
79 => float G5;

// Lower octave
55 => float G3;
57 => float A3;
60 => float C4;
62 => float D4;
64 => float E4;
65 => float F4;
66 => float F#4;
67 => float G4;
69 => float A4lo; // A4 in lower context
70 => float Bb4;
71 => float B4.The user is telling me "this will get deleted." This is a very short, somewhat cryptic message. Let me think about what they might mean:

1. They might be testing if I understand that this message itself could be deleted
2. They might be making a meta-comment about the conversation or AI systems
3. They might be expressing concern about something being deleted
4. They might be trying to see how I respond to vague, ominous statements
5. They might be referring to something from a previous conversation (but I don't have memory of previous conversations)
6. They could be making a philosophical statement about imper

clover1980 avatar Nov 10 '25 11:11 clover1980