cerebros-core-algorithm-alpha icon indicating copy to clipboard operation
cerebros-core-algorithm-alpha copied to clipboard

test-text-llm-encodings-without-attention-layers-with-cerebros

Open david-thrower opened this issue 1 year ago • 1 comments

Kind of issue: enhancement

Additional context: Another attempt to make a lighter weight equally robust model.

Suggested Labels (If you don't know, that's ok): kind/enhancement kind/performance

david-thrower avatar Dec 05 '23 04:12 david-thrower

This is a strong candidate for merging in. The last thing left is to extend the seq length a little on another branch just to see if we can exceed 95% test set accuracy without too much computational work and still being under 50M params.

david-thrower avatar Dec 10 '23 17:12 david-thrower