Julia Longtin comments

Repositories
Issues
Comments

Results 105 comments of


                                            Julia Longtin

trafficstars

Xeon Phi (Knights Corner) Support.

> > goes from a token every 0.18 seconds on mistral 7B instruct to a token every 0.82 seconds. > > Are you showing a performance regression? Or are the...

Xeon Phi (Knights Corner) Support.

> > > goes from a token every 0.18 seconds on mistral 7B instruct to a token every 0.82 seconds. > > > > > > Are you showing a...

Xeon Phi (Knights Corner) Support.

> > > > goes from a token every 0.18 seconds on mistral 7B instruct to a token every 0.82 seconds. > > > > > > > > >...

Xeon Phi (Knights Corner) Support.

> > goes from 0.18 tokens per second on mistral 7B instruct (Q5K) to 0.82 tokens per second. > > How many threads is that with? Since Xeon Phi has...

Xeon Phi (Knights Corner) Support.

now runs at 1.2 tokens per second.