Zheng Yuan issues

Results 9 issues of


Zheng Yuan

Output inconsistent for autoregressive performer

I want to apply autoregressive performer for decoding. ```python import torch from performer_pytorch import Performer from torch import nn attn = Performer( dim = 128, heads = 4, depth =...

Add test script for Yidu-N7K

Code example for obtaining sentence / word embeddings.

TODO

Confusing checkpoint name between UmlsBERT and CODER.

Hello, my work: CODER: Knowledge infused cross-lingual medical term embedding for term normalization (https://github.com/GanjinZero/CODER) (which is a contemporary work with your UmlSBERT) used confusing checkpoint name GanjinZero/UMLSBert_ENG in huggingface. Would...

Align human preferences with Alpaca without reinforcement learning

We propose a new learning paradigm named RRHF (Rank Responses to Align Human Feedback) which does not need reinforcement learning and can perform on par with PPO to align human...

[REQUEST] RRHF

**Is your feature request related to a problem? Please describe.** We have posted a paper with codes [RRHF] (https://github.com/GanjinZero/RRHF) that can achieve human alignment without RLHF. RRHF needs 1-2 models...

enhancement

deepspeed-chat

Zheng Yuan

Output inconsistent for autoregressive performer

Add test script for Yidu-N7K

Code example for obtaining sentence / word embeddings.

Confusing checkpoint name between UmlsBERT and CODER.

Align human preferences with Alpaca without reinforcement learning

[REQUEST] RRHF

Do you apply ablation study on wizardcoder with fine tuning with initial code alpaca?

RFT and QWen belongs to same group.

add rrhf