Xin Zhang comments

Results 37 comments of


                                            Xin Zhang

Propose chunked computation for the `RerankingEvaluator`

Hi, I think the chunking is still needed. Recall that all docs are gatherd and encoded with [`SentenceTransformer.encode`](https://github.com/UKPLab/sentence-transformers/blob/master/sentence_transformers/SentenceTransformer.py#L195) at once, where `torch.stack(all_embeddings)` is called to merge batched embeddings. Since `stack`...

Propose chunked computation for the `RerankingEvaluator`

And I write a simulation script that can quickly reproduct the case. ``` import subprocess import tqdm import torch command = 'nvidia-smi' embedding_dim = 1024 batch_size = 128 print('model and...

Propose chunked computation for the `RerankingEvaluator`

This is indeed a problem with sentence-transformers, as they did not consider calls at the million level. I agree that your method is a reasonable solution. I think the large...

[Question]

感谢提问。单纯的推理功能正在开发，预计半个月到一个月，目前有 [test](https://github.com/modelscope/AdaSeq/blob/master/adaseq/commands/test.py) 功能。这个文件是 [`ner_dumpper`](https://github.com/modelscope/AdaSeq/blob/master/adaseq/data/dataset_dumpers/named_entity_recognition_dataset_dumper.py) 保存的，它就是一个在评测调用的特殊的 `metric`。

[Question] 为什么model.save_pretrained总会报错，无法生成pred文件

您好，之前钉钉群里也有人提到了这个问题，可能是相同原因，麻烦可以尝试一下呢。很抱歉最近开发者没有及时跟进issues。 ![image](https://user-images.githubusercontent.com/26690193/233849712-25fea87c-ad51-462f-bc60-f50960ea4597.png) ![image](https://user-images.githubusercontent.com/26690193/233849984-e45a527f-f0fa-44c6-98ff-9838a98e1a73.png) 可以尝试先把modelscope版本降级到1.4.2以下，运行pip install modelscope==1.4.2。这个问题是ms版本升级导致的兼容性问题，我们会尽快修复！

自己的这段文字明明是用AI生成的，但是却检测不出来

您好，可能是文本风格的原因。检测器其实是根据行文的风格和特征来判断是否是chatgpt生成的，所以如果风格相似，确实会误判 😄

could u provide the helpfulness dataset with human label?

@NJRBarry

What is the license?

Hi, Generally, If the source datasets used in this corpus has a specific license, our products follow the same. If not, or if the datasets are crawled by ourselves, they...

What is the license?

But the license may change, we will decide it ASAP.

What is the license?

We use CC-BY-SA. [dataset copyright](https://github.com/Hello-SimpleAI/chatgpt-comparison-detection#dataset-copyright)