YuRonan issues

Results 11 issues of


                                            YuRonan

找不到label_map_path

absa_laptops.label_map.txt这个文件有什么作用，我在初始化方面级情感分类任务中遇到这个问题

[Bug] Error response from daemon: Get "https://docker-hub.x-lab.info/v2/": EOF

### Current Behavior ![image](https://github.com/X-lab2017/open-digger/assets/29674550/dab958cd-4242-4093-b0ea-f208060f96a9) ### Expected Behavior _No response_ ### Any Additional Comments? _No response_

waiting for author

[Label Data] ECNU-OSS101-2024-Spring-Class

### Label communities/ecnu-oss101-2024-spring ### Type oss101 course learners ### Repos/Orgs/Users - ecnu-oss101-2024-spring ### Other information _No response_

waiting for author

[论文] 面向开源领域指令微调数据集的构建以及大模型的实现

### Description 因为后续会讨论很多实验方面以及数据集构建方面的细节，故将[开源领域大模型的科研进展](https://github.com/X-lab2017/open-wonderland/issues/374)放到open-research仓库中讨论，我根据现有实验室资源以及GitHub的现有的功能，将任务大致划分为以下部分： ![image](https://github.com/X-lab2017/open-research/assets/29674550/b148bcd4-97f7-4f94-ba97-e22d2633ebd8) 任务还是有点多，可能会适当删减，我认为可以先确定方法的有效性，再一次次扩大数据集并添加更多任务，这样更稳一些同时 @衍童最近在看[LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory)，可以支持后续大模型的微调以及开发。今天王老师组会上提到，针对不同的仓库可能模型设计的问题和答案是不同的，这个我也确实考虑到了，因为同一个问题在不同的仓库下是有可能都会被问到的，但因为仓库不同，所以答案不一定相同，这个需要一开始设计任务时做好处理，我计划从仓库入手获取数据集，先考虑以下几个热门仓库： https://github.com/vercel/next.js https://github.com/gatsbyjs/gatsby https://github.com/nodejs/node https://github.com/tailwindlabs/tailwindcss https://github.com/laravel/framework

关于学院开展研究生学术沙龙的讲座安排

### Description 时间：今年3月份讲座：参考往年的流程，需要有3-4个讲座，一般第一个讲座需要一位老师先讲，后面两个由学生讲。还需要一位学生负责拍照，可能要麻烦专业摄影师 @zhicheng-ning 同学到时候帮一下忙我觉得可以参考年会的汇报安排 1. OpenRank @will-ww 2. OpenDigger @PureNatural 3. OpenPerf @bifenglin 王老师先开个头，然后我和老毕再接着讲，看看大家有什么其他意见。

[Presentation] Improving Language Understandingby Generative Pre-Training（GPT1.0）

### Title Improving Language Understanding by Generative Pre-Training（GPT1.0） ### Link https://s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf ### Year 2018 ### Author and affiliation Alec Radford OpenAI Karthik Narasimhan OpenAI Tim Salimans OpenAI Ilya Sutskever OpenAI...

[Bug] the length of contributor bar have some problem

### Version 1.9.7 ### Current Behavior ![image](https://github.com/user-attachments/assets/48eaf663-da16-4610-9437-49b448122711) As we can see the image, two contributor's activity are 5.8, but the length of the bar may not equal. ### Expected Behavior...

kind/bug

[Recommendation] SWE-BENCH: CAN LANGUAGE MODELS RESOLVE REAL-WORLD GITHUB ISSUES?

### Title SWE-BENCH: CAN LANGUAGE MODELS RESOLVE REAL-WORLD GITHUB ISSUES? ### Link https://arxiv.org/abs/2310.06770 ### Year 2024 ### Conference or Journal ICLR ### Rank A ### Keywords _No response_ ### Reason...

[Research] 关于实现 OpenDigger 自动化多层标签分类的研究进展跟踪

该工作首先是支撑OpenDigger项目本身，GitHub每年都会有大量热门仓库的创建，然而每次OpenDigger中的仓库标签大多数均由人工给出，耗时耗力其次是为了支持蚂蚁开源办公室夏博士的[工作](https://xiaoya-yaya.github.io/ant-ospo-insights/analysis/Data4AI-Analysis) 多层次标签分类在自然语言处理中算是比较难的一个分类任务，英文为Hierarchical Text Classification或者 Hierarchical-Multilabel-Text-Classification，分类结构参考如下： ![Image](https://github.com/user-attachments/assets/8cde493c-3764-4d97-be1a-164de8f6081b) 我们自己的标签体系也是一个树状的结构，例如，root下面有AI database fronted 等等，database下面又有key-value, time-series等等，我们现在希望当输入一个仓库的一些描述信息时，可以直接自动分类。那么现在的问题就是我们需要自己有一套标签体系，然后再通过使用合理的方法对其进行分类。

[Presentation] MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue ReSolution

### Title MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue ReSolution ### Link https://arxiv.org/pdf/2403.17927 ### Year 2024 ### Author and affiliation Wei Tao, Fudan University, Yucheng Zhou, University of Macau Yanlin...