slm-code-generation icon indicating copy to clipboard operation
slm-code-generation copied to clipboard

Reason for not releasing the source code?

Open xuansontrinh opened this issue 3 years ago • 5 comments

Hello authors,

I am very interested in this project and would like to have the true implementation of the model to see if I can somehow improve it. However, as can be seen from issue #11 and #8 , it is believed that the implementation will not see the public release date any time soon, although you promised to do it in #4. Can I ask why is it the case? If it is because of human resource shortage, I am willing to help you out.

Thank you for creating this wonderful project and I look forward to hearing from you soon.

Best Regards, Son.

xuansontrinh avatar Sep 16 '21 07:09 xuansontrinh

Hi Son, Thank you for your interest in our work!

We eventually decided to release only the trained-model demo + API. We also released the data and the data creation code, to allow other researchers to compare their results on the exact same data.

Best, Uri

On Thu, Sep 16, 2021 at 3:52 AM xuansontrinh @.***> wrote:

Hello authors,

I am very interested in this project and would like to have the true implementation of the model to see if I can somehow improve it. However, as can be seen from issue #11 https://github.com/tech-srl/slm-code-generation/issues/11 and #8 https://github.com/tech-srl/slm-code-generation/issues/8 , it is believed that the implementation will not see the public release date any time soon, although you promised to do it in #4 https://github.com/tech-srl/slm-code-generation/issues/4. Can I ask why is it the case?

Thank you for creating this wonderful project and I look forward to hearing from you soon.

Best Regards, Son.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/tech-srl/slm-code-generation/issues/13, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADSOXMBIK2E32D5FMYCILFDUCGO4BANCNFSM5EEEG6XA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

urialon avatar Sep 16 '21 12:09 urialon

@urialon I see, thank you for your swift reply. Is that okay with you if I ask a lot of questions regarding the model when I attempt to reproduce it?

P/s: I saw that in the file preprocess.sh there is a folder called scripts being used but I cannot find it in the souce code. Am I missing something?

xuansontrinh avatar Sep 17 '21 15:09 xuansontrinh

Sure! Feel free to ask as many questions as you'd like.

Sorry for the missing file, I just uploaded it. Let me know if you have any more questions.

urialon avatar Sep 17 '21 15:09 urialon

@urialon I appreciate you releasing the dataset and prepro code, but it will be very hard to reproduce your work from the mysterious phrasing of Section 3 alone.

ocramz avatar Jun 28 '22 06:06 ocramz

Hi @ocramz , Sorry about that. Let me know if there are any specific things that are unclear.

We have a newer model, PolyCoder, that performs better, and works for 12 programming languages. Would you like to try it? https://github.com/VHellendoorn/Code-LMs

Best, Uri

urialon avatar Jun 28 '22 14:06 urialon