albertan017 comments

Results 61 comments of


                                            albertan017

Compile-Trace-Filter Framework release?

Thanks for your interest, currently we do not plan to release the framework.

Binary files with debugging information

Yes, you can grab the [bins](https://huggingface.co/datasets/LLM4Binary/decompile-bench-bins)—they’re unstripped and include full debug information—but be aware the unzipped size is quite large (around 500 GB).

llm4decompile-ref dataset

The LLM4Binary/decompile-ghidra-100k dataset is a sample dataset used for the v2 series models. For training the v2 series, we use a larger dataset consisting of 1 billion tokens (approximately 1.6...

llm4decompile-ref dataset

We're using the ExeBench with the first 400K functions, which contains the AnghaBench. Yes, compile the bench and decompile by Ghidra.

Release source code versions of decompile-bench Github projects

We don't have that info on record, but we're working on a bigger dataset now. We'll be including these metadata: - Source code (including the exact git commit, as you...

Support for decompilation platforms

Not yet—we plan to add arm64 support by the end of this year.

Error while deserializing header: MetadataIncompleteBuffer

Please use the [vllm script](https://github.com/albertan017/LLM4Decompile/blob/main/evaluation/run_evaluation_llm4decompile_vllm.py) Other scripts have not been updated. Regarding your error, I believe it is associated with the environment rather than the model. You might need to...

albertan017

Compile-Trace-Filter Framework release?

Binary files with debugging information

llm4decompile-ref dataset

llm4decompile-ref dataset

Release source code versions of decompile-bench Github projects

Support for decompilation platforms

Error while deserializing header: MetadataIncompleteBuffer

Data filtering codes for training dataset construction?

Compilation platform of decompile-bench

Compilation platform of decompile-bench