bigcodebench
bigcodebench copied to clipboard
Is it normal for the ground truth accuracy not to be 100%?
Under this setting, my evaluation results on qwen2.5coder-instruct-3b is betther than results claimed from the officical techinique report.