SeeAct issues

Results 10 SeeAct issues

Sort by recently updated

Lack of evalution code of offline evalution of mm-mind2web

Could you please provide the complete offline evaluation code for mm-mind2web? Currently, only the prediction demo code is available, lacking the full dataset loop and evaluation metric to reproduce the...

leoozy

Bug Report: Missing Previous Actions in MM-Mind2web Dataset

Thank you very much for your work. I have found a potential bug in your MM-Mind2web model. It seems that each data point only contains a list of selectable actions...

leoozy

Statement has no effect

https://github.com/OSU-NLP-Group/SeeAct/blob/8af310159af97b123ff07abb925c497bb1ca2478/src/data_utils/format_prompt_utils.py#L204 Sorry It's harder for me to do a PR, I'm working on another codebase

manuel-delverme

Would you release the code of finetuning BLIP-2-T5?

leoozy

Can an online assessment yield a score, or can the process of an offline assessment be visualized?

![1712585243173](https://github.com/OSU-NLP-Group/SeeAct/assets/28804414/9689c185-4160-4296-87b6-ce8baa2e4e37) I wanted to visualize how the model action on the Mind2Web dataset, but SeeAct didn't seem to do that. When computing online, the output "success_or_not" is always empty, which...

Tangent-90C

What is the Accuracy for Ungrounded Experiments?

![image](https://github.com/OSU-NLP-Group/SeeAct/assets/63557613/6c4d8c89-1199-4cf3-a6c0-cab75c0ca48d) While all of this information is nice and all, is there a comparison to experiments without grounding? It can be argued that grounding may hurt performance without knowing what...

Arvulus

SeeAct
SeeAct copied to clipboard

Metadata

Lack of evalution code of offline evalution of mm-mind2web

Bug Report: Missing Previous Actions in MM-Mind2web Dataset

Statement has no effect

Would you release the code of finetuning BLIP-2-T5?

Can an online assessment yield a score, or can the process of an offline assessment be visualized?

What is the Accuracy for Ungrounded Experiments?

Display log on Openai request error

possibly dumb action space ideas

Model trajectories release

Model Predictions and Oracle Grounding

← Metadata

Owner

Metadata

SeeAct SeeAct copied to clipboard

Metadata

← Metadata

Owner

Metadata

SeeAct
SeeAct copied to clipboard