screenpipe
screenpipe copied to clipboard
[bounty] Video/Voice LLM for Timeline
Use voice/video understanding to auto-label and summarize timeline chunks.
eg
- [ ] Combine transcript timestamps with visual content (e.g., scene changes)
- [ ] Auto-tag segments with:
- [ ] Topic
- [ ] Speaker
- [ ] Action
- [ ] Could use a timeline-aware model like VideoX, Slam, or custom LLM prompts
all the exact things that will need to be done to receive the bounty.
precision is important otherwise the bounty cannot be awarded.
/bounty 400
This is necessary as matches with user's needs
This issue is a response/relied to this issue: #1142
@louis030195 I just created the third issue concern this #1142 if you can take a look