openai-cookbook icon indicating copy to clipboard operation
openai-cookbook copied to clipboard

New notebook + 1 video + 1 image file

Open anurag-openai opened this issue 8 months ago • 0 comments
trafficstars

Summary

Briefly describe the changes and the goal of this PR. Make sure the PR title summarizes the changes effectively.

This PR introduces a detailed notebook demonstrating how to leverage GPT-4o's vision capabilities for analyzing video frames to extract structured operational insights in a manufacturing warehouse. It provides step-by-step instructions, best practices for bounding boxes, structured data extraction, confidence scoring, and cost considerations to effectively implement an AI-driven monitoring system.

Motivation

Why are these changes necessary? How do they improve the cookbook?

Warehouse managers often lack real-time visibility into their operations, relying instead on delayed or manual reporting methods, leading to reactive rather than proactive decision-making. This contribution addresses these issues by using GPT-4o's vision capabilities to analyze video footage, enabling rapid identification of safety concerns, monitoring space utilization, and detecting operational inefficiencies in near-real-time. This significantly improves decision-making speed, enhances safety compliance, and reduces operational inefficiencies.


For new content

When contributing new content, read through our contribution guidelines, and mark the following action items as completed:

  • [ ] I have added a new entry in registry.yaml (and, optionally, in authors.yaml) so that my content renders on the cookbook website.
  • [X ] I have conducted a self-review of my content based on the contribution guidelines:
    • [X ] Relevance: This content is related to building with OpenAI technologies and is useful to others.
    • [X] Uniqueness: I have searched for related examples in the OpenAI Cookbook, and verified that my content offers new insights or unique information compared to existing documentation.
    • [X ] Spelling and Grammar: I have checked for spelling or grammatical mistakes.
    • [X ] Clarity: I have done a final read-through and verified that my submission is well-organized and easy to understand.
    • [X] Correctness: The information I include is correct and all of my code executes successfully.
    • [X ] Completeness: I have explained everything fully, including all necessary references and citations.

We will rate each of these areas on a scale from 1 to 4, and will only accept contributions that score 3 or higher on all areas. Refer to our contribution guidelines for more details.

anurag-openai avatar Mar 04 '25 16:03 anurag-openai