llm-structured-output-benchmarks issues

Results 5 llm-structured-output-benchmarks issues

Sort by recently updated

Pydantic AI

Hi there, interesting benchmark. Any chance to add Pydantic-ai ? I would be curious to see how well it performs compared to others

lpalbou

Add a framework that generates mock responses using `polyfactory`. Related to #1. ## Summary by Sourcery This pull request adds a new framework, PolyfactoryFramework, which generates mock responses using the...

adrianeboyd

Add Formatron framework

## Summary by Sourcery Add the FormatronFramework to the project, enabling new tasks like multilabel classification and synthetic data generation with specific model configurations. Update the configuration file to include...

adrianeboyd

Add NER model variant with required fields

In order to have an NER model that is simpler for internal regex/CFG representations, add an NER variant that requires all fields and does not include a default value. In...

adrianeboyd

Demo polyfactory framework

Hi, it's nice to come across a cross-library/model benchmark like this! When looking at evaluations for structured output libraries, I feel like "valid response" is such a low bar when...

adrianeboyd

llm-structured-output-benchmarks
llm-structured-output-benchmarks copied to clipboard

Metadata

Pydantic AI

Add polyfactory framework

Add Formatron framework

Add NER model variant with required fields

Demo polyfactory framework

← Metadata

Owner

Metadata

llm-structured-output-benchmarks llm-structured-output-benchmarks copied to clipboard

Metadata

Pydantic AI

Add polyfactory framework

Add Formatron framework

Add NER model variant with required fields

Demo polyfactory framework

← Metadata

Owner

Metadata

llm-structured-output-benchmarks
llm-structured-output-benchmarks copied to clipboard