PostHog Bot
PostHog Bot
## 🧠 AI eval results Evaluated **9** experiments, comprising **9** metrics. ### [funnel](https://www.braintrust.dev/app/PostHog/p/max-ai-funnel/experiments/max-query-plan-rearchitect-1751021618) 🆕 **plan_correctness**: **0.00%** Avg. case performance: ⏱️ 30.51 s, 🔢 0 tokens ### [memory](https://www.braintrust.dev/app/PostHog/p/max-ai-memory/experiments/max-query-plan-rearchitect-1751021687) 🔵 **ToolRelevance**: **98.22%**,...
## 🧠 AI eval results Evaluated **9** experiments, comprising **25** metrics. ### [funnel](https://www.braintrust.dev/app/PostHog/p/max-ai-funnel/experiments/max-query-plan-rearchitect-1752003419) 🆕 **QueryKindSelection**: **73.33%** 🆕 **plan_correctness**: **60.58%** 🆕 **query_and_plan_alignment**: **81.11%** 🆕 **time_range_relevancy**: **95.22%** Avg. case performance: ⏱️ 188.89...
## 🧠 AI eval results Evaluated **9** experiments, comprising **25** metrics. ### [funnel](https://www.braintrust.dev/app/PostHog/p/max-ai-funnel/experiments/max-query-plan-rearchitect-1752591782) 🆕 **QueryKindSelection**: **76.00%** 🆕 **plan_correctness**: **69.92%** 🆕 **query_and_plan_alignment**: **82.90%** 🆕 **time_range_relevancy**: **96.50%** Avg. case performance: ⏱️ 60.54...
## 🧠 AI eval results Evaluated **9** experiments, comprising **25** metrics. ### [funnel](https://www.braintrust.dev/app/PostHog/p/max-ai-funnel/experiments/max-query-plan-rearchitect-1752596193) 🆕 **QueryKindSelection**: **68.00%** 🆕 **plan_correctness**: **71.58%** 🆕 **query_and_plan_alignment**: **77.50%** 🆕 **time_range_relevancy**: **95.00%** Avg. case performance: ⏱️ 59.62...
## 🧠 AI eval results Evaluated **9** experiments, comprising **25** metrics. ### [funnel](https://www.braintrust.dev/app/PostHog/p/max-ai-funnel/experiments/max-query-plan-rearchitect-1752682658) 🔴 **QueryKindSelection**: **74.47%**, **-25.53%** versus [baseline (master)](https://www.braintrust.dev/app/PostHog/p/max-ai-funnel/experiments/master-1752590740) (improvements: 0, regressions: 7) 🔴 **plan_correctness**: **66.00%**, **-22.25%** versus [baseline...
## 📸 UI snapshots have been updated **2** snapshot changes in total. **0** added, **2** modified, **0** deleted: - **`chromium`**: **0** added, **2** modified, **0** deleted ([diff for shard 1](https://github.com/PostHog/posthog/pull/33690/commits/28a66449c2bff70088c947dbe456d61fe2ef41e6))...
## 📸 UI snapshots have been updated **1** snapshot changes in total. **0** added, **1** modified, **0** deleted: - **`chromium`**: **0** added, **1** modified, **0** deleted ([diff for shard 1](https://github.com/PostHog/posthog/pull/33690/commits/8aa4e0fca4ff56c72be451c9a215fac48fa9e208))...
## 📸 UI snapshots have been updated **1** snapshot changes in total. **0** added, **1** modified, **0** deleted: - **`chromium`**: **0** added, **1** modified, **0** deleted ([diff for shard 5](https://github.com/PostHog/posthog/pull/33690/commits/c837a0bcb4a69db6187533a8f561c434ed57327c))...
## 📸 UI snapshots have been updated **3** snapshot changes in total. **0** added, **3** modified, **0** deleted: - **`chromium`**: **0** added, **3** modified, **0** deleted ([diff for shard 8](https://github.com/PostHog/posthog/pull/33690/commits/28ae6dd41e3e60b8fca6802200962e7ae6e15ff4),...
## 📸 UI snapshots have been updated **18** snapshot changes in total. **0** added, **18** modified, **0** deleted: - **`chromium`**: **0** added, **18** modified, **0** deleted ([diff for shard 12](https://github.com/PostHog/posthog/pull/33690/commits/3acaabb34941c9a7e57eb918d5d16a6118b3c321))...