claude-code
claude-code copied to clipboard
WF2S: Teaser emails with expired tracking URLs produce cruft-filled content
Issue Summary
Teaser/notification emails that contain "Read the press note" buttons with Marketo tracking URLs (path.qualcomm.com/...) fail to fetch the actual PR content because:
- The tracking URL returns 404 (expired/one-time-use link)
- The email body content is used as fallback
- Email body contains PR summary + legal disclaimers + footer cruft
Example: Qualcomm/Ventana (PS-24731)
Message ID: [email protected]
Email contains:
- "Read the press note" button with Marketo tracking URL
- Short PR summary (3 paragraphs)
- Full Qualcomm legal disclaimer
- Email footer with Terms/Privacy/Unsubscribe links
Tracking URL test:
curl -s -X POST "http://localhost:3200/fetch-content" \
-d '{"url": "https://path.qualcomm.com/Mzg1LVRXUy04MDMAAAGep5kyA1pO..."}'
# Returns: 404 - Page not found
Result: Content tab shows PR summary + all legal cruft instead of clean PR text.
Root Causes
- Missing teaser pattern: "Read the press note" not in TEASER_PATTERNS
- No email content cleanup: When URL fetch fails, email body is used without stripping legal/footer content
- Tracking URLs expire: Marketo links are time-sensitive or one-time-use
Proposed Fixes
- Add teaser patterns:
read the press note,view the press release,learn more here - Add email content cleanup function (strip legal disclaimers, footers, copyright notices)
- Consider detecting tracking URL domains and prioritizing direct newsroom URLs
Affected Version
WF2S v270-PREFER-BODYTEXT
Status
PARKED - Moving to other examples first