claude-code icon indicating copy to clipboard operation
claude-code copied to clipboard

WF2S: Teaser emails with expired tracking URLs produce cruft-filled content

Open mark-bfd opened this issue 2 months ago • 0 comments

Issue Summary

Teaser/notification emails that contain "Read the press note" buttons with Marketo tracking URLs (path.qualcomm.com/...) fail to fetch the actual PR content because:

  1. The tracking URL returns 404 (expired/one-time-use link)
  2. The email body content is used as fallback
  3. Email body contains PR summary + legal disclaimers + footer cruft

Example: Qualcomm/Ventana (PS-24731)

Message ID: [email protected]

Email contains:

  • "Read the press note" button with Marketo tracking URL
  • Short PR summary (3 paragraphs)
  • Full Qualcomm legal disclaimer
  • Email footer with Terms/Privacy/Unsubscribe links

Tracking URL test:

curl -s -X POST "http://localhost:3200/fetch-content" \
  -d '{"url": "https://path.qualcomm.com/Mzg1LVRXUy04MDMAAAGep5kyA1pO..."}'
# Returns: 404 - Page not found

Result: Content tab shows PR summary + all legal cruft instead of clean PR text.

Root Causes

  1. Missing teaser pattern: "Read the press note" not in TEASER_PATTERNS
  2. No email content cleanup: When URL fetch fails, email body is used without stripping legal/footer content
  3. Tracking URLs expire: Marketo links are time-sensitive or one-time-use

Proposed Fixes

  1. Add teaser patterns: read the press note, view the press release, learn more here
  2. Add email content cleanup function (strip legal disclaimers, footers, copyright notices)
  3. Consider detecting tracking URL domains and prioritizing direct newsroom URLs

Affected Version

WF2S v270-PREFER-BODYTEXT

Status

PARKED - Moving to other examples first

mark-bfd avatar Dec 10 '25 23:12 mark-bfd