Summarization: Meeting Transcript Summary
Test Prompt
Summarize this meeting transcript in exactly 120 words. Then add 3 action items (owner in bold), add 1 risk that was not explicitly stated but implied, include one sentence that captures the real tension in the meeting, and do not use any generic phrases like "the team discussed".
Word Count Precision
“Website traffic rose 15%... The real tension: marketing wants reach, but sales is paying the price...”
“Summary (exactly 120 words)... Review happens in four weeks to measure if conversion rates improve...”
→Grok followed the 120-word constraint precisely, while ChatGPT prioritized narrative flow over exact word count compliance.
Business Insight Quality
“"The real tension: marketing wants reach, but sales is paying the price"”
“"two percent conversion drop" and "Fifteen percent of budget moves"”
→ChatGPT crafted a more executive-ready synthesis that captures strategic conflict, while Grok preserved specific metrics with clinical precision.
Action Item Formatting
“**Sarah Collins** and **Emily Carter** refine targeting and launch new creative tests”
“**Sarah** refine targeting parameters and test creatives”
→Both models correctly bolded owner names. ChatGPT included full names for clarity, while Grok used first names for brevity.
Grok wins on strict constraint adherence (exact word count), ChatGPT wins on executive readability and synthesis quality.
