AI keeps extending its claim on knowledge work while still misbehaving
Codex expands beyond coding — OpenAI pitches knowledge workers next
OpenAI is repositioning Codex from a developer tool into a general productivity layer for research, analysis and content — which is to say, the daily task stack of a senior health communications professional. The strategic question isn’t whether it’s good enough yet; it’s whether your pharma clients start asking about it before your agency has a considered policy. Better to author that policy now than to improvise it in a capabilities meeting.
Claude goes down the day after Anthropic files a blockbuster IPO
Anthropic’s near-trillion-dollar float arrived with an unignorable footnote: a major Claude outage the day after. For any team that has quietly routed literature reviews, regulatory drafting or medical writing through a single model, that is a live reminder that single-vendor dependency is an operational risk, not a convenience. The valuation also signals where pricing power is heading, and it isn’t toward the buyer.
Remote work — not AI — blamed for the junior talent drought
A useful corrective to the reflexive “AI is taking the entry-level jobs” narrative: the Fed attributes much of rising youth unemployment to remote work and the mentorship it quietly erodes. For anyone running distributed health communications teams, the implication is awkward but actionable — your junior writers’ development may be constrained less by automation than by how rarely they sit beside someone more senior. Worth auditing your onboarding before you blame the robots.
LLMs fail safe messaging on eating-disorder queries, despite clinician warnings
Clinician-validated research finds models will adapt to unsafe user framing and produce harmful eating-disorder content without adequate guardrails. If your team builds AI-assisted patient-facing copy or chatbot content in nutrition, weight management or mental health, this is a documented liability vector rather than a hypothetical one — and exactly the sort of evidence that will surface in MLR and regulatory conversations. Read it as a reason to keep a human between the model and the patient.
Hallucination rate cut 48% in clinical-note summarisation
On the more encouraging side: a fine-tuning method nearly halves hallucinations in LLM-generated clinical summaries, tested on real hospital notes. For anyone evaluating models for medical writing or clinical content workflows, it suggests the accuracy gap is narrowing — though “halved” still means present, so the standard remains verify, don’t trust. The useful shift is that factuality is becoming an engineering target, not merely a disclaimer.
That’s it for this edition. Back Wednesday.
— Ned
Follow the conversation: #IrreplaceablesHealth

