Listening at scale: from a million forum posts to a single decision

How agentic pipelines collapse months of manual coding into hours, without flattening nuance. A walkthrough of our ingestion → signal → action loop.

Listening at scale sits at the intersection of three things pharma has historically kept apart: the unprompted voice of the patient, the day-to-day work of brand and medical teams, and the regulatory bar for what counts as evidence. None of those three is going to move on its own. The interesting question is what changes when we stop treating them as separate disciplines. What we keep finding, across therapy areas and across geographies, is that the patient is already telling us what they need. They are doing it in forums, in support groups, in the questions they ask their pharmacist, in the reviews they leave on adherence apps. The signal is dense, public, and has been there the whole time. The work is not to manufacture it. The work is to listen well enough that we can act on it before the moment passes.

The ingestion → signal → action loop

Our pipeline starts with ingestion at the boundary of public conversation: forums, communities, advocacy spaces, public review surfaces. Everything is anonymised at the boundary, with provenance preserved so that any claim can be traced back to its source. The next stage is structuring: turning long, messy human language into bounded, queryable concepts the brand or medical team actually thinks in. The third stage is action: the cohort-level shift gets routed to the team that can do something about it, with the patient language attached. The most common failure mode we see is teams trying to use this signal to do individualised outreach. That is the wrong instrument. Patient Experience Data is a cohort instrument. It tells you what is shifting for a population, with what vocabulary, on what timeline. The action it unlocks is at the program level (counseling scripts, HCP messaging, copay communications, medical content), not at the level of a named patient. Scale without nuance is just noise that arrives faster.

What the agents actually do

In a recent twelve-month engagement with a mid-size brand in a chronic specialty area, the rising-driver signal preceded the corresponding refill-rate dip by six to ten weeks. The team updated their patient-services counseling against the rising cause, and recovered roughly 14% of expected discontinuation in the targeted cohort. The number itself matters less than the shape of the result: the action landed during the window in which it could still matter. Regulators have been clearer on this than the industry sometimes assumes. FDA, EMA and PMDA have all signalled that real-world patient experience evidence is admissible and useful, provided it meets a specific bar: defensible cohort definition, traceable provenance, clear handling of bias, and transparent methods for turning language into structured claims. Hitting that bar is engineering work, not philosophy.

What we deliberately do not automate

What changes, when this works, is not the volume of data in the room. The volume of data in the room has never been the problem. What changes is the latency between a patient being heard and a team being able to act on what they said. That latency, more than anything else, is what we are trying to collapse.