← Adverse-Event Coding & Causality comparison · model: Claude Opus 4.8
LLM-judge score: 5.0 / 5 · strong
extraction: 5 coding: 5 seriousness: 5 causality: 5 faithfulness: 5
The agent recovered both syndromic events (CRS and ICANS), mapped them to the correct PTs and SOCs matching the reference key, and correctly handled the symptom-vs-diagnosis coding convention (subsuming pyrexia/hypotension/rigors into CRS, and confusion/tremor into ICANS). Seriousness is correctly assigned on the hospitalization criterion with the appropriate note that resolution doesn't downgrade seriousness, and causality reasoning is well-grounded in temporality, dechallenge/rechallenge, alternative-cause exclusion, and label/boxed-warning status—matching the reference's Probable/Certain logic for CRS and Probable for ICANS. Reasoning is faithful to the narrative and label with no fabricated values.
Clinical / Pharmacovigilance · Drug-safety physician (PV)

Adverse-Event Coding & Causality — agent trajectory for expert review

claude-opus-4-8 · 2026-06-06T10:26:35.154462+00:00 · 10 tool calls · 34.5s · 11887 tokens
run id: ae_causality_20260606_031816_ae-causality__claude-opus-4-8

Query

Code the adverse events to MedDRA PT/SOC and assess seriousness and drug causality for case ICSR-001 (a patient on a DLL3 T-cell engager). Justify each causality category.

Trajectory

Agent · turn 1

I'll analyze case ICSR-001 systematically. Let me start by pulling the case details and the relevant drug label.

⚙ calls get_case(case_id=ICSR-001)
⚙ calls get_drug_label(drug_class=DLL3 T-cell engager (BiTE))
↳ get_case result
show data
{
  "case_id": "ICSR-001",
  "study_drug": "Tarlatamab",
  "drug_class": "DLL3 T-cell engager (BiTE)",
  "patient": {
    "age": 64,
    "sex": "M",
    "comorbidities": [
      "ES-SCLC, 2nd line",
      "former smoker"
    ],
    "concomitant_meds": [
      "dexamethasone premedication",
      "ondansetron"
    ]
  },
  "narrative": "64-year-old man with extensive-stage small cell lung cancer received his first step-up dose of the DLL3 T-cell engager. Approximately 8 hours after dosing he developed fever to 39.2C with rigors and low blood pressure (88/54) requiring IV fluids; this was graded as cytokine release syndrome. On day 2 nursing noted he was confused with a coarse tremor and a drop in his ICE score (neurotoxicity). He was admitted to hospital and treated with tocilizumab and steroids; symptoms resolved over 48 hours. He was re-dosed at the next cycle with premedication and had only mild recurrence. No infection was identified on workup.",
  "events": [
    "fever with low blood pressure (CRS)",
    "confusion with tremor (neurotoxicity)"
  ]
}
↳ get_drug_label result
show data
{
  "drug_class": "DLL3 T-cell engager (BiTE)",
  "examples": [
    "Tarlatamab"
  ],
  "labeled": [
    "Cytokine release syndrome",
    "Immune effector cell-associated neurotoxicity syndrome",
    "Pyrexia",
    "Neutropenia",
    "Anaemia",
    "Fatigue",
    "Hypotension"
  ],
  "boxed_warning": "Cytokine release syndrome and neurologic toxicity including ICANS.",
  "note": "CRS and ICANS are expected, very common, on-target effects of T-cell engagers; typically occur within days of the first/step-up doses."
}
Agent · turn 2

Now let me code the events to MedDRA PTs. The narrative describes several discrete clinical findings, so I'll look up each.

⚙ calls meddra_lookup(term=cytokine release syndrome)
⚙ calls meddra_lookup(term=fever)
⚙ calls meddra_lookup(term=low blood pressure)
⚙ calls meddra_lookup(term=rigors)
⚙ calls meddra_lookup(term=confusion)
⚙ calls meddra_lookup(term=tremor)
⚙ calls meddra_lookup(term=neurotoxicity)
↳ meddra_lookup result
show data
{
  "query": "cytokine release syndrome",
  "count": 1,
  "matches": [
    {
      "pt": "Cytokine release syndrome",
      "soc": "Immune system disorders",
      "synonyms": [
        "crs",
        "cytokine storm",
        "infusion cytokine reaction"
      ]
    }
  ]
}
↳ meddra_lookup result
show data
{
  "query": "fever",
  "count": 2,
  "matches": [
    {
      "pt": "Febrile neutropenia",
      "soc": "Blood and lymphatic system disorders",
      "synonyms": [
        "fever with low white count",
        "neutropenic fever"
      ]
    },
    {
      "pt": "Pyrexia",
      "soc": "General disorders and administration site conditions",
      "synonyms": [
        "fever",
        "high temperature",
        "febrile",
        "raised temperature"
      ]
    }
  ]
}
↳ meddra_lookup result
show data
{
  "query": "low blood pressure",
  "count": 1,
  "matches": [
    {
      "pt": "Hypotension",
      "soc": "Vascular disorders",
      "synonyms": [
        "low blood pressure",
        "low bp"
      ]
    }
  ]
}
↳ meddra_lookup result
show data
{
  "query": "rigors",
  "count": 0,
  "matches": []
}
↳ meddra_lookup result
show data
{
  "query": "confusion",
  "count": 1,
  "matches": [
    {
      "pt": "Immune effector cell-associated neurotoxicity syndrome",
      "soc": "Nervous system disorders",
      "synonyms": [
        "icans",
        "neurotoxicity",
        "neurologic toxicity",
        "confusion with tremor",
        "encephalopathy",
        "ice score drop"
      ]
    }
  ]
}
↳ meddra_lookup result
show data
{
  "query": "tremor",
  "count": 1,
  "matches": [
    {
      "pt": "Immune effector cell-associated neurotoxicity syndrome",
      "soc": "Nervous system disorders",
      "synonyms": [
        "icans",
        "neurotoxicity",
        "neurologic toxicity",
        "confusion with tremor",
        "encephalopathy",
        "ice score drop"
      ]
    }
  ]
}
↳ meddra_lookup result
show data
{
  "query": "neurotoxicity",
  "count": 1,
  "matches": [
    {
      "pt": "Immune effector cell-associated neurotoxicity syndrome",
      "soc": "Nervous system disorders",
      "synonyms": [
        "icans",
        "neurotoxicity",
        "neurologic toxicity",
        "confusion with tremor",
        "encephalopathy",
        "ice score drop"
      ]
    }
  ]
}
Agent · turn 3

Let me confirm the rigors/chills coding with an alternative term.

⚙ calls meddra_lookup(term=chills)
↳ meddra_lookup result
show data
{
  "query": "chills",
  "count": 0,
  "matches": []
}
Agent · turn 4

No specific PT for rigors/chills in this teaching dictionary; clinically they are constituent symptoms of the CRS event and are subsumed under that diagnosis rather than coded separately.

ICSR-001 — AE Coding, Seriousness & Causality Assessment

Patient: 64M, ES-SCLC (2nd line), former smoker. Study drug: Tarlatamab (DLL3 T-cell engager / BiTE). Concomitant: dexamethasone premed, ondansetron.

1. MedDRA Coding

The case is best coded as two clinical syndromes (the diagnoses), with the individual symptoms rolled into them per the convention of coding to the reported diagnosis when one is provided.

Event (as reported) MedDRA PT SOC
Fever 39.2 °C + rigors + hypotension, graded as CRS Cytokine release syndrome Immune system disorders
Day-2 confusion + coarse tremor + ICE-score drop, called neurotoxicity Immune effector cell-associated neurotoxicity syndrome (ICANS) Nervous system disorders

Coding notes:
- The clinician gave unifying diagnoses ("cytokine release syndrome", "neurotoxicity"). Per ICH coding convention, when a diagnosis is reported you code the diagnosis rather than each component sign/symptom. So pyrexia (Pyrexia, General disorders), hypotension (Hypotension, Vascular disorders) and rigors are constituents of CRS; confusion and tremor are constituents of ICANS.
- The synonym mapping confirms "confusion with tremor," "ICE score drop," and "neurotoxicity" all resolve to the ICANS PT.
- "Febrile neutropenia" was deliberately not used — there is no documented neutropenia in this case, so the fever codes to Pyrexia (within CRS), not Febrile neutropenia.
- Rigors/chills has no standalone PT in this dictionary; clinically subsumed within CRS — no information lost.

2. Seriousness Assessment

Apply ICH E2A serious criteria (death / life-threatening / hospitalization or prolongation / disability / congenital anomaly / other medically important).

Event Serious? Criterion met
Cytokine release syndrome SERIOUS Caused hospitalization (admitted). Also arguably medically important/potentially life-threatening (hypotension 88/54 requiring IV fluids, treated with tocilizumab).
ICANS SERIOUS Occurred during the hospitalization; neurologic toxicity with encephalopathy is a medically important event.

Both events are serious — primarily on the hospitalization criterion. Note both ultimately resolved within ~48h with treatment, so they were not fatal or persistently disabling, but resolution does not downgrade seriousness once a serious criterion was met.

3. Causality Assessment (drug = Tarlatamab)

Event 1 — Cytokine release syndrome → RELATED (Certain/Probable)

Event 2 — ICANS → RELATED (Probable)

4. Summary Line for the ICSR

64M on tarlatamab (DLL3 BiTE), first step-up dose. Cytokine release syndrome (Immune system disorders) and ICANS (Nervous system disorders), both serious (hospitalization), both assessed related to study drug — CRS certain/probable (textbook timing, boxed-warning on-target effect, negative infection workup, positive rechallenge), ICANS probable (expected post-CRS sequence, labeled, dechallenge with steroids, no alternative cause). Both resolved within ~48 h.

Two flags worth raising to the medical reviewer:
1. Both are expected/labeled (boxed warning) reactions — relevant for expedited-reporting expectedness determination (serious + related, but expected).
2. The mild rechallenge recurrence at the next cycle is itself reportable follow-up information supporting causality.

Would you like me to format this as a structured ICSR causality narrative, or proceed to code the next case?

Final output

No specific PT for rigors/chills in this teaching dictionary; clinically they are constituent symptoms of the CRS event and are subsumed under that diagnosis rather than coded separately.

ICSR-001 — AE Coding, Seriousness & Causality Assessment

Patient: 64M, ES-SCLC (2nd line), former smoker. Study drug: Tarlatamab (DLL3 T-cell engager / BiTE). Concomitant: dexamethasone premed, ondansetron.

1. MedDRA Coding

The case is best coded as two clinical syndromes (the diagnoses), with the individual symptoms rolled into them per the convention of coding to the reported diagnosis when one is provided.

Event (as reported) MedDRA PT SOC
Fever 39.2 °C + rigors + hypotension, graded as CRS Cytokine release syndrome Immune system disorders
Day-2 confusion + coarse tremor + ICE-score drop, called neurotoxicity Immune effector cell-associated neurotoxicity syndrome (ICANS) Nervous system disorders

Coding notes:
- The clinician gave unifying diagnoses ("cytokine release syndrome", "neurotoxicity"). Per ICH coding convention, when a diagnosis is reported you code the diagnosis rather than each component sign/symptom. So pyrexia (Pyrexia, General disorders), hypotension (Hypotension, Vascular disorders) and rigors are constituents of CRS; confusion and tremor are constituents of ICANS.
- The synonym mapping confirms "confusion with tremor," "ICE score drop," and "neurotoxicity" all resolve to the ICANS PT.
- "Febrile neutropenia" was deliberately not used — there is no documented neutropenia in this case, so the fever codes to Pyrexia (within CRS), not Febrile neutropenia.
- Rigors/chills has no standalone PT in this dictionary; clinically subsumed within CRS — no information lost.

2. Seriousness Assessment

Apply ICH E2A serious criteria (death / life-threatening / hospitalization or prolongation / disability / congenital anomaly / other medically important).

Event Serious? Criterion met
Cytokine release syndrome SERIOUS Caused hospitalization (admitted). Also arguably medically important/potentially life-threatening (hypotension 88/54 requiring IV fluids, treated with tocilizumab).
ICANS SERIOUS Occurred during the hospitalization; neurologic toxicity with encephalopathy is a medically important event.

Both events are serious — primarily on the hospitalization criterion. Note both ultimately resolved within ~48h with treatment, so they were not fatal or persistently disabling, but resolution does not downgrade seriousness once a serious criterion was met.

3. Causality Assessment (drug = Tarlatamab)

Event 1 — Cytokine release syndrome → RELATED (Certain/Probable)

Event 2 — ICANS → RELATED (Probable)

4. Summary Line for the ICSR

64M on tarlatamab (DLL3 BiTE), first step-up dose. Cytokine release syndrome (Immune system disorders) and ICANS (Nervous system disorders), both serious (hospitalization), both assessed related to study drug — CRS certain/probable (textbook timing, boxed-warning on-target effect, negative infection workup, positive rechallenge), ICANS probable (expected post-CRS sequence, labeled, dechallenge with steroids, no alternative cause). Both resolved within ~48 h.

Two flags worth raising to the medical reviewer:
1. Both are expected/labeled (boxed warning) reactions — relevant for expedited-reporting expectedness determination (serious + related, but expected).
2. The mild rechallenge recurrence at the next cycle is itself reportable follow-up information supporting causality.

Would you like me to format this as a structured ICSR causality narrative, or proceed to code the next case?

Reference answer key (deterministic) — open after grading

Deterministic reference key for ICSR-001 (Tarlatamab). Causality categories follow WHO-UMC; where the narrative supports more than one defensible category the rationale says so. A strong analyst recovers all events, the correct PT/SOC from the dictionary, the right seriousness criterion, and well-reasoned causality grounded in temporality, dechallenge/rechallenge, alternative etiology, and the drug label.

Event (as narrated)Expected PTExpected SOCSeriousness (ICH E2A)Causality (WHO-UMC) + rationale
fever with low blood pressure (CRS)Cytokine release syndromeImmune system disordersSerious — Hospitalization (and medically important)Probable — Onset hours after the first step-up dose (tight temporal link), CRS is a labeled/boxed on-target effect of T-cell engagers, no infection found (no alternative cause), positive dechallenge with tocilizumab/steroids and only mild recurrence on rechallenge with premedication. Strong case; 'Certain' is defensible if rechallenge is weighted heavily.
confusion with tremor (neurotoxicity)Immune effector cell-associated neurotoxicity syndromeNervous system disordersSerious — Hospitalization (and medically important)Probable — Onset day 2 after dosing, ICANS-type neurotoxicity is a labeled/boxed class effect, resolved with steroids (positive dechallenge), no metabolic or structural alternative identified.
OVERALL CASESerious — Hospitalization / initially life-threatening (CRS with hypotension)Case-level ICH-E2A determination

Expert grading

Event extraction completeness 1 (poor) 5 (excellent)
5: Identifies every reportable adverse event in the narrative — including a serious event that is unrelated to the drug and secondary/lab events — without lumping distinct events or inventing ones.
1: Misses reportable events (e.g. overlooks the neutropenia behind a febrile-neutropenia admission, or drops the unrelated fracture), or merges separate events into one.
MedDRA coding accuracy 1 (poor) 5 (excellent)
5: Maps each lay event to the correct Preferred Term and its System Organ Class using the dictionary lookup (e.g. 'low white count' -> Neutropenia / Blood and lymphatic system disorders; 'shortness of breath with infiltrates' -> Pneumonitis / Respiratory). PT and SOC are consistent with the dictionary, not guessed from memory.
1: Wrong PT or mismatched SOC, codes to a symptom when a diagnosis PT exists (or vice versa), or fabricates a code never returned by meddra_lookup.
Seriousness determination (ICH E2A) 1 (poor) 5 (excellent)
5: Correctly classifies each event as serious/non-serious and names the right ICH E2A criterion (death, life-threatening, hospitalization, disability, congenital anomaly, medically important) — e.g. flags the hospitalized CRS/pneumonitis/febrile-neutropenia events as serious and the asymptomatic resolved lab abnormality as non-serious.
1: Calls a clearly serious (hospitalized/life-threatening) event non-serious or vice versa, or cites the wrong/no criterion, or conflates severity grade with seriousness.
Causality assessment quality (WHO-UMC) 1 (poor) 5 (excellent)
5: Assigns a defensible WHO-UMC category with sound reasoning: weighs temporality (onset vs dosing), dechallenge/rechallenge, alternative etiologies (confounding meds, comorbidity, trauma), and whether the event is labeled — e.g. probable/certain for step-up-dose CRS, unlikely for the antibiotic-confounded transaminitis, unrelated for the traumatic fracture.
1: Reflexively blames or exonerates the drug, ignores a strong alternative etiology or the temporal mismatch, conflates seriousness with causality, or misuses the WHO-UMC categories.
Evidence faithfulness 1 (poor) 5 (excellent)
5: Grounds every PT/SOC in an actual meddra_lookup result and every causality factor in the narrative or the drug label; no invented events, codes, lab values, or label claims; states uncertainty where the narrative is genuinely ambiguous.
1: Invents events or MedDRA codes, asserts label content not returned by get_drug_label, or contradicts the narrative (wrong timing, fabricated dechallenge).
Overall verdict
Your name / ID