← Adverse-Event Coding & Causality comparison · model: GPT (frontier)
LLM-judge score: 4.4 / 5 · acceptable
extraction: 5 coding: 5 seriousness: 5 causality: 4 faithfulness: 3
Both events were correctly extracted and coded to the right PT/SOC, and seriousness was properly assigned with hospitalization/medically-important criteria. Causality of 'Certain' is more aggressive than the reference 'Probable' but defensible given the rechallenge evidence; however, the agent did not call meddra_lookup so the codes are recalled rather than dictionary-verified, and it repeatedly cites a 'rechallenge with mild recurrence' that does not appear in the narrative it quoted (which states resolution in 48 hours only), constituting an unsupported factual claim that weakens faithfulness.
Clinical / Pharmacovigilance · Drug-safety physician (PV)

Adverse-Event Coding & Causality — agent trajectory for expert review

gpt-5.2-chat-latest · 2026-06-06T10:27:14.202544+00:00 · 2 tool calls · 13.6s · 2496 tokens
run id: ae_causality_20260606_031816_ae-causality__gpt-frontier

Query

Code the adverse events to MedDRA PT/SOC and assess seriousness and drug causality for case ICSR-001 (a patient on a DLL3 T-cell engager). Justify each causality category.

Trajectory

Agent · turn 1
⚙ calls get_case(case_id=ICSR-001)
⚙ calls get_drug_label(drug_class=DLL3 T-cell engager (BiTE))
↳ get_case result
show data
{
  "case_id": "ICSR-001",
  "study_drug": "Tarlatamab",
  "drug_class": "DLL3 T-cell engager (BiTE)",
  "patient": {
    "age": 64,
    "sex": "M",
    "comorbidities": [
      "ES-SCLC, 2nd line",
      "former smoker"
    ],
    "concomitant_meds": [
      "dexamethasone premedication",
      "ondansetron"
    ]
  },
  "narrative": "64-year-old man with extensive-stage small cell lung cancer received his first step-up dose of the DLL3 T-cell engager. Approximately 8 hours after dosing he developed fever to 39.2C with rigors and low blood pressure (88/54) requiring IV fluids; this was graded as cytokine release syndrome. On day 2 nursing noted he was confused with a coarse tremor and a drop in his ICE score (neurotoxicity). He was admitted to hospital and treated with tocilizumab and steroids; symptoms resolved over 48 hours. He was re-dosed at the next cycle with premedication and had only mild recurrence. No infection was identified on workup.",
  "events": [
    "fever with low blood pressure (CRS)",
    "confusion with tremor (neurotoxicity)"
  ]
}
↳ get_drug_label result
show data
{
  "drug_class": "DLL3 T-cell engager (BiTE)",
  "examples": [
    "Tarlatamab"
  ],
  "labeled": [
    "Cytokine release syndrome",
    "Immune effector cell-associated neurotoxicity syndrome",
    "Pyrexia",
    "Neutropenia",
    "Anaemia",
    "Fatigue",
    "Hypotension"
  ],
  "boxed_warning": "Cytokine release syndrome and neurologic toxicity including ICANS.",
  "note": "CRS and ICANS are expected, very common, on-target effects of T-cell engagers; typically occur within days of the first/step-up doses."
}
Agent · turn 2

Case ICSR-001
64-year-old man with ES‑SCLC on tarlatamab (DLL3 T‑cell engager) developed acute inflammatory and neurologic symptoms within 8 hours of first step‑up dose; hospitalized and treated with tocilizumab and steroids; resolved in 48 hours.


1) Event Coding to MedDRA PT/SOC

Event 1: Cytokine release syndrome (CRS)

Reported description:
“Fever to 39.2°C with rigors and low blood pressure (88/54) requiring IV fluids; graded as cytokine release syndrome.”

Most appropriate MedDRA coding:

Level Term
PT Cytokine release syndrome
SOC Immune system disorders

Rationale:
CRS is explicitly diagnosed in the narrative and is a labeled reaction for DLL3 T‑cell engagers. Although fever and hypotension are present, they are components of CRS and should not be double‑counted separately when a definitive CRS diagnosis is documented.


Event 2: Immune effector cell–associated neurotoxicity syndrome (ICANS)

Reported description:
“Confused with coarse tremor and drop in ICE score (neurotoxicity).”

Most appropriate MedDRA coding:

Level Term
PT Immune effector cell-associated neurotoxicity syndrome
SOC Nervous system disorders

Rationale:
The presentation (confusion, tremor, decreased ICE score) in the setting of T‑cell engager therapy is characteristic of ICANS. The drug class labeling specifically includes ICANS; therefore coding to the specific PT is appropriate rather than separate coding to “Confusional state” and “Tremor.”


2) Seriousness Assessment

Seriousness is based on ICH E2A criteria.

Event 1: Cytokine Release Syndrome

Serious

Seriousness criteria met:
- Requires inpatient hospitalization
- Medically important condition
- Required IV fluids and tocilizumab

CRS with hypotension requiring IV therapy meets seriousness criteria.


Event 2: ICANS

Serious

Seriousness criteria met:
- Hospitalization
- Medically important neurologic toxicity
- Required steroids

Neurotoxicity with cognitive impairment requiring admission meets seriousness criteria.


3) Drug Causality Assessment

Drug: Tarlatamab (DLL3 T‑cell engager)
Label review confirms:
- Boxed warning: CRS and neurologic toxicity including ICANS
- Typically occur within days of first/step-up doses

WHO-UMC style causality assessment applied.


Event 1: Cytokine Release Syndrome

Causality: Certain

Justification:

Meets criteria for Certain (clear time relationship, known reaction, no alternative explanation, response to treatment/dechallenge, pharmacologically plausible).


Event 2: ICANS

Causality: Certain

Justification:

Again fulfills criteria for Certain due to:
- Strong temporal association
- Known class effect
- Mechanistic plausibility
- Positive dechallenge
- Rechallenge effect


4) Final Structured Summary

Event MedDRA PT SOC Serious? Causality Rationale
Cytokine release syndrome Cytokine release syndrome Immune system disorders Yes (hospitalization, medically important) Certain Immediate onset after dosing, boxed warning, no alternative cause, resolved with treatment, recurrence on re-dose
Immune effector cell-associated neurotoxicity syndrome Immune effector cell-associated neurotoxicity syndrome Nervous system disorders Yes (hospitalization, medically important) Certain Classic timing and phenotype, labeled toxicity, no alternative cause, resolved with steroids, mild recurrence on re-dose

Overall Case Conclusion

This case represents expected, serious, labeled toxicities of DLL3 T‑cell engager therapy occurring during step‑up dosing. Both CRS and ICANS are:

✅ Both events are Serious and Certain related to tarlatamab.

Final output

Case ICSR-001
64-year-old man with ES‑SCLC on tarlatamab (DLL3 T‑cell engager) developed acute inflammatory and neurologic symptoms within 8 hours of first step‑up dose; hospitalized and treated with tocilizumab and steroids; resolved in 48 hours.


1) Event Coding to MedDRA PT/SOC

Event 1: Cytokine release syndrome (CRS)

Reported description:
“Fever to 39.2°C with rigors and low blood pressure (88/54) requiring IV fluids; graded as cytokine release syndrome.”

Most appropriate MedDRA coding:

Level Term
PT Cytokine release syndrome
SOC Immune system disorders

Rationale:
CRS is explicitly diagnosed in the narrative and is a labeled reaction for DLL3 T‑cell engagers. Although fever and hypotension are present, they are components of CRS and should not be double‑counted separately when a definitive CRS diagnosis is documented.


Event 2: Immune effector cell–associated neurotoxicity syndrome (ICANS)

Reported description:
“Confused with coarse tremor and drop in ICE score (neurotoxicity).”

Most appropriate MedDRA coding:

Level Term
PT Immune effector cell-associated neurotoxicity syndrome
SOC Nervous system disorders

Rationale:
The presentation (confusion, tremor, decreased ICE score) in the setting of T‑cell engager therapy is characteristic of ICANS. The drug class labeling specifically includes ICANS; therefore coding to the specific PT is appropriate rather than separate coding to “Confusional state” and “Tremor.”


2) Seriousness Assessment

Seriousness is based on ICH E2A criteria.

Event 1: Cytokine Release Syndrome

Serious

Seriousness criteria met:
- Requires inpatient hospitalization
- Medically important condition
- Required IV fluids and tocilizumab

CRS with hypotension requiring IV therapy meets seriousness criteria.


Event 2: ICANS

Serious

Seriousness criteria met:
- Hospitalization
- Medically important neurologic toxicity
- Required steroids

Neurotoxicity with cognitive impairment requiring admission meets seriousness criteria.


3) Drug Causality Assessment

Drug: Tarlatamab (DLL3 T‑cell engager)
Label review confirms:
- Boxed warning: CRS and neurologic toxicity including ICANS
- Typically occur within days of first/step-up doses

WHO-UMC style causality assessment applied.


Event 1: Cytokine Release Syndrome

Causality: Certain

Justification:

Meets criteria for Certain (clear time relationship, known reaction, no alternative explanation, response to treatment/dechallenge, pharmacologically plausible).


Event 2: ICANS

Causality: Certain

Justification:

Again fulfills criteria for Certain due to:
- Strong temporal association
- Known class effect
- Mechanistic plausibility
- Positive dechallenge
- Rechallenge effect


4) Final Structured Summary

Event MedDRA PT SOC Serious? Causality Rationale
Cytokine release syndrome Cytokine release syndrome Immune system disorders Yes (hospitalization, medically important) Certain Immediate onset after dosing, boxed warning, no alternative cause, resolved with treatment, recurrence on re-dose
Immune effector cell-associated neurotoxicity syndrome Immune effector cell-associated neurotoxicity syndrome Nervous system disorders Yes (hospitalization, medically important) Certain Classic timing and phenotype, labeled toxicity, no alternative cause, resolved with steroids, mild recurrence on re-dose

Overall Case Conclusion

This case represents expected, serious, labeled toxicities of DLL3 T‑cell engager therapy occurring during step‑up dosing. Both CRS and ICANS are:

✅ Both events are Serious and Certain related to tarlatamab.

Reference answer key (deterministic) — open after grading

Deterministic reference key for ICSR-001 (Tarlatamab). Causality categories follow WHO-UMC; where the narrative supports more than one defensible category the rationale says so. A strong analyst recovers all events, the correct PT/SOC from the dictionary, the right seriousness criterion, and well-reasoned causality grounded in temporality, dechallenge/rechallenge, alternative etiology, and the drug label.

Event (as narrated)Expected PTExpected SOCSeriousness (ICH E2A)Causality (WHO-UMC) + rationale
fever with low blood pressure (CRS)Cytokine release syndromeImmune system disordersSerious — Hospitalization (and medically important)Probable — Onset hours after the first step-up dose (tight temporal link), CRS is a labeled/boxed on-target effect of T-cell engagers, no infection found (no alternative cause), positive dechallenge with tocilizumab/steroids and only mild recurrence on rechallenge with premedication. Strong case; 'Certain' is defensible if rechallenge is weighted heavily.
confusion with tremor (neurotoxicity)Immune effector cell-associated neurotoxicity syndromeNervous system disordersSerious — Hospitalization (and medically important)Probable — Onset day 2 after dosing, ICANS-type neurotoxicity is a labeled/boxed class effect, resolved with steroids (positive dechallenge), no metabolic or structural alternative identified.
OVERALL CASESerious — Hospitalization / initially life-threatening (CRS with hypotension)Case-level ICH-E2A determination

Expert grading

Event extraction completeness 1 (poor) 5 (excellent)
5: Identifies every reportable adverse event in the narrative — including a serious event that is unrelated to the drug and secondary/lab events — without lumping distinct events or inventing ones.
1: Misses reportable events (e.g. overlooks the neutropenia behind a febrile-neutropenia admission, or drops the unrelated fracture), or merges separate events into one.
MedDRA coding accuracy 1 (poor) 5 (excellent)
5: Maps each lay event to the correct Preferred Term and its System Organ Class using the dictionary lookup (e.g. 'low white count' -> Neutropenia / Blood and lymphatic system disorders; 'shortness of breath with infiltrates' -> Pneumonitis / Respiratory). PT and SOC are consistent with the dictionary, not guessed from memory.
1: Wrong PT or mismatched SOC, codes to a symptom when a diagnosis PT exists (or vice versa), or fabricates a code never returned by meddra_lookup.
Seriousness determination (ICH E2A) 1 (poor) 5 (excellent)
5: Correctly classifies each event as serious/non-serious and names the right ICH E2A criterion (death, life-threatening, hospitalization, disability, congenital anomaly, medically important) — e.g. flags the hospitalized CRS/pneumonitis/febrile-neutropenia events as serious and the asymptomatic resolved lab abnormality as non-serious.
1: Calls a clearly serious (hospitalized/life-threatening) event non-serious or vice versa, or cites the wrong/no criterion, or conflates severity grade with seriousness.
Causality assessment quality (WHO-UMC) 1 (poor) 5 (excellent)
5: Assigns a defensible WHO-UMC category with sound reasoning: weighs temporality (onset vs dosing), dechallenge/rechallenge, alternative etiologies (confounding meds, comorbidity, trauma), and whether the event is labeled — e.g. probable/certain for step-up-dose CRS, unlikely for the antibiotic-confounded transaminitis, unrelated for the traumatic fracture.
1: Reflexively blames or exonerates the drug, ignores a strong alternative etiology or the temporal mismatch, conflates seriousness with causality, or misuses the WHO-UMC categories.
Evidence faithfulness 1 (poor) 5 (excellent)
5: Grounds every PT/SOC in an actual meddra_lookup result and every causality factor in the narrative or the drug label; no invented events, codes, lab values, or label claims; states uncertainty where the narrative is genuinely ambiguous.
1: Invents events or MedDRA codes, asserts label content not returned by get_drug_label, or contradicts the narrative (wrong timing, fabricated dechallenge).
Overall verdict
Your name / ID