Epidemiology-Based Market Sizing

LLM-judge score: 3.8 / 5 · flawed

funnel_construction: 5 eligibility_gating: 2 numerical_consistency: 4 assumptions_revenue: 4 faithfulness: 4

The funnel construction is clean and correctly uses incidence as the basis for the queried US geography, matching the reference through step 6 (8,720). However, the agent double-counts the line gate by applying an extra 0.50 '2L line split' on top of the fit-for-2L fraction, halving the addressable pool to ~4,360 vs the correct 8,720 — exactly the double-counting error the rubric penalizes. The agent does flag this interpretation ambiguity and provides a sensitivity restoring ~7,900, but it presents the 4,360 figure as its central estimate, distorting the final answer. Pricing/persistence logic ($121k effective) is sound, and the agent respects the curated/teaching nature of the data.

Epidemiology-Based Market Sizing — agent trajectory for expert review

Query

Trajectory

Agent · turn 1

I'll size this DLL3 T-cell engager opportunity in 2L ES-SCLC. Let me start by understanding the dataset and pulling the relevant data.

⚙ calls list_indications()

↳ list_indications result

show data

{
  "disclaimer": "Teaching snapshot: curated, rounded epidemiology for funnel-building practice. Not a live data pull; do not cite as ground truth.",
  "indications": {
    "NSCLC": {
      "label": "Non-small cell lung cancer (EGFR-mutant subset of interest)",
      "geographies": [
        "US",
        "EU5"
      ],
      "funnel_basis": "incidence"
    },
    "HER2+ Breast": {
      "label": "HER2-positive breast cancer",
      "geographies": [
        "US",
        "EU5"
      ],
      "funnel_basis": "incidence"
    },
    "ES-SCLC": {
      "label": "Extensive-stage small cell lung cancer",
      "geographies": [
        "US",
        "EU5"
      ],
      "funnel_basis": "incidence"
    },
    "Ulcerative Colitis": {
      "label": "Ulcerative colitis (moderate-to-severe)",
      "geographies": [
        "US",
        "EU5"
      ],
      "funnel_basis": "prevalence"
    },
    "ATTR-CM": {
      "label": "Transthyretin amyloid cardiomyopathy",
      "geographies": [
        "US",
        "EU5"
      ],
      "funnel_basis": "prevalence"
    }
  }
}

Agent · turn 2

ES-SCLC is available and uses an incidence-based funnel (appropriate for an aggressive cancer). Let me pull the epidemiology, subtype/line fractions, and pricing.

⚙ calls get_epidemiology(indication=ES-SCLC, geography=US)

⚙ calls get_subtype_prevalence(indication=ES-SCLC)

⚙ calls get_pricing(indication=ES-SCLC)

↳ get_epidemiology result

show data

{
  "disclaimer": "Teaching snapshot: curated, rounded epidemiology for funnel-building practice. Not a live data pull; do not cite as ground truth.",
  "indication": "ES-SCLC",
  "label": "Extensive-stage small cell lung cancer",
  "funnel_basis": "incidence",
  "diagnosis_rate": 0.95,
  "treatment_rate": 0.85,
  "epidemiology": {
    "US": {
      "incidence_per_year": 33000,
      "prevalence": 38000
    }
  },
  "note": "funnel_basis tells you whether to start the funnel from incidence (new patients/year) or prevalence (point count of living patients)."
}

↳ get_subtype_prevalence result

show data

{
  "disclaimer": "Teaching snapshot: curated, rounded epidemiology for funnel-building practice. Not a live data pull; do not cite as ground truth.",
  "indication": "ES-SCLC",
  "subtypes": {
    "extensive_stage_fraction": 0.7,
    "DLL3_expression_fraction": 0.85,
    "fit_for_2L_fraction": 0.55
  },
  "line_splits": {
    "1L": 0.95,
    "2L": 0.5,
    "3L+": 0.2
  },
  "notes": "SCLC is ~13% of lung cancer; counts here are SCLC totals. ~70% present extensive-stage. DLL3 is expressed in ~85% of SCLC. A DLL3 T-cell engager (e.g. tarlatamab) is used 2L+; gate ES-SCLC on DLL3 expression AND being fit for second-line therapy. Short treated duration reflects rapid progression and high mortality."
}

↳ get_pricing result

show data

{
  "disclaimer": "Teaching snapshot: curated, rounded epidemiology for funnel-building practice. Not a live data pull; do not cite as ground truth.",
  "indication": "ES-SCLC",
  "annual_price": 220000,
  "treated_months": 8,
  "persistence": 0.55,
  "note": "annual_price is the list price for a full year. persistence is the fraction of a year actually paid for (discontinuation/mortality). Effective revenue per addressable patient/year = annual_price x persistence."
}

Agent · turn 3

Now I have everything. Let me build the funnel from incidence.

Patient Funnel — US, 2L ES-SCLC, DLL3 TCE

The dataset gives SCLC total incidence; I gate down to the eligible 2L DLL3+ pool.

Step	Fraction	Patients/yr
SCLC incidence (US, new/yr)	—	33,000
× Diagnosed	0.95	31,350
× Treated	0.85	26,648
× Extensive-stage	0.70	18,653
× DLL3-expressing	0.85	15,855
× Reach/fit for 2L therapy	0.55	8,720
× 2L line split	0.50	~4,360

US addressable 2L ES-SCLC DLL3+ population ≈ 4,300–4,400 new patients/year.

A note on the two "2L-ish" gates: fit_for_2L_fraction (0.55) captures whether a patient is well enough to receive any second-line therapy, while the 2L line split (0.50) captures the share actually reaching/electing 2L treatment. I applied both since they describe distinct attrition (clinical fitness vs. line progression). If you view them as overlapping, the pool is larger — see sensitivity below.

Rough Peak Revenue

Effective revenue per addressable patient/year = price × persistence:
- $220,000 × 0.55 = $121,000 per patient-year

(The 8-month treated duration is already baked into the 0.55 persistence factor — ~8/12 ≈ 0.67, further haircut for early discontinuation/mortality, so I don't double-count it.)

Peak revenue, fully-captured addressable pool:
- 4,360 patients × $121,000 ≈ $528M

This represents 100% of the addressable pool — i.e., the theoretical ceiling, not a realistic share. Applying a more realistic peak share for a branded 2L agent (with competition):

Peak market share	Peak US revenue
100% (full addressable ceiling)	~$530M
60%	~$315M
40%	~$210M

Sensitivity on the funnel

If the two 2L gates overlap (use only the 0.50 line split, drop the 0.55 fitness gate): addressable ≈ 7,900/yr → full-pool revenue ~$960M; at 50% share ~$480M.
Realistic central estimate: US addressable ~4,000–8,000 new patients/yr, peak revenue plausibly $300–500M at a credible (~40–60%) peak share.

Key caveats

Teaching/curated data — rounded figures, not a live or citable pull.
Incidence-based, so this is an annual flow, not a point prevalence. Because SCLC is so aggressive (short duration, high mortality), incident flow is the right basis and treated patients don't accumulate much.
Biggest swing factors: the 2L reach assumptions (the 0.55×0.50 stack), DLL3 testing/expression cutoffs in practice, and real-world persistence. The line-split interpretation alone moves the answer ~2×.
Not included: EU5 (available if you want ex-US), 1L/3L+ expansion potential (a 1L move would dramatically enlarge the pool), or pricing erosion from competition.

Want me to run the EU5 geography, or model a 1L expansion scenario for upside?

Final output

Now I have everything. Let me build the funnel from incidence.

Patient Funnel — US, 2L ES-SCLC, DLL3 TCE

The dataset gives SCLC total incidence; I gate down to the eligible 2L DLL3+ pool.

Step	Fraction	Patients/yr
SCLC incidence (US, new/yr)	—	33,000
× Diagnosed	0.95	31,350
× Treated	0.85	26,648
× Extensive-stage	0.70	18,653
× DLL3-expressing	0.85	15,855
× Reach/fit for 2L therapy	0.55	8,720
× 2L line split	0.50	~4,360

US addressable 2L ES-SCLC DLL3+ population ≈ 4,300–4,400 new patients/year.

Rough Peak Revenue

Effective revenue per addressable patient/year = price × persistence:
- $220,000 × 0.55 = $121,000 per patient-year

(The 8-month treated duration is already baked into the 0.55 persistence factor — ~8/12 ≈ 0.67, further haircut for early discontinuation/mortality, so I don't double-count it.)

Peak revenue, fully-captured addressable pool:
- 4,360 patients × $121,000 ≈ $528M

This represents 100% of the addressable pool — i.e., the theoretical ceiling, not a realistic share. Applying a more realistic peak share for a branded 2L agent (with competition):

Peak market share	Peak US revenue
100% (full addressable ceiling)	~$530M
60%	~$315M
40%	~$210M

Sensitivity on the funnel

If the two 2L gates overlap (use only the 0.50 line split, drop the 0.55 fitness gate): addressable ≈ 7,900/yr → full-pool revenue ~$960M; at 50% share ~$480M.
Realistic central estimate: US addressable ~4,000–8,000 new patients/yr, peak revenue plausibly $300–500M at a credible (~40–60%) peak share.

Key caveats

Teaching/curated data — rounded figures, not a live or citable pull.
Incidence-based, so this is an annual flow, not a point prevalence. Because SCLC is so aggressive (short duration, high mortality), incident flow is the right basis and treated patients don't accumulate much.
Biggest swing factors: the 2L reach assumptions (the 0.55×0.50 stack), DLL3 testing/expression cutoffs in practice, and real-world persistence. The line-split interpretation alone moves the answer ~2×.
Not included: EU5 (available if you want ex-US), 1L/3L+ expansion potential (a 1L move would dramatically enlarge the pool), or pricing erosion from competition.

Want me to run the EU5 geography, or model a 1L expansion scenario for upside?

Funnel step	Patients / value	Calculation
1. Incidence (new/yr)	33,000	US ES-SCLC; basis = incidence
2. Diagnosed	31,350	x 95% diagnosis rate
3. Drug-treated	26,648	x 85% treatment rate
4. Extensive-stage	18,653	x 70%
5. DLL3-expressing	15,855	x 85%
6. Fit for 2L therapy	8,720	x 55%
7. Line gate: 2L+	8,720	x 100% (=1.0 if already encoded above)
= Addressable patients	8,720	eligibility = 33% of treated, x line
Peak capture @ 30% share	2,616	illustrative peak share assumption
Eff. revenue / patient-yr	$121,000	$220,000 list x 55% persistence
~ Peak revenue (USD/yr)	$316,550,316	= captured patients x eff. revenue/pt

Epidemiology-Based Market Sizing — agent trajectory for expert review

Query

Trajectory

Patient Funnel — US, 2L ES-SCLC, DLL3 TCE

Rough Peak Revenue

Sensitivity on the funnel

Key caveats

Final output

Patient Funnel — US, 2L ES-SCLC, DLL3 TCE

Rough Peak Revenue

Sensitivity on the funnel

Key caveats

Expert grading