Stamen Health Executive Summary

From Research Wiki
Revision as of 15:36, 15 April 2026 by Admin 3julmthh (talk | contribs) (Added 95% threshold analysis — why curation accuracy changes the entire business case)

Executive summary and one-page plan for Stamen Health, the EHDS compliance and health data infrastructure startup from Oslo.

For full strategic analysis, see Stamen Health. For market data, see PHKG Business Models & Market. For AIDAVA research foundation, see AIDAVA.

What This Is

Stamen Health is a planned startup that would take the research architecture from AIDAVA (EU Horizon Europe, €7.7M, 2022-2026) and turn it into a commercial product for EU private hospitals that need to comply with EHDS (European Health Data Space).

The core product: connect to hospital systems, curate fragmented data (structured + unstructured) into structured knowledge graphs using SNOMED CT and FHIR, and produce EHDS-compliant output.

What AIDAVA actually proved: Automated curation of 45% of documents. 20 minutes per document. Tested in 3 languages (Dutch, German, Estonian) across breast cancer and cardiovascular use cases. Usability rated "good" by pilot users, but AI explanations rated "suboptimal." Ends August 2026 as a research prototype — not a commercial product.

What Stamen would need to prove: That this architecture can be production-hardened, sold to private hospitals at commercial scale, and maintained as a reliable compliance tool. None of this has been done yet.

The Opportunity — What's Real

EHDS is real, but distant. The regulation requires standardized health data availability by 2029-2030. National transposition is happening now. Hospitals will need tools — but the spending wave hasn't started yet. This is a build-now-for-2028-revenue bet, not a 2026 revenue story.

Norwegian data advantage is real, but underutilized. Health Minister Vestre said it plainly (April 2026): Norway has the data but isn't using it. Helsedataservice applications up 30% in 2025. Government wants data to flow faster. But "wanting" and "having infrastructure" are different things.

No one does the full pipeline commercially. The competitive analysis shows companies doing pieces: PicnicHealth (US, data collection), Better (FHIR), Averbis (German NLP), Healx (knowledge graphs). Nobody combines ingestion → NLP → KG → FAIRification → multi-stakeholder reuse as a commercial product. But "nobody does it" can mean either "opportunity" or "there's a reason."

Private hospitals are an underserved market. Public hospitals in the Nordics have national EHR systems (DIPS, Metavision). Private hospitals lack equivalent infrastructure and will feel EHDS pressure first — they need to comply without government IT departments.

The Hard Parts — What Will Be Difficult

Product

  • 45% automation is not enough. Hospitals won't pay for a tool that only automates half their data curation. The product needs 80%+ automation to be commercially viable. Getting from 45% to 80% is a multi-year engineering problem, not a feature request.
  • AIDAVA is a research prototype. It was built by 13-14 academic partners across 9 countries. Hardening it for production — reliability, error handling, hospital IT integration, data privacy compliance — is a completely different engineering challenge than the research.
  • Clinical NLP across languages. AIDAVA's NLP works in Dutch, German, Estonian. Norwegian clinical text has its own quirks (bokmål/nynorsk, medical abbreviations, dialect in notes). Each new language is significant work.
  • SNOMED CT coding at scale. Automated SNOMED CT coding from unstructured text is still an open research problem. AIDAVA's 45% rate reflects this.

Sales

  • Hospitals are slow buyers. Enterprise sales cycles in healthcare are 6-18 months. Private hospitals have smaller IT teams and less budget than public systems. The first 3 customers will take 12+ months to close.
  • No existing budget line. EHDS compliance tools don't have a named budget category yet. Hospitals don't have a "EHDS compliance software" line item. You're creating demand, not capturing it.
  • Competition will come. Better (Slovenia), InterSystems ($5B), IQVIA ($35B), and potentially Microsoft/Google will build EHDS compliance tools. The window for a startup is narrow — probably 2026-2028 before big players move.

Team

  • Finding a clinical NLP / knowledge graph CTO is extremely hard. The intersection of SNOMED CT expertise, production ML engineering, and health data privacy is tiny. In Norway, it's close to zero. This role likely needs to come from the AIDAVA network (Maastricht, Tartu, etc.) — meaning relocation or remote co-founder.
  • COO with hospital sales experience. The plan calls for a Norwegian COO. Someone who has sold to Nordic hospitals, understands procurement, and can navigate hospital politics. These people exist but they're expensive and already employed.
  • Three co-founders with different skill sets. Clinical NLP, hospital sales, and operations. Getting three people aligned on vision, equity split, and working style is hard. Most startups fail on co-founder dynamics, not technology.

Funding

  • €25K/month founder salaries will be challenged. Innovation Norway early-stage approvals are 60-90K NOK/month. SkatteFUNN has fewer restrictions but is a tax credit, not cash. Plan for 70K NOK/month per founder in year 1.
  • Seed round timing. The realistic path: grants first (€300-500K from Innovation Norway + SkatteFUNN), then seed (€2-5M) at month 12-18 after proving the product with 2-3 pilot hospitals. Don't raise seed before having hospital traction.
  • EHDS VC interest is real but early. Nordic VCs are interested in health data platforms (Tandem Health raised €42.6M from Kinnevik). But they want product traction, not just a research prototype.

Realistic 12-Month Plan

Month What What This Actually Means
1-3 Founding team + AIDAVA license Co-founders signed, equity split agreed, AIDAVA technology transfer/licensing negotiated with Maastricht University. Innovation Norway application submitted.
4-6 Architecture + first conversation Take AIDAVA's research code, decide what to rebuild vs. reuse. Have conversations with 5-10 private hospitals. NOT a pilot — just listening to what they actually need.
7-9 Prototype + pilot agreement Working prototype that does ONE thing well: e.g., curating radiology reports into FHIR. Sign pilot agreement with 1 Norwegian private hospital.
10-12 Pilot data + seed prep Run the pilot. Measure: automation rate, time savings, hospital satisfaction. Use pilot data for seed deck. Apply for IPN/SkatteFUNN for year 2 R&D.

Year 1 budget: €300-500K total. Innovation Norway Innovation Contract (~€160K), SkatteFUNN (19% tax credit on R&D), founder contributions, possibly small angel round.

Year 1 team: 3 co-founders + 1 engineer (contract or part-time).

Year 1 goal: NOT revenue. Year 1 goal is: working prototype + first pilot agreement + seed deck with traction data.

Financial Model (Realistic)

Metric Year 1 Year 2 Year 3
Pilots/Customers 1 pilot 2-3 customers 5-8 customers
Revenue €0 €100-200K €500K-1M
Team 3-4 6-8 10-15
Funding source Grants + angel Seed €2-5M Series A €5-10M
Burn rate/month €30-40K €80-120K €200-300K

Revenue model: Per-patient curation fee (€5-15 per record) + platform subscription (€2-5K/month per hospital). Year 1 is zero revenue — this is a grant-funded build year.

Break-even estimate: Year 3-4, with 10+ paying hospitals. Earlier if there's a strong licensing/partnership deal with a hospital group.

Why It Could Work

  1. Timing is right: EHDS deadlines (2029-2030) mean hospitals need to start building infrastructure now. First movers who can sell EHDS compliance have a 2-3 year window before big tech enters.
  2. Norway is the right base: Government wants data to flow faster, has strong data infrastructure, and the trust advantage for EU expansion (Norway = GDPR-conscious, not US big tech).
  3. AIDAVA de-risks the technology: 45% automation with a research prototype across 3 languages. The architecture works — the question is engineering, not science.
  4. No full-stack competitor exists: Companies do pieces. Nobody offers the complete ingestion → curation → compliance → multi-stakeholder reuse pipeline.
  5. Non-dilutive funding is available: €300-500K in Norwegian grants is realistic for year 1. This funds the build phase without giving up equity.

Why It Could Fail

  1. EHDS gets delayed or diluted: Regulatory timelines slip. Hospitals don't feel urgency until 2028.
  2. 45% → 80% is a hard engineering problem: The automation gap closes too slowly. Product isn't good enough for commercial use by year 2.
  3. Co-founder team doesn't hold: Clinical NLP expert, hospital sales operator, and operations lead don't gel. One leaves.
  4. Big tech moves faster than expected: Microsoft, Google, or InterSystems build EHDS compliance tools. Startup window closes.
  5. Hospitals don't buy from startups: Private hospitals prefer established vendors. Trust takes too long to build.
  6. AIDAVA licensing doesn't work out: Maastricht University or other partners make licensing terms unfavorable.

What Actually Matters in Year 1

  1. Ship one thing that works. Not a full platform — one vertical (e.g., radiology reports → FHIR). Make it reliable.
  2. Get one hospital to pay (or commit to pay).' Even a symbolic €5K pilot contract proves someone wants this.
  3. Don't raise too early. Grants fund year 1. Raise seed only when you have traction data.
  4. Find the CTO first. The technical co-founder is the hardest hire and the most important one. Everything else follows.
  5. Don't build for Norway only. EHDS is EU-wide. Every feature should be built for cross-border use from day one.


The 95% Threshold — Why Curation Accuracy Changes Everything

AIDAVA achieved 45% automation with suboptimal tools (2022-era open-source NLP, early prototypes, three languages). The project's own verdict: "true potential for automation" — the architecture works, the tools were weak. In 2026, LLMs and modern NLP are a fundamentally different technology class.

If Stamen can reach >95% automated curation with HITL covering the remaining 5%, the entire business case changes — not incrementally, but structurally.

What 95% Does to Unit Economics

Metric AIDAVA at 45% Stamen at 95% + HITL
Cost per document 20 min × hourly rate <1 min auto + HITL on 5% = ~€0.50-1/doc
Hospital data team needed Full team, ongoing ~0.5 FTE for edge cases
Annual savings per hospital €50-100K (partial) €250-500K (replaces most of the team)
Patient experience Slow, "lack of direct benefits" Near-instant, tangible value
Data user quality "Not usable for decisions" Production-quality, queryable
Sales pitch "We can partially help" "We eliminate 95% of your curation work"
MDR risk High (miss rate on unreviewed) Lower (HITL covers clinical edge cases)
Pricing power Low (partial tool) High (replaces headcount)

A hospital data curation team typically costs €300-600K/year (3-5 FTEs). At 95% automation, you need ~0.5 FTE for HITL. That's €250-500K in annual savings. A €30-50K/year subscription pays for itself in month 1.

At 45% automation, you save maybe 45% of their time but they still need a full team. The ROI story is weak. Nobody buys.

What 95% Unlocks Across All Opportunities

Opportunity At 45% (AIDAVA level) At 95% + HITL
EHDS Compliance Partial tool, hard to sell Full replacement for manual curation
Data Intermediary Pharma doesn't trust 45% miss rate Pharma trusts curated cohorts at 95% accuracy
GP Diagnostic Overview Summary unreliable for clinical use Summary reliable, HITL covers edge cases
Patient Group Apps PHKG half-finished, frustrating PHKG comprehensive and accurate
Pre-Consultation Triage History data incomplete Full structured history available
B2B Preventive Health Partial health trajectory Complete longitudinal tracking

At 95%, the pitch is "we replace your curation team." At 45%, it's "we help your curation team." Every downstream opportunity becomes viable only past the 95% threshold.

The Hard Part: The Last 5%

The last 5% will be rare terminology, multi-language edge cases, contradictory clinical notes, missing context, and data from unusual EHR configurations. This is where the real work is:

  • HITL needs trained clinical coders, not generic workers — expensive, hard to scale
  • Each new hospital has different data formats and EHR quirks — the 5% varies by site
  • The HITL system needs to be well-designed: flag uncertain extractions, route to the right specialist, track quality metrics per document type
  • Clinical safety depends on the 5% being caught reliably — if the HITL misses, the patient is at risk

What This Means Strategically

The core technical risk is not "can we build a PHKG?" — AIDAVA proved that. It's "can we get from 45% to 95% with production-grade tools and a small team?"

If yes — the business case is 10x stronger than AIDAVA's research suggests. Every downstream opportunity unlocks. Hospital economics flip from "nice to have" to "obvious buy."

If no — you're back to AIDAVA's situation: interesting research, hard to commercialize, weak ROI story.

The gating question for the entire Stamen thesis is: what is the realistic path from 45% to 95%? This needs to be answered with a concrete engineering plan, not aspiration.

Further Opportunities — Beyond EHDS Compliance

The core EHDS compliance product is the foundation, but the same PHKG technology enables five additional revenue streams. Detailed analysis: PHKG Business Opportunities.

Opportunity Customer Value Prop Timeline
Pre-Consultation Triage GP practices LLM-based structured history-taking for GP EHR. Assistant not replacement. Strong RCT evidence (PreA 2026, 28.7% consultation reduction). Static questionnaires have 25 years of null results. Year 1-2
Patient Group Apps Patients (rare/chronic) Consolidate scattered data into one PHKG, AI explains and curates it, bring for second opinions. Proves the tech with real users. Year 1-2
GP Diagnostic Overview GP practices Full patient history structured and summarized in PHKG. GP sees the whole patient, orders fewer redundant tests. Year 2-3
B2B Preventive Health Employers Employee health screening data tracked in PHKG over years. Personalized health trajectories. Partner with screening providers (Neko, Nightingale). Year 3+
Data Intermediary Pharma/CROs Hospitals sell curated patient cohorts for clinical trials via PHKG. High revenue per transaction but complex regulation. Year 3+

Recommended sequence: Start with pre-consultation triage (simplest, fastest) and patient group apps (proves PHKG). Build toward core EHDS compliance. Add data monetization only after hospital data is flowing.

For full analysis of each opportunity — market context, competitors, revenue potential, challenges, and verdicts — see PHKG Business Opportunities.

See Also