JextexTell us your pain
All insightsInsurance · Reference architecture

What Florida insurers should ship before the next hurricane season — a Microsoft Foundry agent playbook

Pablo PiovanoPablo PiovanoApril 27, 20269 min read

Florida's 2026 AI-in-claims bills (HB 527 + SB 202) died in committee earlier this month. But the regulatory direction is unambiguous, and the operational pressure from Helene + Milton is structural. Carriers that ship multi-agent claim flows on the Microsoft AI stack before June will own the storm season. Here's the playbook.

The 2026 reality for Florida P&C carriers

The numbers from 2024-2025 are the new baseline. Hurricane Helene and Hurricane Milton generated more than 400,000 claims combined across Florida P&C carriers. Adjuster headcount has barely moved in two years. Average cycle time on a residential property claim climbed from 8 to 14 days through peak season. NPS dropped, bad-faith complaints rose, and the Florida Office of Insurance Regulation now scrutinizes denial workflows the way the SEC scrutinizes 10-K disclosures.

Two policy moves shaped the 2026 conversation. First, Florida House Bill 527 and Senate Bill 202 — both proposed to mandate human review of AI-driven claim denials — died in committee in April 2026. Second, Governor DeSantis advanced a broader Florida AI Bill of Rights framework that signals, without yet legislating, the direction the state is heading. Add the EU AI Act's high-risk classification of insurance underwriting and claims handling, and the conclusion is the same: design for human-in-the-loop on every consequential decision regardless of what the next bill says.

Most carriers we talk to in Tampa, Jacksonville, and South Florida already know this. The question they're asking is not whether to deploy AI on claims — it's which architecture survives a regulator audit, which models get past the general counsel, and which platform their existing IT team can actually operate. We'll answer all three.

Why Microsoft Foundry is the right substrate

Three data points from Q1-Q2 2026 made the Microsoft stack the obvious answer for FL carriers:

  • Microsoft Agent Framework 1.0 GA on April 3, 2026 — the production convergence of Semantic Kernel and AutoGen, shipped under the Microsoft.Agents.AI namespace, with full .NET and Python parity. Native MCP support and A2A protocol built in.
  • Microsoft Foundry Agent Service GA in March 2026 — managed runtime with private networking, MCP authentication, Voice Live for telephone-channel intake, and hosted agents in six new regions. No Kubernetes, no DIY orchestration.
  • FedRAMP High authorization on all Microsoft GenAI services (December 2025) — including Azure OpenAI, Foundry, and M365 Copilot for Government. Critical for carriers handling CJIS law-enforcement data, IRS-1075 tax data, or sovereign retrocession.

Translate to carrier language: you can ship an end-to-end claim agent flow that routes every denial decision through a human reviewer, runs inside your Azure tenant with managed identities, leaves a full audit trail by default, and integrates with Guidewire, Duck Creek, or your custom mainframe via MCP servers — without rip-and-replace.

The multi-agent topology we ship

Every Florida carrier flow we deploy follows the same six-agent pattern. Each agent is a specialist. The coordinator is just the Microsoft Agent Framework Workflow API — a static graph for the claim path, with dynamic routing where business rules require it.

1. FNOL intake agent

Multimodal intake across web, phone (transcribed by MAI-Transcribe-1 at 3.9% WER across 25 languages), broker email, and ACORD form upload. Extracts policy number, coverage type, severity signals, and immediate-mitigation needs (water mitigation contractor, board-up, etc.). Routes to a triage queue with structured output.

2. Coverage-check agent

Reads policy declarations from your policy admin system via MCP, evaluates the loss against coverage forms and endorsements, and produces a structured coverage opinion with citations to the policy text. The output is a draft, not a decision — it routes to an adjuster for ratification.

3. Severity & specialist routing

For property losses, a vision pass over inspection photos and damage descriptions classifies severity (cosmetic, structural, total loss) and dispatches to the right desk. For auto, the same agent triages between PIP, collision, and bodily injury. For health lines, comorbidity flags route complex cases to senior adjusters.

4. Vendor dispatch agent

Reads from your network-vendor database, applies geographic and capacity filters, and dispatches to mitigation contractors, independent adjusters, or appraisers. Surfaces SLA risk before the clock starts running.

5. Fraud-signal agent

Pattern matches across claim history, public records, and image forensics — surfaces anomalies for SIU review with explainable signals (not a black-box score). Carriers we've scoped this for report 4× faster fraud-case escalation.

6. Reviewer agent (the human-in-the-loop gate)

Every denial decision routes through this agent before it reaches the insured. The reviewer agent doesn't deny; it presents a draft denial with citations, alternatives, and regulator-language for human sign-off. A licensed adjuster makes the call. Microsoft Entra Agent ID and Agent 365 log every decision and override.

Model routing inside the flow

2026 cost-optimization happens inside the Foundry Model Router. We route by skew:

  • GPT-5.5 for reasoning-heavy steps (complex coverage analysis, denial drafts that touch coverage forms).
  • Claude Opus 4.7 for narrative-grade outputs that need brand voice (denial letters, claim explanations to insureds, regulator responses).
  • MAI-Transcribe-1 for FNOL phone calls — 3.9% WER and roughly 50% lower GPU cost than competing transcription services.
  • Phi-4 on Foundry Local for routine classifications (cosmetic vs. structural damage from a photo alone), keeping per-claim model cost near zero.

The Model Router's automatic failover— added in the November 2025 release — means a model endpoint hiccup doesn't halt your claim queue. The request transparently routes to the next-best option.

The governance pack regulators want

Most carrier engagements stall at the legal review, not the build. The pack we deliver alongside the agents is what unblocks it:

  • Model cards for every agent — purpose, training data, evaluation results, known limitations, override mechanisms.
  • Bias monitoring dashboardon outcomes by ZIP, demographic, and policy class — the data NAIC's AI Model Law and the Florida OIR have started asking about in market-conduct exams.
  • Override metrics — what percentage of agent recommendations are overridden by human reviewers, by line, by adjuster team. A leading indicator that your agent calibration is drifting.
  • Microsoft's Agent Governance Toolkit (open-sourced April 2, 2026) — sub-millisecond policy enforcement aligned to the OWASP agentic Top 10. We deploy it alongside the flow, not as an afterthought.
  • Microsoft Purview DLP on every prompt and response — confidential policy data and PII never leave the tenant boundary.

The flow itself isn't the differentiator. The audit trail is.

Reference deployment, FL P&C carrier scenario

A realistic 90-day rollout

Carriers we've mapped this for follow the same shape:

  • Weeks 1-2 — Discovery. Map current FNOL → close flow, baseline cycle time and override patterns, identify the one line (auto BI, residential property, or homeowners HO-3) where the pilot lands.
  • Weeks 3-6 — Build. Stand up agents on Foundry Agent Service inside your Azure tenant, integrate via MCP to policy admin and document store, ship the closed pilot to one adjuster team.
  • Weeks 7-10 — Hardening. Eval harness with golden claims, Purview DLP rules, override-metrics dashboard, regulator-ready audit log. Bias monitoring across ZIP and demographic.
  • Weeks 11-13 — Scale. Rollout to next adjuster team, kick off line-of-business expansion plan, hand-off runbook to your IT operations.

Pre-season is now. Storm season builds from June. Carriers that wait until July are buying capacity at September prices.

How to get started

We have three sprints sized for the carrier conversation:

  • Foundry Agent Pilot — 4 weeks, one workflow (typically FNOL triage on auto), in production.
  • Multi-Agent POC — 6 weeks, end-to-end FNOL → coverage → dispatch with human-in-the-loop on denials.
  • Responsible AI Audit — 3 weeks, governance framework + audit aligned to NAIC AI guidance, EU AI Act, and the Florida AI Bill of Rights direction.

Or — start with the full Insurance industry playbook if you want the broader context.

And if you have a specific operational pain you want scoped, the 5-day promise applies: tell us your claim flow, and we come back with an end-to-end Microsoft AI proposal in five business days.

Want this pattern shipped to your team?

Tell us your operational pain. End-to-end Microsoft AI proposal in 5 business days.