Question 1

How does AI connect to Florence if it is a SaaS platform?

Accepted Answer

Florence exposes REST APIs and webhook subscriptions covering documents, binders, signatures, users, studies, and audit trails. We connect AI models in two patterns: event-driven (a Florence webhook fires when a new essential document is uploaded, triggering a serverless function that calls an AI model with the document, classifies it against the TMF Reference Model, extracts metadata, and writes the proposed tags back to the binder for human approval), and on-demand (a study coordinator clicks a button in a thin AI sidecar UI that pulls the relevant Florence record, builds a grounded prompt, and renders the AI response for human approval). Both patterns log every prompt, model version, and human decision back into the binder audit trail so the entire AI-mediated workflow is inspection-ready under 21 CFR Part 11.

Question 2

Which AI models do you connect to Florence and how do you choose?

Accepted Answer

We connect Anthropic Claude, OpenAI GPT and o-series, Google Gemini, Azure OpenAI, and open-weights models served via AWS Bedrock. Model choice depends on the task: Claude excels at long-form reasoning over multi-page protocol amendments and ICF differences; GPT-4o is strong at structured extraction from CVs, GCP training certificates, and FDA Form 1572s; Gemini handles multimodal tasks well (scanned PDFs, signed source documents); locally hosted open-weights models are reserved for environments where data residency rules prohibit external API calls. We benchmark model performance against a frozen evaluation set built from your historical Florence binder data before choosing a default.

Question 3

How do you keep AI compliant with 21 CFR Part 11 and ICH E6(R3) when it touches binder data?

Accepted Answer

AI features that influence GCP-regulated decisions must respect 21 CFR Part 11, EU Annex 11, and ICH E6(R3). Our integration pattern enforces this with several controls: every AI-generated output is logged with the source prompt, model version, retrieval context, timestamp, and the human reviewer who accepted or rejected it; AI never executes a GCP-affecting action autonomously — it produces a recommendation, and a qualified user signs off with a Part 11-compliant electronic signature that captures meaning of signature in Florence; model versions are pinned and changes are managed through formal change control. The FDA draft guidance on AI in drug and biological products (Jan 2025) and the FDA guidance on electronic records in clinical investigations shape our control framework.

Question 4

What are the highest-value AI use cases on Florence today?

Accepted Answer

The use cases with the strongest ROI tend to be: automated regulatory packet QC that flags missing or expiring CVs, GCP training certificates, medical licenses, and FDA Form 1572 elements before activation; TMF Reference Model auto-tagging on every uploaded document so essentials reach the right binder zone without manual filing; ICF amendment differencing across protocol versions with risk-flagged language changes for IRB submission; redaction agents that prepare documents for sponsor or regulator submission; AI-assisted essential document classification using both content and metadata; and natural-language search across binders. We sequence pilots so each use case builds the audit, monitoring, and governance pattern the next one inherits — turning AI from a one-off project into a repeatable capability.

Question 5

How does retrieval-augmented generation (RAG) work over Florence data?

Accepted Answer

RAG over Florence uses three layers. First, an indexing layer extracts binder documents — protocols, ICFs, CVs, training certificates, monitoring follow-up letters, deviation logs — from Florence via API and pushes them into a vector store such as Pinecone, OpenSearch, or Azure AI Search, with Florence role and study metadata preserved so the model only retrieves what the calling user is authorized to see. Second, a retrieval layer filters by binder zone, study, language, and effective date so retired protocol versions are never surfaced. Third, a generation layer grounds the model in retrieved passages with explicit citation back to the binder document and Florence record ID. Indexing runs on a schedule with delta updates triggered by Florence webhooks, so the AI is never working from stale data.

Question 6

What does an AI deployment timeline look like for Florence?

Accepted Answer

A typical first AI workflow on Florence can be delivered in 8-12 weeks: 2 weeks of use case definition and prompt engineering, 2 weeks of integration build (Florence API client, webhook handlers, secrets management, governance configuration), 2 weeks of validation including model evaluation against a frozen test set drawn from historical Florence data, 2 weeks of UAT in a Florence sandbox or staging tenant, and 2 weeks of validated production deployment with hypercare. Subsequent AI workflows are typically additive and follow a 3-6 week cycle once the governance framework is in place. We build the integration layer with infrastructure-as-code so that audit artifacts (prompts, model versions, retrieval indexes, evaluation sets) are continuously version-controlled alongside source code.

Question 7

How do you measure AI model performance in a regulated clinical context?

Accepted Answer

We define a frozen evaluation set of historical binder documents, ICF amendments, regulatory packets, and monitoring letters drawn from your Florence tenant, then measure model outputs against expert-labeled ground truth using structured metrics — accuracy, precision, recall, F1, calibration — and qualitative reviews scored by clinical operations SMEs. For long-form reasoning tasks like ICF amendment differencing, we use rubric-based scoring with LLM-as-judge cross-checked by humans on a 10% sample. Production telemetry tracks human acceptance rate, edit distance between AI draft and final approved output, and any cases where the human disagreed with the model — feeding back into model evaluation and prompt refinement. Drift detection re-runs the eval set on every model version change and triggers revalidation if performance degrades beyond a defined threshold. The NIST AI Risk Management Framework guides our monitoring program.

Question 8

Can you build AI agents that take action in Florence?

Accepted Answer

Yes — we build bounded agents using frameworks like the Anthropic Agent SDK and OpenAI Assistants, plus custom orchestration for multi-step workflows. Clinical-focused agents we have built include site activation packet QC agents, TMF auto-tagging agents, ICF amendment differencing agents, redaction agents, and audit preparation agents. Critically, every agent runs with a tightly scoped tool set against the Florence API, logs every tool call and reasoning step for audit, and never executes a GCP-affecting write autonomously — instead it assembles a proposed change (e.g. a draft regulatory packet, a proposed TMF tag) and routes it to a human for review and electronic-signature approval before commit. The Model Context Protocol is increasingly the standard we use to expose Florence capabilities to agents in a controlled way.

Question 9

How do you handle data residency and patient privacy when Florence data leaves the platform for AI processing?

Accepted Answer

Clinical trial documents frequently contain PHI — subject identifiers, signed consents, source documents with names and dates of birth. For external models, we use zero-retention enterprise endpoints from each provider — Anthropic enterprise zero-retention, Azure OpenAI with abuse-monitoring opt-out where eligible, AWS Bedrock with the AWS data residency commitment. Regional model hosting is selected to match data residency requirements (US, EU, UK, APAC). PHI is masked before leaving the platform when not needed for the task, and free-text fields are scanned for inadvertent PHI before being sent to any external model. HIPAA, GDPR, and CCPA obligations are flowed through to model providers via DPAs and BAAs reviewed during supplier qualification.

Question 10

What does AI governance look like for Florence at our organization?

Accepted Answer

We help clients establish an AI governance committee with clinical operations, IT, regulatory, legal, and security representation, modeled after the structure recommended by the NIST AI RMF and the EU AI Act for high-risk AI systems. Governance artifacts include an AI use case registry with risk classification, a model risk management policy, prompt and model change-control SOPs, an AI incident response procedure, and a periodic AI system review cadence. We also help draft an AI policy that names the systems where AI is permitted (Florence being one of them), the human-in-the-loop requirements per use case, and the escalation path when AI behavior deviates from expectations. See our AI policy and governance services.

Question 11

Can AI help with FDA BIMO inspection preparation?

Accepted Answer

Yes — inspection preparation is one of the most pragmatic AI use cases on Florence. We build agents that assemble inspection-ready document packets on demand: pulling all binder essentials, training records, monitoring visit reports, deviation logs, and source documents relevant to a given FDA BIMO inspection scope; generating a draft narrative response to anticipated inspector questions; and flagging gaps that should be addressed before the inspection. The agent never modifies regulated records — it produces a working pack that the QA and clinical operations team reviews and finalizes. This pattern has compressed inspection prep from weeks to days for several sponsors, and the same agent powers continuous self-audits against the FDA risk-based monitoring guidance between formal inspections.

Question 12

What is the cost structure for an AI-on-Florence engagement?

Accepted Answer

Costs break into three components: implementation (one-time fixed-scope project), AI inference costs (consumed via the model providers, typically a few hundred to a few thousand dollars per month per workflow at sponsor or CRO scale), and ongoing managed services (monthly retainer covering model evaluation, prompt iteration, and integration health). Inference costs scale with the volume of regulated documents and the chosen model — a regulatory packet QC workflow on a mid-size sponsor with 200 site activations per year will cost meaningfully less than a TMF auto-tagging workflow on a large CRO with thousands of monthly document uploads. We model expected costs against historical Florence document volumes during the discovery phase so there are no surprises at production scale.

Florence AI Integration — Connect Claude, GPT & Gemini to Your eRegulatory Platform

Where AI Adds Value in Florence

Built on the Florence REST API

Validated AI Governance for GCP Workflows

Grounded in Your Florence Data

AI Workflows We Build on Florence

Regulatory Packet QC Agent

TMF Auto-Tagging Agent

ICF Amendment Differencing

Document Redaction Agent

BIMO Inspection Prep Agent

Natural-Language Binder Search

How We Run AI Workflows Inside Florence

Pinned Model Versions

Human-in-the-Loop by Default

Zero-Retention Endpoints

AI Use Cases Across the Trial Lifecycle

Study Start-Up

Site Activation

Remote Monitoring

Protocol Amendments

Inspection Readiness

Database Lock & Close-Out

Getting Started With Florence AI

Typical Engagement Path

Frequently Asked Questions

Ready to Layer AI Onto Your Florence Tenant?