Blog

Engineering, product and documents.

Engineering pillar

ReAct architecture for documents

A technical deep-dive on why fluex's document extraction is built on a ReAct agentic architecture rather than pure RAG, classical OCR, or single-pass LLM extraction — with three worked examples and an honest take on when not to use it.

Read article
Engineering

Prompt injection in document AI: the threat model nobody scopes

Every document your pipeline ingests is untrusted instruction text. The threat model, three real attack patterns, and the four defenses that actually hold.

9 min
Engineering

SOC 2 Type II for AI startups: what to build in

The five architectural commitments that turn the SOC 2 audit from a quarter-long cleanup project into an emergent property of your platform.

9 min
Regulation

Sub-processors in AI: what your DPA needs in 2026

AI products invoke 4–7 sub-processors per request. What your DPA needs to say about LLM providers, observability vendors, and zero-retention APIs in 2026.

9 min
Regulation

GDPR for document AI: a practical guide for operators

Article 28 obligations, lawful basis, sub-processor governance, data subject rights, and what your DPA actually needs to say.

9 min
Engineering

Tracing agentic document extraction

How to make multi-step LLM workflows debuggable. OpenTelemetry span design, sampling strategies, and the structured logs that turn a black box into a flight recorder.

8 min
Engineering

Audit trails for non-deterministic outputs

How to log AI extractions in a way that holds up to reproducibility, regulatory audit, and customer "why did you extract this?" questions — with the actual schema we use at fluex.

8 min
Product

How we detect tampering in digital payslips

An analysis of the most common document-fraud signals and how fluex catches them without false positives.

6 min
Case

From 48h to 2.4s: the Meridian Credit case

How a fast-growing fintech replaced a team of 12 operators with an API.

5 min
Case

Prior auth turnaround cut from 5 days to 4 hours

How Northwind Health automated PA triage with HIPAA-grade controls.

6 min
Case

Contract intake at 100x volume — same legal team

How Cascade Legal scaled M&A diligence with fluex's extraction API.

5 min
Engineering

OCR is a commodity. Context is the difference.

Why obsessing over OCR accuracy is the wrong metric in 2026.

7 min
Regulation

What modern data-protection laws say about document retention in AI

A practical guide for operators processing PII through language models.

10 min
Product

Ask the PDF: inside our new conversational agent

What changed in the architecture to answer questions with page-level citations in < 800ms.

9 min
Engineering

Multi-tenancy in .NET: how we isolate data by subscription

Tenant-id patterns, EF Core interceptors and automatic auditing we run in production.

8 min