Document AI Firm Retab Secures $3.5M Pre-Seed

Retab, a San Francisco-based startup, has secured $3.5 million in pre-seed funding and launched a platform focusing on document AI.

The round was backed by leading early-stage funds including VentureFriends, Kima Ventures, and K5 Global, alongside Eric Schmidt (via StemAI), Olivier Pomel (CEO, Datadog), and Florian Douetteau (CEO, Dataiku).

Developers define the schema of the data they need; Retab handles the rest – from dataset labeling and evaluations, to automated prompt engineering & model selection.

“People keep building demos that look like magic, but break the moment you put them into production,” said Louis de Benoist, co-founder and CEO of Retab. “We lived that pain ourselves. Wiring up fragile pipelines just to extract a few fields from a PDF. We built Retab because it’s the developer-first platform we always wished we had.”

Louis and his co-founders cut their teeth building internal automation tools for document-heavy workflows in logistics. Over time, they realized their true value wasn’t in the output, but the orchestration layer they’d built to make the models work. That tooling became the foundation of Retab – now used by dozens of companies to extract structured data from messy, real-world inputs.

Retab is not another large language model. It’s the essential intelligence layer that makes the world’s most powerful models—from providers like OpenAI, Google, and Anthropic—usable for critical workflows. Developers define the data they need, and Retab’s platform manages the entire lifecycle to ensure verifiable accuracy.

The platform delivers guaranteed performance through a system of intelligent checks and balances:

Self-optimizing schemas: An AI agent automatically tests and refines instructions based on a user’s documents, maximizing accuracy before the system ever goes live.

Intelligent model routing: The platform is model-agnostic. It automatically benchmarks and routes each task to the best-performing model for the job, whether the priority is cost, speed, or accuracy. This can make processes up to 100x cheaper than other solutions.

Guided reasoning & k-LLM consensus: Retab forces models to “think” step-by-step and uses a consensus mechanism among multiple models to quantify uncertainty, acting as a powerful safety net to ensure trustworthy results.

“Retab is the OS for reliably extracting structured data,” said de Benoist. “It wraps the best models in a layer of logic that makes them usable with error handling and structured outputs. That’s what devs need if they want to build production apps, not just prototypes.”

According to Douetteau, “the AI-fication of the economy depends on the capability to convert operations based on millions of documents into verified, structured data that autonomous systems can utilize. On a large scale, this process hinges on quality control, cost efficiency, and rapid implementation. The team at Retab understands this thoroughly and is uniquely positioned to solve it for the thousands of AI-first companies that are emerging.”

Looking ahead, Retab is expanding its platform to apply the same reliable extraction methods to websites and is launching integrations with automation platforms like n8n, Zapier, and Dify.



Sponsored Links by DQ Promote

 

 

 
Send this to a friend