Ingest
CSV, XLSX, JSON, Parquet. Streamed or batched. SHA-256 hashed on landing.
Five products on one engine — clean it, dedupe it, connect it, query it. Pick the ones you need; ship in an afternoon. Every output ships with an audit chain regulated industries can defend.
DAVA brings foundation-model intelligence to spreadsheets and databases — the lifeblood of science and business — and wraps every output in an audit chain regulated industries can defend.
Foundation models have transformed text and images; tabular data is the next surface, and the gating constraint isn't capability — it's trust.
That's what we're building.
Every record passes through the same deterministic pipeline — ingest, profile, normalize, resolve, decide, evidence. Foundation-model intelligence where it earns its keep. Hash-chained lineage as a free byproduct.
CSV, XLSX, JSON, Parquet. Streamed or batched. SHA-256 hashed on landing.
Type inference, sensitivity tagging, outlier detection — the Smart Tables layer.
Coerce types, fix encodings, fold case, parse dates. Sandboxed adapters.
Cluster duplicates, surface FK candidates, suggest semantic links across files.
Assistive or Decisioning mode. Reviewer queue when human-in-the-loop.
Signed ZIP pack: input hash, output, lineage, decisions, retention proof.
Pick the products you need today. Add the rest later — they share auth, audit, and policies, so the second one is always cheaper to deploy than the first. No bundling pressure.
Drop a messy file, get a clean structured table. Type-inferred, encoding-fixed, sensitivity-tagged.
Cluster duplicate records across messy datasets. Fuzzy keys, canonical row selection, audit trail per cluster.
Read moreDiscover relationships across datasets — FK candidates, value overlap, semantic links. The ring graph that finds the joins you forgot.
Read moreTalk to your data — chat with the dataset, or expose tools to your AI agents over MCP. Decisions logged like everything else.
Lineage, audit chain, retention TTLs, evidence packs, EU AI Act mode. The compliance layer your CDO is going to ask about.
Same auth. Same audit chain. Same retention rules. Wire one product up — the next four cost a fraction of the integration time.
You're here for the AI products — clean, dedupe, connect, query. The audit chain is the part you didn't have to ask for: every transformation, decision, policy hit, and download writes a tamper-evident block to your org's chain. Replay any job, hand the evidence pack to your auditor — without ever leaving the platform.
Each event includes the previous event's hash. Break one and the chain visibly fractures.
Open Policy Agent under the hood. Your PDP rules, your decisions, your records.
Signed ZIP — input hash, output, lineage graph, decisions, retention proof. Auditor-ready.
svc.evidence-builder for job norm.7f2apdp/redact-pii · 412 columnsv3.2 · 2.41M rows · 3.2sana@acme.com · file customers_q4.csvana@acme.com · scope norm:writeNot a sales demo. The same dashboard your data engineers use every day — submitting jobs, reviewing clusters, minting keys, downloading evidence packs. Bring your own SSO, ship in an afternoon.
No anonymous self-serve, no surprise overages. Every plan is contracted with AVA Research, with a one-week pilot included for new accounts.
Pick one product. Cloud-hosted in EU. Standard SLAs and support.
All five products on the same engine. Volume scales with your data, not your line items.
Docker bundle, JWT-licensed. Yours to deploy in your perimeter.
We'll clean it, audit it, and give you back a file you can ship — plus an evidence pack you can hand to your auditor. Most pilots run in under a week.