AI.DI by imkore  ·  Document Intelligence Platform  ·  Built ground-up 2024–2025
Document Intelligence.
Any document. Any industry.
At any scale.
AI.DI is not a better file cabinet. It is the first platform built from the ground up to make every enterprise document permanently intelligent, certified, and ready to transact — with six integrated engines that compound in value the more you use them. Box stores your documents. SharePoint organizes them. AI.DI understands them.
6
Integrated Engines
~10K
Doc Fingerprints
ANY
Doc Types
30
ML Engines
0
Legacy Code Lines
Any
Industry · Any Scale
Sentry Document Assurance Abstract.DI Extraction AI.DI Document Warehouse AI Orchestration & Agent Gateway Document Gateway Millennia FileStar Continuous Transaction Readiness
Overview · Tab 01
The Document Intelligence Category Just Changed.
For thirty years, document management meant storage, organization, and search. AI.DI redefines the category entirely: Document Intelligence means every document is certified, extracted, scored, distributed, and queryable by any AI system — automatically, continuously, at scale. No incumbent has built this. We have.
Full Platform Architecture — Reproduced from AI.DI Statement of Outcomes
DOCUMENT SOURCES Box / Egnyte SharePoint Yardi / MRI SAP / Workday Email / Outlook DocuSign Salesforce Windows Share Any REST API ADVISORY SERVICE imkore Blueprint — Document Intelligence Audit + Readiness Roadmap · $50–150K · 60–90 days TRUST & GOVERNANCE LAYER TRUST ENGINE Sentry Document Assurance Fingerprint · Deduplicate · Certify · Search Zero doc storage · Fingerprints flow to Warehouse GOVERNANCE ENGINE Millennia FileStar Document Fabric · Workflow · Compliance Governs docs · Syncs with Warehouse OPERATIONAL ENGINES — RUN ON THE WAREHOUSE EXTRACTION ENGINE Abstract.DI Any doc · Any industry · 100K batch Extracts intelligence → Stores in Warehouse EXCHANGE ENGINE Document Gateway Ingest · Validate · Distribute · Track Certifies docs · Routes to Warehouse AI AGENT ENGINE AI Orchestration MCP · Agent Gateway · Q&A · RAG Queries Warehouse · Returns certified answers DATA & ANALYTICS Snowflake Databricks Power BI / BigQuery LAYER 1 — FOUNDATION INFRASTRUCTURE AI.DI Document Warehouse Documents · Extracted Data · Metadata · Fingerprints · Audit Trail · CTR Score The hub all engines connect through · Queryable by any AI or analytics system INTEGRATIONS REST API · JDBC MCP · SDK Webhooks · Snowflake AI & ANALYTICS CONSUMERS Snowflake / BigQuery Copilot / ChatGPT Claude / Gemini Power BI / Tableau Custom AI Agents MCP Clients / SDK DEPLOYMENT: Azure Cloud AWS On-Premise Hybrid Single-Tenant Multi-Tenant Any File Type · Any Industry · Any Org Size Every engine has standalone value · Modular adoption · No rip-and-replace required
The Intelligence Flywheel — Why It Compounds
Compounding Platform Value

Documents flow into Document Gateway. Abstract.DI extracts intelligence from every one. Sentry fingerprints and certifies them. The Document Warehouse stores all of it as structured, queryable data. The Warehouse improves Abstract.DI model accuracy. Better accuracy improves Sentry signals. Better signals make Document Gateway more valuable. More value drives more documents. After 18 months, switching costs are effectively permanent — and accuracy measurably exceeds any out-of-the-box alternative.

Platform Layer Stack — Complete Feature Inventory
DG
Document Gateway — Exchange & Distribution Engine
Check-In Studio · Distribution Studio · Transaction Rooms · ML Learning Studio · 200+ components · 29 edge functions
Core OSAny industry
The central operating system for every document. Replaces Box, SharePoint, Egnyte as the primary system of record while connecting to any of them as migration sources. React/TypeScript/Vite + Supabase + Deno Edge Functions + Cloudflare R2 storage.
Check-In Studio
AI-powered document intake with AbstractIQ auto-classification, batch template mode, session history, rejection/resubmission pipeline, and external submitter portal.
Distribution Studio
Unified hub replacing 6 legacy distribution workflows. Standing distributions, serialized delivery, access tracking, client branding engine, full audit trail.
ML Learning Studio
30 self-improving AI engines across 6 capability tiers. Org-specific model weights. 4 map views. Continuous accuracy improvement.
AB
Abstract.DI — AI Extraction Engine
Any doc type · 94% day-one confidence · 100K batch chunks · GPU OCR · Anomaly detection · Custom schema builder
AI-nativeCore moat
Reads every document and converts it into structured, queryable intelligence. Multi-pass pipeline: OCR → classification → extraction → confidence scoring → anomaly detection → warehouse write. Different models optimized per document type.
Batch Engine
Process entire archives in 100K-document chunks. ZIP, Box, SharePoint, S3. Output to Excel, JSON, CSV, or Warehouse.
Any Document Type
Custom schema builder for proprietary types — hours not months. Pre-built schemas for legal, financial, compliance, real estate.
GPU OCR
DocTR engine. CPU and GPU — 10x–50x speedup on GPU. Selective OCR for maximum cost efficiency.
SE
Sentry Document Assurance — Trust & Compliance Monitor
~10,000 fingerprints · Zero doc storage · Patent pending · GDPR/HIPAA/SEC by architecture · 10-100x faster search v2
TrustCompliance
Deterministic mathematical fingerprinting. Zero document storage — only immutable fingerprints. Three types: Document Content, Document Data, Trusted Data Fingerprints (unique in market — fingerprint individual database rows, find every document referencing that entity).
Duplicate Elimination
40%+ industry average duplicate rate = 40% wasted AI spend. 30-50% LLM cost reduction immediately.
Cross-System Search
Search SharePoint, OneDrive, Windows Share, email archives, ERP, FileStar simultaneously. No tags. No training.
PII Redaction
Auto-detects and redacts SSNs, financial IDs, tax IDs before fingerprint storage. GDPR data minimization by mathematics.
DW
AI.DI Document Warehouse — Structured Intelligence Layer
PostgreSQL · SQL/GraphQL/REST · Snowflake · Databricks · MCP · Vector embeddings · 6 query views
Data moatBI connectors
The biggest differentiator in the market — and it doesn't exist anywhere else. Every document Abstract.DI processes becomes structured rows in PostgreSQL. Every extracted field is queryable data. Every AI signal is persisted as a structured record.
6 Query Views
List · Library · Cube (pivot) · Time series · Schema · Scientist mode. Every dimension instantly explorable.
BI Connectors
Snowflake Data Share, Databricks, Tableau, Power BI, dbt, BigQuery, Python SDK. Zero ETL overhead.
Event Streaming
Webhook Manager fires events on every platform action. Real-time pipeline triggers for any downstream system.
OA
AI Orchestration & Agent Gateway
LLM-agnostic · Live MCP server · RAG foundation · OAuth2/OIDC · Zero hallucination
AI agentsRAG substrate
The AI layer that makes every enterprise LLM investment actually work. Not competing with LLMs — the prerequisite. Works with Copilot, GPT-4, Claude, Gemini, Llama, or any custom LLM. Live MCP server callable from Claude, Cursor, LangChain, AutoGen.
Live MCP Server
Production-ready. Callable from any MCP-compatible environment. No custom integration layer required.
LLM-Agnostic
Client chooses the AI model; AI.DI provides the trusted foundation. No vendor lock-in to any LLM.
RAG Foundation
Every chunk provenance-tracked, every answer traceable to a specific certified document version. Zero hallucination.
FS
Millennia FileStar — Document Governance & Fabric
45 enterprise clients · 7,500 users · ~$1M ARR · 20+ year trust base · Zero-CAC upsell on-ramp
Installed baseOn-ramp
The governance engine and most defensible competitive moat. 45 enterprise clients with 7,500 users cannot be cold-called by any competitor. FileStar governs document lifecycle and syncs all metadata to the AI.DI Warehouse.
Document Fabric
Structured lifecycle governance for any document type. Configurable routing, approval chains, escalation paths.
AI.DI Integration
FileStar governs; Warehouse stores; Sentry certifies; Abstract.DI extracts — automatically on every FileStar doc.
Installed Base
8–10 Phase 1 upgrade targets. 20+ year institutional trust that no competitor can replicate regardless of funding.
Platform Combinations — Entry to Full Suite
DG alone
Day-one value. Replaces Box.
Document storage, hierarchy, roles, and lifecycle management for any organization. 30-day deployment. No implementation project.
"We're better than Box at organizing documents for your specific org structure. Same price. Setup in a week."
DG+Abstract.DI
The intelligence upgrade.
Every document uploaded turns into structured, searchable data automatically. No manual tagging. 94% accuracy on day one.
"Every document you upload now tells us what it says, who it's about, and when it expires — without you doing anything."
DG+Sentry
The compliance overlay.
Sentry wraps any existing repository — Box, SharePoint, Egnyte — and adds continuous compliance monitoring without replacing storage.
"Keep Box. We add the trust layer that tells you what's missing, what's expired, and what's been altered."
All six engines
The platform. The moat.
All six engines create a compounding intelligence flywheel. The platform becomes permanently irreplaceable.
"Your documents are now a structured, AI-certified, queryable intelligence asset. This is a different category."
"M-Files has been building their AI product since 2019. Box added AI features in 2022. SharePoint Copilot launched in 2023. None of them started from scratch. None of them can. We did. That is the only advantage that cannot be copied."
— Platform architecture principle
Overview · Tab 02
AI Intelligence — Built Ground-Up. Not Bolted On.
Every incumbent document platform bolted AI on top of a 20-year-old data model. AI.DI was architected in 2024 with AI as the primary actor — not the afterthought. 27 engines running in production. 30 self-improving ML models. A live MCP server connectable to any LLM today. This is not a roadmap.
The Competitive Barrier — Stated Plainly

Competitors have slides about AI. AI.DI has 200+ React/TypeScript components, 29 live serverless edge functions, an ML Learning Studio with 30 self-improving engines, a live MCP server, an AI Agent Gateway connecting to Claude/Copilot/GPT-4/Gemini, and a production AI.DI Studio running 27 active AI engines. The gap between what competitors promise and what we have already shipped is measured in years of engineering. This is the unfair advantage that cannot be purchased with a VC round.

The AI.DI Studio — 27 Active Engines Running in Production
documentgateway.ai
AI.DI Studio — Real-Time Intelligence Infrastructure
Click to enlarge
AI
AI.DI Studio — Real-Time Intelligence Infrastructure
AI Intelligence · 27 Active Engines · 5 Capability Domains
27 AI engines running simultaneously across 5 capability domains — not planned, not in beta, not slides — running in production right now against real documents. Each column is a domain: AI Core handles classification, extraction, confidence scoring, and the HITL Reduction meta-engine that makes the entire platform self-improving. Intelligence manages deep document comprehension, cross-document validation, expiry detection, and the document type registry that routes every document to the right schema. Process automates the operational pipeline — OCR, obligation extraction, approval routing, distribution rule execution, workflow management, and industry-specific feature configuration. Trust & Security enforces immutability through blockchain anchoring, continuous tamper detection, and database-level access control. Data/Integration maintains the warehouse sync, API gateway, registry, and storage optimization engines that keep the entire data pipeline running at scale. Every node shows live metrics: documents processed, classifications made, conflicts detected, connections active. This is the intelligence infrastructure. No competitor has built it. No competitor can.
HITL Reduction — The Self-Improving Loop
Human-In-The-Loop Reduction Architecture

The HITL Reduction AI engine monitors all other engines' human review rates and autonomously moves classifications to auto-approve when confidence consistently exceeds configurable thresholds. Standard document types trend toward zero human intervention at 12 months. Novel or edge-case documents always retain human oversight — the goal is the right humans reviewing the right exceptions, not zero humans.

Legacy Platform HITL at 12 Months
Standard contracts65%
Insurance certificates55%
Financial statements70%
Fixed model weights. No production learning. Same cost and error rate at month 12 as month 1.
AI.DI HITL at 12 Months
Standard contracts8%
Insurance certificates5%
Financial statements12%
Continuous production learning. Every human correction retrains the model automatically. No ML engineers required.
Blockchain Engine — Immutable Audit Trail
documentgateway.ai
AI.DI Studio — Blockchain Engine · On-Chain Document Integrity
Click to enlarge
AISENTRY
AI.DI Studio — Blockchain Engine · On-Chain Document Integrity
AI Intelligence · Trust Engine · Ethereum / Hedera / Polygon
2,814 documents have been anchored on-chain through this engine — each one generating a Merkle tree hash committed to Ethereum, Hedera, or Polygon as an immutable proof of existence and content at a specific point in time. The HITL Reduction panel shows 100% automated — meaning the Blockchain Engine requires zero human review at this tenant after 12 months, because blockchain anchoring is deterministic: if the fingerprint matches, it anchors; no judgment required. The Pages Powered By This Engine panel shows exactly where this engine's certifications surface in the UI: Document Vault, Asset Vault, and Verification Portal. This is not a compliance checkbox — it is the infrastructure that allows a document to be presented to a regulator, counterparty, or auditor with cryptographic proof that its content has not changed since a specific timestamp. No document management platform on the market ships this out of the box.
Integration Studio — Connect Any AI Agent
documentgateway.ai
Integration Studio — Live AI Agent Gateway
Click to enlarge
AIORCHESTRATION
Integration Studio — Live AI Agent Gateway
AI Orchestration · MCP Server + 3 Connected AI Systems
This screen represents the moment enterprise AI deployment becomes real. Three AI systems are live and connected: Claude.ai, ChatGPT/OpenAI, FileStar. Each has read-only, row-level-security-enforced access to the entire certified document corpus through 6 production tools. The MCP Server URL is a live endpoint — paste it into Claude, Cursor, LangChain, or any MCP-compatible environment and the AI immediately gains the ability to search certified documents, check compliance status, retrieve obligations, query the warehouse, navigate the hierarchy, and retrieve signed document URLs. Keys are tenant-scoped, revocable instantly, and enforce the same RLS policies as the UI. The AI Agent Gateway panel on the left shows Microsoft Copilot, Gemini/Google, and Grok/xAI as available-to-add connections — meaning your entire AI vendor portfolio can query the same trusted document foundation. This is the infrastructure that makes every LLM investment in your organization actually work.
Integration Ecosystem — 28 Connectors
documentgateway.ai
Integration Studio — 28 Connectors Across Every Enterprise System
Click to enlarge
AIORCHESTRATION
Integration Studio — 28 Connectors Across Every Enterprise System
AI Orchestration · Full Connector Ecosystem
The integration architecture eliminates the "but we already use X" objection entirely. 28 connectors span every category an enterprise might already have in production. AI Agent Platforms: Claude.ai and ChatGPT are live-connected today; Microsoft Copilot, Grok/xAI, and Gemini are configured with setup wizards. CRE & PE Platforms: Yardi Voyager pulls leases and rent rolls directly, MRI Software delivers financial reports and lease abstracts, Argus Enterprise connects valuation models, Salesforce manages CRM documents and contracts, Juniper Square handles fund admin and investor documents, Dealpath delivers deal pipeline docs. Document Management: SharePoint, Google Drive, Box, OneDrive, Dropbox all connect as document sources. Dev & Monitoring: Sentry, Datadog, GitHub. Data & Warehouses: Snowflake, Databricks, BigQuery, Redshift, Tableau, Power BI. The AI Wizard button walks through each connection in guided steps — no IT project, no professional services, no custom integration work.
Document IQ — Conversational AI Over Your Certified Corpus
documentgateway.ai
Document IQ — AI-Powered Document Intelligence Assistant
Click to enlarge
AIORCHESTRATION
Document IQ — AI-Powered Document Intelligence Assistant
AI Orchestration · Conversational AI · Portfolio-Wide Access
Document IQ is what happens when you give an AI system access to a certified, structured document corpus instead of raw PDFs. With access to 18 assets across a portfolio, it can answer questions that would take a human analyst days: "What's missing from the vault?" surfaces every gap across every asset simultaneously. "Show critical risk items" aggregates all violation flags and expiry warnings into a single prioritized view. "Expiring in the next 30 days" is a precise query against structured expiry dates in the Warehouse — not a keyword search, not an approximation. Upload any file and Document IQ cross-references it against vault data in real time: upload a rent roll and it identifies which tenants aren't in the vault, which leases are missing, which figures don't match the abstracted data. This is not a chatbot bolted onto a document management system — it is an AI with structured, trusted data access that no general-purpose LLM can replicate without the Warehouse underneath it.
ML Learning Studio — 30 Engines, 6 Tiers
The Self-Improvement Architecture

Every legacy DMS has fixed classification models requiring expensive, time-consuming retraining. AI.DI's ML Learning Studio inverts this entirely — 30 engines improving continuously from production data, automatically, without engineering intervention. AI.DI gets cheaper and more accurate at scale. Every competitor's cost stays flat or increases.

TierFocusExample EnginesHITL Trajectory
Tier 1 — FoundationDocument type classificationCRE Type Classifier, PE Type Classifier, Legal Type ClassifierNear-zero for covered types
Tier 2 — EntityNamed entity extractionParty Extractor, Property Identifier, Fund/Entity Linker5–15% at 6 months
Tier 3 — Date & ValidityTemporal signal extractionExpiration Detector, Effective Date Parser, Renewal ClassifierNear-zero for standard formats
Tier 4 — FinancialFinancial data extractionLoan Terms Extractor, Rent Roll Parser, Appraisal Value Extractor10–20% at 6 months
Tier 5 — ComplianceCompliance validationCoverage Gap Detector, Compliance Flag Engine, Signature Validator15–25% — domain expertise retained
Tier 6 — Cross-DocumentCross-document consistencyPortfolio Benchmark Engine, Anomaly Correlator, Reconciliation EngineComplex analysis — strategic HITL
"We didn't build a document platform and add AI. We built an AI platform that happens to manage documents. The difference is not semantic. It is architectural. And architecture determines destiny."
— AI.DI platform design principle
Overview · Tab 03
The Honest Battle Table — Where We Win and Why It's Structural
We do not win on every dimension today. What matters is the architecture. An incumbent can add a feature. No incumbent can add a clean data model, a zero-legacy stack, or an AI engine designed in from the first line of code.
Why Legacy Platforms Cannot Catch Up

Box cannot rebuild their data model for AI without breaking 150,000 customers. SharePoint's incentive is to preserve Teams and Office revenue, not cannibalize Copilot. M-Files is 2–3 years behind on the data model and the Warehouse layer. Egnyte wins on storage reliability but has no awareness of what documents contain. Every dollar these platforms invest in AI is constrained by the need to not break existing products. That constraint does not exist for AI.DI.

Full Capability Matrix
Capability AI.DI Platform BoxSharePointM-FilesEgnyte
Architecture & Philosophy
AI-native architecture (built for AI, not adapted)Win 2024-2025. Zero compromise. AI is core, not a wrapper.Bolt-onCopilot wrapperAino — improving but bolt-onMinimal
Zero legacy technical debtWin No codebase older than 18 months.2005 origin2001 origin2003 origin2009 origin
Edge compute architectureWin All compute at edge. Scale to zero or infinity.NoneAzure Functions (partial)NoneNone
Modular adoption (standalone or full suite)Win Every engine has standalone value.PartialModule-based but complexPartialPartial
AI & Document Intelligence
Structured data extraction from documentsWin Abstract.DI — any type, 94% day-one, 100K batch.NoneBasic Copilot extractionAino — requires trainingNone
Day-one extraction accuracy (no training)Win 94%+ on pre-built schemas. No training required.N/AN/AMonths of trainingN/A
GPU-accelerated OCR pipelineWin DocTR — 10-50x speedup on GPU.NoneAzure OCR (limited)Basic OCRBasic OCR
Batch processing (100K+ archives)Win 100K-chunk batch. ZIP, Box, SharePoint, S3.NoneNoneLimited batchNone
30 self-improving ML enginesWin Continuous production learning. No ML engineers.NoneGeneric CopilotLimited self-learningNone
HITL Reduction AI (autonomous meta-engine)Win Autonomous promotion of high-confidence classifications.NoneNoneNoneNone
Trust, Compliance & Security
Document fingerprinting (deterministic, patent-pending)Win ~10,000 fingerprint catalog. Zero doc storage.NoneNoneNoneNone
Zero document storage compliance modelWin Only fingerprints stored. GDPR minimization by math.Full storageFull storageFull storageFull storage
PII auto-detection and redaction pipelineWin Tokenization pipeline auto-redacts at ingestion.NonePurview (partial)NoneDLP (partial)
Fraud / document manipulation detectionWin Deterministic — single character change detectable.NoneNoneNoneNone
Blockchain audit trailWin On-chain anchoring. 2,814+ documents on chain.NoneNoneNoneNone
Data & AI Infrastructure
Structured document intelligence warehouseWin Every extracted field is a queryable row. Unique.NoneNoneNoneNone
Snowflake Data Share (zero-ETL)Win Zero-copy. Join doc intelligence with financial data.NoneNoneNoneNone
Live MCP server for AI agentsWin Production MCP. Claude, Cursor, LangChain — no wrapper.NoneNoneNoneNone
Vector embeddings on certified chunksWin Tied to certified versions. pg_vector native.NoneAzure AI Search (partial)NoneNone
CTR Score (Continuous Transaction Readiness)Win Live composite readiness score. Portfolio-wide.NoneNoneNoneNone
27 active AI engines in productionWin AI.DI Studio — live engine map with real-time status.NoneNoneNoneNone
Deployment & Integration
Unlimited hierarchy depth (any org structure)Win Enterprise → Group → Entity → Asset → Unit. Any depth.Folders onlySites/subsitesMetadata-basedFolders/workspaces
30-day deployment (no implementation project)Win 30 days from contract to live. M-Files runs 3–6 months.Weeks–monthsMonths–years3–6 months typicalWeeks–months
Installed base / existing trust relationshipsWin 45 FileStar enterprise clients. 20+ year relationships. Zero-CAC.Large (hard to access)Large (bundled)Existing clientsExisting clients
One-Line Positioning Per Competitor
vs. Box
"Box stores it. We understand it."
Never compete with Box on storage. Lead with the batch engine demo — upload a folder of contracts, produce a structured Excel workbook in 4 hours. Box produces a list of file names. Do not ask them to cancel Box. Ask what Box actually tells them about their documents.
vs. SharePoint
"Keep SharePoint. Add intelligence."
Never demand they cancel SharePoint. "SharePoint manages collaboration. AI.DI manages the intelligence layer — extraction, assurance, readiness scoring — running on top of whatever you already have. You do not have to change anything to start."
vs. M-Files
"Day-one intelligence, not month-six."
M-Files Aino requires months of training. AbstractIQ delivers 90%+ confidence on day one for any document type — because schemas are pre-built and AI classification runs from the first upload. "You bring the documents. We bring the intelligence."
vs. Egnyte
"You know where your files are. We know what they say."
Egnyte wins on hybrid storage reliability and IT governance. It has no awareness of what documents contain. AbstractIQ connects to an Egnyte repository as an intelligence overlay. "Add intelligence to Egnyte" — the displacement happens organically once the intelligence layer is live.
"We are not trying to be a better Box. We are trying to make Box irrelevant — the same way Salesforce made Act! irrelevant. Not by being louder. By being categorically different."
— imkore category strategy · 2025
Products · Tab 05
Document Gateway — The Operating System for Every Document
The central hub where documents live, roles are configured, compliance is monitored, and every other AI.DI engine plugs in. 200+ React/TypeScript components. 29 live serverless edge functions. Replaces Box, SharePoint, Egnyte as the primary system of record.
200+
React/TS Components
29
Live Edge Functions
30 days
Avg. Deployment
5-tier
Org Hierarchy
30
ML Engines
4
Role Types
Check-In Studio — AI-Powered Document Ingestion
documentgateway.ai
Check-In Studio — Intelligent Document Intake
Click to enlarge
GATEWAYAI
Check-In Studio — Intelligent Document Intake
Document Gateway · The AI Intake Engine
Every file dropped here enters a multi-stage AI pipeline that runs entirely without human instruction. AbstractIQ classifies the document by type, extracts key fields, scores confidence, checks for duplicates, detects anomalies, and routes to the correct steward queue — all before a human sees it. Required documents are surfaced as named cards organized by packet template, so a steward's view is not a list of files but a structured set of obligations: what's needed, what's fulfilled, what's outstanding, and what was rejected with AI-identified reasons. The HITL Reduction AI continuously monitors which document types consistently reach auto-certify confidence and promotes them to bypass human review entirely. As your document corpus grows, the percentage of documents requiring human attention trends toward zero for standard types. This is not document management — it is an autonomous compliance engine that happens to accept file uploads.
Check-In Engine — AI Thresholds & Real-Time Performance
documentgateway.ai
Check-In Engine Settings — Configurable AI Per Tenant
Click to enlarge
GATEWAYAI
Check-In Engine Settings — Configurable AI Per Tenant
Document Gateway · Per-Tenant ML Configuration
Every AI parameter visible here is independently configurable per tenant — the same platform serves a 500-asset institutional portfolio and a 10-asset regional operator with different accuracy thresholds, different OCR pipelines, and different ML feedback settings. The auto-certify threshold (85%) means any document the AI classifies with 85%+ confidence is automatically approved without human review. The review threshold (65%) sends mid-confidence results to the steward queue. Below 65% triggers rejection with AI-generated explanation. The AI Model Performance panel shows live production metrics: auto-classify rate 78%, average confidence 91%, 47 human corrections logged in 30 days, model last retrained March 12 with a 6,421-document corpus. Those 47 corrections become training data — each human decision permanently improves future accuracy for that document type. No ML engineers. No retraining pipeline. The platform learns from its own production use continuously.
Check-In API & Webhook Integration
documentgateway.ai
Check-In API — Full Programmatic Document Ingest
Click to enlarge
GATEWAYDEVELOPERS
Check-In API — Full Programmatic Document Ingest
Document Gateway · Developer Interface
The same AI pipeline that powers the visual Check-In Studio is fully accessible through a REST API — meaning any internal system, any existing workflow, any document management tool can push files directly into the AI.DI pipeline without a user interface. POST to /v1/checkin/ingest for a single file; POST to /v1/checkin/bulk for ZIP bundles of up to 10,000 documents in one call. The response returns a job ID for status polling — the full AbstractIQ pipeline runs asynchronously and fires webhook events at every stage: classified, extracted, named, certified, review_required, rejected. Each event carries the full payload — document type, confidence score, extracted fields, routing decision, DG name, anomaly flags. This means your existing systems get real-time notification the moment a document reaches any status, enabling downstream automation without polling. The platform is not just a UI — it is a document intelligence API that happens to have an excellent UI.
Distribution Studio — Studio View
documentgateway.ai
Distribution Studio — The Unified Distribution Hub
Click to enlarge
GATEWAY
Distribution Studio — The Unified Distribution Hub
Document Gateway · Transaction Rooms · Packages · Share Links
This screen replaced six separate legacy distribution workflows — Transaction Rooms, Document Packages, Standing Distributions, Share Links, Distribution Rules, and Distribution Templates — all now visible and manageable from a single hub. Each Transaction Room is a secure, deal-type-aware environment that tracks exactly who has accessed which documents, for how long, from which organization — with CTR Score progress, expiry countdown, and phase completion all visible at a glance. Document Packages are curated bundles sent to specific recipients with custom cover letters, watermarks, NDA gates, and per-recipient download permissions. Every distribution event is immutable — timestamped, version-locked, recipient-specific, and fully auditable. The engagement data flowing from these rooms tells you more about your counterparty's interest level than any conversation: which documents they spent the most time on, which sections they returned to repeatedly, and which they never opened.
Distribution Studio — Builder & Templates
documentgateway.ai
Distribution Builder — Three Wizard Modes
Click to enlarge
GATEWAY
Distribution Builder — Three Wizard Modes
Document Gateway · Distribution Wizard
The Builder asks one question: what kind of distribution is this? The answer determines everything — deal type, counterparty structure, access controls, expiry configuration, NDA gate options, and QA thread settings — all pre-configured by the wizard based on the selection. Transaction Room launches a 7-step guided workflow covering deal type (CRE Refinancing, M&A/PE, LP Reporting, Asset Sale, JV Formation, General), counterparty org hierarchy, phase-based document structure, section-level access matrix, Abstract.DI engagement signal integration, QA thread with room-level threading, and CTR Score gap alert configuration. No guesswork. No "what settings should I use?" — the platform knows.
documentgateway.ai
Distribution Analytics — Counterparty Intelligence
Click to enlarge
GATEWAYVALUE
Distribution Analytics — Counterparty Intelligence
Document Gateway · Deal Analytics
The Analytics tab transforms distribution data into deal intelligence. Room engagement by deal type shows views per room, CTR percentage achieved, and QA threads open — giving deal teams an objective read on counterparty engagement that no email thread can provide. Phase completion rates from NDA & Setup through Closing show exactly where deals stall portfolio-wide. The document access heatmap reveals which document types drive the most views per counterparty per room — signaling where interest is concentrated before a conversation happens. When a counterparty spends 3 hours on the indemnification clause and then asks for a 15-day extension, you already know why. This is not analytics — it is a behavioral signal layer over every transaction.
Process Library — Pre-Built Transaction Workflows
documentgateway.ai
Process Library — 11 Pre-Built Transaction Workflows
Click to enlarge
GATEWAYADMIN
Process Library — 11 Pre-Built Transaction Workflows
Document Gateway · Workflow Automation
Every complex document transaction — refinancing, acquisition, compliance audit, tenant coordination, property sale — follows a predictable sequence of document requirements organized in phases. The Process Library codifies that institutional knowledge into launchable templates. The Refinance/Lender Request template shown here has 7 phases (Legal & Ownership → Property & Physical → Leasing & Tenants → Financial Performance → Valuation → Existing Debt → Package & Delivery), 22 required documents across 24 total, and 45–60 day estimated duration. Launching a process from this library creates an active tracking instance with phase-by-phase document completeness, automatic CTR Score update as documents are fulfilled, and distribution-ready packaging when all phases are complete. The process library is the institutional knowledge layer that turns an ad-hoc scramble into a repeatable, measurable, improvable workflow.
Document Type Studio & Hierarchy Studio
documentgateway.ai
Document Type Studio — Complete Document Vocabulary
Click to enlarge
GATEWAYADMIN
Document Type Studio — Complete Document Vocabulary
Document Gateway · Document Taxonomy
The document type taxonomy is the foundation everything else is built on — classification, extraction schemas, routing rules, compliance requirements, and CTR Score calculations all reference this vocabulary. The Document Type Studio shows the full hierarchy: Commercial Real Estate and Private Equity libraries with subcategories, the Diligence library alone contains 62 document types (43 essential, 19 elective). Essential types drive CTR Score calculations — missing an Essential document drops the score. Elective types are tracked but don't penalize. The AI Generate button uses your existing document corpus to suggest new types your organization actually uses that aren't in the default catalog. The platform ships pre-built taxonomies for every major industry — configurable, extensible, and learnable from your own document patterns.
documentgateway.ai
Hierarchy Studio — Any Org Structure, Any Depth
Click to enlarge
GATEWAYADMIN
Hierarchy Studio — Any Org Structure, Any Depth
Document Gateway · Organization Architecture
The hierarchy is not a folder structure — it is a typed, dimensional organizational model where every node carries permissions, document requirements, CTR Score calculations, process templates, library assignments, and AI extraction schemas. The Traditional Private Equity template shown here (Enterprise GP → Fund → LP/Investor → Portfolio Company → Subsidiary/Division) maps exactly to how a PE fund operates — but the studio can configure any structure for any industry without code or professional services. CRE operators, corporate legal departments, financial institutions, and government agencies all configure different hierarchies from the same studio. Every node created here becomes a first-class citizen in the Document Warehouse — queryable, scoreable, and connectable to any AI agent via the MCP server.
Platform Configuration
documentgateway.ai
Platform Masters — Document Status Workflow Engine
Click to enlarge
GATEWAYADMIN
Platform Masters — Document Status Workflow Engine
Document Gateway · Status Configuration
Document statuses are not labels — they are workflow triggers. Each status in this table drives a specific system behavior: Submitted auto-routes to review queue, Approved fires the Approval Engine, Expired triggers the Violation Engine, Sentry Certified records an immutable fingerprint in vault_records. The drag-to-reorder interface sets the logical default sequence, but the real power is in the terminal and certified flags — terminal statuses cannot be manually overridden, and certified statuses can only be assigned by the Sentry fingerprinting pipeline, never by a user. The AI Suggestions panel on the right uses your industry and document patterns to propose status additions — "Add AI Flagged for low-confidence classifications" appears because the system detected classifications below the review threshold that currently fall through to Needs Revision without a distinct routing path. This is the platform configuring itself.
documentgateway.ai
Roles & Permissions — 9 Roles × 138 Features
Click to enlarge
GATEWAYADMINDEVELOPERS
Roles & Permissions — 9 Roles × 138 Features
Document Gateway · Identity & Access
138 features. 9 roles. 4 tiers. This is enterprise access control with the granularity that regulated industries require. The Role Matrix shows exactly which features each role can access — filtered by Actions, Data, or Pages — with color-coded permission states (full access, limited, read-only, none). The 4-tier structure (System, Tenant, Hierarchy, Node) means a Steward at a specific hierarchy node can only see documents and actions relevant to their assigned assets — not portfolio-wide. Row Level Security enforcement happens at the database layer via Supabase RLS, not at the application layer — which means even direct API access or MCP agent connections respect the same access boundaries. No orphaned permissions. No over-provisioned service accounts. Security is structural, not configured.
Storage Management — Intelligent Lifecycle Automation
documentgateway.ai
Storage Manager — Automated Document Lifecycle Policies
Click to enlarge
GATEWAYADMIN
Storage Manager — Automated Document Lifecycle Policies
Document Gateway · Storage Intelligence
Documents cost money to store, process, and query — and most organizations keep everything in hot storage indefinitely because moving things manually never happens. Storage Manager automates the entire lifecycle through policy rules that run on configurable schedules. Auto-Warm After Inactivity moves documents not accessed in 30 days from Hot to Warm storage automatically. Archive Certified Docs moves Sentry-certified documents to Archive on an hourly schedule — certified documents are immutable by definition, so hot storage is wasteful. Hot Retention for Active keeps any document accessed in the last 7 days in Hot tier regardless of other rules. Each tier has a defined retention schedule: Hot is indefinite for active docs, Warm moves certified docs to Archive after 180 days, Archive retains compliance documents for 7 years minimum. The platform manages storage cost at scale without operational overhead.
White Label Branding
documentgateway.ai
White Label Branding — Full Enterprise Identity Control
Click to enlarge
GATEWAYADMIN
White Label Branding — Full Enterprise Identity Control
Document Gateway · Enterprise Branding
Every customer-facing surface of the platform — Document Banner, Login Page, Email Templates, Certificates of Authenticity, Shared Viewer links — carries your organization's identity, not imkore's. Upload Master Logos once (light mode and dark mode variants) and they propagate automatically to all surfaces. Each surface can also be individually overridden with a custom logo if different contexts require different branding. The Logo Across Surfaces panel on the right shows a real-time preview of exactly how your logo appears on each surface before you publish. For institutional clients sharing documents with investors, lenders, or regulatory bodies, the platform presents entirely as their own product. This is the infrastructure that allows a GP to present a Transaction Room to an LP counterparty with full institutional branding — no "Powered by imkore" anywhere in the counterparty experience.
Products · Tab 06
Abstract.DI — The Engine That Reads Your Documents
Abstract.DI does not summarize documents. It comprehends them — classifying every document type, extracting every meaningful field, scoring confidence at the field level, detecting anomalies against portfolio patterns, and writing all of it as structured, queryable data into the Document Warehouse. Any document. Any industry. 94%+ confidence out of the box. No training required.
ANY
Document Type
94%+
Day-One Confidence
100K
Batch Chunk Size
10–50x
GPU OCR Speedup
Day 1
Accuracy (not Month 6)
Abstract.DI in Action — AI-Powered Document Comprehension
documentgateway.ai
Abstract.DI — Structured Field Extraction from Any Document
Click to enlarge
ABSTRACTAI
Abstract.DI — Structured Field Extraction from Any Document
Abstract.DI · AI Extraction Engine · Any Document Type
What you are seeing here is a document — any document — being converted into structured, queryable intelligence in seconds. The left panel shows the original document exactly as it arrived. The right panel shows every field Abstract.DI extracted: parties, dates, financial terms, obligations, conditions, signatures, execution status — organized into typed field groups with individual confidence scores. Every highlighted passage in the document is a live link: click any extracted field and the document scrolls to the exact source text it was derived from. This is not an AI summary — it is a structured database record created from an unstructured document, with full provenance tracing from field value back to source text. The 94%+ confidence score is field-level, not document-level — you know exactly which fields the AI is certain about and which need review. All extracted data is immediately written to the Document Warehouse as queryable PostgreSQL rows, available to any BI tool, API consumer, or AI agent the moment extraction completes. This is the engine that turns a folder of PDFs into a structured database.
Multi-Pass Extraction Pipeline
Abstract.DI — Document to Intelligence Pipeline
Step 1
Document Ingestion
PDF, DOCX, XLSX, PPTX, MSG/EML, CSV, ZIP, JPEG/PNG/TIFF, DB records
Step 2
Selective OCR
DocTR engine · GPU 10–50x speedup · Multilingual · 8s timeout with fallback
Step 3
Classification
Any doc type · Claude Haiku inference · "AI.DI Named" badge · Confidence scoring
Step 4
Field Extraction
Type-specific schemas · Dates · Parties · Amounts · Obligations · Conditions
Step 5
Anomaly Detection
Cross-document consistency · Version comparison · Portfolio baseline deviations
AI.DI Document Warehouse — Structured PostgreSQL Rows
All extracted fields stored as structured, queryable data. Available to BI tools, APIs, and AI agents instantly.
"Box shows you a file. We show you what the file says. Run both side by side. The demo closes itself."
— Abstract.DI positioning principle
Products · Tab 06
Sentry Document Assurance — "Shazam for Documents"
Deterministic mathematical fingerprinting — patent pending. Zero document storage. Zero PII exposure. GDPR, HIPAA, SEC, and APA compliant by architecture, not configuration. ~10,000 pre-built fingerprints. Three unique fingerprint types including Trusted Data Fingerprints — unique in the market.
~10K
Fingerprint Catalog
0
Documents Stored
10-100x
Search Speed (v2)
30-50%
LLM Cost Reduction
Patent
Pending Architecture
Three Fingerprint Types
Type 01
Document Content Fingerprints
Full textual and structural content of any document. Two identical documents always produce identical fingerprints. Any change — single word, date, number, comma — produces a measurably different fingerprint. Deterministic mathematics. Zero false positives. Used for certification, version tracking, and fraud detection.
Type 02
Document Data Fingerprints
Structured data extracted from document fields fingerprinted independently of full document content. Field-level matching — find every document containing the same lease term, coverage limit, or financial figure — without requiring full-text identity. Supports cross-document data validation.
Type 03
Trusted Data Fingerprints
Unique in the market. No equivalent exists anywhere. Fingerprint individual rows from Excel files and database tables — a tenant record, vendor entry, or asset row — then find every document in the enterprise that references that specific record. Given any entity: instantly surface every connected document across the entire corpus.
30–50% AI Cost Reduction — Immediate

Industry average: 40%+ of files are duplicates. A corpus with 40% duplicates means 40% of every LLM bill computes the same content twice. Sentry identifies all duplicates, consolidates to canonical records, preserves all metadata from every duplicate instance, then suppresses duplicates from AI queues. LLM compute costs drop 30–50% immediately — without changing a single prompt or model.

"The fingerprint never lies. The document might. Sentry tells you the difference."
— Sentry Document Assurance design principle
Products · Tab 08
AI.DI Document Warehouse — The Data Moat That Doesn't Exist Anywhere Else
Every document Abstract.DI processes becomes structured rows in PostgreSQL. Every extracted field is queryable data. Every AI signal is persisted as a structured record. No competitor has built this. No competitor can build it without starting over.
Document Warehouse — Live Query Interface
documentgateway.ai
AI.DI Document Warehouse — Your Entire Document Corpus as Structured Data
Click to enlarge
WAREHOUSE
AI.DI Document Warehouse — Your Entire Document Corpus as Structured Data
Warehouse · 9,857 Documents · Live Query Interface
What looks like a document list is actually a live query interface into a structured database. Every row here is not a file reference — it is a record in PostgreSQL with typed columns for every field Abstract.DI extracted: classification type, confidence score, execution status, expiry date, party names, financial terms, compliance flags, and dozens more depending on document type. The Abstract.DI query bar at the top accepts natural language — "show all contracts expiring in Q1 with coverage below minimum" returns structured results because the underlying data is structured, not a keyword search across unstructured text. The six view modes (List, Gallery, Library, Cube, Time Series, Schema, Scientist) let you slice the same corpus as a compliance officer, a data analyst, an AI engineer, or a CFO — each seeing exactly the view that matches their workflow. This is the first document management system where the documents are a side effect of the real product: a continuously enriched, AI-maintained structured database of everything your organization has ever received, produced, or executed.
Warehouse Studio — BI Connectors
documentgateway.ai
Warehouse Studio — Zero-ETL Connections to Every Data Stack
Click to enlarge
WAREHOUSEDEVELOPERS
Warehouse Studio — Zero-ETL Connections to Every Data Stack
Warehouse · Snowflake · Databricks · 9 Connectors
The Document Warehouse is not a destination — it is a source of truth that feeds every analytics system your organization already uses. Snowflake receives 891 rows on a 15-minute incremental sync via Data Share — zero-copy, zero ETL, no pipeline to build or maintain. Databricks connects via Delta Lake for full and incremental refresh, enabling document intelligence to join with financial models, risk systems, and ML pipelines in the same compute environment. Webhooks fire on every document event — ingest, certify, extract, expire — enabling real-time triggers to any downstream HTTP endpoint. BigQuery, Redshift, Tableau, Power BI, dbt Cloud, and a Python SDK are available with one-click configuration. The data model is fully documented — every extracted field, every confidence score, every audit event — so data engineers can join document intelligence with any other enterprise dataset without discovering the schema by trial and error. The Warehouse is not just queryable. It is the most current, most complete, most structured view of your document estate that has ever existed.
Data Intelligence — Full Data Lineage Observability
documentgateway.ai
Data Lineage Map — Complete Provenance from Source to Consumer
Click to enlarge
WAREHOUSEDEVELOPERS
Data Lineage Map — Complete Provenance from Source to Consumer
Warehouse · Data Intelligence · 15 Nodes · 17 Connections
Every piece of intelligence in the platform has a traceable origin. The Data Lineage Map visualizes the complete path a document takes from source system through ingestion, processing, warehouse storage, and consumer delivery — with live status on every node. PDF Documents (693 ingested) flow through the Ingest Pipeline (1,000 processed), Deduplication (exact + fuzzy matching), and Fingerprinting (3 fingerprint types) before Abstract.DI extracts 13 abstraction fields. Those fields become Extraction Fields (queryable warehouse layer), Document Metadata (cross-asset search index), and Query Engine (PostgreSQL + custom SQL). Consumers — MCP Server, Snowflake, Webhooks — pull from the warehouse in real time. Stale nodes are visually flagged (Bridge Sync shows 0 fields synced, triggering immediate attention). This is not just observability — it is the audit trail that answers "where did this AI answer come from?" for every query, every extraction, and every alert the platform produces.
"The Finance director ran one query. Five years of contract values from 3,000 documents into Excel in 30 seconds. He looked up and said: 'We've been paying people to do this manually for twenty years.' That was the moment the platform sale closed itself."
— Warehouse Scientist Mode proof moment
Products · Tab 08
Millennia FileStar — The Installed Base That Cannot Be Replicated
45 enterprise clients. 7,500 active users. ~$1M ARR. 20+ years of institutional trust. The FileStar installed base is the most defensible competitive moat in the AI.DI portfolio — a client list that cannot be cold-called by any competitor, at any funding level.
45
Enterprise Clients
7,500
Active Users
~$1M
Current ARR
20+
Years of Trust
8-10
Phase 1 Upgrade Targets
Document Fabric & Lifecycle Governance

Structured lifecycle governance for any document type. Every document has a defined lifecycle: creation → review → approval → distribution → monitoring → archival. Configurable routing rules, approval chains, and escalation paths enforce this lifecycle.

Version control with full history. Compliance monitoring with expiry tracking. Audit trail on every action. Role-based access aligned with hierarchy.

The AI.DI Integration Pathway

FileStar governs documents. AI.DI makes them intelligent. FileStar-managed documents automatically flow through Sentry certification and Abstract.DI extraction without any workflow change for existing users. All FileStar metadata syncs to the AI.DI Warehouse.

Every FileStar client is one conversation away from the full AI.DI platform. No rip-and-replace. No migration project. No change management crisis.

The Permanent Competitive Advantage

Every FileStar client that upgrades to AI.DI is a client that could not have been won by a cold-start competitor — regardless of product quality, VC funding, or pricing. A startup raising $20M today cannot replicate a 20-year enterprise trust relationship with a client's CFO. This is structural moat, not temporary advantage.

Products · Tab 10
AI Orchestration & Agent Gateway — The Infrastructure That Makes LLMs Actually Work
AI.DI is not a competitor to LLMs. It is their prerequisite. Every enterprise deploying Copilot, GPT-4, Claude, or Gemini faces the same problem: the AI is only as good as the documents it reasons from. If documents are uncertified and unstructured — your AI hallucinates. AI.DI is the trusted document foundation that makes any LLM enterprise-grade.
Live MCP Server — AI Agent Gateway
documentgateway.ai
Integration Studio — Live AI Agent Gateway
Click to enlarge
ORCHESTRATIONAI
Integration Studio — Live AI Agent Gateway
AI Orchestration · MCP Server + Connected AI Systems
This is the screen that enterprise AI teams have been waiting for. A live production MCP server exposes 6 certified tools to any MCP-compatible AI system — Claude, Cursor, LangChain, AutoGen, or any agent framework. The moment Claude.ai connects to this URL, it can search your certified document corpus, check compliance status on any asset, retrieve all obligations from any document set, run structured queries against the full Warehouse, navigate your org hierarchy, and retrieve signed access URLs for specific document versions. Every query enforces row-level security at the database layer — the AI agent cannot access documents the connected user is not authorized to see. Keys are revocable instantly. Usage is logged. This is not middleware or a wrapper — it is a purpose-built enterprise document intelligence API that treats your LLM as a trusted, auditable consumer of certified data rather than a summarizer of raw PDFs.
28 Connectors — Every System You Already Use
documentgateway.ai
Integration Studio — 28 Enterprise Connectors
Click to enlarge
ORCHESTRATIONAI
Integration Studio — 28 Enterprise Connectors
AI Orchestration · Full Connector Ecosystem
The most common objection to any new platform is "we already use X." AI.DI answers that objection by connecting to every X simultaneously. Yardi Voyager delivers leases and rent rolls directly into the ingestion pipeline. MRI Software pushes financial reports and lease abstracts. Argus Enterprise connects valuation models. Salesforce ingests CRM documents and contracts. The document management connectors (SharePoint, Google Drive, Box, OneDrive, Dropbox) mean AI.DI adds intelligence to your existing storage rather than replacing it. Dev and monitoring connectors (Sentry, Datadog, GitHub) ingest operational documents as first-class records. Data warehouse connectors (Snowflake, Databricks, BigQuery) make extracted document intelligence immediately available in every analytics environment you operate. The AI Wizard guides each connection in steps — no custom integration code, no IT project, no professional services. The platform meets your existing infrastructure where it lives.
Document IQ — AI-Powered Portfolio Intelligence
documentgateway.ai
Document IQ — Conversational AI Over a Certified Corpus
Click to enlarge
ORCHESTRATIONAI
Document IQ — Conversational AI Over a Certified Corpus
AI Orchestration · Portfolio-Wide AI Query Interface
Document IQ demonstrates what changes when AI has access to structured, certified data instead of raw files. A question like "What's missing from the vault?" is not a keyword search — it is a completeness calculation across every asset in the portfolio, comparing current documents against required schemas, returning a prioritized gap list with asset, document type, and days overdue. "Show critical risk items" aggregates violation flags, expiry warnings, and compliance alerts across the entire portfolio into a single prioritized view that would take a compliance analyst days to compile manually. The file upload capability takes any document — a counterparty rent roll, a vendor certificate, a financial statement — and cross-references it against vault records in real time, identifying discrepancies, missing correlations, and data conflicts without a human pulling comparison reports. This is the AI experience that becomes the reason nobody opens a legacy platform again.
Data Lineage — Full Provenance for Every AI Answer
documentgateway.ai
Data Intelligence — Data Lineage Map
Click to enlarge
ORCHESTRATIONWAREHOUSE
Data Intelligence — Data Lineage Map
AI Orchestration · End-to-End Data Provenance
When an AI agent answers a question using AI.DI data, every element of that answer has a traceable origin. The Data Lineage Map shows the complete pipeline from source document to consumer — enabling any data engineer, compliance officer, or auditor to trace exactly how a specific piece of intelligence was produced, what transformations it passed through, and which source document it ultimately came from. This is the infrastructure that eliminates LLM hallucination risk: every answer the AI returns is backed by a certified document, a specific extraction, a confidence score, and a provenance chain. The stale node indicators (Bridge Sync showing 0 fields synced) surface data freshness issues proactively — you know before an AI answer is delivered whether the underlying data is current. Provenance is not an afterthought in AI.DI. It is the foundation.
Value & Strategy · Tab 10
Continuous Transaction Readiness™ — The Score That Runs Your Document Strategy
CTR is not a feature. It is a category-defining concept. It means your organization is always prepared to respond to a capital call, close an acquisition, satisfy a regulator, or distribute to an investor — because AI.DI monitors, scores, routes, and maintains your entire document estate in real time.
The Primary Value Statement

AI.DI gives your organization Continuous Transaction Readiness — the state where every document is accessible, authentic, aligned, and actionable at all times. Enterprises that achieve this state lower their cost of capital, reduce audit risk, deploy AI with confidence, and close transactions faster.

How CTR Is Calculated
Sample Asset CTR Score
84/100
Near Ready — Minor Gaps
23/26
Docs Present
2
Expiring Soon
1
Violation
4.2d
Avg Response
Five Weighted Dimensions
Document Completeness88/100
23 of 26 required document types present and valid
Document Validity & Freshness76/100
2 insurance certificates expire within 45 days
Compliance & Regulatory Status71/100
1 active violation: Certificate of Occupancy version mismatch
Distribution Readiness92/100
Document package deliverable to counterparty within 2 hours
Access & Permissioning Health97/100
All role assignments current. No orphaned access detected.
Score Interpretation
ScoreStatusTypical SituationTime to Transact
90–100Transaction ReadyAll documents present, current, valid. No violations.48 hours
75–89Near Ready1–3 documents missing or expiring. No active violations.1–5 business days
55–74Attention RequiredMultiple gaps or 1–2 violations.2–4 weeks
35–54Not ReadySignificant document gaps. Will not survive buyer diligence.30–60 days
0–34CriticalSeverely incomplete or non-compliant documentation.90+ days
Value & Strategy · Tab 11
For Data Scientists — The Document Intelligence Stack You've Been Waiting For
You've been asked to build AI on enterprise documents. You know what that means: unstructured PDFs, no provenance, wrong versions, 40% duplicates, PII everywhere, no reliable way to trace an LLM answer to a specific document. AI.DI is the infrastructure layer that solves every one of those problems — through every interface you already use.
What You're Actually Getting

AI.DI is not a document management UI with an API bolted on. It is a document intelligence data platform: a PostgreSQL warehouse of structured document intelligence, a live MCP server, a webhook event stream, a REST/GraphQL API, Snowflake Data Share, JDBC/ODBC direct access, vector embeddings on certified document chunks, and a 30-engine ML pipeline that improves continuously. Every document becomes structured, provenance-tracked, certified data — available to any model, pipeline, or analytics tool you're running.

The Data Model — What You're Querying
TableContentsKey FieldsPrimary Use
document_recordsEvery document processedid, original_name, document_type, workflow_status, asset_id, classification_confidence, storage_pathDocument inventory, classification analysis
extracted_fieldsStructured extraction from Abstract.DIdocument_id, field_name, field_value, confidence_score, extraction_model, extraction_timestampContract analytics, financial extraction
sentry_fingerprintsCryptographic fingerprint recordsdocument_id, fingerprint_hash, fingerprint_type, certified_at, version_chain, similarity_scoresCertification, duplicate detection, fraud monitoring
hierarchy_nodesFull org hierarchyid, parent_id, node_type, node_name, industry, ctr_score, completeness_pctPortfolio analytics, CTR aggregation
document_activity_logEvery action on every documentdocument_id, event_type, actor_id, actor_role, timestamp, metadataAudit trail, access pattern analysis
vector_embeddingsEmbeddings on certified chunksdocument_id, chunk_id, certified_version_hash, embedding_vector, model_versionSemantic search, RAG retrieval, clustering
ctr_score_historyCTR Score time seriesnode_id, score, dimension_scores, calculated_at, delta_from_priorReadiness trending, portfolio benchmarking
Python SDK — Example Patterns
from aidi import DocumentWarehouse
client = DocumentWarehouse(api_key="YOUR_KEY", tenant_id="YOUR_TENANT")

# Query all Q1 2027 lease expirations across a portfolio — certified docs only
expirations = client.extractions.query(
  document_type="commercial_lease", field="expiration_date",
  date_range=("2027-01-01", "2027-03-31"), certified_only=True
)

# Get version-locked embeddings for RAG pipeline
embeddings = client.vectors.get_certified_chunks(
  document_ids=expirations.document_ids(), version_locked=True
)

# Subscribe to certification events for real-time model retraining
@client.events.on("document.certified", document_type="financial_statement")
async def on_new_financial_statement(event):
  extracted = await client.abstractions.get_fields(event.document_id)
  await my_model.retrain_incremental(extracted.to_feature_vector())
Value & Strategy · Tab 12
Any Industry. Any Complexity. Built for Scale.
AI.DI was not built for one vertical and adapted for others. The same engine that certifies a Blackstone real estate portfolio is equally compelling for a PE firm's data room, a hospital's compliance records, or a law firm's contract vault. The document problem is universal. So is the solution.
Unlimited Org Depth
Enterprise → Group → Entity → Asset → Unit → Counterparty. Any depth, any width, any industry. A 500-asset CRE fund, a 15-portfolio-company PE firm, a 200-branch bank — all map to the same hierarchy model with zero configuration overhead.
Edge Compute at Scale
Deno edge functions scale to zero when idle and to any volume on demand — same code handles 10 documents and 10 million. No ops team. No provisioning. No performance cliffs at scale.
Any File Type. Zero Exceptions.
PDF, DOCX, XLSX, PPTX, MSG/EML, CSV, ZIP, JPEG/PNG/TIFF scans, database records. No conversion required. No pre-processing. Whether a scanned fax or a native Word contract — AI.DI ingests, classifies, and extracts from all of it.
Commercial Real Estate
Asset managers, sponsors, operators across multifamily, office, industrial, retail, and mixed-use
Primary Market
Document Types
  • Title policies & ALTA surveys
  • Lease abstracts & leases
  • Insurance certificates
  • Environmental studies (Phase I/II)
  • Appraisals & BPOs
  • Loan documents & notes
  • Certificates of occupancy
  • Property management agreements
Key Use Cases
  • Acquisition due diligence
  • Loan closing packages
  • Lender covenant compliance
  • Insurance renewal management
  • Portfolio disposition readiness
  • LP reporting distributions
CTR Impact
  • Diligence prep: 6 weeks → 48 hours
  • Eliminate insurance gap incidents
  • Pre-qualify assets 12 months early
  • LP reports in 1 click, not 2 weeks
  • Close refinancings in half the time
Private Equity
GPs, fund managers, and portfolio operations teams managing company-level and fund-level documentation
Primary Market
Document Types
  • Fund formation documents
  • LP subscription agreements
  • Cap tables & equity agreements
  • Material contracts
  • Audited financials
  • Board minutes & resolutions
  • Exit transaction documents
Key Use Cases
  • Portfolio company exit readiness
  • LP capital call packages
  • Annual audit preparation
  • Co-investor reporting
  • Secondary transfer docs
CTR Impact
  • Exit prep starts 18 months early
  • LP Q&A response under 24 hours
  • Audit cycle cut 70%
  • Deal team on diligence, not hunting docs
Department-Level Entry Points
DepartmentAcute PainAI.DI Entry ProductExpansion Path
Legal / GCContract version disputes, discovery liability, GDPR complianceSentry certification + Document Gateway distributionFull Document Warehouse for corporate legal corpus
Finance / AccountingAudit prep fire drills, financial document reconciliationAbstract.DI batch (financial extraction) + Blueprint auditSentry certification + Warehouse integration to ERP
Compliance / RiskRegulatory filing tracking, compliance gaps, audit exposureSentry + Warehouse (compliance corpus) + CTR ScoreFull platform across regulated document types
Transactions / Deal TeamDue diligence prep time, data room chaosDocument Gateway + Distribution Studio + Transaction RoomsAbstract.DI batch for portfolio-wide extraction
IT / Data EngineeringUnstructured data not in Snowflake; LLM hallucinationsDocument Warehouse + Snowflake + MCP ServerFull platform as enterprise document intelligence backbone
Operations / HREmployee records, policy tracking, onboarding complianceFileStar lifecycle governance + Abstract.DI HR extractionSentry certification + Document Gateway policy distribution
Get Started · Tab 13
Start with One Department. Get the Whole Platform.
AI.DI is not a pilot program with limited features. From your very first document, you have access to the complete platform — every engine, every view, every integration. We believe in earning your full commitment by delivering full capability from day one. Start small if you want. The platform is built for all — enormous portfolios and single-department deployments run on the exact same infrastructure.
The imkore Philosophy — Do Some or Do It All

The world's largest institutional real estate portfolios run on the same platform as a 12-asset regional operator starting their first compliance program. A single compliance officer in one department gets the same AI intelligence, the same CTR Score, the same Warehouse, the same MCP server as a 500-person investment management firm running 20 funds. We built for scale from day one — which means the smallest client gets the most powerful platform available at any price point. No feature tiers. No locked capabilities. No "upgrade to get the real thing."

Three Ways to Start — All Paths Lead to the Same Platform
Entry Path 01
Start with One Document Type
Pick your most painful document type — insurance certificates, leases, vendor contracts, financial statements. Run Abstract.DI on everything you have. Get a CTR Score on that category in 72 hours. See exactly what's missing, expiring, or wrong. The rest of the platform is right there when you're ready.
"We started with just our COIs. In three days we knew which assets were exposed. We hadn't done that audit in two years." — Property Operations Director
Entry Path 02
Start with One Department
Give legal, compliance, finance, or your deal team a standalone deployment. They get the full platform — just scoped to their hierarchy node and document types. No IT project. No enterprise rollout required. One steward, one asset group, full capability. When they prove ROI, the next department asks to join.
"Legal started it. Then finance wanted in. Then the deal team. We never ran a rollout — it spread itself." — Chief Operating Officer, PE Firm
Entry Path 03
Start with One Asset or Fund
Run a complete AI.DI deployment on a single asset or fund as a proof of concept with real production data. CTR Score goes live in 72 hours. Abstract.DI processes your existing archive in the first week. Distribution Studio sends your first LP package before the end of month one.
"We ran one asset. The CTR Score told us things we didn't know. That asset closed three months faster. Then we did the whole portfolio." — Managing Director, CRE Manager
imkore Blueprint — The Highest-Confidence Entry Point
Advisory Service · $50–150K · 60–90 Days
imkore Blueprint — Document Intelligence Audit & Readiness Roadmap

Blueprint evaluates your entire document ecosystem — every repository, every system, every process — and delivers a scored readiness assessment and a prioritized AI.DI product roadmap. Blueprint invariably reveals exactly which products the client needs and why. The roadmap we deliver IS the AI.DI implementation plan for your organization.

01
Discover
Map all document repositories across all systems
02
Assess
Evaluate governance, structure, and integrity
03-04
Classify + Validate
Standardize taxonomies, confirm authenticity, remove duplicates
05-06
Structure + Certify
Apply metadata conventions, establish certified records
07-08
Enable + Optimize
Prepare for AI and automation, maintain Continuous Transaction Readiness
Plans
Tier 01
Foundation
For teams managing one department, fund, or asset group getting organized and transaction-ready for the first time.
Custom
Based on asset count and user seats.

+Up to 25 assets — full platform
+Check-In Studio with Abstract.DI
+CTR Score Dashboard
+Distribution Studio
+Up to 10 user seats
Contact Sales
Tier 03
Strategic Partner
For institutional platforms, banks, and technology integrators embedding AI.DI into their own products.
API-First
Full API access, white-label options, revenue sharing available.

+Full Abstract.DI API access
+Sentry fingerprinting API
+CTR Score API
+White-label Transaction Rooms
+MCP server + Revenue sharing
Talk to Partnerships
Frequently Asked Questions

You get the full platform from the moment you deploy — every engine, every view, every integration. There are no feature gates, no capability tiers, and no "enterprise unlock" for core functionality. Your first document gets the same AI pipeline as document number one million. We believe you should see the full value immediately, not earn access to it through a ramp-up process.

No. AI.DI layers over your existing infrastructure. Start with your highest-priority asset group or begin fresh with new documents. There is no requirement to migrate your entire historical archive before going live. The batch engine can process any legacy archive on its own timeline — you decide when and what to bring in.

Sentry generates a mathematical fingerprint — a unique hash derived from document content. Two identical documents always produce identical fingerprints. Any change produces a different fingerprint. The original document is never stored by Sentry. GDPR data minimization is achieved structurally — your documents never leave your control.

The live MCP server exposes 6 tools: search_documents, get_compliance_status, get_obligations, query_warehouse, get_hierarchy, get_document_url. Add AI.DI to Claude, Cursor, LangChain, AutoGen, or any MCP-compatible environment and your agents immediately have certified document search and structured extraction queries. Authentication via OAuth2 — agents only access what the connecting user is authorized to see. Keys are revocable instantly.

Yes. Full platform via Docker containers — no Kubernetes required. Azure Cloud, AWS, fully on-premise, and hybrid (metadata in cloud, documents on-prem) are all supported. Air-gapped environments with no internet connectivity are also supported. Contact the enterprise team for deployment architecture details.

Snowflake Data Share (zero-copy, no ETL), Databricks connector (Delta Lake, streaming), Tableau and Power BI native connectors, dbt compatibility, BigQuery export, direct JDBC/ODBC access, REST API with OpenAPI 3.0 spec, Python SDK, and webhook event streaming to any HTTP endpoint. SSO via SAML 2.0 and OAuth 2.0.

Request Access
Your next deal closes faster
when your documents are always ready.
Request a Simulator Key and explore a fully populated AI.DI demo environment with real document intelligence running on sample portfolio data — no implementation required.
Request Simulator Key Log In to Document Gateway
hello@imkore.ai  ·  documentgateway.ai  ·  imkore.ai