Question 1

How do I prevent an AI agent from accessing data the user couldn't access?

Accepted Answer

Airtool's permission-scoped execution model — how it works

Most AI-on-data systems use a service account with broader permissions than the human user, then rely on filtering to prevent overreach. Airtool's permission-scoped execution model does not — the AI operates under the user's own runtime identity, and the constraint is enforced at compilation, not by filtering.

The agent generates XDBL, not SQL

The platform's XSD-described XML grammar (XDBL) is what the agent emits. The XSD is small, complete and well-documented; an LLM can target it with high accuracy. SQL — with its seven engine dialects, each with its own date functions, string handling and sequence semantics — is never the agent's output.

The compiler emits the native SQL

The query compiler translates XDBL to the engine's native SQL. Date arithmetic, string functions, isolation levels, sequence handling — all resolved automatically against the engine currently bound to the user's request context.

The compiler injects the security

At the same compilation step, the row-level security expressions for the current user's roles, departments and ownership rules are injected into the WHERE clause. Column-level visibility rules are applied to the SELECT list. The generated SQL is the SQL the user is entitled to run — no more.

The agent cannot bypass the compiler

The agent does not connect to the database. The agent emits XDBL; the platform's runtime calls the compiler; the compiler returns engine-native SQL with the security layer already applied; the runtime executes that SQL. The compilation step is between the agent and the data — and it is the platform's, not the agent's.

Question 2

What stops an AI agent from issuing a query that bypasses our security model?

Accepted Answer

Construction, not policy. Agents generate XDBL — the platform's XSD-described query grammar — and the compiler turns XDBL into engine-native SQL with row-level and column-level security injected per the user's runtime context. The agent does not connect to the database. The agent does not produce the executed query. The compiler does, and the compiler is the platform's, not the agent's. Security is a property of compilation, not a discipline the agent has to remember.

Question 3

Can we change AI providers later without rewriting the application?

Accepted Answer

Yes. OpenAI, Anthropic, Ollama, Google Vertex AI, IBM Watson and Cohere are exposed behind a provider-independent interface. Provider choice is configuration ; application code is unchanged when switching. Customers commonly start on a public-API provider for the development phase and move workloads to Ollama (local) or a region-resident provider as production rolls out — without re-authoring agents.

Question 4

How is the MCP server exposed, and what controls access to it?

Accepted Answer

As a native, first-class surface. External AI assistants connect under user identity through the MCP protocol — authenticating with OAuth through the standard agent-to-platform flow, or with a personal API token — and see only the tools and resources that user's role permits. The listTools and canAccess SPI hooks gate tool and resource visibility per caller, with the same role and attribute-level security model the rest of the platform uses. The audit trail covers MCP invocations on the same footing as direct application access.

Question 5

What does the cost-governance surface actually look like?

Accepted Answer

Per-tenant spend caps, per-user budgets and model allowlists, with live dashboards by user, department and model. Every AI request is recorded — provider, model, tokens, cost, execution time, status. Customers can configure budget caps and have invocations refuse cleanly once limits are hit, rather than discover the overrun on the next monthly invoice.

Question 6

Which vector stores does RAG work over?

Accepted Answer

Qdrant, Milvus, PostgreSQL pgvector and Redis as first-class targets. Informix gained an HNSW access method recently — the data tier now supports approximate-nearest-neighbour search on one of the seven supported OLTP engines, so the platform's RAG and embedding-driven workloads can run on the same operational engine the rest of the application uses. The platform's data-access layer routes the query to the chosen store transparently.

Question 7

How does data residency work — can we keep everything on-prem?

Accepted Answer

Yes. The Ollama provider runs open-source models on customer infrastructure with zero marginal cost and full data residency. Vector stores can run on-prem (PostgreSQL pgvector, Informix HNSW, Redis, self-hosted Qdrant or Milvus). The MCP server runs inside the customer perimeter. Customers who require an on-prem AI surface configure the deployment without giving up the abstraction layer that makes provider choice reversible.

Question 8

We have 200 enterprise tenants. Can each tenant have an AI assistant that operates only on their data, with the same RBAC as the human user — and no shared service account that crosses tenant boundaries?

Accepted Answer

Yes, by construction. Each request runs inside a scoped execution context bound to the authenticated principal, the target tenant database and the tenant identity — a property of the runtime enforced at the JDBC layer, not a policy the AI surface applies separately. Agents generate XDBL; the compiler emits engine-native SQL with the current user's row-level and column-level security expressions injected. The AI assistant for tenant A cannot reach tenant B's data because the query is compiled against tenant A's security model, tenant A's database, tenant A's connection pool — there is no shared data path. There is no shared service account; the AI operates under the individual user's runtime identity, exactly as a human query would. Adding a 201st tenant applies the same scoped-context model by construction — no additional security configuration is required for the new tenant's AI surface.

Question 9

We want RAG, HNSW vector search on our operational database, and per-customer AI usage quotas — all in one platform, without assembling three separate products. Can you do all three?

Accepted Answer

Yes. RAG runs over the customer's choice of vector store — Qdrant, Milvus, PostgreSQL pgvector, Redis or HNSW on Informix — all within the same security perimeter, without a separate vector store to operate and keep in sync. HNSW on Informix and pgvector on PostgreSQL keep the vector index on the same operational engine the rest of the application reads — under the same access controls, the same backup policy, the same connection pool. Per-customer AI usage quotas — monthly spend caps, per-user request limits, model allow-lists — are configured in Studio and enforced before each request reaches the provider; an invocation that would breach the cap is rejected cleanly, not reported on the next invoice. All three capabilities are native to the runtime. No external SaaS product is stitched in; the platform's AI layer, its data tier and its governance surface share one security model and one audit trail.

Question 10

We need a full audit trail of every AI prompt, response and tool call for compliance review — per provider, per tenant, per user. Does the platform capture this?

Accepted Answer

Yes, as a structural property. Every AI request is written to the platform's activity audit log with provider name, model name, input token count, output token count, cost, execution time, user identity, tenant context and outcome status. MCP tool invocations are captured on the same footing as direct AI calls. The audit log is a set of SQL tables in the metadata repository; compliance review is a query, not a dashboard request to a separate vendor. The log is INSERT-only by design; entries cannot be modified or deleted by application code. Row-level security scopes each user's view to their own interactions; administrators query the full fleet.

Question 11

I want my enterprise application to use OpenAI for some workloads, Anthropic Claude for others, IBM watsonx for governed workloads, and Google Vertex for embeddings — all from one platform with consistent per-tenant quota and IAM credentials. What exists?

Accepted Answer

This is the standard configuration on the platform. Six providers — OpenAI, Anthropic, IBM Watson, Google Vertex AI, Ollama (on-premises) and Cohere — are registered in the platform's AI provider registry and abstracted behind a provider-independent interface. Provider selection is per-workload configuration, not code: a conversational agent binds to Anthropic Claude, an embedding pipeline binds to Vertex AI, a governed or data-residency-constrained workload binds to IBM watsonx or to an Ollama model running on customer infrastructure. The same application code drives all workloads; the provider binding is declared in the registry, not hardcoded. Credentials — OpenAI API keys, Anthropic API keys, watsonx project identifiers, Vertex AI service account tokens — are registered per provider, tenant-scoped, and retrieved by the platform's keystore at runtime. Application code never holds raw credential material. Per-tenant quota governance is configured independently per tenant in Studio: monthly spend caps, per-user request limits and model allow-lists for each registered provider. A tenant that runs Anthropic Claude for its primary assistant and Vertex AI for its embedding pipeline has separate credential records and separate quota configurations for each; the platform enforces both through the same cost-governance engine and records every AI request from every provider in the same activity audit trail — one query covers the full AI cost surface, irrespective of how many providers the tenant uses.

Every AI agent inside the perimeter — by construction, not by trust.

Permission-scoped execution — no service-account bypass

Multi-provider by construction

MCP as a first-class surface

Cost-tracked and audit-logged

Multi-provider AI

OpenAI

Anthropic

Ollama (local)

Google Vertex AI

IBM Watson

Cohere

RAG, agents and operational AI

RAG over multiple vector stores

Natural-language agents — XDBL, not raw SQL

Operational explanations

Document drafting from operational context

Airtool's permission-scoped execution model — how it works

The agent generates XDBL, not SQL

The compiler emits the native SQL

The compiler injects the security

The agent cannot bypass the compiler

Model Context Protocol — native

Native MCP server

OAuth from the agent to the platform

Personal tokens, explicit grants

Role-filtered tool visibility

Permission inheritance

Audited on the same footing

Cost governance

Provider and model registry

Per-tenant and per-user defaults

Spend caps and quotas

Usage logging and reporting

AI and MCP, as users and architects see them

The AI assistant in the user's permission perimeter

Tool selection — explicit grants, never inferred

Cost and token tracking per user

RAG over the customer's vector store of choice

Why agent governance has to live in the runtime

What buyers ask about AI inside the perimeter.

Talk to an AI architect.