Transitioning to enterprise software. Services live now. First product, RAG Studio, ships Q4 2026. See the roadmap →
01 Cluster · Intelligence Layer

The intelligence layer
for African
enterprise AI.

Build, deploy, fine-tune, and govern AI from one POPIA-native environment. Four product surfaces — Agent Builder, RAG Studio, Fine-Tuning Ops, and Governance Hub — sharing one API gateway, one model registry, and one observability stack. Hosted in AWS af-south-1. Priced in rand.

Developers Data scientists Compliance teams Business analysts
Product surfaces
4
Agent Builder · RAG Studio · Fine-Tuning Ops · Governance Hub
Capacity unlocked
60–70%
Engineering time recovered from glue-code, integration, and compliance work
Tool integrations
500+
Pre-built connectors via Composio — APIs, databases, SaaS, calendars
Supported foundation models
8+
Claude · GPT-4 · Gemini · Llama 3 · Mistral · Phi-3 · Gemma · Qwen
Why the AI Dev Platform exists

Enterprise AI is fragmented, unaccountable, and not built for South African procurement.

Three structural problems make AI difficult to operationalise inside regulated African enterprise. The AI Dev Platform was built specifically to dissolve them.

01 — Market fragmentation

Seven vendors. Seven invoices. Zero coherence.

Today's AI stack means model APIs from one vendor, agent orchestration from another, vector storage somewhere else, monitoring duct-taped on top, fine-tuning compute from a fifth provider, and compliance done manually in spreadsheets.

Separate tools, contracts, and security reviews to assemble what should be one product.
02 — The trust deficit

No bias detection. No XAI. No audit trail.

Banks, insurers, government bodies, and healthcare cannot deploy AI they cannot explain or audit. Most AI tooling has no native governance layer — so compliance departments block deployment indefinitely.

3 Questions a board must answer before AI ships: Is it fair? Can we explain it? Is it POPIA-compliant?
03 — The local market gap

Hyperscalers are present. They are not local.

AWS, Azure, and Google Cloud have infrastructure in af-south-1 — but no POPIA documentation, no local compliance support, no ZAR pricing, and no understanding of South African enterprise procurement cycles.

0 POPIA-native enterprise AI platforms operating with local data residency before sonofgraig.
04 — The capacity drain

Teams build infrastructure, not intelligence.

Enterprise AI teams routinely waste two-thirds of their capacity on infrastructure plumbing, integration glue, and compliance rework — not on the intelligence layer that creates business value.

60–70% Of enterprise AI engineering capacity consumed by non-differentiating work.
Stitched stack

The old way

  • Five vendor APIs reconciled by hand
  • Compliance redone every model swap
  • Observability rebuilt for each project
  • USD billing exposes you to FX volatility
  • No proof your data stays inside South Africa
sonofgraig

The unified platform

  • One auth, one API key, one audit trail
  • Compliance baked into the build workflow
  • Shared observability across every product
  • ZAR pricing — your budget does not move
  • Architecture-level POPIA — not a checkbox
Five-layer architecture

Built like a platform — because that's the moat.

The AI Dev Platform launches as a focused product, matures into a platform, and opens as an ecosystem. The same sequence followed by Salesforce, Shopify, and Microsoft Azure. Five layers separate concerns and create the data flywheel that makes the platform compound over time.

L1
Consumers
Four buyer personas sharing one platform — developers, data scientists, compliance officers, and business analysts.
Developers Data scientists Compliance Analysts
L2
Products
Four user-facing surfaces — each is independently subscribable and solves a specific problem for a specific persona.
Agent Builder RAG Studio Fine-Tuning Ops Governance Hub
L3
Platform services
Shared infrastructure built once, serves every product. The thing that makes the platform more than the sum of its parts.
API gateway Model registry Observability
L4
Data layer — the flywheel
Shared vector store, structured datastore, prompt library, and usage telemetry. Compounds with every interaction. The strategic moat.
Qdrant PostgreSQL Prompt library Telemetry
L5
Infrastructure
AWS af-south-1 hosting, multi-tenant cryptographic isolation, SOC 2 / POPIA controls, and 99.9% SLA backbone.
af-south-1 Kubernetes SOC 2 99.9% SLA
The data flywheel — why the platform compounds

Every interaction generates signal: prompt performance data, evaluation results, agent execution traces, bias detection results, and usage telemetry. That data — stored with full consent and governance — feeds back into model quality, default templates, evaluation benchmarks, and recommendations. The longer a customer uses the platform, the better the platform performs for them. This is the same loop that made Google Search unassailable and Microsoft 365 difficult to replace.

Four product surfaces

One platform. Four buyers. Shared infrastructure.

Each product is independently subscribable. Each solves a specific problem for a specific persona. Together, they share an API gateway, a model registry, and an observability stack — making the platform more than the sum of its parts.

Product 01 · Land-and-expand vehicle

Agent Builder

A visual environment for designing, testing, and deploying autonomous AI agents.

Q1 2027 · Beta Per-agent + tokens
Visual workflow canvas
Drag-and-drop agent design. No code required for simple agents. React Flow under the hood.
Tool use configuration
Web search, code execution, API calls, database queries. 200+ pre-built connectors via Composio.
Memory architecture
Short-term (conversation), long-term (vector), episodic (event log). Grounded in your knowledge bases.
Multi-agent orchestration
Supervisor-worker patterns with AutoGen. Agent handoff, parallel execution, and shared memory.
Testing sandbox
Run agents against test scenarios before production. Replay execution traces. Compare versions.
Deployment controls
Rate limiting, kill switches, and human-in-the-loop approval gates for consequential actions.
Full execution trace
Every decision, tool call, and reasoning step is auditable. Immutable log feeds Governance Hub.
Pre-built templates
Customer support, research assistant, data pipeline agent, code reviewer — start in one click.
Development teams
Build internal automation without AI/ML expertise.
Customer experience
Deploy AI-powered support and sales agents.
Operations
Automate multi-step business workflows end-to-end.
Product teams
Embed intelligent agents into customer-facing products.
Product 02 · First to ship

RAG Studio

Private knowledge chat systems grounded in your enterprise data — with citations.

Q4 2026 · Early access Document + token volume
Document ingestion
PDF, Word, Excel, Confluence, Notion, SharePoint, custom sources. PII scrubbed before embedding.
Auto chunking & embedding
Smart chunking strategies, embedding model selection, vector indexing — no ML knowledge required.
Hybrid search engine
Combines semantic and keyword search for higher accuracy than dense retrieval alone.
Source citation
Every AI response links back to the exact document and page it drew from. Auditable by design.
Access control integration
Users only retrieve answers from documents they are authorised to see. Honours your IAM model.
Freshness management
Automatic re-indexing when source documents update. No stale answers. Webhook-driven refresh.
Evaluation dashboard
Faithfulness, answer relevancy, context precision, context recall — tracked per query, powered by Ragas.
Multi-language support
Query and retrieve in any language — including isiZulu, Afrikaans, Sesotho, and other SA languages.
Legal teams
Query entire contract archive in plain English. Cited responses.
Knowledge management
Internal helpdesk grounded in real policy, not hallucinated guesses.
Customer support
Tier-1 deflection grounded in product manuals and SOPs.
Compliance
Policy-aware AI with auditable retrieval — auditors love it.
Product 03 · High-margin upsell

Fine-Tuning Ops

Train and manage domain-specialised AI models on your proprietary data — without it leaving af-south-1.

Q1 2027 GPU compute + hosting
Dataset management
Upload, clean, label, and version training datasets. Label Studio for annotation tasks.
Training orchestration
Configure and launch fine-tuning runs on managed GPU infrastructure. Axolotl + DeepSpeed.
Model versioning
Every trained model versioned, tagged, and stored in the shared model registry. Clean rollback.
Automated evaluation
BLEU, ROUGE, human preference, and custom metric pipelines run after every training job.
A/B deployment
Route production traffic between model versions with statistical significance dashboards.
PEFT support
LoRA, QLoRA, and adapter-based training for efficient fine-tuning on limited data.
Base model catalogue
Llama 3, Mistral, Phi-3, Gemma, Qwen — all production-ready starting points.
Cost dashboard
Real-time GPU compute cost tracking. Budget controls. Stop runs that exceed thresholds.
Data scientists
Train domain-specific models without operating GPU clusters.
ML engineers
Productionise fine-tuned models with hosting and A/B routing.
Domain experts
Tune voice, terminology, and judgement without writing code.
FinOps
Real-time GPU spend tracking. No surprise quarterly bills.
Product 04 · The enterprise unlock

Governance Hub

Audit, monitor, and govern AI systems for enterprise compliance — POPIA, GDPR, EU AI Act, ISO 42001.

Q1 2027 Mandatory tier add-on
Bias detection
Automated demographic parity, equal opportunity, disparate impact testing — Fairlearn + AIF360.
XAI reports
SHAP-powered plain-language explanations for non-technical stakeholders. Board-ready PDFs.
Compliance mapping
POPIA articles, GDPR, EU AI Act risk classification, ISO 42001 readiness — all checklisted.
Immutable audit log
Tamper-proof record of every model decision, input, and output. Searchable. Exportable.
Policy enforcement
Define guardrails — blocked topics, PII filters, output length, content classification rules.
Risk classification
Categorise AI systems by EU AI Act risk taxonomy. Minimal · Limited · High · Unacceptable.
Drift monitoring
Statistical alerts when output distributions shift from approved baseline. Powered by Evidently.
Executive reporting
Board-ready PDF generator — compliance status, bias summary, incident log, risk score trend.
Compliance officers
Auto-generated POPIA evidence. Audit log on demand.
Risk teams
EU AI Act risk classification. Quarterly board packs.
ML engineers
Bias and drift detection wired into the deployment pipeline.
Internal audit
Tamper-proof decision log. Exportable for external auditors.
Layer 3 — shared platform services

Build once. Serve every product.

Beneath the four product surfaces sits a shared platform services layer. This infrastructure is built once and serves all products simultaneously — what makes the platform more than the sum of its parts.

Unified API Gateway

One authentication surface across all four products. Single login. Single API key. Single audit trail. RBAC, rate limiting, and tenant isolation built in.

Single-sign-on across Agent Builder, RAG Studio, Fine-Tuning Ops, and Governance Hub
Role-based access control (RBAC) with custom roles and team groups
Rate limiting and throttling — prevent runaway costs from misconfigured agents
Per-call, per-token, per-model usage metering for accurate cost attribution
Cryptographic tenant isolation — your data never co-mingles
Webhook system for downstream integrations and incident alerting

Model Registry

Central catalogue of foundation models and customer fine-tunes. Performance benchmarks, bias evaluations, A/B routing, and controlled deprecation — all in one place.

Foundation model catalogue: GPT-4, Claude 3.5, Gemini, Llama 3, Mistral, Phi-3, Gemma
Custom fine-tuned model storage from Fine-Tuning Ops, fully versioned
Per-model metadata: benchmarks, bias scores, training data provenance
A/B routing engine with configurable traffic weights
Deprecation workflow with rollback safety
Real-time cost-per-model pricing — optimise spend without leaving the platform

Observability Stack

Token usage, latency, errors, prompts, drift, and cost — tracked across every product. Hooks into Datadog, Grafana, and PagerDuty for teams with existing observability infrastructure.

Per-request, per-user, per-project, per-model token tracking
Latency monitoring — P50, P95, P99 with alerting thresholds
Structured error logging with full request context for debugging
Immutable prompt audit trail — every prompt and response logged
Statistical drift detection — alerts when output distributions shift
Cost anomaly detection with configurable budgets per project

Data Layer — The Flywheel

The most strategically important component. The data layer is what separates a feature from a platform — and a platform from a defensible business. Everything happens here, with consent, governance, and tenant isolation enforced at the kernel.

Vector store (Qdrant) — embeddings from documents, agent memories, eval datasets
Structured datastore (PostgreSQL) — multi-tenant via row-level security
Prompt library — versioned, tested, evaluated across products
Usage telemetry — feeds the recommendation and quality flywheel
Cryptographic isolation — your data is unreadable to other tenants
Hosted in af-south-1 — every byte stays in South Africa unless you say otherwise
The enterprise unlock

Three questions every board must answer.
The platform answers all three automatically.

South African banks, insurers, government, and healthcare cannot deploy AI they cannot explain or audit. Compliance teams block deployment indefinitely without these three answers. Governance Hub is the unlock.

Is this AI fair?
Automated demographic parity, equal opportunity, and disparate impact testing on every deployed model. Statistical evidence — not gut feel.
→ Bias detection · Fairlearn + AIF360
Can we explain its decisions?
SHAP-powered XAI reports translate model decisions into plain language. Board-ready PDFs. Mathematically grounded. Regulator-accepted.
→ XAI reports · SHAP
Are we POPIA-compliant?
POPIA articles checklisted, evidence auto-collected, data processing register maintained, breach workflows defined. EU AI Act and ISO 42001 alongside.
→ Compliance map · Audit log
Honest roadmap

When each AI Dev Platform product ships.

We publish this because enterprise buyers deserve to know the actual state of the product they are evaluating. Services revenue funds the build. We hit our dates.

Phase 1 · Months 1–6 · Beachhead
RAG Studio early access
Live · Q4 2026
Document ingestion pipeline (LlamaIndex + Unstructured.io)
Hybrid search (Qdrant + BM25)
Source citation + access control
Eval dashboard (Ragas)
POPIA s.11/14/19/72 architecture-native
Platform shell: auth, billing, RBAC, API keys
Target: 10 paying pilot customers
ARR: R500K
Audit: SOC 2 Type I initiated
Phase 1 · Months 4–6 · Expansion
Agent Builder beta
Q1 2027
Visual agent canvas (React Flow)
LangGraph stateful orchestration
200+ tool integrations (Composio)
Human-in-the-loop approval gates
Pre-built agent templates
RAG Studio integration as agent tool
Target: 25 customers across products
Sectors: Legal · FS · Professional services
Phase 2 · Months 7–12 · Platform expansion
Governance Hub & Fine-Tuning Ops
Q1 2027
Bias detection service (Fairlearn + AIF360)
SHAP XAI report generator + board PDF export
Compliance map (POPIA · GDPR · EU AI Act · ISO 42001)
Axolotl training service + vLLM inference
Dataset manager + Label Studio integration
Multi-agent orchestration (AutoGen)
Evidently drift monitor
Full observability (Prometheus + Grafana + Langfuse)
Target: 50 paying customers
ARR: R3M
Certifications: SOC 2 Type II + POPIA
Phase 3 · Months 13–24 · Ecosystem
Marketplace & geographic expansion
2027+
Developer marketplace — third-party agent templates
Voice Agent product surface (Phase 2 of Service 11)
Adversarial testing module (red teaming)
Reinforcement learning module (advanced)
Carbon & compute efficiency dashboard
Nigeria + Kenya regional infrastructure
Self-hosted / air-gapped deployment
ISO 27001 certification
Target: 200 enterprise customers
ARR: R50M
Footprint: 3 countries
Pricing for the cluster

One subscription. Priced in rand.

Starter includes one cluster — pick AI Dev Platform and get RAG Studio. Growth includes three. Enterprise includes all five plus Governance Hub. No USD pricing. No FX risk.

Cluster 01 — AI Dev Platform

Start with the AI Dev Platform on Starter.

Get RAG Studio, 5 seats, 50 GB document storage, 500K LLM tokens per month, audit log, and full POPIA compliance documentation. Ideal for an enterprise proof of concept with real data.

R4,999
per month · billed monthly

Ready to put
real enterprise AI into production?

3 founding customer slots remaining. 247 organisations on the waitlist. Apply for early access — we onboard the first cohort in person across Pretoria, Johannesburg, and Cape Town.

3 founding slots remaining POPIA-native architecture Hosted in af-south-1 ZAR billing — no FX risk