Federal entity
Federal RAG over Arabic correspondence
Azure UAE North + G42 Cloud, pgvector on Postgres 16, Llama Guard 3 + Arabic classifier, full audit trail to a WORM store.
Download the architecture
// STACK.md
Six planes. Pinned versions on every component. Three reference architectures. Walked through line by line with the principal platform engineer who owns it.
STACK.md · last updated 2026-05-01 · commit a7c41f3
$ cat stack.versions
The production stack we run today
Why we publish the stack
If you cannot read it, you cannot evaluate it. So we publish the pins, the ADRs, and the boundaries between what is portable and what is opinionated.
A stack page that survives an architecture review board is one written by the engineer who has run the pager rotation, not the marketer who collected the badges. Brocode publishes its production stack with pinned versions because architects we respect have told us the difference between a serious partner and a slide deck is whether the version numbers are real. They are real here, reviewed quarterly by our CTO, and reflected in the ADR repository we walk you through under NDA in the first hour of an engagement.
Every component is opinionated for a reason. Every alternative we considered is documented. Every plane has a fallback we have actually rehearsed. Brocode is a services firm — we do not sell a Brocode platform, a Brocode runtime, or a Brocode-branded model. Every component on this page is open source or a portable commercial product that you license directly from its vendor. If you walk away from the engagement six years from now and run this stack inside your own organisation with your own engineers, it will keep working — and that is the only acceptable test for a serious enterprise architecture.
The six planes
Click into the lead-magnet pack for the full pin lists, dependency graphs and Terraform module skeleton.
plane.01
PyTorch for training, vLLM and TensorRT-LLM for inference, LangGraph for agentic orchestration. We do not adopt a framework until it has shipped two stable minor releases.
plane.02
Airflow for orchestration, dbt for transformation, Iceberg on S3-compatible object storage, Trino for federated query. Feast holds the feature store; Kafka carries event streams.
plane.03
Ray for distributed training, MLflow as the experiment store and model registry, DVC for dataset versioning. Evidently covers drift and bias; Argilla supports the human-in-the-loop evaluation we run with your team for every release.
plane.04
Kubernetes 1.30 on EKS, AKS, OKE or G42 K8s. Argo CD for GitOps, Terraform plus Terragrunt for IaC, Vault for secrets. Prometheus, Grafana and Loki layered through OpenTelemetry.
plane.05
pgvector on Postgres is the default. Qdrant and Weaviate enter the design when cardinality demands them. Elastic 8 handles hybrid lexical-plus-vector workloads.
plane.06
NeMo Guardrails and Llama Guard 3 sit on the request-and-response path; where Arabic policy classification is needed, we train a bespoke classifier inside your engagement repository on your taxonomy. Prompt and response logging is written to a tamper-evident store you own.
The logo library
No category logo walls. Each cell shows the brand, the version, and the plane it sits in. We swap a logo only when we swap the underlying tool — and write an ADR explaining why.
The runtimes, frameworks, and inference engines we build on.
8 components
Python
TypeScript
Go
PyTorch
TensorFlow
Hugging Face
LangGraph
NVIDIA · TensorRT-LLM
Where data is ingested, modelled, and served to models.
6 components
Apache Airflow
dbt-core
Apache Kafka
Snowflake
Databricks
Postgres · pgvector
Distributed training, experiment tracking, drift, and bias.
4 components
MLflow
DVC
Weights & Biases
HF Hub
Where models live, how they ship, how they are observed.
10 components
Kubernetes
Docker
Helm
Istio
Argo CD
Terraform
HashiCorp Vault
OpenTelemetry
Prometheus
Grafana
Where embeddings, retrieval indexes, and operational data live.
3 components
pgvector
Elasticsearch
Redis
Foundation-model and tooling partners we deploy with.
5 components
OpenAI
Anthropic
Meta · Llama
Ollama
NVIDIA Inception
Portable across UAE-resident estates and global hyperscalers.
4 components
AWS UAE North
Azure UAE North
Google Cloud
OCI Abu Dhabi
174
Production deployments on this stack
6
Planes, six versioned pin lists
5
UAE estates the stack runs on unmodified
0
Proprietary lock-in components
Default versus rejected
Choosing what to leave out of a stack is as important as choosing what is in it.
| Capability | Brocode default | Offshore SI delivery centres | Big-4 AI practice | Sovereign-only integrator |
|---|---|---|---|---|
| Published, version-pinned stack | Yes — STACK.md in a public repo | No — slide-only | No — bespoke per engagement | Whatever the client's enterprise architecture mandates |
| Architecture Decision Records | Public ADR catalogue, walked through line by line | Word documents inside engagement folders | None published | Embedded in deck appendices |
| Portable across UAE clouds | Runs unmodified on AWS UAE North, Azure UAE North, OCI Abu Dhabi, G42 Cloud, Khazna | Locked to one hyperscaler | Locked to one hyperscaler | Locked to a sovereign-only estate |
| Proprietary runtime required | Often | Often | Sometimes | |
| Reference architectures with Terraform skeleton | Three downloadable — federal RAG, retail bank fraud, energy major predictive maintenance | Diagrams only | On request | Sovereign-locked example |
| Quarterly stack review by CTO | Annual at best | Ad hoc | Whenever the sovereign ecosystem moves |
Stack rejections
n8n / Streamlit in production paths
Pleasant for prototypes; not built for the auth, scaling and audit demands of a regulated enterprise.
Single-vendor managed AI platforms with proprietary file formats
Migrating off them is a year of work. We will not put that risk on a client balance sheet.
Closed-weight-only model strategies
Sovereignty and unit economics fail for high-volume Arabic workloads. Open-weight options stay in the architecture.
CI providers without a self-hosted runner option
Regulated estates routinely block public cloud runners. Self-hosted runners are non-negotiable.
Bespoke orchestration languages
Anything that needs a new DSL to operate the platform fails the post-handover test on day one.
Three reference architectures
Each is a one-click downloadable PDF with Mermaid sources, Terraform module skeleton, and the same component names you have just read.
Federal entity
Azure UAE North + G42 Cloud, pgvector on Postgres 16, Llama Guard 3 + Arabic classifier, full audit trail to a WORM store.
Download the architectureTier-1 bank
AWS UAE North, Iceberg on S3-compatible storage, Ray for distributed training, drift monitoring through Evidently and the Brocode pack.
Download the architectureADNOC ecosystem
OCI Abu Dhabi + Khazna on-prem appliance, KServe for inference, Trino federated query across plant historians and the lakehouse.
Download the architecturePortability promise
We separate what the stack mandates from what is a default convenience. No surprises after the SoW is signed.
Portable
Every plane runs on AWS UAE North, Azure UAE North, OCI Abu Dhabi, G42 Cloud and Khazna on-prem. One Terraform variable changes the cloud target.
Optional
Weights & Biases, Qdrant, Weaviate, Elastic. We pull these in only where the workload demands them. pgvector is the default; the others are replacements, not additions.
Replaceable
Every component has a documented fallback. We will not push back on a client mandate to use ClickHouse over Iceberg or Dagster over Airflow. The fallback ADR lists the trade-off.
Five ADRs we have published
Default vector store: pgvector over Qdrant for sub-50M vector workloads
vLLM versus Triton: routing decision matrix per workload class
Sovereign deployment topology for the G42 Cloud control plane
Terraform module structure for cross-cloud landing zones (AWS/Azure/OCI/G42)
Guardrails composition: NeMo + Llama Guard 3 + Arabic policy classifier ordering
The full ADR repository is shared under mutual NDA before the walk-through.
The Stack Walk-Through
Every engagement opens with the assigned principal engineer reading the live ADR repository with your architecture team. The agenda is set by your questions, not ours.
00:00
Live view of STACK.md, the ADR index, the Terraform module skeleton, and a recent evaluation-suite CI run on a representative client engagement.
00:15
How each plane lands in AWS UAE North / Azure UAE North / OCI Abu Dhabi / G42 / Khazna — with the variable changes called out.
00:30
You pick three ADRs to read in detail. We open the repo and walk the reasoning.
00:50
You leave with the ADR titles you read, a follow-up note within 24 hours, and the reference architecture pack.
Free download
The 48-page PDF, the Terraform module skeleton and the Mermaid-source architecture diagrams for federal RAG, retail bank fraud and energy major predictive maintenance — all pinned to the versions on this page.
Architect questions
Send us the question we did not answer here and we will route it to the principal engineer who owns that plane.
We re-evaluate every pin quarterly against four criteria: security patches landed upstream, breaking-change blast radius across our reference architectures, ecosystem readiness (libraries, drivers, deployment manifests), and whether a current client is asking for a feature only the new version provides. Upgrades are batched into a single quarterly stack release with a written migration note and a re-run of the evaluation suite we maintain as a methodology.
Book the walk-through
A senior platform engineer responds within one business day. If your architecture review board sits inside two weeks, tell us — we will prioritise.
Prefer chat? Message us on WhatsApp.
Continue exploring
The badges that back this stack up — at Premier, Advanced and Solution Provider tier.
Read moreSeven UAE cloud estates this stack ships into.
Read moreWhere our engineers actually commit upstream — vLLM, tokenizers, pgvector.
Read moreHow the stack is delivered: Discover, Build, Harden, Run.
Read moreThe service that operates the stack inside your estate.
Read more