Practitioner Pack · Q1 2026 · 4 guides · 240 pages · 3 GitHub repos

Engineering handbooks for teams shipping AI in the GCC — written by engineers, not marketers.

Twelve long-form practitioner guides, each 40 to 80 pages, each authored by a named Brocode principal engineer, each tied to a public GitHub or Hugging Face artefact. Quarterly cadence with a visible last-reviewed date. One email a quarter — no sales rotation.

Download the Practitioner PackBrowse all 12 guides

14,200
downloads
12
guides
626 pp
in the library
9
GitHub repos

Engineer working through a Brocode practitioner guide on a laptop with annotated diagrams

Cover · open chapter

Arabic NLP at Production Scale — 72 pp

Chapter 3 (Khaleeji dialect handling) is open without a gate.

What a Brocode guide looks like

40 to 80 pages. Named author. Companion code. Quarterly review.

The standards every guide must clear before publication.

A guide is not a blog post and it is not a whitepaper. It is the engineering handbook a competent team can read on a Monday and act on by Friday. Every guide carries the photo and LinkedIn handle of the principal engineer who wrote it, a visible last-reviewed date, and a link to the public GitHub or Hugging Face artefact that backs the content. We publish the first chapter without a gate so the visitor can verify quality before exchanging an email.

We send one email a quarter — the new guide and a short editorial note. No 14-touch nurture sequence, no sales rotation, no second send. If a guide is wrong, the erratum is published with the date and the prior text. The standard we hold ourselves to is whether a senior engineer who has never met us could use the guide unaided.

12 guides
In the current library
240 pp
In the Practitioner Pack
14,200
Verified-practitioner downloads
1 / qtr
New guide cadence

Free download

The Brocode 2026 Practitioner Pack

A single ZIP containing the latest editions of the four most-downloaded guides — Arabic NLP at Production Scale, RAG That Survives the Regulator, MLOps for Sovereign-Cloud Deployments, and From Notebook to Live Model in 90 Days. Totals 240 pages and three companion GitHub repositories.

Arabic NLP at Production Scale — 72 pages (latest edition)
RAG That Survives the Regulator — 58 pages (latest edition)
MLOps for Sovereign-Cloud Deployments — 66 pages (latest edition)
From Notebook to Live Model in 90 Days — 44 pages (latest edition)
Three companion GitHub repos with pinned versions and a CHANGELOG

The full library

Twelve guides, organised by track.

Sorted by track — NLP, MLOps, Sovereign Cloud, Governance. Each tile shows the page count, the author, the last-reviewed date, and the public artefact link.

Arabic NLP · 5 guides
MLOps · 3 guides
Sovereign Cloud · 2 guides
Governance · 2 guides

Arabic NLP track

Five guides on Arabic NLP at production scale — tokenisation, dialect, fine-tuning, evaluation.

72 ppReviewed February 20263,120 downloads

Arabic NLP at Production Scale

Tokenisation, Khaleeji dialect handling, NER, intent classification, RAG retrieval over Arabic corpora, and evaluation with the Khaleeji Benchmark v2. Companion code, datasets, and reproducible eval suite.

Yasmin Al Marzooqi — Head of Arabic NLP

Public artefact: github.com/brocode-ai/khaleeji-benchmark

Download this guide

40 ppReviewed January 20261,840 downloads

Evaluating Arabic LLMs with the Khaleeji Benchmark

Benchmark methodology, leaderboard interpretation, and what to score in your own evals — including code-switching cases the textbook benchmarks miss.

Yasmin Al Marzooqi — Head of Arabic NLP

Public artefact: huggingface.co/datasets/brocode-ai/khaleeji-v2

Download this guide

52 ppReviewed March 20262,410 downloads

Self-Hosting Falcon and Jais on a Single DGX H100

vLLM tuning, batching, KV-cache sizing, GPU pricing — and the 36-month TCO model versus Azure OpenAI and OpenAI Enterprise.

Tareq Ibrahim — Principal Platform Engineer

Public artefact: github.com/brocode-ai/dgx-h100-llm-stack

Download this guide

42 ppReviewed October 20252,230 downloads

Prompt Engineering Patterns for Regulated Industries

Twelve named patterns with regulator-mapped redaction rules, prompt-injection defence, and an open prompt-evaluation harness.

Layla Mansoor — Principal ML Engineer

Public artefact: github.com/brocode-ai/prompt-pattern-library

Download this guide

46 ppReviewed September 20251,980 downloads

Fine-Tuning vs RAG: a Decision Framework

When fine-tuning is the right answer, when RAG is, and when the answer is both. Cost models, evaluation methodology, and Khaleeji-specific examples.

Yasmin Al Marzooqi — Head of Arabic NLP

Public artefact: github.com/brocode-ai/finetune-rag-decision

Download this guide

MLOps track

Four guides on notebook-to-production discipline, vector-database selection, and agentic systems.

44 ppReviewed November 20253,540 downloads

From Notebook to Live Model in 90 Days

The 4-week discover, 12-week build, 4-week harden recipe with concrete artefacts at each stage and a sample steering deck.

Reem Saleh — Head of Delivery

Public artefact: github.com/brocode-ai/notebook-to-production

Download this guide

48 ppReviewed December 20251,620 downloads

Vector Database Selection for GCC Workloads

pgvector, Weaviate, Vespa, Milvus, Pinecone — benchmark on a 50M-vector Arabic-English corpus, with cost, latency, and sovereignty trade-offs.

Omar Haddad — Principal Architect

Public artefact: github.com/brocode-ai/vectordb-benchmark

Download this guide

56 ppReviewed February 20261,410 downloads

Agentic Systems in Production

The supervisor pattern with LangGraph and Temporal, exception-closure SLAs, and the test harness for non-deterministic agent workflows.

Tareq Ibrahim — Principal Platform Engineer

Public artefact: github.com/brocode-ai/agent-supervisor-pattern

Download this guide

Sovereign Cloud track

Two guides on UAE-resident landing-zone patterns and the talent strategy that powers them.

66 ppReviewed January 20261,980 downloads

MLOps for Sovereign-Cloud Deployments

Landing-zone patterns across AWS UAE North, Azure UAE North, OCI Abu Dhabi, and G42 Cloud. CI/CD across sovereign boundaries, cross-cloud observability, and the operating model.

Khaled Al Otaibi — Principal Architect

Public artefact: github.com/brocode-ai/sovereign-mlops-blueprints

Download this guide

38 ppReviewed January 2026980 downloads

AI Talent Strategy for GCC Enterprises

How to build a 12-person AI capability in 12 months — role definitions, salary bands, hiring channels, and the relocation-versus-local trade-off.

Reem Saleh — Head of Delivery

Public artefact: Role pack — open PDF

Download this guide

Governance track

Two guides on regulator-survivable RAG and data-residency law across the GCC.

58 ppReviewed February 20262,670 downloads

RAG That Survives the Regulator

Retrieval architecture, source citation, faithfulness evaluation, prompt-injection defence, audit trail and WORM logging, and control mapping to CBUAE, FSRA, and NCA.

Aisha Al Hosani — Head of AI Risk

Public artefact: github.com/brocode-ai/rag-evidence-pack

Download this guide

50 ppReviewed December 20251,240 downloads

Data Residency Law for AI in the GCC

UAE PDPL, DIFC DP Law, KSA PDPL, Qatar PDPPL, Bahrain PDPL — with a side-by-side regulator matrix and a sample DPA template.

Aisha Al Hosani — Head of AI Risk

Public artefact: Reference matrix — open PDF

Download this guide

Sample chapter — open without a gate

Chapter 3: Khaleeji dialect handling at production scale.

From the Arabic NLP guide. The full chapter is publicly readable so you can verify the depth before exchanging an email.

Chapter 3 · pages 24–38

Why MSA-trained models fail on Khaleeji calls.

Modern Standard Arabic is a literary register; almost nobody speaks it on a contact-centre call. A customer in Sharjah complaining about an inflated du bill will switch between Khaleeji vocabulary, an English brand name, and a Bahraini turn of phrase three times in a sentence. The MSA-tuned ASR transcribes the first half competently, mis-renders the brand name, and emits a confidence score the application layer treats as gospel.

The remedy is not a bigger model. It is a calibration set that contains the dialect, the code-switching, and the brand-named entities the deployment actually encounters. We construct one in three weeks per dialect: a 1,200-utterance reference set across MSA, Khaleeji, Levantine, and Egyptian, balanced for gender, age band, channel quality, and noise profile. The set lives in the client's environment, refreshes quarterly, and runs as a CI step on every model push.

The detail that matters: Khaleeji morphology produces dozens of valid surface forms for the same lemma. NER trained on MSA collapses them into a single noisy class. The Khaleeji Benchmark v2 splits them. The benchmark scores the same models 9–14 F1 points lower on Khaleeji than on MSA — a gap that is invisible in the published model cards because the model cards report MSA.

Chapter continues — pages 28-38 — Khaleeji NER feature engineering · dialect classifier · evaluation gates.

What readers say

Three pull-quotes from practitioners.

Anonymised at the request of their employers — role and country preserved.

UAE
“Forwarded the Arabic NLP guide to my team on Friday afternoon. By Monday the architecture discussion was four months ahead. The dialect chapter alone justifies the download.”
Head of AI, UAE federal entity
KSA
“The RAG governance guide reads like an audit working paper. We lifted six of the controls into our model-risk register and the second-line lead waved them through.”
Lead ML Engineer, GCC bank
Qatar
“Notebook-to-90-days is the only schedule I have seen from a vendor that actually accounts for the procurement freeze in week six. The steering deck template went straight into our PMO.”
Director of Data Platform, regional telco

Quarterly digest

One email a quarter. New guide, short editorial note. That is it.

Prefer chat? Message us on WhatsApp — a senior engineer answers.

Next guide

“Continuous Pre-Training of Arabic LLMs on a Sovereign Cluster” — drops Q2 2026.

60 pages, authored by Yasmin Al Marzooqi and Tareq Ibrahim. Companion code lives in github.com/brocode-ai/continuous-pretrain.

Browse adjacent insights

Engineering handbooks for teams shipping AI in the GCC — written by engineers, not marketers.

40 to 80 pages. Named author. Companion code. Quarterly review.

The Brocode 2026 Practitioner Pack

Twelve guides, organised by track.

Five guides on Arabic NLP at production scale — tokenisation, dialect, fine-tuning, evaluation.

Arabic NLP at Production Scale

Evaluating Arabic LLMs with the Khaleeji Benchmark

Self-Hosting Falcon and Jais on a Single DGX H100

Prompt Engineering Patterns for Regulated Industries

Fine-Tuning vs RAG: a Decision Framework

Four guides on notebook-to-production discipline, vector-database selection, and agentic systems.

From Notebook to Live Model in 90 Days

Vector Database Selection for GCC Workloads

Agentic Systems in Production

Two guides on UAE-resident landing-zone patterns and the talent strategy that powers them.

MLOps for Sovereign-Cloud Deployments

AI Talent Strategy for GCC Enterprises

Two guides on regulator-survivable RAG and data-residency law across the GCC.

RAG That Survives the Regulator

Data Residency Law for AI in the GCC

Chapter 3: Khaleeji dialect handling at production scale.

Why MSA-trained models fail on Khaleeji calls.

Three pull-quotes from practitioners.

One email a quarter. New guide, short editorial note. That is it.

“Continuous Pre-Training of Arabic LLMs on a Sovereign Cluster” — drops Q2 2026.

Related capabilities and stories

Insights hub

AI glossary

Open-source contributions

NLP services

MLOps & AI Infrastructure