62 terms
Foundation models
LLM, transformer, attention, embedding, RAG, context window, agent, tool use.
Browse this categoryThe Brocode AI Glossary · 240 terms · 5 categories
Plain English, a one-sentence GCC example, and a one-line note on why it matters to your business. No Wikipedia walls of text. Arabic-script parity on Arabic-specific entries. Reviewed quarterly by a named principal engineer.
How an entry is built
The shortest accurate answer, in the same shape every time. Definition. Example. Why it matters.
Block 1 · Plain definition
Forty words or fewer. No jargon. No analogies that require a second analogy. A non-technical reader can quote it into a board paper.
Block 2 · UAE / GCC example
One sentence, regionally grounded. A UAE bank, a federal entity, a Saudi retailer, a Qatari hospital — the example is recognisable to a GCC reader within five seconds.
Block 3 · Why it matters to your business
One sentence on the commercial or operational consequence. If the visitor remembers nothing else, this is the line they remember.
240
Terms in the library
5
Categories — foundation to Arabic-specific
Qtr
Reviewed every quarter by named engineers
60 s
Average time to read an entry
Most-read this month
Browse by category
62 terms
LLM, transformer, attention, embedding, RAG, context window, agent, tool use.
Browse this category48 terms
Supervised, unsupervised, reinforcement, gradient descent, overfitting, ROC.
Browse this category54 terms
Training, inference, drift, observability, model registry, vector database.
Browse this category40 terms
Model risk, bias, fairness, audit trail, data residency, TDRA, CBUAE, FSRA.
Browse this category36 terms
Khaleeji, MSA, NER, tashkeel, tatweel — with Arabic-script parity.
Browse this categorySample entries — read inline
The first chapter of the glossary, rendered openly. Arabic-script parity is shown on Arabic-specific terms.
Reviewed by Yasmin Al Marzooqi, Head of Arabic NLP — last refresh February 2026.
Arabic-specific entries — English and Arabic side-by-side
Where the term is anchored in the regional context, the Arabic rendering sits beside the English. Reviewed by a native MSA editor.
العربية الخليجية
The cluster of Gulf Arabic dialects spoken across the UAE, KSA, Qatar, Bahrain, Kuwait, and Oman. Distinct from Modern Standard Arabic in vocabulary, morphology, and phonology.
Why it matters: Most production AI for GCC customers fails on Khaleeji unless explicitly tuned for it.
العربية الفصحى الحديثة
The pan-Arab literary and broadcast register, used in print, news, and official documents. Almost nobody speaks it in everyday conversation.
Why it matters: Most published Arabic NLP benchmarks score MSA — your production data is rarely MSA.
التشكيل
Diacritical marks that disambiguate Arabic vowels and grammatical roles. Usually absent in everyday text; sometimes essential for disambiguation.
Why it matters: Tashkeel handling is a tokenisation question with downstream model-quality effects.
Free download
A 24-page downloadable distillation of the 60 most-asked terms, formatted for printing or reading on a phone. The three-block format is preserved. The Arabic-script parity is preserved.
Monthly vocabulary email
One short email per month. You can unsubscribe in one click, and we will not put you on a sales rotation.
Glossary editorial
Common requests in the last quarter: speculative decoding, ICV (in-country value), tatweel, prompt injection, and federated learning. Three of these will be in the next refresh.
Suggest a termNext steps
Whitepapers, guides, FAQs, blog, case studies — the wider library.
Read moreEach glossary term links to the guide that goes deeper.
Read moreWhen the question moves from "what is this?" to "how does Brocode answer this?".
Read moreFor Arabic-specific terms with a commercial reflection.
Read moreFor LLM-related terms with a commercial reflection.
Read more