AI Glossary
475 AI and tech terms explained in plain language. No jargon about jargon.
Showing 475 of 475 terms
A/B testing
Comparing two versions of something by randomly showing each to different users and measuring which performs better.
ablation
Systematically removing parts of a model to understand each component's contribution to performance.
ablation study
An experiment that systematically removes or changes components of an AI system to understand each part's contribution.
activation function
A mathematical function in a neural network that determines whether a neuron should fire. Common ones include ReLU and sigmoid.
active learning
A training strategy where the model identifies which unlabelled examples would be most valuable to learn from next.
adaptive learning
Educational technology that adjusts content difficulty and sequence based on a student's performance and needs.
adversarial attack
Deliberately crafted inputs designed to fool or manipulate an AI system into making mistakes.
agent
An AI system that can perceive its environment, make decisions, and take actions to achieve goals.
agentic AI
AI that can independently plan, make decisions, and take actions to complete multi-step tasks with minimal human guidance.
AGI
Artificial General Intelligence. A hypothetical AI system that can match human-level intelligence across all cognitive tasks, not just specific ones.
AI Act
The European Union's comprehensive regulatory framework for artificial intelligence, the world's first major AI law.
AI agent framework
Software libraries and tools that make it easier to build AI agents capable of using tools and taking actions.
AI audit
A systematic evaluation of an AI system's performance, fairness, safety, and compliance with regulations.
AI chip
A semiconductor specifically designed or optimised for running AI workloads efficiently.
AI city
Urban development projects integrating AI throughout city infrastructure for traffic, energy, safety, and public services.
AI companion
An AI designed primarily for ongoing social interaction, emotional support, or entertainment.
AI copilot
An AI assistant that works alongside a human, providing suggestions and handling routine tasks while the human stays in control.
AI ethics board
A committee within an organisation or government tasked with reviewing and guiding the ethical use of AI.
AI for good
The movement and initiatives applying AI to address social, environmental, and humanitarian challenges.
AI governance
The policies, standards, and oversight structures for managing how AI systems are developed and deployed.
AI in agriculture
Applying AI for crop monitoring, yield prediction, pest detection, and precision farming.
AI in education
Using AI to personalise learning experiences, automate grading, and provide intelligent tutoring.
AI in finance
The application of AI in banking, trading, risk assessment, fraud detection, and financial planning.
AI liability
Legal questions about who is responsible when an AI system causes harm or makes a mistake.
AI literacy
The ability to understand, use, and think critically about AI technologies and their implications.
AI maturity
An organisation's level of sophistication in adopting, integrating, and benefiting from AI technologies.
AI readiness
How prepared an organisation, sector, or country is to adopt and benefit from artificial intelligence.
AI safety
Research focused on ensuring AI systems behave as intended without causing unintended harm.
AI Singapore
Singapore's national programme to build AI capabilities, catalyse AI adoption, and grow the local AI ecosystem.
AI tutor
An AI system that provides personalised educational instruction, adapting to a student's learning pace and style.
AI washing
Exaggerating or falsely claiming that a product uses AI when it does not, or overstating AI capabilities for marketing purposes.
AI winter
Historical periods when interest, funding, and progress in AI research declined significantly.
AI-native
A product or company built from the ground up with AI at its core, rather than adding AI to an existing system.
algorithm
A set of step-by-step instructions that tells a computer how to solve a problem or complete a task.
algorithmic accountability
Holding organisations responsible for the decisions their AI systems make, especially those affecting people's lives.
algorithmic trading
Using AI and algorithms to execute financial trades automatically based on predefined strategies and market data.
Alibaba Cloud
Alibaba's cloud computing division, which developed the Qwen (Tongyi Qianwen) family of AI models.
alignment
Ensuring AI systems pursue goals that genuinely match human intentions, values, and well-being.
AlphaFold
Google DeepMind's AI system that predicts protein structures with remarkable accuracy, a breakthrough for biology.
AlphaGo
Google DeepMind's AI that defeated world champion Go players, a milestone in AI history.
analytics
The systematic analysis of data to discover patterns, extract insights, and support decision-making.
annotation
Adding metadata or labels to data points, such as drawing boxes around objects in images or tagging sentiment in text.
anomaly detection
Identifying unusual patterns or outliers in data that do not conform to expected behaviour.
Anthropic
An AI safety company founded by former OpenAI researchers. Creator of the Claude family of AI assistants.
API
Application Programming Interface. A standardised way for different software systems to communicate with each other.
API rate limiting
Restrictions on how many requests can be made to an API within a given time period.
artificial intelligence
The broad field of computer science focused on creating systems that can perform tasks typically requiring human intelligence.
ASEAN AI Guide
The ASEAN Guide on AI Governance and Ethics, a voluntary framework for responsible AI across Southeast Asian nations.
ASI
Artificial Superintelligence. A theoretical AI that surpasses human intelligence in every domain. Remains purely speculative.
ASIC
Application-Specific Integrated Circuit. A chip custom-designed for a particular task, such as AI inference, offering high efficiency.
attention head
A single attention mechanism within a multi-head attention layer, each learning to focus on different aspects of the input.
attention mechanism
The component that lets a model focus on the most relevant parts of its input when making predictions.
AUC
Area Under the Curve. A single number summarising model performance from the ROC curve. Closer to 1 is better.
autoencoder
A neural network that learns to compress data into a smaller representation and then reconstruct it, useful for denoising and anomaly detection.
AutoML
Automated Machine Learning. Tools that automate the process of building, selecting, and tuning machine learning models.
autonomous
Operating independently without human control. Applies to vehicles, drones, robots, and software agents.
autonomous AI
AI systems that operate with minimal or no human intervention over extended periods.
autonomous vehicles
Cars, trucks, drones, or other vehicles that use AI to navigate and drive without human intervention.
autoregressive model
A model that generates output one token at a time, with each new token depending on all previously generated tokens.
avatar
A digital representation of a user or AI character in virtual environments, games, or communication platforms.
backpropagation
The process of calculating how much each weight contributed to an error, then adjusting them to improve accuracy.
Baidu
China's leading search and AI company, developer of the Ernie large language model series.
batch normalisation
A technique that normalises inputs to each layer during training, making networks train faster and more stably.
batch processing
Processing large volumes of data in groups at scheduled intervals rather than in real-time.
batch size
The number of training examples processed together before updating the model's weights.
Bayesian methods
Statistical approaches that update predictions as new evidence becomes available, based on probability theory.
Bayesian optimisation
A strategy for efficiently finding the best settings or parameters by building a model of the objective function.
beam search
A text generation strategy that keeps track of multiple possible outputs simultaneously and selects the overall best one.
benchmark
A standardised test or dataset used to compare the performance of different AI models on specific tasks.
BERT
Bidirectional Encoder Representations from Transformers. Google's influential model that understands text by reading in both directions.
bias
When an AI system produces unfair or skewed results, often because its training data reflects existing human prejudices.
bias-variance tradeoff
The balance between a model being too simple (high bias) and too complex (high variance). Finding the sweet spot is key.
big data
Datasets so large or complex that traditional tools struggle to process them, often described by volume, velocity, and variety.
black box
An AI model whose internal decision-making process is opaque and difficult for humans to understand.
BPE
Byte Pair Encoding. A common tokenisation method that builds a vocabulary by repeatedly merging the most frequent pairs of characters.
ByteDance
Chinese technology company behind TikTok and Douyin, with major investments in AI for content recommendation and creation.
carbon footprint
The total greenhouse gas emissions caused by training and running an AI model.
catastrophic forgetting
When a neural network trained on new data loses its ability to perform previously learned tasks.
causal inference
AI methods for determining cause-and-effect relationships rather than just correlations in data.
chain-of-thought
A prompting technique that asks the AI to show its reasoning step by step, improving accuracy on complex problems.
chatbot
An AI-powered software application that conducts conversation with users through text or voice.
ChatGPT
OpenAI's conversational AI product built on GPT models. Popularised the use of AI chatbots worldwide from late 2022.
Chinchilla scaling
Research from DeepMind showing that training smaller models on more data is often more efficient than simply making models larger.
Chinese room argument
A philosophical thought experiment arguing that a computer following rules does not truly understand language.
chip shortage
Insufficient global supply of advanced semiconductors, constraining AI development and deployment.
chunking
Breaking long documents into smaller, manageable pieces for processing by AI models with limited context windows.
churn prediction
Using AI to identify customers likely to stop using a product or service, enabling proactive retention efforts.
CI/CD
Continuous Integration and Continuous Deployment. Automated processes for testing and releasing software updates frequently.
classification
A machine learning task where the model assigns inputs to predefined categories, like spam vs. not spam.
Claude
Anthropic's family of AI assistants, designed with a focus on safety, helpfulness, and harmlessness.
clinical trial
Using AI to optimise patient recruitment, data analysis, and monitoring in pharmaceutical testing.
CLIP
Contrastive Language-Image Pre-training. An OpenAI model that learns to connect images and text descriptions.
closed source model
An AI model whose architecture, weights, and training data are kept proprietary and accessed only through an API.
cloud computing
Using remote servers hosted on the internet to store, manage, and process data instead of a local computer.
clustering
Grouping similar data points together without predefined labels. Used for customer segmentation and anomaly detection.
CNN
Convolutional Neural Network. A type of neural network designed for processing grid-like data such as images.
code generation
AI writing computer code based on natural language descriptions or partial code inputs.
cognitive AI
AI systems designed to simulate human thought processes such as reasoning, understanding, learning, and decision-making. Unlike narrow AI that focuses on a single task, cognitive AI integrates multiple capabilities to interpret complex, ambiguous information in ways that mirror human cognition.
collaborative filtering
A recommendation technique that predicts what a user might like based on the preferences of similar users.
compute
The processing power needed to train and run AI models. A key bottleneck and cost factor in AI development.
compute governance
Policies controlling access to the large-scale computing resources needed to train powerful AI models.
computer vision
The field of AI that enables computers to interpret and understand visual information from images and videos.
confusion matrix
A table showing how often a classification model correctly and incorrectly predicts each category.
consent
In data privacy, the requirement to get explicit permission from individuals before collecting or using their personal data.
constitutional AI
An approach where AI is trained to follow a set of written principles, using AI feedback to improve its own behaviour.
containerisation
Packaging software and its dependencies into isolated units (containers) that run consistently across different environments.
content authenticity
Technologies and standards for proving whether content is genuine, AI-generated, or manipulated.
content filtering
Systems that screen AI outputs to block harmful, offensive, or inappropriate content before it reaches users.
content moderation
Using AI to review and filter user-generated content on platforms to enforce community guidelines.
context window
The maximum amount of text an AI model can consider at once. Larger windows allow the model to work with longer documents.
continual learning
Enabling AI models to learn new tasks or information over time without forgetting what they previously learned.
contrastive learning
A training method where the model learns by comparing similar and dissimilar pairs of examples.
conversational AI
AI systems designed to engage in natural, human-like dialogue across text and voice channels.
conversational search
Using natural language questions to search for information, with the AI understanding intent rather than just matching keywords.
copyright and AI
Legal debates around whether AI-generated content can be copyrighted and whether AI training on copyrighted material is lawful.
coreference resolution
The task of determining which words in a text refer to the same entity, like linking 'she' to 'Dr. Chen'.
cosine similarity
A measure of how similar two vectors are by comparing the angle between them, regardless of their magnitude.
cross-entropy loss
A loss function commonly used for classification tasks that measures the difference between predicted and actual probability distributions.
cross-modal
Relating information across different data types, such as matching images to text descriptions.
cross-validation
A technique for assessing how well a model generalises by testing it on different subsets of the data.
CUDA
NVIDIA's parallel computing platform that allows software to use GPU acceleration for AI and scientific computing.
curriculum learning
Training a model by starting with simpler examples and progressively introducing more complex ones.
DALL-E
OpenAI's AI system that generates images from text descriptions.
data augmentation
Artificially expanding a training dataset by creating modified versions of existing data, such as flipping or rotating images.
data card
Documentation describing a dataset's composition, collection methods, intended uses, and known limitations.
data centre
A facility housing computer servers and networking equipment used to store, process, and serve data and AI workloads.
data drift
When the statistical properties of real-world data change over time, causing AI model performance to degrade.
data engineering
Building and maintaining the systems and infrastructure needed to collect, store, and process data at scale.
data flywheel
A cycle where more users generate more data, which improves the AI product, which attracts more users.
data governance
Policies and processes for managing data quality, access, security, and compliance across an organisation.
data labelling
The process of annotating raw data with meaningful tags or categories so it can be used for supervised learning.
data lake
A centralised storage system that holds vast amounts of raw data in its original format until needed.
data pipeline
An automated workflow that moves data from source systems through processing steps to its final destination.
data poisoning
An attack where an adversary intentionally corrupts training data to make the resulting AI model behave incorrectly.
data preprocessing
Cleaning, transforming, and preparing raw data before feeding it into an AI model for training.
data protection
Laws and practices safeguarding personal information from misuse, unauthorised access, or breach.
data science
An interdisciplinary field combining statistics, programming, and domain expertise to extract insights from data.
data sovereignty
The principle that data is subject to the laws and regulations of the country where it is collected or stored.
data warehouse
A structured storage system optimised for querying and analysing large volumes of processed, organised data.
decision tree
A model that makes predictions by following a tree-like series of yes/no questions about the data.
deep learning
A type of machine learning using neural networks with many layers to learn complex patterns from large amounts of data.
deepfake
AI-generated or AI-manipulated media, typically video or audio, that convincingly depicts something that did not happen.
DeepSeek
A Chinese AI lab that gained attention for producing highly capable open-weight models at significantly lower training costs.
demand forecasting
Using AI to predict future customer demand for products or services based on historical data and trends.
democratizing AI
Making AI tools and capabilities accessible to non-experts, smaller organisations, or underserved communities.
depth estimation
AI inferring the distance of objects from the camera in a 2D image, creating a sense of 3D understanding.
differential privacy
A mathematical technique that adds carefully calibrated noise to data, allowing useful analysis while protecting individual privacy.
diffusion model
An AI system that generates images by starting with pure noise and gradually removing it, guided by text prompts or other inputs.
digital human
An AI-powered virtual character with realistic human appearance, used in customer service, entertainment, or education.
digital twin
A virtual replica of a physical object, process, or system that is continuously updated with real-world data.
dimensionality reduction
Techniques for reducing the number of input variables in a dataset while preserving important information.
distillation
Training a smaller, faster student model to replicate the behaviour of a larger, more capable teacher model.
distributed training
Splitting the training of a large AI model across multiple GPUs or machines to speed up the process.
document AI
AI systems specialised in understanding, extracting, and processing information from documents.
drone
An unmanned aerial vehicle (UAV) that can be remotely controlled or fly autonomously using AI.
dropout
A training technique where random neurons are temporarily disabled to prevent the network from relying too heavily on any single path.
drug discovery
Using AI to identify potential new medicines and accelerate pharmaceutical research and development.
dual-use
Technology that can be applied for both beneficial civilian purposes and potentially harmful military or surveillance uses.
edge AI
Running AI directly on devices like phones, cameras, or sensors instead of sending data to the cloud.
efficient AI
AI models and methods designed to achieve good performance while using less compute, energy, and data.
embedding
Converting text, images, or other data into numerical vectors that capture their meaning, allowing AI to compare and reason about them.
embodied AI
AI systems that exist within and interact with the physical world through robot bodies or other physical forms.
emergent behaviour
Unexpected capabilities that appear in AI models as they scale up, not explicitly programmed or anticipated.
emotion recognition
AI that attempts to identify human emotions from facial expressions, voice, or text. A controversial application.
encoder-decoder
An architecture where one network compresses input into a representation (encoder) and another generates output from it (decoder).
endpoint
A specific URL or address where an API can be accessed to send or receive data.
energy consumption
The significant electricity required to train and run AI models, raising environmental sustainability concerns.
ensemble methods
Combining multiple AI models to produce better results than any single model alone.
epoch
One complete pass through the entire training dataset during the training process.
Ernie
Baidu's family of large language models, one of the leading LLMs developed in China.
ethical AI
AI designed and used in ways that align with moral principles, prioritising fairness and transparency.
ETL
Extract, Transform, Load. The process of pulling data from sources, converting it into a useful format, and storing it for analysis.
existential risk
The possibility that very advanced AI could pose a fundamental threat to human civilisation. A topic of active debate.
explainability
The ability to understand and describe how an AI system reached a particular decision or output.
export controls
Government restrictions on selling or sharing certain AI technologies, hardware, or data with other countries.
F1 score
A balanced metric that combines precision and recall into a single number between 0 and 1.
facial recognition
AI that identifies or verifies a person by analysing the unique features of their face.
fairness
Ensuring AI systems treat different groups of people equitably and do not discriminate.
feature engineering
The process of selecting, transforming, or creating input variables to improve a model's performance.
feature store
A centralised system for storing and serving the processed input data (features) used by machine learning models.
federated learning
Training AI across many devices or servers without centralising the raw data, preserving privacy.
few-shot
Providing a model with a small number of examples in the prompt to help it understand the desired task or format.
few-shot learning
The ability to learn new concepts or tasks from just a small handful of examples.
fine-grained
Highly detailed or specific, as opposed to broad or coarse. Often refers to classification or permissions.
fine-grained control
The ability to precisely adjust specific aspects of an AI system's behaviour or outputs.
fine-tuning
Adapting a pre-trained AI model with specialised data to improve its performance on a particular task or domain.
fine-tuning as a service
Cloud platforms that allow customers to fine-tune foundation models on their own data without managing infrastructure.
fintech
Financial technology. Companies using technology to improve and automate financial services.
FLOPS
Floating Point Operations Per Second. A measure of computing power, often used to describe AI hardware performance.
flywheel
A self-reinforcing business cycle where each component strengthens the others, creating accelerating momentum.
foundation model
A large AI model trained on broad data that can be adapted for many different specific tasks.
FPGA
Field-Programmable Gate Array. A chip that can be reprogrammed after manufacturing, offering flexibility for AI workloads.
fraud detection
Using AI to identify suspicious transactions or activities that may indicate fraud.
frontier model
The most capable AI models at the cutting edge of performance, typically requiring enormous compute to train.
function calling
A specific capability where an AI model can invoke predefined functions or APIs as part of generating a response.
GAN
Generative Adversarial Network. Two neural networks competing against each other: one creates fake data, the other tries to detect the fakes.
GDPR
General Data Protection Regulation. The EU's landmark data privacy law that has influenced privacy regulations worldwide, including across Asia.
Gemini
Google DeepMind's family of multimodal AI models, integrated across Google's products and services.
generative AI
AI systems that create new content such as text, images, code, audio, and video by learning patterns from large datasets. Also known as gen AI. Prominent examples include large language models like GPT and Claude, and image generators like DALL-E and Midjourney.
generative AI
AI that creates new content such as text, images, music, video, or code, rather than just analysing existing data.
geospatial AI
AI applied to geographic and location-based data for mapping, urban planning, and environmental monitoring.
GitHub
The world's largest platform for hosting, sharing, and collaborating on software code.
GitHub Copilot
An AI-powered coding assistant built on LLMs that suggests code completions and generates code from natural language.
Gojek
Indonesian super-app leveraging AI across ride-hailing, payments, food delivery, and logistics services.
Google DeepMind
Google's AI research lab, formed by merging Google Brain and DeepMind. Behind Gemini, AlphaFold, and AlphaGo.
GPT
Generative Pre-trained Transformer. OpenAI's family of large language models that generate text by predicting the next token.
GPT-4
OpenAI's multimodal large language model released in 2023, capable of processing both text and images.
GPU
Graphics Processing Unit. Powerful chips originally designed for graphics that are now essential for training and running AI models.
Grab
Southeast Asian super-app using AI for ride-hailing, food delivery, payments, and fraud detection.
gradient descent
The optimisation method used to minimise a model's errors by adjusting weights in the direction that reduces loss.
graph neural network
A neural network designed to work with data structured as graphs, such as social networks or molecular structures.
green AI
Research and practices aimed at reducing the environmental impact of developing and running AI systems.
ground truth
The verified, correct answer or label that a model's predictions are compared against.
grounding
Connecting AI outputs to verified sources of information to reduce hallucinations and improve factual accuracy.
GRU
Gated Recurrent Unit. A simplified version of LSTM that is faster to train while achieving similar performance.
guardrails
Safety constraints and rules built into AI systems to prevent harmful, inappropriate, or off-topic outputs.
hallucination
When AI generates confident-sounding but factually incorrect or fabricated information.
Hugging Face
A platform and community hub for sharing, discovering, and deploying open-source AI models and datasets.
human-in-the-loop
AI systems designed to include human oversight, review, or approval at critical decision points.
human-on-the-loop
AI operates autonomously but a human monitors and can intervene if something goes wrong.
hybrid search
Combining traditional keyword search with semantic vector search for more accurate results.
hype cycle
The pattern of inflated expectations followed by disillusionment and eventual productive adoption of a new technology.
hyperparameters
Settings chosen by developers before training begins, such as learning rate or batch size, that control how training works.
hyperscaler
A massive cloud computing provider like AWS, Microsoft Azure, or Google Cloud that operates at enormous scale.
image captioning
AI generating natural language descriptions of what is happening in an image.
image generation
AI creating new images from text descriptions, sketches, or other inputs.
image recognition
AI identifying what is depicted in an image, such as recognising objects, scenes, or activities.
image segmentation
AI dividing an image into distinct regions, assigning each pixel to a specific object or category.
impact assessment
A formal evaluation of the potential effects of an AI system on individuals, society, and the environment before deployment.
in-context learning
An LLM's ability to learn new tasks from examples provided within the prompt, without any additional training.
in-context learning window
The portion of a model's context window used to provide examples and instructions that guide the model's behaviour.
inference
When a trained AI model processes new input and produces output. The actual moment the AI does its work.
inference cost
The ongoing expense of running a trained AI model to process real-world requests, often the largest operational cost.
inpainting
AI filling in missing or selected parts of an image with contextually appropriate content.
instruction tuning
Training a model to follow natural language instructions more reliably, making it more useful as an assistant.
interpretability
The degree to which a human can understand the internal workings and reasoning of an AI model.
IoT
Internet of Things. A network of physical devices embedded with sensors and software that connect and exchange data over the internet.
jailbreak
A prompt designed to bypass an AI model's safety guardrails and make it produce restricted content.
JSON
JavaScript Object Notation. A lightweight data format commonly used for structured communication between systems and AI APIs.
K-means
A popular clustering algorithm that groups data into K clusters based on similarity to cluster centres.
knowledge base
An organised repository of information that AI systems can query to provide accurate, factual answers.
knowledge distillation
Training a smaller, faster model to mimic the behaviour of a larger, more capable model.
knowledge graph
A structured database that stores information as a network of interconnected entities and their relationships.
Kubernetes
An open-source platform for automating the deployment, scaling, and management of containerised applications.
LangChain
A popular open-source framework for building applications powered by large language models, especially agents and RAG systems.
language modelling
Predicting the next word in a sequence based on the words that came before. The fundamental task behind LLMs.
latency
The delay between sending a request to an AI system and receiving a response. Lower latency means faster results.
latent space
The compressed, abstract representation of data that AI models work with internally. A lower-dimensional map of complex data.
layer normalisation
A technique that normalises the inputs across features within a single training example, commonly used in transformers.
leaderboard
A public ranking of AI models based on their performance on standardised benchmarks.
learning rate
How much a model adjusts its weights with each training step. Too high causes instability, too low causes slow learning.
LIDAR
Light Detection and Ranging. A sensor technology that uses laser pulses to create detailed 3D maps of the surrounding environment.
Llama
Meta's family of open-weight large language models, widely used by researchers and developers worldwide.
LLM
Large Language Model. Software trained on massive amounts of text data to understand and generate human-like language.
LoRA
Low-Rank Adaptation. An efficient fine-tuning method that trains only a small number of additional parameters instead of the full model.
loss function
A mathematical formula that measures how far off a model's predictions are from the correct answers. The model tries to minimise this.
low-code
Development platforms requiring minimal hand-written code, using visual tools and pre-built components instead.
LSTM
Long Short-Term Memory. A type of RNN designed to remember information over long sequences, solving the vanishing gradient problem.
machine learning
A subset of AI where software improves at tasks by learning from data rather than following explicit programming.
machine translation
Using AI to automatically translate text from one language to another.
masked image modelling
A training task where parts of an image are hidden and the model learns to predict the missing regions.
masked language modelling
A training task where random words are hidden and the model learns to predict them from surrounding context. Used by BERT.
membership inference
An attack that determines whether a specific data point was used in training a model, potentially compromising privacy.
memory
An AI system's ability to store and recall information from previous interactions or long-running tasks.
Meta AI
Meta's artificial intelligence research division, creator of the open-weight Llama language models.
meta-learning
Teaching AI systems to learn how to learn more efficiently, often called 'learning to learn'.
microservices
An architecture where software is built as a collection of small, independent services rather than one large application.
Midjourney
An independent AI image generation service known for producing highly artistic and stylised visuals.
Mistral
A French AI company and its family of efficient open-weight language models known for strong performance relative to their size.
mixture of agents
A system architecture where multiple specialised AI models collaborate, each contributing their expertise.
mixture of experts
An architecture that routes each input to specialised sub-models (experts), activating only a fraction of the total parameters for efficiency.
MLOps
Machine Learning Operations. Practices for deploying, monitoring, and maintaining AI models in production reliably.
MMLU
Massive Multitask Language Understanding. A popular benchmark testing AI knowledge across 57 academic subjects.
moat
A competitive advantage that protects a business from rivals and is difficult to replicate.
model
A trained AI system that has learned patterns from data and can make predictions or generate outputs based on new inputs.
model card
A standardised document that describes an AI model's intended use, performance, limitations, and ethical considerations.
model collapse
When AI trained on AI-generated data gradually degrades in quality and diversity over successive generations.
model context protocol
A standard for connecting AI models to external tools, data sources, and services in a consistent way.
model drift
When an AI model's predictions become less accurate over time as the real world changes around it.
model evaluation
Systematically testing an AI model's performance, capabilities, and limitations using standardised methods.
model extraction
An attack that recreates a proprietary AI model by systematically querying it and analysing its outputs.
Model Garden
A curated collection of pre-trained AI models offered by cloud providers, allowing developers to easily test and deploy various models.
model marketplace
A platform where developers can discover, compare, and deploy AI models from various providers.
model registry
A centralised repository for storing, versioning, and managing trained AI models.
model serving
The infrastructure and process for making a trained AI model available to handle real-world requests.
multi-agent system
Multiple AI agents working together, each with specialised roles, to solve complex problems collaboratively.
multi-head attention
Running multiple attention operations in parallel, allowing the model to capture different types of relationships simultaneously.
multi-turn conversation
An AI interaction spanning multiple back-and-forth exchanges, where context from earlier turns informs later responses.
multimodal
AI that can process and generate multiple types of content such as text, images, audio, and video.
multimodal embedding
Converting different types of data (text, images, audio) into a shared numerical space for comparison.
multimodal learning
Training AI to understand and connect information across different types of data such as text, images, and audio.
MVP
Minimum Viable Product. The simplest functional version of a product, built to test the core idea with real users.
NAIRR
National AI Research Resource. Proposed US initiative to democratise access to AI compute and data for researchers. Similar programmes exist across Asia.
named entity recognition
Identifying and classifying key elements in text such as people, organisations, locations, and dates.
narrow AI
AI designed to perform a single specific task well, like playing chess or recognising faces. All current AI systems are narrow AI.
narrow intelligence
AI that excels at one specific task but cannot generalise to other domains. Describes all current AI systems.
network effect
When a product becomes more valuable as more people use it, such as social media platforms or messaging apps.
neural architecture search
Using AI to automatically design the structure of neural networks, rather than relying on human intuition.
neural network
A computing system loosely inspired by how brain cells connect, used to find patterns in data. The building block of deep learning.
NLP
Natural Language Processing. The field of AI focused on enabling computers to understand, interpret, and generate human language.
no-code AI
AI platforms that allow users to build and deploy models through visual interfaces without writing any code.
normalisation
Scaling data to a standard range to help AI models train more effectively and fairly.
NPU
Neural Processing Unit. A specialised chip built into devices like smartphones to run AI tasks efficiently on-device.
NVIDIA
The leading manufacturer of GPUs used for AI training and inference. A central player in AI hardware.
object detection
AI locating and identifying multiple objects within an image, drawing bounding boxes around each one.
OCR
Optical Character Recognition. AI that reads and converts text in images or scanned documents into editable digital text.
on-device AI
AI processing that happens locally on a user's device, offering faster response times and better privacy.
one-hot encoding
Representing categorical data as binary vectors where only one element is 1 and the rest are 0.
online learning
Continuously updating a model as new data arrives, rather than retraining from scratch in batches.
ontology
A formal, structured representation of concepts and the relationships between them within a specific domain.
open foundation model
A foundation model whose weights are publicly available, enabling widespread research and commercial use.
open source
Software whose source code is freely available for anyone to view, use, modify, and distribute.
open source model
An AI model whose code and trained weights are publicly available for anyone to use, study, and modify.
open-weight
AI models whose learned numerical parameters (weights) are publicly shared, though the training code or data may not be.
OpenAI
The American AI company behind GPT, ChatGPT, DALL-E, and Sora. One of the most influential AI labs globally.
orchestration
Coordinating multiple AI models, agents, or services to work together on a complex workflow.
outpainting
AI extending an image beyond its original borders, generating new content that matches the existing scene.
overfitting
When a model memorises its training data too closely and performs poorly on new, unseen data.
parallel processing
Running multiple computations simultaneously, essential for training large AI models quickly.
parameters
The internal settings an AI model learns during training. More parameters generally means more capable but more resource-hungry.
PCA
Principal Component Analysis. A technique for simplifying complex data by reducing the number of variables while keeping the most important patterns.
PDPA
Personal Data Protection Act. Data privacy laws adopted across several Asian countries including Singapore, Thailand, and Malaysia.
perceptron
The simplest type of neural network, consisting of a single layer. The building block of more complex networks.
perplexity
A measurement of how well a language model predicts text. Lower perplexity means the model is more confident and accurate.
personalisation
Tailoring content, recommendations, or experiences to individual users based on their data and behaviour.
planning
An AI's ability to break down complex goals into a sequence of steps and execute them in the right order.
pose estimation
AI detecting the position and orientation of a person's body parts in images or video.
positional encoding
A method for giving transformer models information about the order of elements in a sequence, since they process everything in parallel.
pre-training
The initial phase where an AI model learns general knowledge from a large, broad dataset before being specialised.
precision
Of all the items the model predicted as positive, the percentage that actually were positive.
precision agriculture
Using AI, sensors, and data to optimise farming practices at a granular level for better yields and sustainability.
precision medicine
Using AI and genomic data to tailor medical treatments to individual patients.
predictive analytics
Using AI and statistical methods to analyse current and historical data to forecast future outcomes.
predictive maintenance
Using AI to predict when equipment will fail so maintenance can be performed proactively.
prescriptive analytics
AI that not only predicts what will happen but also recommends specific actions to take.
privacy by design
Building privacy protections into AI systems from the beginning, not as an afterthought.
product-market fit
The stage where a product satisfies a strong market demand, with users actively wanting and paying for it.
prompt
The text input or instruction you give to an AI model to get a desired response.
prompt chaining
Connecting multiple AI prompts in sequence, where the output of one becomes the input for the next.
prompt engineering
The practice of crafting effective instructions and inputs to get better, more reliable results from AI tools.
prompt injection
An attack where malicious instructions are hidden in input text to manipulate an AI into unintended behaviour.
provenance
Tracking the origin and history of data or AI-generated content to verify its authenticity and source.
pruning
Removing unnecessary connections or neurons from a neural network to make it smaller and faster without significant quality loss.
QLoRA
Quantized Low-Rank Adaptation. Combines quantization with LoRA to fine-tune large models on consumer hardware.
quantization
Compressing an AI model to use less memory and run faster by reducing the precision of its numerical values, with some quality trade-off.
question answering
AI systems that can read text and answer questions about it, or retrieve answers from a knowledge base.
Qwen
Alibaba Cloud's family of large language and multimodal models, also known as Tongyi Qianwen.
RAG
Retrieval-Augmented Generation. A technique where AI looks up relevant real-world information before generating an answer, reducing hallucinations.
random forest
An ensemble method that combines many decision trees to make more accurate and stable predictions.
ranking algorithm
A system that determines the order in which search results, recommendations, or content are presented to users.
real-time
Processing and responding to data immediately as it arrives, with minimal delay.
reasoning
An AI model's ability to think through problems logically, draw conclusions, and explain its thinking process.
recall
Of all the actually positive items, the percentage the model correctly identified.
recommendation system
AI that suggests relevant content, products, or actions to users based on their behaviour and preferences.
red-teaming
Deliberately testing an AI system by trying to make it fail, produce harmful outputs, or reveal vulnerabilities.
reflection
An AI evaluating its own outputs for errors and self-correcting before delivering a final answer.
regression
A machine learning task where the model predicts a continuous numerical value, like a house price.
regularisation
Techniques used to prevent overfitting by adding constraints or penalties during training.
regulatory framework
A structured set of rules, guidelines, and standards governing how AI can be developed and used.
regulatory sandbox
A supervised testing environment where companies can trial innovative AI products under relaxed regulations.
reinforcement learning
Training an AI by letting it interact with an environment and rewarding good outcomes while penalising bad ones.
reinforcement learning from human feedback
The full term for RLHF. Training AI to produce outputs that humans rate as helpful, harmless, and honest.
ReLU
Rectified Linear Unit. A simple activation function that outputs zero for negative inputs and the input itself for positive ones. Very widely used.
representation learning
Training AI to automatically discover useful representations of raw data, rather than manually designing features.
reranking
A second-stage process that refines initial search results by applying a more sophisticated relevance model.
residual network
A deep neural network architecture using skip connections that allow information to bypass layers, enabling much deeper networks.
responsible AI
Developing and deploying AI with careful consideration for ethics, fairness, safety, and societal impact.
responsible disclosure
The practice of reporting AI vulnerabilities or safety issues privately to developers before making them public.
responsible scaling
The principle of increasing AI capabilities in a measured way with appropriate safety measures at each stage.
REST API
Representational State Transfer. A common architectural style for web APIs that uses standard HTTP methods.
retrieval
The process of finding and fetching relevant information from a database or knowledge base to assist AI generation.
reward hacking
When an AI finds unexpected ways to maximise its reward signal that do not align with the intended goal.
reward model
An AI trained to predict how helpful or harmful a response is, used to guide the training of other models.
right to explanation
The principle that individuals affected by AI decisions should be able to receive a meaningful explanation of how the decision was made.
risk-based approach
Regulating AI according to the level of risk it poses, with stricter rules for high-risk applications.
RLAIF
Reinforcement Learning from AI Feedback. Using another AI model to provide training feedback instead of humans.
RLHF
Reinforcement Learning from Human Feedback. Training AI using human ratings of its outputs to make it more helpful, harmless, and honest.
RNN
Recurrent Neural Network. A neural network designed for sequential data like text or time series, processing one element at a time.
robotics
The field of designing, building, and programming physical machines that can sense, decide, and act in the real world.
ROC curve
Receiver Operating Characteristic curve. A graph showing the trade-off between true positive and false positive rates.
RPA
Robotic Process Automation. Software bots that automate repetitive, rule-based business tasks like data entry and form filling.
safety benchmark
A standardised test specifically designed to evaluate how safely an AI model behaves.
Samsung
South Korean conglomerate investing heavily in AI chips, on-device AI, and smart device integration.
sandbox
A controlled, isolated testing environment for safely experimenting with new AI technologies or regulations.
satellite imagery analysis
Using AI to interpret satellite photos for agriculture, environmental monitoring, urban planning, and disaster response.
scaling laws
Research showing predictable relationships between model size, data volume, compute budget, and AI performance.
SDK
Software Development Kit. A collection of tools, libraries, and documentation that helps developers build applications using a particular platform.
search engine
Software that indexes and retrieves relevant information from a large collection of data in response to queries.
self-attention
A mechanism where each element in a sequence learns how much to attend to every other element, enabling rich contextual understanding.
self-driving
Vehicle technology that uses AI sensors and algorithms to navigate roads without a human driver.
self-supervised learning
A training method where the model generates its own labels from the data, such as predicting masked words in a sentence.
semantic search
Searching by meaning rather than exact keywords, using embeddings to find conceptually related content.
semi-supervised learning
A training approach using a small amount of labelled data combined with a large amount of unlabelled data.
semiconductor
The base material and industry for manufacturing the chips that power AI and all modern computing.
sensor fusion
Combining data from multiple sensor types (cameras, LIDAR, radar) to create a more complete understanding of the environment.
sentiment analysis
Using AI to determine whether a piece of text expresses positive, negative, or neutral emotions.
sentiment scoring
Assigning a numerical value to text representing how positive or negative the expressed sentiment is.
seq2seq
Sequence-to-sequence. A model architecture that transforms one sequence into another, used in translation and summarisation.
simulation
Using software to model real-world environments so AI agents can learn and be tested safely before physical deployment.
single-state AI
An AI system that processes each input independently within a fixed, non-evolving state, without retaining memory or context from prior interactions. Every request is treated as a fresh transaction, making the system stateless. This contrasts with multi-turn or agentic AI systems that maintain conversational history or evolving world models.
skip connection
A shortcut that allows data to bypass one or more layers in a neural network, helping with training very deep networks.
smart city
Urban areas that use AI, IoT, and data analytics to improve infrastructure, services, and quality of life.
SoftBank
Japanese conglomerate and major technology investor with significant stakes in AI companies globally.
softmax
A mathematical function that converts a set of numbers into probabilities that sum to one, used for classification outputs.
Sora
OpenAI's text-to-video AI model capable of generating realistic video clips from text descriptions.
sovereign AI
National initiatives to develop domestic AI capabilities independent of foreign providers, ensuring technological self-reliance.
specification gaming
When an AI satisfies the literal requirement of its objective but violates the spirit of what was intended.
speech recognition
AI that identifies and processes human speech, converting it to text or commands.
speech-to-text
AI that converts spoken audio into written text. Also known as automatic speech recognition.
Stable Diffusion
An open-source AI image generation model developed by Stability AI, widely used and modified by the community.
streaming
Processing data continuously as it flows in, rather than in static batches.
structured output
AI generating responses in specific formats like JSON, tables, or code, rather than free-form text.
style transfer
AI technique that applies the visual style of one image (such as a painting) to the content of another.
summarisation
AI condensing longer text into shorter versions while retaining the key information.
super-app
A mobile application that combines many services (messaging, payments, shopping, transport) into one platform, common across Asia.
supervised learning
Training a model on labelled data where both inputs and correct outputs are provided.
supply chain optimisation
Using AI to improve efficiency, reduce costs, and manage risks across the supply chain.
support vector machine
A classification algorithm that finds the best boundary to separate different categories of data.
swarm intelligence
Coordinated behaviour of many simple agents producing complex collective intelligence, inspired by bees or ants.
synthetic data
Artificially generated data used to train AI when real data is scarce, expensive, or contains privacy concerns.
synthetic media
Any media content (text, images, audio, video) that is created or significantly modified by AI.
system message
Instructions provided to an AI model at the start of a conversation that set its behaviour and constraints.
system prompt
Hidden instructions given to an AI model that define its behaviour, personality, and constraints before the user interacts with it.
task decomposition
Breaking a complex task into smaller, manageable sub-tasks that an AI agent can handle individually.
taxonomy
A hierarchical classification system that organises concepts into categories and subcategories.
technical debt
The accumulated cost of shortcuts or quick fixes in software that will need to be addressed later.
temperature
A setting that controls how creative or random an AI's outputs are. Higher temperature means more varied responses.
Tencent
Chinese technology conglomerate with significant AI research spanning gaming, social media, and enterprise applications.
text generation
AI producing new text based on a prompt or input, ranging from completing sentences to writing entire articles.
text-to-image
AI models that generate images based on written descriptions or prompts.
text-to-speech
AI that converts written text into natural-sounding spoken audio.
text-to-video
AI systems that generate video content from text descriptions.
TF-IDF
Term Frequency-Inverse Document Frequency. A method for measuring how important a word is to a document relative to a collection.
throughput
The amount of data or number of requests an AI system can process in a given time period.
time series
Data points collected at regular intervals over time, such as stock prices, weather readings, or sensor data.
token economics
The cost structure of AI APIs based on the number of tokens processed in inputs and outputs.
token limit
The maximum number of tokens an API call can process, combining both input and output.
token window
Another term for context window. The maximum number of tokens a model can process in a single interaction.
tokeniser
Software that breaks text into smaller units (tokens) that an AI model can process. Different models use different tokenisation strategies.
tokenization
The process of breaking input data (usually text) into smaller units called tokens that an AI model can process.
tokens
Small chunks of text, such as words or word fragments, that AI language models break text into for processing.
tool augmented generation
Enhancing AI responses by giving the model access to external tools like calculators, search engines, or databases.
tool use
An AI model's ability to call external tools, APIs, or software to accomplish tasks beyond text generation.
top-k sampling
A text generation method that limits the AI to choosing from only the K most likely next words.
top-p sampling
Also called nucleus sampling. A method that limits word choices to the smallest set whose combined probability exceeds a threshold P.
TPU
Tensor Processing Unit. Google's custom chip designed specifically for machine learning workloads.
training
The process of teaching an AI model by feeding it data and adjusting its internal settings until it performs well.
training cost
The one-time expense of training an AI model, which can range from thousands to hundreds of millions of dollars for frontier models.
transfer learning
Applying knowledge gained from one task to a different but related task, saving time and data.
transformer
The neural network architecture behind most modern AI language models. Uses attention mechanisms to process all parts of the input simultaneously.
trustworthy AI
AI that is reliable, transparent, fair, and respects privacy, meeting the expectations of its users and society.
TSMC
Taiwan Semiconductor Manufacturing Company. The world's largest contract chip maker, producing most of the world's advanced AI chips.
Turing test
A test proposed by Alan Turing where a machine passes if a human cannot distinguish it from another human in conversation.
underfitting
When a model is too simple to capture the patterns in the data, performing poorly on both training and new data.
unsupervised learning
Training a model on unlabelled data to discover hidden patterns or groupings without being told the answers.
VAE
Variational Autoencoder. An autoencoder that learns a probability distribution over the compressed representation, enabling generation of new data.
value alignment
The challenge of ensuring AI systems internalise and act according to human values and preferences.
vector database
A database optimised for storing and searching high-dimensional numerical representations (embeddings) of data.
vector search
Finding the most similar items in a database by comparing their numerical representations (vectors).
version control
Systems for tracking changes to code or documents over time, allowing collaboration and rollback.
vision transformer
A transformer architecture adapted for processing images by treating image patches as tokens.
vision-language model
An AI model that can understand both images and text together, answering questions about visual content.
ViT
Vision Transformer. The original paper and model that successfully applied transformer architecture to image classification.
vocabulary
The complete set of tokens that an AI language model can recognise and work with.
voice assistant
AI software that uses speech recognition and NLP to respond to voice commands, such as Siri or Alexa.
voice cloning
AI that can replicate a specific person's voice from a small audio sample.
watermark detection
AI systems that identify hidden markers in content to determine whether it was generated by AI.
watermarking
Embedding invisible or visible markers in AI-generated content to identify its origin and help detect synthetic media.
weights
The numerical values inside a neural network that are adjusted during training to improve accuracy.
Whisper
OpenAI's open-source automatic speech recognition system that can transcribe and translate audio in multiple languages.
Word2Vec
An early and influential method for creating word embeddings where similar words have similar numerical representations.
world model
An AI's internal representation of how the world works, allowing it to predict outcomes and simulate scenarios.
XAI
Explainable AI. The field of research focused on making AI systems more transparent and understandable to humans.
YOLO
You Only Look Once. A fast object detection algorithm that processes an entire image in a single pass.
zero-shot
Asking a model to perform a task it was not specifically trained on, without providing any examples.
