From Chatbots to RL to GANs, here’s our essential AI glossary for marketers
What’s the difference between machine learning and deep learning? You’re about to find out. This glossary can serve as a linguistic primer as you begin to navigate the world of AI.
Images created by Midjourney using the prompt: “An intelligent humanoid machine holding a dictionary, sci-fi digital art.”
There’s been a lot of chatter about artificial intelligence (AI) lately and how it could either deliver a work-free paradise or escape our control and quickly escalate into a nightmare (the likes of which have been captured in countless Hollywood blockbusters like 2001: A Space Odyssey and the Terminator franchise). Regardless of where you happen to fall on the utopia-dystopia spectrum, one thing is by now abundantly clear: AI is here to stay – and it seems almost certain to transform civilization to a degree that many can scarcely imagine.
That being the case, it’s important for all of us – including marketers, whose industry is already feeling the effects of the AI revolution – to have at least a basic understanding of what AI is and how it works. That starts with understanding some of the languages that’s spoken in this strange technological territory.
Here are some critical AI terms that you need to know [we will be updating this glossary on a regular basis, so we recommend checking in on it routinely]:
A/B testing: A form of randomized experimentation wherein two variants of a particular model, A and B, are tested by a human subject to determine which of them performs better than the other.
Algorithm: A set of instructions or rules used - often by a computer - to solve a set of problems, execute calculations or process data.
AlphaGo: An AI model developed by DeepMind and designed specifically to play the ancient Chinese board game Go. In 2015, AlphaGo became the first AI model to defeat a professional human Go player (Chinese-born Fan Hui). It beat Lee Sedol, a then-professional Go player from South Korea, the following year. Sedol retired from playing professional Go in 2019, telling the South Korean media outlet Yonhap News Agency that AI specializing in the game of Go “is an entity that cannot be defeated.”
Artificial general intelligence (AGI, also sometimes referred to as Strong AI): An AI program with an intellectual ability that’s comparable to that of an average adult human. AGI, in other words, would hypothetically (we have yet to build one) be able to solve problems across a vast range of categories, just as a human brain can.
Artificial narrow intelligence (ANI, also sometimes referred to as Weak AI): An AI program built to perform a single, narrow function, such as playing chess or responding to customer service questions. All of the AI programs that have been developed to date fall into the category of ANI.
Artificial neural network (ANN): A synthetic system, roughly modeled on the architecture of organic brains, comprised of layers of artificial neurons.
Artificial superintelligence (ASI): First postulated by Oxford philosopher Nick Bostrum, “Superintelligence” is a theoretical intellect – artificial or organic – which is more advanced than that of humans. An ASI could have only a slightly higher IQ score than the average human being, or it could be vastly, unfathomably more intelligent, comparable to the difference in cognitive ability between an ant and Nobel Laureate Roger Penrose.
Association rule learning: A method of unsupervised and rule-based machine learning aimed at identifying commonalities or associations between variables in a dataset.
Automatic speech recognition (ASR – also known as computer speech recognition, speech-to-text or simply speech recognition): A machine’s capability to recognize human speech and then convert it into text. The iPhone dictation feature, for example, uses ASR.
Backpropagation: The process by which a neural network informs itself that it has made a predictive error, and subsequently corrects that error. The word “backpropagation” means roughly responding to flawed information by sending new information back in the direction of the source of the error. Sometimes colloquially referred to simply as “backprop” or “BP.”
Bayes’ theorem: Named after the 18th-century statistician Thomas Bayes, this theorem is a mathematical formula that can be used to determine what’s known as “conditional probability” – that is, the likelihood of a particular outcome based on one’s prior knowledge of a previous result that occurred in similar conditions.
Black box: A metaphor that’s invoked to describe a system whose inner workings are hidden and ultimately mysterious to the system’s creator (or creators). AI is sometimes described as a “black box” because models will often behave and evolve in ways that even the system’s programmers cannot fully understand or predict.
Central processing unit (CPU): The most important component of a digital computer. The CPU – sometimes referred to as the “brain” or the “control center” of a computer – is the locus of every digital computing system’s memory, arithmetic capabilities (adding, subtracting, multiplying and dividing), and the orchestrator of its operating system. The CPU of modern computers is built upon a microprocessor.
Chatbot: An AI-based computer program that leverages natural language processing (NLP) to field customer service questions in automated verbal or text-based responses that simulate human speech.
ChatGPT: An AI-powered chatbot launched by San Francisco-based startup OpenAI in November of 2022. ChatGPT uses NLP to simulate human conversation. According to OpenAI’s website, ChatGPT can “answer follow-up questions, admit its mistakes, challenge incorrect premises and reject inappropriate requests.”
Computer vision: A branch of AI that’s concerned with enabling machines to understand and respond to information derived from visual inputs - such as images and video - in a manner similar to that of the visual system in the human brain.
Convolutional neural network (CNN): A subset of artificial neural networks, commonly used in machine visual processing, which can enable an AI model to differentiate and analyze various components within an image.
Dall-E 2: A deep learning model developed by OpenAI and released in 2022 which generates images based on the input of text-based natural language prompts. Its predecessor is Dall-E. The name of both models is a play on both the name of the title character of the Pixar film Wall-E and the surname of the 20th-century surrealist painter Salvador Dalí.
The Dartmouth Summer Research Project on Artificial Intelligence: A conference - colloquially referred to as the Dartmouth Workshop - which began in mid-1956 at Dartmouth College and is widely considered to be the event that gave birth to AI as a field of research. The conference was organized by Marvin Minsky, John McCarthy, Nathaniel Rochester and Claude Shannon.
Deep Blue: An AI program developed by IBM, the sole purpose of which is to play chess. In 1997, it made history by becoming the first intelligent machine to beat chess master Gary Kasparov in a chess match.
Deep learning (also known as deep reinforcement learning): An extension of machine learning based on the premise that machine learning models can be made more intelligent if they’re provided with vast quantities of data. Deep learning requires neural networks of at least three layers; the more layers it’s equipped with, the better its performance will be.
DeepMind: An artificial intelligence research laboratory based in London and founded in 2010 by Demis Hassabis, Shane Legg and Mustafa Suleyman. The company was acquired by Google in 2014 and is now a wholly-owned subsidiary under Alphabet Inc.(Google’s parent company). DeepMind describes itself on its website as “a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.”
Decision tree: An imagistic illustration of the process of arriving at a decision, wherein each “branch” represents a particular course of action. Decision trees start at a “root node” (which consists of all the relevant data that’s being analyzed), branch off into “internal nodes (also known as “decision nodes”) and then terminate in “leaf nodes” (also known as “terminal nodes,” which represent all the possible outcomes of a given decision-making process).
Here’s a simple example of a decision tree rooted in the question of whether or not you should go outside to play soccer:
Entropy: In the context of machine learning, “entropy” refers to the degree of randomness, disorder and unpredictability within a dataset that’s being processed by a machine learning system. More broadly, the concept of entropy is commonly associated with the second law of thermodynamics, which essentially holds that the degree of disorder or randomness within a system will never decrease over time – it can only remain constant or increase.
Game Theory: A mathematical formula, first postulated by mathematician John von Neumann and economist Oskar Morgenstern in 1944, relating to the dynamic interaction between two or more rational agents seeking their own gains within a parameterized (rule-governed) framework. Game Theory defines a broad set of games, including zero-sum and nonzero-sum.
Generative adversarial network (GAN): A machine learning methodology wherein two neural networks compete with one another in a zero-sum game – that is, one network’s loss translates to the other’s gain, and vice versa. Both networks are provided with a dataset, and one network called the “generator” is essentially tasked with tricking the other – the “discriminator” – into believing that the new information that’s being generated is part of the original dataset. For example, the generator might generate a new image of a human face, based on many images of real human faces, at which point the discriminator will try to determine whether or not the new image is real or manufactured. This contest will continue until the generator succeeds at tricking the discriminator with the majority (more than 50 percent) of its original output. GANs were invented in 2014 by American computer scientist Ian Goodfellow, who has henceforth been dubbed “The GANfather.”
GPT-3: Generative Pre-Trained Transformer 3 (GPT-3) is an open-source large language model developed by OpenAI and released in 2020. The model is the framework for the viral chatbot ChatGPT and is able to generate text responses in natural language based on text-based prompts.
Hallucination: In an AI context, the term “hallucination” refers to any kind of output from an AI model which is seemingly inconsistent with its training data. A hallucinating AI-powered chatbot, for example, might confidently and falsely insist that there are around 5.7tn stars in the Milky Way galaxy, even though it was not trained using any astronomical data.
Human-in-the-loop (HITL): A methodology deployed in some machine learning models wherein at least one human programmer provides feedback to the model (during testing or training) to improve the model’s performance. Ideally, HITL results in a positive feedback loop that enhances the intelligence of both machines and humans.
Hyperparameter: An overarching, predominant parameter, established by a human programmer, which determines the parameters that an AI model will establish and hone by itself during its training process.
Machine learning: A subdiscipline of artificial intelligence that, using statistical formulas and data, enables computers to progressively improve their ability to carry out a particular task or set of tasks. Crucially, a computer leveraging machine learning does not need to be explicitly programmed to improve its performance in a particular manner – rather, it’s given access to data and is designed to “teach” itself. The results are often surprising to their human creators.
Machine translation (MT): An automated process that leverages AI to translate text or speech from one language into another.
Microprocessor: A CPU for digital computing systems contained within a single integrated circuit (also known as a microchip, hence the prefix in the word “microprocessor”) or a small grouping of integrated circuits. Intel introduced the world's first microprocessor, dubbed the 4004, in 1971.
Midjourney: A research lab that launched a text-to-image AI model by the same name in open beta in 2022.
Moore’s Law: A principle, based on an observation usually attributed to former Intel CEO Gordon Moore, which holds that the number of transistors that can be contained within an integrated circuit (i.e., a microchip) doubles roughly every two years.
Natural language processing (NLP): A branch of artificial intelligence – that also blends elements of linguistics and computer science – aimed at enabling computers to understand verbal and written language in a manner that imitates the human brain’s language-processing capability.
OpenAI: A non-profit AI research lab founded in 2015 by Sam Altman, Elon Musk and others. As its name suggests, the original foundational goal of OpenAI was to collaborate with other organizations in the field of AI and to open-source its research. In 2019, the organization launched a “capped profit” subsidiary called OpenAI Limited Partnership (OpenAI LP). (Musk has lamented this decision on Twitter.)
Parameter: A variable within the process of training an AI model which can be adjusted by the model in order to hone its ability to produce a particular output using a given dataset.
Pattern recognition: An automated process whereby a computer is able to identify patterns within a set of data.
Prior probability (also sometimes referred to simply as a prior): A term used in the field of Bayesian statistics to refer to the assigned likelihood of an event before (prior to) additional (posterior) information necessitates the revision of that likelihood.
Reinforcement learning (RL): The process of teaching machine learning models to make optimal decisions within a dynamic environment. When using RL, a programmer will often present a machine learning model with a game-like situation in which one outcome is preferable to others. The machine then proceeds to experiment with different strategies and the programmer will “reinforce” the desired behavior with rewards and discourage other behaviors through penalties.
Self-supervised learning: A branch of machine learning wherein an AI model is provided with unlabeled data and is allowed to label the data according to its own pattern recognition capabilities. A self-supervised algorithm will then use those initial labels as it continues to interpret subsequent iterations of data input.
Semi-supervised learning: A branch of machine learning which, as the name suggests, blends elements of both supervised learning and unsupervised learning. Semi-supervised learning is based on the input of some labeled data and a higher quantity of unlabeled data, the goal being to teach an algorithm to categorize the latter into predetermined categories based on the former, and also to allow the algorithm to identify new patterns across the dataset. It is widely considered to be a kind of bridge, connecting the benefits of supervised learning with those of unsupervised learning.
Supervised learning: A branch of machine learning based on the input of clearly labeled data and aimed at training algorithms to recognize patterns and accurately label new data.
Stochastic: A mathematical term referring to a system’s tendency to produce results that are unpredictable. (Roughly synonymous with “probabilistic,” “indeterminable” and “random.”) Many AI algorithms are programmed to incorporate some degree of randomness into their learning processes and are therefore described as stochastic. The results of a deterministic system, in contrast, can reliably be predicted beforehand.
TensorFlow: An open-source platform, developed by Google, designed for the management of machine learning and AI systems.
Turing test: A blinded experiment – invented by and named after 20th-century mathematician Alan Turing – where a human subject interacts with an artificially intelligent machine and asks it a series of questions. If the human interlocutor is unable to say definitively whether the responses are being generated by a human or an AI, the latter has passed the Turing Test.
Uncanny valley: A theoretical concept, first postulated by roboticist Masahiro Mori in 1970, which refers to an eerie, uncanny quality that will be perceived by a human being interacting with an artificial entity that closely (though imperfectly) resembles another human.
Unsupervised learning: A branch of machine learning which is based upon the input of unlabeled data. In contrast to supervised learning, unsupervised learning allows an algorithm to create its own rules for identifying patterns and categorizing data.
Value alignment problem: Coined by computer scientist Stuart Russel, the phrase “value alignment problem” – or simply “alignment problem” – refers to the difficulties that come with ensuring that intelligent machines share the same values and goals as their human programmers. This problem has spawned a subfield of AI and machine learning called “alignment research.”
For more on the latest happening in tech, sign up for The Drum’s Inside the Metaverse weekly newsletter here.