// term 27 · Learning Paradigms

One-Shot Learning

Single-Example Learning

Performing a task from exactly one example. In LLM practice, a single demonstration anchors format and approach; in classic machine learning, one-shot methods recognize new categories from a single instance by learning similarity itself — the regime of rare events and scarce data.

One ExampleFormat AnchoringRare EventsSimilarity

// Examples

1

A single demonstration — enough to anchor format and approach when the underlying capability already exists.

// Sweet spot

format

One example excels at conveying output structure and style — the things instructions describe imperfectly and one sample shows exactly.

// Classic lineage

similarity

Pre-LLM one-shot systems learned to compare rather than classify — recognizing new categories from a single reference instance.

// full definition

What One-Shot Learning actually is

One-shot learning occupies the pragmatic middle between zero-shot's pure instruction and few-shot's example set. A single demonstration does what instructions do worst: it shows the exact output format, the level of detail, the tone, the handling of one representative case. For tasks where the model already has the capability and mostly needs anchoring — reformatting, structured extraction, style matching — one well-chosen example captures most of few-shot's lift at a fraction of the token cost.

The single example carries outsized weight, which cuts both ways. The model will imitate it faithfully — including its quirks, its edge-case handling, and its mistakes. With no second example to triangulate against, idiosyncrasies read as requirements: an example answer that happens to be three sentences teaches three-sentence answers. Choosing the demonstration is choosing the behavior; it deserves the same care as writing a spec, because it is one.

The term predates LLMs with a stricter meaning worth knowing. Classic one-shot learning tackled recognition from a single instance — verify this face from one reference photo, flag this defect type from one prior occurrence. The methods (siamese networks, metric learning) learn similarity itself rather than categories, comparing new inputs against stored references. That lineage powers verification systems, rare-defect inspection, and any domain where examples of the thing that matters are inherently scarce.

Both senses converge on a strategic point: data scarcity is no longer disqualifying. Rare events, new product lines, novel fraud patterns, and low-resource categories — situations that defeat dataset-hungry approaches — are addressable through prior capability plus minimal anchoring. The question shifts from “do we have enough data?” to “do we have one good example and a model whose priors cover the rest?”

// how it works

What one example can do

One demonstration anchors the pattern — the model leans on prior capability for everything the single example can't specify.

01

Capability Check

Confirm the model already handles the task family — one example anchors existing capability; it cannot create it.

02

Example Selection

Choose the single most representative case — typical input, exemplary output. This one choice is the behavioral spec.

03

Prompt Assembly

Instruction plus the demonstration, formatted exactly as outputs should be — every detail will be imitated.

04

Pattern Anchoring

At inference, the model fuses the instruction's intent with the example's form — format and style lock to the demonstration.

05

Variance Testing

Evaluate across input diversity — single-example anchoring is most fragile exactly where inputs least resemble the demonstration.

06

Escalate if Needed

Where one example under-specifies the task, add demonstrations — the few-shot graduation — or reconsider the approach entirely.

// anatomy

The components teams must understand

01

The Demonstration

Spec in miniature

One input-output pair carrying the full burden of format, tone, and approach. Its quirks become requirements — select it like a specification.

02

Prior Capability

The silent partner

Pretrained competence fills everything the single example leaves unsaid. One-shot is leverage on priors, not learning from scratch.

03

Format Anchoring

The core value

Output structure, length, and style locked by demonstration — the things one example communicates better than any instruction.

04

Overfit-to-One Risk

The failure mode

Without a second example to triangulate, idiosyncrasies read as rules. Inputs unlike the demonstration get forced into its mold.

05

Metric Learning

The classic mechanism

Siamese and similarity networks comparing inputs against a stored reference — recognition from one instance, the pre-LLM lineage.

06

Rare-Event Fit

The natural domain

Verification, rare defects, novel patterns — wherever examples are inherently scarce, one-shot methods are the design center.

// strategic implications

What this changes for the business

01 · Economics

Maximum anchoring per token

One example delivers most of few-shot's format and style lift at minimal context cost — the efficient default for high-volume tasks where every prompt token recurs in the bill. Start at one; add examples only where measured variance demands them.

02 · Coverage

Scarce data stopped being a blocker

Rare events, new categories, and low-resource domains — historically disqualified by dataset requirements — are addressable through priors plus a single anchor. Use-case screening should stop filtering on dataset size and start asking whether one exemplary case exists.

03 · Care

One example is one point of failure

The demonstration is the spec, and its flaws propagate to every output. Review it like production configuration, test against inputs that don't resemble it, and escalate to multiple examples the moment variance testing exposes the anchoring's limits.

// common misconceptions

What One-Shot Learning is not

Myth

“One example teaches the model the task.”

Reality

It anchors format and approach onto capability pretraining already built. Where the underlying competence is missing, one example produces confident imitation of surface form with no substance underneath.

Myth

“One-shot is just a weaker few-shot.”

Reality

It is a distinct operating point with its own economics — most of the anchoring benefit at minimal token cost — and a distinct classic lineage in similarity learning that powers verification and rare-event recognition.

Myth

“If the example is good, outputs will be good everywhere.”

Reality

Anchoring fidelity is highest near the demonstration and decays with input distance from it. Variance testing across unlike inputs — not the quality of the example alone — predicts production behavior.

// from literacy to leverage

Know the term. Now build the strategy.

Vocabulary is the entry fee. Turning these primitives into pipeline, moats, and margin is the work. That's the conversation.

AI innovation, applied
Andekian

AI-first digital transformation for enterprise growth. Strategy and execution, under one operator.

© 2026 Stephen Andekian.