Robust adversarial inputs

We’ve created images that reliably fool neural network classifiers when viewed from varied scales and perspectives. This challenges a claim from last week that self-driving cars would be hard to trick maliciously since they capture images from multiple scales, angles, perspectives, and the like.

This innocuous kitten photo, printed on a standard color printer, fools the classifier into thinking it’s a `monitor` or `desktop computer` regardless of how its zoomed or rotated. We expect further parameter tuning would also remove any human-visible artifacts.

Out-of-the-boxadversarial examples⁠(opens in a new window)do fail under image transformations. Below, we show the same cat picture, adversarially perturbed to be incorrectly classified as a desktop computer byInception v3⁠(opens in a new window)trained onImageNet⁠(opens in a new window). A zoom of as little as 1.002 causes the classification probability for the correct label tabby cat to override the adversarial label`desktop computer`.

However, we’d suspected that active effort could produce a robust adversarial example, as adversarial examples have been shown totransfer⁠(opens in a new window)to the physical world.

## Scale-invariant adversarial examples

Adversarial examples can be created using an optimization method called projected gradient descent to find small perturbations to the image that arbitrarily fool the classifier.

Instead of optimizing for finding an input that’s adversarial from a single viewpoint, we optimize over a largeensemble⁠(opens in a new window)of stochastic classifiers that randomly rescale the input before classifying it. Optimizing against such an ensemble produces robust adversarial examples that are scale-invariant.

A scale-invariant adversarial example.

Even when we restrict ourselves to only modifying pixels corresponding to the cat, we can create a single perturbed image that is simultaneously adversarial at all desired scales.

## Transformation-invariant adversarial examples

By adding random rotations, translations, scales, noise, and mean shifts to our training perturbations, the same technique produces a single input that remains adversarial under any of these transformations.

A transformation-invariant adversarial example. Note that it is visibly more perturbed than its scale-invariant cousin. This might be fundamental: it’s intuitively plausible that small adversarial perturbations are harder to find the more transformations an example must be invariant to.

Our transformations are sampled randomly at test time, demonstrating that our example is invariant to the whole distribution of transformations.

Point-E: A system for generating 3D point clouds from complex prompts Publication Dec 16, 2022

Multimodal neurons in artificial neural networks Milestone Mar 4, 2021

CLIP: Connecting text and images Milestone Jan 5, 2021

Our Research * Research Index * Research Overview * Research Residency * OpenAI for Science * Economic Research

Latest Advancements * GPT-5.3 Instant * GPT-5.3-Codex * GPT-5 * Codex

Safety * Safety Approach * Security & Privacy * Trust & Transparency

ChatGPT * Explore ChatGPT(opens in a new window) * Business * Enterprise * Education * Pricing(opens in a new window) * Download(opens in a new window)

Sora * Sora Overview * Features * Pricing * Sora log in(opens in a new window)

API Platform * Platform Overview * Pricing * API log in(opens in a new window) * Documentation(opens in a new window) * Developer Forum(opens in a new window)

For Business * Business Overview * Solutions * Contact Sales

Company * About Us * Our Charter * Foundation(opens in a new window) * Careers * Brand

Support * Help Center(opens in a new window)

More * News * Stories * Livestreams * Podcast * RSS

Terms & Policies * Terms of Use * Privacy Policy * Other Policies

(opens in a new window)(opens in a new window)(opens in a new window)(opens in a new window)(opens in a new window)(opens in a new window)(opens in a new window)

English United States

Robust adversarial inputs

Transition at Sonata: Rajsekhar Datta Roy is new CEO as Samir Dhar steps down

Why Cohere is merging with Aleph Alpha

Axis Bank frontloads FY27 provisions amid West Asia war uncertainty

Why Tokyo is the most important tech destination of 2026

Latest Briefs