We find that, just as a large transformer model trained on language can generate coherent text, the same exact model trained on pixel sequences can generate coherent image completions and samples. By establishing a correlation between sample quality and image classification accuracy, we show that our best generative model also contains features competitive with top convolutional nets in the unsupervised setting.
Latest Briefs
Fast updates from the latest stories.
MARKET TRENDS
+2
India's Furniture Ecommerce Market Shifts Towards Consolidation
Apr 7, 2026
STARTUPS
+1
Rocket Launches AI Platform for Strategic Product Development
Apr 7, 2026
COMPANIES
+2
JPMorgan CEO Jamie Dimon warns of AI and geopolitics risks facing his bank: ‘The list is long…’
Apr 7, 2026
COMPANIES
+3
SpaceX lays out IPO details, targets early June roadshow: Report
Apr 7, 2026