Researchbreaking
Meta Presents Tuna-2: Pixel Embeddings Unify Visual Understanding and Generation
Meta's Tuna-2 unifies multimodal understanding, generation, and editing from raw pixel embeddings — bypassing vision encoders for a more unified architecture.
April 29, 20261 min read