Researchbreaking
TALOS-V2 Runs a Full Transformer in FPGA at 53,000 tok/sec Without a GPU
TALOS-V2 burns a full transformer into FPGA hardware achieving 53K tok/sec on battery power — embeddings, attention, MLP, and sampling all in silicon.
May 5, 20261 min read