Monday, December 23, 2024

Google puts Nvidia on high alert as it showcases Trillium, its rival AI chip, while promising to bring H200 Tensor Core GPUs within days

Must read


  • Trillium offers 4x training boost, 3x inference improvement over TPU v5e
  • Enhanced HBM and ICI bandwidth for LLM support
  • Scales up to 256 chips per pod, ideal for extensive AI tasks

Google Cloud has unleashed its latest TPU, Trillium, the sixth-generation model in its custom AI chip lineup, designed to power advanced AI workloads.

First announced back in May 2024, Trillium is engineered to handle large-scale training, tuning, and inferencing with improved performance and cost efficiency.

Latest article