Trillium, the sixth iteration of Google’s Tensor Processing Unit (TPU), is nearly five times more efficient than its predecessor, TPUv5, in peak compute performance and memory bandwidth, Google said.| Network World
Although OpenAI says that it doesn’t plan to use Google TPUs for now, the tests themselves signal concerns about inference costs.| Network World