
Transforming AI through Hardware-Software Codesign for Gen AI Inference
November 13, 2024
In this short talk, d-Matrix CTO Sudeep Bhoja discusses the release of the Deep Seek R1 model, highlighting its impact on inference compute. He discusses the evolution of reasoning models and the significance of inference time compute in enhancing model performance.
Learn more about d-Matrix