Why AI Inference Costs Are Crashing 40 Percent This Year cover art

Why AI Inference Costs Are Crashing 40 Percent This Year

Why AI Inference Costs Are Crashing 40 Percent This Year

Listen for free

View show details
Lucas and Luna unpack the dramatic collapse in AI inference pricing, using the latest numbers from NVIDIA, AMD, and Super Micro to show how hardware competition is reshaping the economics of running large language models. They trace the shift from training to inference, explain why inference costs have dropped roughly 40 percent year-over-year, and discuss what that means for startups, cloud margins, and the next wave of AI applications. Along the way, they touch on the surprising rally in Super Micro Computer and what the 'death of the GPU shortage' means for the entire supply chain. A focused, data-driven conversation for anyone trying to understand where the AI market is heading in mid-2026. #AIInference #InferenceCosts #NVIDIA #AMD #SuperMicro #GPU #LLM #CloudComputing #AIHardware #TechInvesting #Semiconductors #AIPricing #DataCenters #AIStartups #Technology #FexingoBusiness #BusinessPodcast #ChatGPTAndBeyond Keep every episode free: buymeacoffee.com/fexingo
adbl_web_anon_alc_button_suppression_t1
No reviews yet