Media Summary: Talk : Introductions and Meetup Updates by Chris Fregly and Antje Barth Talk : If you use GPT or Claude, you've probably heard “ Discover how AMD powered Amazon EC2 instances are transforming cloud economics for

Optimizing Ai Inference For Heterogeneous - Detailed Analysis & Overview

Talk : Introductions and Meetup Updates by Chris Fregly and Antje Barth Talk : If you use GPT or Claude, you've probably heard “ Discover how AMD powered Amazon EC2 instances are transforming cloud economics for Philip Kiely, Head of Developer Relations at Baseten, presents the “Golden Triangle” of See the detailed reference architecture → Learn how to use JAX, Google Kubernetes Engine (GKE) and ... In this episode, Zain Asgar, co-founder and CEO of Gimlet Labs, joins us to discuss the

Check out videos from Upperside Conference's recent World Congress (formerly known as MPLS World Congress): ... Open-source LLMs are great for conversational applications, but they can be difficult to scale in production and deliver latency ...

Photo Gallery

Optimizing AI Inference for Heterogeneous Clusters by Natalie Serrino, Founder @ Gimlet Labs
AI Inference: The Secret to AI's Superpowers
Faster LLMs: Accelerate Inference with Speculative Decoding
Optimizing AI Inferencing for Agentic Operations in Manufacturing
Inference at Scale: The New Frontier for AI Infrastructure and ROI
What is vLLM? Efficient AI Inference for Large Language Models
What is AI Inference for Developers | Explained Simply
AWS re:Invent 2025 - Why Your Processor Matters for AI Inference and General Compute (MAM210)
Deploying scalable and reliable AI inference on Google Cloud
Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou
Scaling AI Inference Performance in the Cloud with Nebius
AWS re:Invent 2024 - Faster, cheaper, better: Optimizing inference for production AI (AIM248)
Sponsored
Sponsored
View Detailed Profile
Sponsored
Sponsored