Generative AI with Kubernetes: Implementing secure and observable AI infrastructure to deliver reliable AI applications (English Edition)
商品資訊
ISBN13:9789365898323
出版社:INDEPENDENT CAT
作者:Jonathan Baier
出版日:2025/02/28
裝訂:平裝
規格:23.5cm*19.1cm*1.5cm (高/寬/厚)
商品簡介
DESCRIPTION
Over the past few years, we have seen leaps and strides in ML and most recently generative AI. Companies and software teams are rushing to enhance, rebuild, and create new software offerings with this new intelligence. As they innovate and create delightful new experiences for their customers new challenges arise. Understanding how these applications work and how to use state-of-the-art infrastructure tools like Kubernetes will help organizations and professionals succeed with this new technology.
The book covers essential technical implementations from ML fundamentals through advanced deployment strategies, focusing on practical patterns. Core topics include Kubernetes-native GPU scheduling and resource management, MLOps pipeline architectures using Kubeflow/MLflow, and advanced model serving patterns. It details data management architectures, vector databases, and RAG systems, alongside monitoring solutions with Prometheus/Grafana. Finally, we will look at some advanced concerns for production in the realm of security and data reliability.
After reading this book, you will be equipped with a broad knowledge of the end-to-end generative AI pipeline and how Kubernetes can be leveraged to run your generative AI workloads at scale in the real-world.
KEY FEATURES
● Learn how Kubernetes can help you run your generative AI workloads.
● Using hands-on examples, you will work with real-world foundational models and a variety of tools and capabilities in the K8s ecosystem.
● A broad survey of both generative AI and Kubernetes in one book.
WHAT YOU WILL LEARN
● How to evaluate and compare models for new applications and use cases.
● How Kubernetes can add reliability and scale to your AI applications.
● What does an AI delivery pipeline contain and how to start one.
● How AI models encode words and work with natural language.
● How prompting and refinement techniques can improve results.
● How to use your own data to augment AI responses.
WHO THIS BOOK IS FOR
This book is for teams building new applications or new functionality with generative AI, but want to better understand the infrastructure needed to bring their AI applications to production. This book is also for shared services, infrastructure, or cybersecurity teams who provide platforms and infrastructure for application, or product development.
主題書展
更多書展購物須知
外文書商品之書封,為出版社提供之樣本。實際出貨商品,以出版社所提供之現有版本為主。部份書籍,因出版社供應狀況特殊,匯率將依實際狀況做調整。
無庫存之商品,在您完成訂單程序之後,將以空運的方式為你下單調貨。為了縮短等待的時間,建議您將外文書與其他商品分開下單,以獲得最快的取貨速度,平均調貨時間為1~2個月。
為了保護您的權益,「三民網路書店」提供會員七日商品鑑賞期(收到商品為起始日)。
若要辦理退貨,請在商品鑑賞期內寄回,且商品必須是全新狀態與完整包裝(商品、附件、發票、隨貨贈品等)否則恕不接受退貨。

