Code: 52898120

Data Engineering for Large Foundation Models

Name: Data Engineering for Large Foundation Models
Brand: Springer, Berlin
SKU: 52898120
Price: 216.55 EUR
Availability: PreOrder
Author: Jun Yu
ISBN: 9789819228492

by Jun Yu, Chang Wen Chen

Pre-order
New!

Data quality has become a decisive foundation for large foundation models, shaping their capability, reliability, alignment, and real-world applicability. Data Engineering for Large Foundation Models: A Handbook provides a systema ... more

Language: English
Binding: Hardback
ISBN-13: 9789819228492
Publisher: Springer, Berlin, 2026
More about this

216.55 €

Forthcoming
Expected 14. 12. 2026

Availability alert

Add to wishlist

Moonwalk
14.04 € -18 %
Buy
The Score
13.73 €
Buy
The Deal
12.32 € -21 %
Buy
The Mistake
13.53 € -14 %
Buy
Jujutsu Kaisen, Vol. 30
9.89 € -20 %
Buy
Berserk Deluxe Volume 1
35.76 € -25 %
Buy
The Goal
14.64 €
Buy
Berserk Deluxe Volume 2
33.24 € -30 %
Buy
Invincible Compendium Volume 2
45.77 € -24 %
Buy
Legacy
11.91 € -22 %
Buy
Lord of Mysteries, Vol. 3: The Clown, Part III
17.07 € -13 %
Buy
Murdoku
14.04 € -25 %
Buy
Berserk Deluxe Volume 3
55.16 €
Buy
Witch Hat Atelier Manga Box Set 1
51.22 € -31 %
Pre-order
Chainsaw Man, Vol. 21
9.29 € -25 %
Buy
Dancing The Dream
19.90 € -31 %
Pre-order
White Nights
4.54 €
Buy
Jujutsu Kaisen, Vol. 25
9.89 € -23 %
Buy
Jujutsu Kaisen, Vol. 29
9.69 € -25 %
Buy
Witch Hat Atelier: Grimoire Edition 1
35.26 € -29 %
Pre-order
JoJo's Bizarre Adventure: Part 7--Steel Ball Run, Vol. 7
18.18 € -25 %
Buy

Give this book as a present today

Order book and choose Gift Order.
We will send you book gift voucher at once. You can give it out to anyone.
Book will be send to donee, nothing more to care about.

Book gift voucher sample Read more

Availability alert

We will watch availability for you

Enter your e-mail address and once book will be available,
we will send you a message. It's that simple.

More about Data Engineering for Large Foundation Models

Book details
Synopsis
Trending among others

You get 524 loyalty points

Book synopsis

Data quality has become a decisive foundation for large foundation models, shaping their capability, reliability, alignment, and real-world applicability. Data Engineering for Large Foundation Models: A Handbook provides a systematic and practice-oriented guide to data engineering for foundation models. Moving beyond a narrow focus on large language models, the book covers the data lifecycle behind language models, vision-language models, multimodal understanding systems, text-to-image and text-to-video generative models, reasoning models, agentic systems, and domain-specific AI applications.

The book presents a full-stack framework for building high-quality data pipelines for foundation-model development. It covers large-scale pre-training data engineering, including data sourcing, acquisition, cleaning, deduplication, decontamination, tokenization, serialization, efficient loading, and quality evaluation. It also addresses multimodal data engineering for image-text, document, video, and audio data, as well as post-training and alignment data construction, including SFT, preference data, RLHF, Chain-of-Thought reasoning data, tool-use data, agent memory, and multi-turn interaction data.

The book further examines data-centric AI systems, including synthetic data factories, knowledge distillation, enterprise-grade RAG and multimodal RAG pipelines, online feedback loops, knowledge updating, DataOps platforms, data governance, privacy protection, federated learning, and compliance-aware data engineering. Through end-to-end projects and reproducible system designs, readers gain hands-on experience with distributed pre-training data pipelines, domain-specific SFT datasets, multimodal instruction data factories, reasoning data flywheels, agent tool-use data factories, enterprise DataOps platforms, privacy-preserving pipelines, open-source model reproduction, and text-to-video training data pipelines. Using modern tools such as Ray, Spark, Dask, Parquet, WebDataset, vector databases, DVC, MLflow, and Airflow, this handbook equips data engineers, MLOps and DataOps professionals, AI researchers, and technical product teams to build reliable, scalable, and continuously improving foundation-model systems.

Book details

216.55 €

Full title: Data Engineering for Large Foundation Models
Subtitle: A Handbook
Author: Jun Yu, Chang Wen Chen
Language: English
Binding: Hardback
EAN: 9789819228492
ID: 52898120
Publisher: Springer, Berlin
Dimensions: 235 × 155 mm
Date of publishing: 14. December 2026

All about us

Shopping guide

Digest

Books by language

Payment

Delivery 2.99 €

Collection points Bratislava a 12849 dalších

Slovensko

България Hrvatska România Magyarország Polska Česko

Data Engineering for Large Foundation Models

by Jun Yu, Chang Wen Chen

Publisher: Springer, Berlin, 2026

You might also like

Moonwalk

The Score

The Deal

The Mistake

Jujutsu Kaisen, Vol. 30

Berserk Deluxe Volume 1

The Goal

Berserk Deluxe Volume 2

Invincible Compendium Volume 2

Legacy

Lord of Mysteries, Vol. 3: The Clown, Part III

Murdoku

Berserk Deluxe Volume 3

Witch Hat Atelier Manga Box Set 1

Chainsaw Man, Vol. 21

Dancing The Dream

White Nights

Jujutsu Kaisen, Vol. 25

Jujutsu Kaisen, Vol. 29

Witch Hat Atelier: Grimoire Edition 1

JoJo's Bizarre Adventure: Part 7--Steel Ball Run, Vol. 7

Give this book as a present today

Availability alert

More about Data Engineering for Large Foundation Models

Book synopsis

Book details