Kniha Multimodal AI Systems Wei Sun

Multimodal AI Systems

Architectures, Training, and Applications

Autor: Wei Sun
Jazyk: Angličtina
Väzba: Brožovaná
Dostupnosť: Očakávané naskladnenie
Naskladnenie 30. 06. 2026
69.95
The Transformer Principles Series is a three-volume graduate-level treatise that builds a complete m...

Informácie o knihe

Autor
Jazyk
Angličtina
Väzba
Kniha - Brožovaná
Vydalo
2026
Stránok
480
EAN
9798184326054
Enbook ID
53025974
Hmotnosť
1104
Rozmery
216 x 280 x 25

Kompletný popis

The Transformer Principles Series is a three-volume graduate-level treatise that builds a complete mathematical and engineering understanding of modern AI systems, from the foundational attention mechanism to large language models and multimodal architectures.

Volume III - Multimodal AI Systems: Architectures, Training, and Applications extends the Transformer paradigm beyond text into vision, audio, and video. It covers modality-specific encoders and tokenizers, cross-modal fusion and contrastive alignment (CLIP, SigLIP), diffusion and flow-matching generative models, vision-language architectures (ViT, LLaVA, Q-Former), text-to-image and text-to-video generation, speech and audio processing, efficient inference for multimodal models, long-context scaling, and reasoning agents that perceive and act across modalities.

Mohlo by vás zaujímať

14.27
181.97

The Society of the Screen

SCHWENDENER MARTHA
22.59