Multimodal Artificial Intelligence and Large Language Models: A Comprehensive Guide from Theory to Practice

Hardback Published on: 29/09/2026
Price: £140
Free UK delivery on orders over £25
Please note, this item can only be delivered to a UK address. Find out more
Coming soon
Published 29/09/2026
Make and edit your lists in your account
No stock available in any shop.
Coming soon
Published 29/09/2026
No stock available in any shop.

Synopsis

The book provides a comprehensive technical analysis of multimodal artificial intelligence systems and implementation frameworks. It offers thorough coverage of cross-modal processing methods for use, including speech recognition and automatic image captioning.

  • It presents a detailed discussion of architecture for integrating text, image, audio, and video modalities, cross-modal processing pipelines, and data fusion techniques.
  • Showcases real-time synchronization mechanisms across different modalities and scalable design patterns for multimodal systems.
  • Discusses multimodal emotion recognition using deep Learning techniques, focusing on recent advancements, challenges, and ethical considerations.
  • Investigates deployment optimization strategies to address issues with latency, resource usage, and scalability of multimodal systems.
  • Focuses on techniques for performance optimization, memory management, and distributed processing for multimodal workloads using frameworks like PyTorch and TensorFlow.

The text is primarily written for senior undergraduates, graduate students, and academic researchers in electrical engineering, electronics and communications engineering, computer science and engineering, and information technology.

Publisher information

  • Publisher: Taylor & Francis Ltd
  • ISBN: 9781041152132
  • Number of pages: 376
  • Dimensions: 234 x 156 mm
  • Languages: English

Customer Reviews