IEEE 3300:2022

IEEE 3300:2022

IEEE Standard Adoption of Moving Picture, Audio and Data Coding by Artificial Intelligence (MPAI) Technical Specification Multimodal Conversion Version 1.2

Availability: In stock

€108.00

Details

New IEEE Standard - Active.
This standard adopts MPAI Technical Specification Version 1.2 as an IEEE Standard. Multimodal Conversation (MPAI-MMC) is an MPAI Standard comprising five use cases, all sharing the use of artificial intelligence (AI) to enable a form of human-machine conversation in completeness and intensity.

Multimodal Conversation (MPAI-MMC) is an MPAI Standard comprising five Use Cases, all sharing the use of artificial intelligence (AI) to enable a form of human-machine conversation that emulates human-human conversation in completeness and intensity: 1. "Conversation with Emotion" (CWE), supporting audio-visual conversation with a machine impersonated by a synthetic voice and an animated face. 2. "Multimodal Question Answering" (MQA), supporting request for information about a displayed object. 3. Three Uses Cases supporting conversational translation applications. In each Use Case, users can specify whether speech or text is used as input and, if it is speech, whether their speech features are preserved in the interpreted speech: a. "Unidirectional Speech Translation" (UST). b. "Bidirectional Speech Translation" (BST). c. "One-to-Many Speech Translation" (MST).

Additional Info

Author Institute of Electrical and Electronics Engineers (IEEE)
Committee Entity Collaborative Activities Governance Board
Published by IEEE
Document type Standard
Edition
EAN ISBN 978-1-5044-9330-7
ICS 35.240.01 : Application of information technology in general
35.040.40 : Coding of audio, video, multimedia and hypermedia information
Number of pages 108
Keyword IEEE 3300-2022
Order form