ENHANCEMENTS TO THE AUDIO AND VIDEO MULTIMEDIA ADAPTERS FROM IBM Contents Announcement Letter Number 191-192 dated October 17, 1991 US - Last Revised on October 17, 1991 Brief Description of Announcement, Charges, and Availability The M-Audio Capture and Playback Adapter, M-Audio Capture and Playback Adapter/A, and the Video Capture Adapter/A have been enhanced to provide, at no extra charge, new audio and video application program interfaces (APIs) and audio device drivers. Support for Microsoft (1) Multimedia Windows (2) Extensions has been added. The new high- and low-level programming interfaces simplify authoring by the software developers producing multimedia applications that take advantage of the capabilities of these adapters. To expand the application development possibilities, these high- and low-level interfaces will operate under the IBM DOS, DOS Windows 3.0, and OS/2 (R) operating environments. The availability of these enhancements with the adapters provides end users with the tools to run newly developed multimedia applications using these interfaces. These new device drivers and APIs will be shipped with the M-Audio Capture and Playback Adapter (feature #3908; new part number 92F3378 replaces 87F9908), M-Audio Capture and Playback Adapter/A (feature #3909; new part number 92F3379 replaces 87F9909), and the Video Capture Adapter/A (feature #2785; new part number 92F3380 replaces 34F2785). All unshipped orders will be converted to the new part numbers. Customers who purchased and received these adapters prior to the availability of this new interface code will be able to obtain these enhancements at no charge. References: IBM Product Announcements 189-101, dated June 20, 1989, and 190-159, dated September 18, 1990. Planned Availability Date: October 25, 1991. (1) Registered trademark of Microsoft Corporation. (2) Trademark of Microsoft Corporation. (R) Registered trademark of International Business Machines Corporation. Customer Letter Section HIGHLIGHTS o Audio -- The new audio interface tools provided with the M-Audio Capture and Playback Adapters: - Support Microsoft Multimedia Windows Extensions. - Provide a common audio interface to multimedia application programs. - Simplify the addition of audio capabilities to newly developed multimedia applications. - Enable software developers to write independent of the computer environment or audio interfaces being supported. - Offer the flexibility to write applications in multiple audio data formats. - Contain a synthesizer feature allowing creation of applications that can playback MIDI (Musical Instrument Digital Interface). - Include a JPEG Application Program Interface which provides a video image compression/decompression function for multimedia applications. o Video -- The new video interface tools provided with the Video Capture Adapter/A: - Simplify writing applications for the adapter. - Minimize the amount of memory overhead needed for specific applications as a result of a new modular design. DESCRIPTION BUSINESS SOLUTIONS -- AUDIO ENHANCEMENTS IBM is making available to customers four types of interface code to enhance the development and use of multimedia applications. 1. Windows Multimedia Development Kit Audio Device Driver: This code gives application program developers the ability to write multimedia applications conforming to Microsoft's Multimedia Development Kit specifications when using Microsoft Multimedia Extensions. 2. Audio Application Program Interface (AAPI): The AAPI is a set of high-level 'C' functions that simplify application program access to the audio and stored digitized functions of the M-Audio Capture and Playback Adapters. Audio capabilities, such as playing and recording, can be added to applications with a minimum of effort. 3. Audio Device Drivers: The Audio Device Drivers are a set of computer programs that provide a common audio interface to an application program. This eliminates the need for application software developers to write new multimedia applications specific to the M-Audio Capture and Playback Adapters. Unique functions are supported by the above three types of interface code, as follows: PULSE CODE MODULATION (PCM): PCM is a technique for digitizing audio that provides software developers the ability to create applications for recording and playing back audio data in multiple formats. This capability, included in the AAPI, gives the user the flexibility to record and playback audio data of varying levels of quality and frequency. The sound quality and bandwidth selected dictates the size of the audio data generated. The Audio Device Drivers and AAPI included with the M-Audio Capture and Playback Adapters support the audio data format specified in the Microsoft Multimedia Development Kit. Following is a guide on the approximate file sizes that can be generated using the Audio Device Drivers or the AAPI: MONO SAMPLE DATA SIZE WIDTH SAMPLING RATE (KB/SEC.) 8-bit 8.000 KS/sec. 8 8-bit 11.025 KS/sec. 11 8-bit 22.500 KS/sec. 22 8-bit 44.100 KS/sec. 44 16-bit 8.000 KS/sec. 16 16-bit 11.025 KS/sec. 22 16-bit 22.500 KS/sec. 44 16-bit 44.100 KS/sec. 88 KS/sec.= Thousand samples per second. KB/sec.= Kilobytes per second. SOURCE MIX: The Audio Device Drivers and AAPI offer software developers the technical capability to mix audio coming in from an analog source (for example, the analog output of a compact disc player) with PCM audio data and play them simultaneously using the M-Audio Capture and Playback Adapters. Using PCM with these adapters, software developers have the ability to offer in their applications, the option to record and playback various sizes of audio data commensurate with the sound quality and frequency range required. Additionally, the end user can have the flexibility to mix audio being produced by any analog input with PCM audio files. FILE FORMAT: Two types of audio file formats are supported by the AAPI. The Audio Video Connection file format is used to play and record compressed audio containing audio data that was compressed using an Adaptive Differential Pulse Code Modulation (ADPCM) technique. The Microsoft Resource Interchange File Format Waveform Audio File Format (RIFF WAVE) is used to play and record uncompressed audio containing Pulse Code Modulation (PCM) data. SYNTHESIZER: The synthesizer feature, supported by the Windows Device Driver and included in the AAPI and Audio Device Drivers, offers a software developer the ability to create an application that can playback MIDI (Musical Instrument Digital Interface) data when used with the M-Audio Capture and Playback Adapters. MIDI offers greatly enhanced compression over digitized audio. The following is a comparison of different audio modes and the amount of digitized storage calculated for one minute: DATA SIZE IN KILOBYTES AUDIO MODE (ESTIMATED) Compact disc 16-bit Stereo 10,500 16-bit PCM 22.050 KS/sec. -- Mono 2,640 8-bit PCM 11.025 KS/sec. -- Mono 660 Voice -- Mono 330 MIDI* -- Stereo 12 * MIDI cannot be used to store the human voice. The synthesizer function on the M-Audio Capture and Playback Adapters is characterized as eight-note multitimbral, eight-note polyphonic operation, and has dynamic note allocation. 4. JPEG Application Program Interface (JPEG API): The Joint Photographics Expert Group (JPEG), an International Standards Organization (ISO) group, has established a baseline industry standard, recognized by multimedia application developers, for the compression and decompression of continuous-tone still images. JPEG can be used to compress still images to a fraction of the original size allowing for economical storage or transmission of the image. It can later decompress the image, which can then be displayed by an application program. The JPEG API offered by IBM meets this standard while taking advantage of the digital signal processor (DSP) on the M-Audio Capture and Playback Adapters. It provides developers a fast compression and decompression function for incorporation into multimedia applications. This hardware and software implementation offers a faster solution than would be possible with software only. JPEG can be implemented by interface code using three methods: o Host only o Host with a digital signal processor o Specialized board Each method has performance considerations that are dependent on such things as the speed of the host processor and the size and type of the image used. The host-only approach could be used by a casual user who does not require fast performance. The specialized-board approach is typically for animation and for users who require fast performance. The IBM JPEG API uses the second method. It uses the digital signal processor on the M-Audio Capture and Playback Adapters, in addition to the 80386 host, and is designed for specific applications such as the archiving and transmission of images. This JPEG API supports all possible JPEG baseline combinations of components and sampling ratios. The performance objective for decompression using the IBM JPEG API is on average less than ten seconds for the following image: o Input: - Baseline compressed image - 640x480 image - YUV 4:2:2 format - Average 16 bits per pixel o Output: - Decompressed YUV 4:2:2 format Depending on the size and type of image used and the processor speed, the performance could be slower or faster. BUSINESS SOLUTIONS -- VIDEO ENHANCEMENTS Now included with the Video Capture Adapter/A is a set of high-level functions designed to facilitate the development of applications for this adapter. This Video Application Program Interface offers significant savings in programming effort by grouping many tasks into a single programming call. This reduces the number of lines of code required for initialization of the adapter. In addition, this interface code is modular and is divided into two functional units. This modularity minimizes the amount of memory overhead required by this application program interface. PRODUCT POSITIONING These interface tools provide unique functions that assist developers in writing multimedia applications, optimize the capabilities of the M-Audio Capture and Playback Adapters and Video Capture Adapter/A, and enhance the effectiveness of multimedia presentations created by end users. PUBLICATIONS The following publications are shipped with the product. Additional copies may be ordered and will be available by October 25, 1991. Title Order No. Part No. M-Audio Capture and Playback G571-0203-01 95F1270 Adapter, Installation Manual and Technical Reference* Video Capture Adapter/A, Installation G571-0180-03 34F2796 Manual and Technical Reference** * This publication supports the M-Audio Capture and Playback Adapter and the M-Audio Capture and Playback Adapter/A. ** This publication supports the Video Capture Adapter/A. SCHEDULE The planned availability date is October 25, 1991. TECHNICAL INFORMATION SPECIFIED OPERATING ENVIRONMENT PROGRAMMING REQUIREMENTS: To utilize the interface code being made available, the following software prerequisites exist: o Windows Multimedia Development Kit Audio Device Driver - DOS 3.3, 4.0, and 5.0 - Microsoft Windows 3.0 - Multimedia Windows Extensions 1.0 -- For the IBM version of this software, refer to IBM Programming Announcement 291-595, dated October 17, 199l. o Audio Application Program Interface and Audio Device Drivers - DOS 3.3, 4.0, and 5.0 - OS/2 1.3 o Video Application Program Interface and JPEG Program Interface - DOS 3.3, 4.0, and 5.0 - OS/2 1.3 - Microsoft Windows 3.0 Program developers writing multimedia applications under Microsoft Windows Multimedia Extensions will require the Microsoft Multimedia Development Kit. In addition, IBM C/2 (TM) Version 1.1 or higher or the Microsoft C Professional Development System Version 6.0 or higher may be required. For customers who purchased the M-Audio Capture and Playback Adapters and Video Capture Adapter/A prior to the availability of this new interface code, the drivers and APIs will be available electronically through the IBM National Support Center Bulletin Board System. It is available 24 hours a day to anyone with a modem, asynchronous communications software, and a switched telephone line on a toll-call basis with no access charge. To access this system, customers should call 1-404-835-6600. This interface code is also available from your IBM marketing representative. (TM) Trademark of International Business Machines Corporation. PERFORMANCE: Performance of these enhancements with these adapters is based on several factors relating to the host machine and environment. These items include host system speed, channel speed, host processor disk performance (speed and fragmentation), and memory availability. PLANNING INFORMATION PACKAGING: The interface code being made available with the M-Audio Capture and Playback Adapters and Video Capture Adapter/A will be shipped on separate 3.5-inch diskettes and will be included as part of the following shipment groups: PRODUCT SHIPMENT GROUP M-Audio Capture and PS/2 (R) Audio Adapter, Playback Adapter Installation Manual and Technical Reference, 3.5-inch Diagnostic Diskette, 3.5-inch Driver/API Diskette #1, 3.5-inch Driver/API Diskette #2, Y Connector Cable M-Audio Capture and PS/2 Audio Adapter, Playback Adapter/A Installation Manual and Technical Reference, 3.5-inch Diagnostic Diskette, 3.5-inch Driver/API Diskette #1, 3.5-inch Driver/API Diskette #2, Y Connector Cable Video Capture PS/2 Video Adapter, Adapter/A Installation Manual and Technical Reference, 3.5-inch Diagnostic Diskette, 3.5-inch API Diskette #1, Primary I/O Cable, S-Connector Cable, Wrap Test Cable, Three 75-Ohm Terminator Plugs TERMS AND CONDITIONS Terms and conditions remain unchanged as specified in Product Announcements 189-101, dated June 20, 1989, and 190-159, dated September 18, 1990.