ISO/IEC 23000-11:2009 pdf – Information technology — Multimedia application format (MPEG-A) — Part 11: Stereoscopic video application format.
3 Terms and definitions For the purposes of this document, the following terms and definitions apply. 3.1 baseline line between origins of the respective cameras 3.2 convergence distance distance between a convergence point and a midpoint of baseline 3.3 convergence point point at which two optical axes of left and right cameras intersect 3.4 disparity horizontal difference between corresponding points in stereoscopic view 3.5 focal length distance from a surface of a lens (optical center) or mirror to its focal point (image plane) 3.6 frame one of the many still images which compose the complete moving picture NOTE A frame contains an array of luma samples and two corresponding arrays of chroma samples. A frame consists of two fields: a top field and a bottom field. 3.7 lenticular array of magnifying lenses designed so that, when viewed from slightly different angles, different images are magnified NOTE A lenticular sheet is placed on a normal display panel to show two or more different views simply by changing the angle of light direction. It can make left and right views display on left and right eyes, respectively, creating a sense of depth. 3.8 max of disparity maximum disparity value within a stereoscopic fragment 3.9 monoscopic fragment set of successive samples which represents only monoscopic sequence 3.10 min of disparity minimum disparity value within the stereoscopic fragment
3.11 parallax barrier device to allow a liquid crystal display to show a three dimensional image without the need for the viewer to wear glasses NOTE Placed in front of the normal display panel, a parallax barrier consists of a layer of material with a series of precision slits, allowing each eye to see a different set of pixels, so creating a sense of depth. 3.12 primary view sequence sequence that has a priority of presentation between sequences of Left/Right view sequence type 3.13 rotation relative angular variation from the primary-view camera to the secondary-view camera 3.14 secondary view sequence sequence that has a lower priority of presentation than the primary view sequence between sequences of Left/Right view sequence type 3.15 sequence series of one or more frames 3.16 stereoscopic camera information information for stereoscopic camera parameters such as baseline, focal_length, convergence_distance, camera_arrangement, and rotation 3.17 stereoscopic display information information for the stereoscopic display and visual safety, such as the display size and the viewing distance 3.18 stereoscopic fragment set of successive samples which represents the stereoscopic sequence satisfying the stereoscopic composition type specified in this part of ISO/IEC 23000 3.19 stereoscopic left fragment set of successive samples which represents the left view of stereoscopic sequences satisfying the stereoscopic composition type specified in this part of ISO/IEC 23000 3.20 stereoscopic left view sequence left view sequence of the stereoscopic sequence 3.21 stereoscopic right fragment set of successive samples which represents the right view of the stereoscopic sequences satisfying the stereoscopic composition type specified in this part of ISO/IEC 23000 3.22 stereoscopic right view sequence right view sequence of the stereoscopic sequence
4 Abbreviated terms 3D Three Dimensional AAC Advanced Audio Coding AF Application Format AMR Adaptive Multirate AVC Advanced Video Coding CDMA Code Division Multiple Access EVRC Enhanced Variable Rate Codec GSM Global Systems for Mobile communications HE-AAC High Efficiency AAC JPEG Joint Photographic Experts Group LASeR Lightweight Application Scene Representation PNG Portable Network Graphics PMP Portable Multimedia Player UMTS Universal Mobile Telecommunications System 5 Overview 5.1 Overall procedure of stereoscopic contents The overall procedure for stereoscopic contents can be explained as follows. Both left and right view sequences are acquired from a stereoscopic camera for stereoscopic video sequences, and are composited into a video sequence or two video sequences according to the composition types specified in 5.3. This composited video sequence is encoded and then stored into an AF. A file generator for Stereoscopic Video AF is to accept the stereoscopic contents with video, audio and LASeR streams. The file satisfying the Stereoscopic Video AF is parsed, decoded and then rendered for a stereoscopic display device. 5.2 Acquisition of the stereoscopic contents Stereoscopic video sequences are acquired from two cameras, left and right view. As described in Figure 1, camera parameters shall be needed for specifying spatial relationship between two cameras. The stereoscopic contents can be rendered in the display device more precisely by using these camera parameters. The camera parameters shall be described in the ‘ scdi’ box, which will be specified in 8.5.