Data Fields – Manifestation Tracks

A Digital Track element must contain one of the following elements:

Field NameTypeCardinalityExplanatory Notes
Audiomd: DigitalAssetAudioData-type0-∞ OptionalAudio track type. See “Audio Track Details” below
Videomd: DigitalAssetVideoData-type0-∞ OptionalVideo track type. See “Video Track Details” below.
Subtitlemd: DigitalAssetSubtitleData-type0-∞ OptionalSubtitle track type. See “Subtitle Track Details” below.
Interactivemd: DigitalAssetInteractiveData-type0-∞ OptionalInteractive material track type. Not common. See “Interactive Track Details” below.

Audio Track Details

The following fields are found in an Audio element:

Field NameTypeCardinalityExplanatory Notes
Description{xs:string of 1-128 characters, lang of xs:language}0-1 OptionalExplanation of what the audio encoding is, why it exists, and similar.
TypeControlled vocabulary: primary, narration, dialogcentric, commentary, silent-omitted, silent, other0-1 OptionalDescribes purpose of the track. If not present, track is assumed to be primary. Based on md:string-Audio-Type.
Narration is for the visually impaired and may use multiple channels. Dialogcentric is for the hearing impaired and may use multiple channels. Commentary on the video may be paired with a PIP.
Language{xs:language, dubbed Boolean attribute}1 RequiredThe language of this Track. Dubbed is false for works with subtitles.
Encodingmd:DigitalAssetAudioEncoding-type0-1 OptionalSee Audio Encoding Details immediately below.
Channelsxs:string0-1 OptionalNumber of audio channels, either as a 1–2-digit integer (e.g., 2) or of the form x.y where x is a 1–2-digit integer of full channels, and y is limited channels (e.g. 5.1).
(Based on md:string-Audio-Channels.)
TrackReferencexs:string of 1-128 characters0-1 OptionalTrack cross-reference to be used in conjunction with container-specific metadata. (Based on md:string-TrackReference.)
Privatemd:PrivateData-type0-1 OptionalSee the “Private Details” section below.

Sample XML for the Audio block:

<md:Audio>
  <md:Type>primary</md:Type>
  <md:Encoding>
    <md:Codec>AAC</md:Codec>
    <md:BitrateMax>192000</md:BitrateMax>
    <md:SampleRate>48000</md:SampleRate>
    <md:SampleBitDepth>16</md:SampleBitDepth>
  </md:Encoding>
  <md:Language dubbed="false">en</md:Language>
  <md:Channels>2</md:Channels>
</md:Audio>

Audio Encoding Details

This describes the audio encoding details. Equivalent to md:DigitalAssetAudioEncoding-type.

Field NameTypeCardinalityExplanatory Notes
CodecEnumeration1 RequiredCodec used to encode the audio data such as AAC, AIFF, FLAC, Vorbis, WAV, or WMA. If the codec is unknown, this element should not be included.
CodecTypexs:string0-∞ OptionalEncoding of codec type based on formal registries. The first part includes the name of the authority which must be either: mpeg4ra, IANA, rfc4281
This is followed by a colon and then up to 128 characters. For example:
rfc4281: audio/3gpp2; codecs=mp4a.E1
BitrateMaxxs:integer0-1 OptionalMaximum bitrate (bits/second)
BitrateAveragexs:integer0-1 OptionalBitrate averaged over the entire track. (bits/second)
VBREnumeration:
VBR, Constrained VBR, 2-pass VBR
0-1 OptionalVBR type. (Based on md:string-Audio-Enc-VBR.)
SampleRatexs:integer0-1 OptionalOptional temporal sample rate in Hz (samples/second)
SampleBitDepthxs:integer0-1 OptionalOptional number of bits per audio sample
ChannelMappingEnumeration0-1 OptionalIndication of how channels are mapped to intended speaker locations. See “Channel Mapping Details” below.
WatermarkDigitalAssetWatermark-type0-∞ OptionalInformation about watermark(s) embedded in audio. See “Watermark Details” section below.
ActualLengthxs:duration0-1 OptionalOptional number of bits per audio sample

Sample XML for the Encoding element:

<md:Encoding>
  <md:Codec>AAC</md:Codec>
  <md:BitrateMax>192000</md:BitrateMax>
  <md:SampleRate>48000</md:SampleRate>
  <md:SampleBitDepth>16</md:SampleBitDepth>
</md:Encoding>

Channel Mapping Details

ValueExplanatory Notes
Enumeration:
Left, Center, Right, LFE screen, Left surround, Right surround, Center surround, Left center, Right center, LFE 2, Vertical height front, Top center surround, Left wide, Right wide, Rear surround left, Rear surround right, Left surround direct, Right surround direct
From SMPTE 428-3
Left Top Front Surround 
Left Top Rear Surround 
Right Top Front Surround 
Right Top Rear Surround 
stereoLeft and Right
dual mono 
5.1 matrix5.1 channels matrixed in two channels
6.1 Matrix 
surroundGreater than two channels, without a specific channel assignment
L,R,C,LFE,LS,RSLeft, Right, Center, Low Frequency Effects, Left Surround, Right Surround
IMAX 6.0 
IMAX 6.1 
IMAX 12.0 
IMAX 12.1 
L,C,R,LS,RS,LFELeft, Center, Right, Left Surround, Right Surround, Low Frequency Effects

Watermark Details

This describes the Watermark element details:

Field NameTypeCardinalityExplanatory Notes
Vendorxs:string of 2-128 characters0-1 OptionalOrganization associated with the watermark.
ProductAndVersionIDxs:string of 1-128 characters0-1 OptionalIdentification of specific watermark version of the technology. It must be sufficiently precise to differentiate between incompatible watermarks from the same Vendor.
Dataxs:string of 1-256 characters0-1 OptionalData is a string that either contains the information encoded by the watermark or is a reference to that data. Its content is outside the scope of this document. This may be vendor- private data.

Private Details

This describes the Private element details that have been defined for EIDR. Based to md:PrivateData-type.

Field NameTypeCardinalityExplanatory Notes
EncodingAgentpartyDOIType0-1 OptionalFor the EIDR Party that created this container or track, or is otherwise associated with it. Informational only, and not used for access control.
Descriptionxs:string of 1-128 characters0-1 OptionalDescription of this Track.
Hash{xs:string, method of md:Hash-type such as MD5}0-8 OptionalHash of the file and the method used to generate it. The values for method from md:Hash-type are as follows:
Message Digest: MD2, MD4, MD5, C4. Secure Hash: SHA-0, SHA-1, SHA-2, SHA-3 Cyclic Redundancy Check: CRC16, CRC32, CRC64
Size{xs:positiveInteger, optional pad of xs:positiveInteger}1 RequiredSize of the track, in bytes, with optional pad attribute block size to which Size is padded.

Video Track Details

The following fields are found in a Video element:

Field NameTypeCardinalityExplanatory Notes
Description{xs:string of 1-128 characters, lang of xs:language}0-1 OptionalExplanation of what the audio encoding is, why it exists, and similar.
TypeControlled vocabulary: primary, overlay, angle, enhancement, other0-1 OptionalDescribes purpose of the track. If not present, track is assumed to be primary. Based on md:string-Video-Type.
Whether the primary has burned-in subtitles is determined by the presence of the SubtitleLanguage element.
overlay refers to PIP or other overlay track, intended for use with a “primary” track.
angle is alternate angle track. Other is for none of the above.
Encodingmd:DigitalAssetVideoEncoding-type0-1 OptionalSee “Video Encoding Details” immediately below.
Picturemd:DigitalAssetVideoPicture-type1 RequiredSee “Picture Details” below.
ColorTypemd:ColorType-type, which has an enumeration: color, bandw, colorized, composite, unknown0-1 OptionalType of color in the visual portion of the content. color will be the most common color type. bandw for black and white. colorized is for video that has had color added to what was originally filmed in black-and-white. composite mixes black and white and color segments in the same frame as in the movie Sin City. unknown is for assets based on legacy metadata where color type is not specified.
PictureFormatControlled vocabulary: Letterbox, Pillarbox, Full, Stretch, Pan and Scan, 360, Other0-1 OptionalBased on the md:string-Video-PictureFormat type.
“Full” means the entire original image is substantially included and that the active pixels fit the full area of the picture (within a few pixels). (This should not be confused with fullscreen, a term that may also refer to Pan and Scan.) “Pan and Scan” also includes any other cropping methods.
“Other” is picture format encoding other than the above applies. For example, “SmileBox” or “windowbox”.
SubtitleLanguage{xs:language, closed Boolean attribute, type attribute (enumeration: normal, SDH, large, forced, commentary, easyreader, other)}0-1 OptionalIndicates the presence of subtitles embedded in the video stream, either closed (e.g., EIA-608B) or rendered into the video. This would apply to silent films with intertitles, where the type value would be “normal”. This should not be used for subtitles handled via separate tracks. Subtitles in separate tracks should be included in DigitalAssetMetadata-type’s Subtitle element. This is md:DigitalAssetVideoSubtitleLanguage-type.
SignedLanguagexs:language0-1 OptionalIndicates the presence of signed language in the video. The value must be a sign language such as American or French Sign Language.
CardsetListmd:DigitalAssetCardsetList-type0-∞ OptionalCardsets, such as distribution logos and anti-piracy notices, embedded in video. See “Cardset List Details” below.
TrackReferencexs:string of 1-128 characters0-1 OptionalTrack cross-reference to be used in conjunction with container-specific metadata. (Based on md:string-TrackReference.)
Privatemd:PrivateData-type0-1 OptionalSee the “Private Details” section below.

Sample XML for the Video block:

<md:Video>
  <md:Type>primary</md:Type>
  <md:Encoding>
    <md:Codec>H.264</md:Codec>
    <md:CodecType>IANA:h.264</md:CodecType>
    <md:BitrateMax>1600000</md:BitrateMax>
  </md:Encoding>
  <md:Picture>
    <md:AspectRatio>1:1</md:AspectRatio>
    <md:FrameRate>24</md:FrameRate>
  </md:Picture>
</md:Video>

Video Encoding Details

This describes the audio encoding details. Equivalent to md:DigitalAssetAudioEncoding-type.

Field NameTypeCardinalityExplanatory Notes
CodecEnumeration1 RequiredCodec used to encode the video data such as H.264, Sorenson 3. See “Video Codec Details” below.
CodecTypexs:string0-∞ OptionalEncoding of codec type based on formal registries. The first part includes the name of the authority which must be either: mpeg4ra, IANA
This is followed by a colon and then up to 128 characters. For example:
mpeg4ra: mp4v
MPEGProfileEnumeration0-1 OptionalThe MPEG profile: For MPEG-2: SP, MP, SNR, Spatial, HP, 422, MVP For MPEG-4: MP, CBP, BP, MP, XP, HiP, CHiP (Constrained High Profile), Hi10P, Hi422P, Hi444P, Hi444PP, Hi10IP, Hi422IP, Hi444IP, C444IP, SBP, SCBP, SHP, SHIP, SCHP, StereoHP, MultiviewHP
MPEGLevelEnumeration0-1 OptionalThe MPEG level: For MPEG-2: LL, ML, H-14 (High Level 1440), HL For MPEG-4: 1, 1b, 1.1, 1.2, 1.3, 2, 2.1, 2.2, 3, 3.1, 3.2, 4, 4.1, 4.2, 5.1, 5.2
BitrateMaxxs:integer0-1 OptionalMaximum bitrate (bits/second)
BitrateAveragexs:integer0-1 OptionalBitrate averaged over the entire track. (bits/second)
VBREnumeration:
VBR, Constrained VBR, 2-pass VBR
0-1 OptionalVBR type. (Based on md:string-Audio-Enc-VBR.)
SampleRatexs:integer0-1 OptionalOptional temporal sample rate in Hz (samples/second)
SampleBitDepthxs:integer0-1 OptionalOptional number of bits per audio sample
ChannelMappingEnumeration0-1 OptionalSee “Channel Mapping Details”.
WatermarkEnumeration0-∞ OptionalInformation about watermark(s) embedded in audio. See “Watermark Details” below.
ActualLengthxs:duration0-1 OptionalThe actual duration of the encoded video stream.

Sample XML for the Encoding element:

<md:Encoding>
  <md:Codec>H.264</md:Codec>
  <md:CodecType>IANA:h.264</md:CodecType>
  <md:BitrateMax>1600000</md:BitrateMax>
</md:Encoding>

Video Codec Details

ValueExplanatory Notes
AVI UncompressedAVI Uncompressed
CineForm HDCineForm HD
DIVXDivX
DVDV, including variants such as DVCPRO, DVCAM, etc.
H.264H.264, MPEG-4 Part 10
H.264-DolbyVisionH.264 Dolby Vision Enhancement layer
H.265HEVC/H.265
H.265-DolbyVisionH.265 Dolby Vision Enhancement layer
JPEG2000JPEG 2000
MOBICLIPActimagine’s Mobiclip
MPEG1MPEG 1 Part 2
MPEG2MPEG 2 Part 2
On2On2 codec when not VP6, VP7 or VP8, or exact codec is unknown.
PHOTOJPEGPHOTOJPEG
PRORESApple ProRes
PRORESHQApple ProRes HQ
PRORES422Apple ProRes 422
PRORESXQProRes 4444 XQ
PRORES4444ProRes 4444
QT UncompressedApple QT Uncompressed
REALRealVideo
SparkSorenson Spark
SVQSorenson Video Quantizer
WMVWindows Media Video when not WMV7, WVM8 or WMV9 or exact codec is unknown.
WMV7Windows Media Video 7
WMV8Windows Media Video 8
WMV9Windows Media Video 9
VC1Microsoft VC-1
VP6On VP6
VC-2VC-2 as defined by SMPTE 2042 [SMPTE-2042]; also known as Dirac.
VC-3VC-3, as defined by SMPTE ST 2019-1 [SMPTE-2019]; also known as Avid DNxHD.
VC-5VC-5 as defined by SMPTE 2073 [SMPTE-2073] ; also known as CineForm.
VC-6In development. Term may be used for SMPTE VC-6.
VP7On VP7
VP8On VP8
VP9Google VP9
XVIDXvid
OTHERNone of the above.

Picture Details

This is md:DigitalAssetVideoPicture-type. Only AspectRatio is required in this element.

Field NameTypeCardinalityExplanatory Notes
AspectRatioRestricted to a pattern of N:M, NN:M, N.NN:M1 RequiredVideo aspect ratio of the encoded object, which may differ from that of the original. 16:9 is HD, 4:3 is SD, 1.85:1 is 37:20, 2.2:1 is 11:5, 2.35:1 is 47:20.
PixelAspectControlled vocabulary: NTSC, PAL, square, other0-1 OptionalAspect ratio of a pixel. Note that the arithmetic ratio is implied (square would be 1:1, NTSC is 10:11). Same as md:string-Video-Pic-PixelAspect.
WidthPixelsxs:int0-1 OptionalNumber of columns encoded (e.g., 1920)
HeightPixelsxs:int0-1 OptionalNumber of rows encoded (e.g., 1080)
ActiveWidthPixelsxs:int0-1 OptionalMust be <= WidthPixels
ActiveHeightPixelsxs:int0-1 OptionalMust be <=HeightPixels
FrameRate{xs:int, multiplier, timecode}0-1 OptionalIn frames per second. If interlaced, use frame rate (e.g., NTSC is 30), not the field rate.
The multiplier attribute indicates whether the 1000/1001 multiple should be applied. There is only one legal value for this attribute which is “1000/1001”. If present, then apply 1000/1001 multiplier to FrameRate. For example, a FrameRate of 30 with multiplier=“1000/1001” defines an actual frame rate of 29.97. If the frame rate is integral, this attribute shall not be present.
The timecode attribute indicates how drop frames are handled in timecode. The values are: Drop (Drop frame SMPTE timecode), EBU (AES/EBU embedded timecode), Other.
Progressive{xs:boolean, scanOrder}0-1 Optionaltrue for progressive
false for interlaced The scanOrder attribute indicates the scan order which differentiates interlaced types. The values for false are TFF (Top Field First), BFF (Bottom Field First), while true this value is not recommended, but if supplied must be PFF (Picture Per Field).
ColorSubSamplingEnumerated: 4:1:1, 4:2:0, 4:2:2, 4:4:40-1 OptionalChroma subsampling method.
ColorimetryEnumerated: 601, 709, 2020, P3, xvYCC7090-1 Optional“601” – ITU Recommendation BT.601, Studio encoding parameters of digital television for standard 4:3 and wide screen 16:9 aspect ratios http://www.itu.int/rec/R-REC-BT.601/en “709” – ITU Recommendation BT.709, Parameter values for the HDTV standards for production and international program exchange. http://www.itu.int/rec/R-REC-BT.709/en “2020” – ITU Recommendation BT.2020, Parameter values for ultra-high definition television systems for production and international program exchange. http://www.itu.int/rec/R-REC-BT.2020/en “P3” – SMPTE PR 431-2:2011 D-Cinema Quality – Reference Projector and Environment. This is also referred to as DCI-P3 or P3. “xvYCC709” – Colorimetry for use with Rec.709 primaries defined in [IEC61966-2-4]
Type3DControlled vocabulary: IMAX, RealD, etc.0-1 OptionalDescribes type of 3D picture. Required if ThreeD is true.

Sample XML for the Picture element:

<md:Picture>
  <md:AspectRatio>1:1</md:AspectRatio>
  <md:FrameRate>24</md:FrameRate>
</md:Picture>

Cardset List Details

A cardset is the collection of static text or graphics separate from the work itself that appear at the beginning or end of the video. Cardsets are typically specific to a market and include distributor logos and anti-piracy warnings. Cardsets may be embedded in video (i.e., burned in) or overlaid on video via a subtitle.

The following fields are found in a CardsetList element:

Field NameTypeCardinalityExplanatory Notes
TypeControlled Vocabulary: Theatrical, Broadcast, Hospitality, Rental, EST0-1 OptionalThe intended general usage of the cardset list. Based on md:string-Subtitle-Format.
RegionEnumeration0-1 OptionalThis describes the region(s) for which the cardset was made. The values can be Domestic, International or an ISO 3166-1 alpha 2, two-character code.
Cardsetmd:DigitalAssetCardset-type1 RequiredDescription of the cardset. See immediately below.

The following fields are found in a Cardset element:

Field NameTypeCardinalityExplanatory Notes
TypeControlled Vocabulary: AntiPiracy, DistributionLogo, Rating, DubbingCredit, Intermission, EditNotice, Health, Other0-∞ OptionalThe intended general usage of the cardset list.
This is identical to md:string-Cardset-Type.
Descriptionxs:string of 1-128 characters0-1 OptionalDescription of cardset (human readable).
Sequencexs:positiveInteger0-1 OptionalOrder of display for this cardset. A higher number represents later display. Cardsets with the same sequence must not overlap Region.

Subtitle Track Details

The following fields are found in a Subtitle element:

Field NameTypeCardinalityExplanatory Notes
Format{Controlled Vocabulary: Text, Image, Combined, SDImage of xs:boolean, HDImage of xs:boolean }0-1 OptionalFormat for subtitles that are in separate components from the video. Based on md:string-Subtitle-Format. SDImage Are subtitle images targeted towards SD included? “true” means yes, “false” or absent means no. This only applies if Format is “Image” or “Combined” HDImage Are subtitle images targeted towards HD included? “true” means yes, “false” or absent means no. This only applies if Format is “Image” or “Combined”
Description{xs:string of 1-128 characters, lang of xs:language}0-1 OptionalExplanation of what the subtitle is/why it exists, etc. Description is in the language of the Description text.
TypeControlled Vocabulary: normal, SDH, large, forced, noforced, commentary, easyreader, singalong, other  1-8 RequiredSubtitle purpose. If the track has more than one intended use, then include multiple instances of Type. Based on md:string-Subtitle-Type.
normal – typically used for the actors’ dialogue
SDH – for the hearing impaired
large – for the visually impaired
forced – subtitles are always shown (regardless of whether the user has enabled subtitles) noforced – indicates subtitles do not contain forced subtitles. Must be used with another Type, but not ‘forced’. For example, a subtitle with Type of ‘normal’ and ‘noforced’ would contain all language subtitles except forced subtitles.
easyreader – complying with US Federal requirements [47CFR9.103(c)(9)] singalong – Timed text is used primarly to show words that go with song for the purpose of singing along. This should only be usd if distinct from SDH.  
FormatTypeControlled Vocabulary: 3GPP, Blu-ray, CAP, DCI, DVB, DVD, DXFP, SMPTE 21052-1 Timed Text, SCC, SRT, STL, TTML, ITT, CFF-TT, IMSC1, WebVTT  0-8 OptionalSubtitle format. If the track has more than one intended use, then include multiple instances of Type. The most common value is simply normal. Based on md:string-Subtitle-FormatType.
Languagexs:language1 RequiredLanguage of the subtitle.
CardSetListmd:DigitalAssetCardsetList-type0-∞ OptionalSee “Cardset List Details”.
TrackReference1-64 character xs:string0-1 OptionalTrack cross-reference to be used in conjunction with container-specific metadata. See explanation under .
Privatemd:PrivateData-type0-1 OptionalSee the “Private Details” section below.

Sample XML for the Subtitle element:

<md:Subtitle>
  <md:Format HDImage="false" SDImage="false">Text</md:Format>
  <md:Type>normal</md:Type>
  <md:FormatType>SMPTE 2052-1 Timed Text</md:FormatType>
  <md:Language>en</md:Language>
</md:Subtitle>

Interactive Track Details

The following fields are found in an Interactive element:

Field NameTypeCardinalityExplanatory Notes
TypeControlled vocabulary1 RequiredDescribes purpose of the track. Menu – Menu system for navigating settings, value added material and other options. Mixed-Media – Mixed Media Experience, such as Cross-Platform Extras (CPE) or iTunes Extras package Standalone Game – Playable game that runs independently of audio or video material Overlay Game – Game synchronized to audio or video material Skins – Information that customizes appearance Interactivity – Ability to choose settings, value added material and other options outside of menus. For example, pop-ups. Image – Identifies the special case where the interactive application is an image. This supports the case where no other application Type is playable. This is typically used in conjunction with Encoding/RuntimeEnvironment=‘Default’ Commerce – Commerce Experience Location – Location or Mapping application Live – Live Data feed Comic – Digital Comic VR – Virtual Reality Experience. AR – Augmented Reality Experience MR – Mixed Reality Experience 360 – Linear 360-degree video experience. This covers 360-degree experiences not encoded as a single linear video. Typically, it will fall in this category if playback requires a player not currently assumed in Common Metadata. Other – None of the above.
FormatTypeControlled vocabulary: text, executable, metadata0-1 OptionalDescribes the format of the track. Text – Instructive text. Executable – Software that is executable through a runtime environment such as those described in “Interactive Encoding Details” RuntimeEnvironment. Metadata – Declarative data that describes behavior to a runtime environment
Language{xs:language, dubbed Boolean attribute}0-1 OptionalThe language of this Track. Dubbed is false for subtitles.
Encodingmd:DigitalAssetInteractiveEncoding-type0-1 OptionalSee “Interactive Encoding Details” below.
TrackReferencexs:string of 1-128 characters0-1 OptionalTrack cross-reference to be used in conjunction with container-specific metadata. (Based on md:string-TrackReference.)
Privatemd:PrivateData-type0-1 OptionalSee the “Private Details” section above.

Interactive Encoding Details

This describes the interactive encoding details. Equivalent to md:DigitalAssetInteractiveEncoding-type.

Field NameTypeCardinalityExplanatory Notes
RuntimeEnvironmentEnumeration  1 RequiredThe execution runtime environment for the interactive content: CMX – Connected Media Experience Flash – Adobe Flash BD-J – Blu-ray Java MHEG – MHEG-5, or more formally ISO/IEC 13522-5. HTML5 – W3C HTML5 Android – Android operating system native app iOS – Apple iOS operating system native app tvOS – Apple tvOS MacOS – Apple MacOS native app Windows – Microsoft Windows native app BrightScript – Roku BrightScript native app Linux – Linux native app Default – Represents an application that can be played if nothing else can. This is typically an image. Other – may be used when there is not a type convention.
FirstVersionxs:string of 1-64 characters0-1 OptionalEarliest version of RuntimeEnvironment in which this encoding will play. If it plays in all versions, or all versions less than or equal to LastVersion, this element may be omitted.
LastVersionxs:string of 1-64 characters0-1 OptionalLast version of RuntimeEnvironment in which this encoding will play. If it plays in all versions, or all versions after FirstVersion, this element may be omitted.
Updated on April 11, 2021

Was this article helpful?

Related Articles