5168
INFORMATIONAL

XML Schema for Media Control

Authors: O. Levin, R. Even, P. Hagendorf
Date: March 2008
Working Group: NON WORKING GROUP
Stream: IETF

Abstract

This document defines an Extensible Markup Language (XML) Schema for video fast update in a tightly controlled environment, developed by Microsoft, Polycom, Radvision and used by multiple vendors. This document describes a method that has been deployed in Session Initiation Protocol (SIP) based systems over the last three years and is being used across real-time interactive applications from different vendors in an interoperable manner. New implementations are discouraged from using the method described except for backward compatibility purposes. New implementations are required to use the new Full Intra Request command in the RTP Control Protocol (RTCP) channel. This memo provides information for the Internet community.

RFC 5168: XML Schema for Media Control [RFC Home] [TEXT|PDF|HTML] [Tracker] [IPR] [Info page]

INFORMATIONAL
Network Working Group                                           O. Levin
Request for Comments: 5168                         Microsoft Corporation
Category: Informational                                          R. Even
                                                                 Polycom
                                                            P. Hagendorf
                                                               RADVISION
                                                              March 2008


                      <span class="h1">XML Schema for Media Control</span>

Status of This Memo

   This memo provides information for the Internet community.  It does
   not specify an Internet standard of any kind.  Distribution of this
   memo is unlimited.

Abstract

   This document defines an Extensible Markup Language (XML) Schema for
   video fast update in a tightly controlled environment, developed by
   Microsoft, Polycom, Radvision and used by multiple vendors.  This
   document describes a method that has been deployed in Session
   Initiation Protocol (SIP) based systems over the last three years and
   is being used across real-time interactive applications from
   different vendors in an interoperable manner.  New implementations
   are discouraged from using the method described except for backward
   compatibility purposes.  New implementations are required to use the
   new Full Intra Request command in the RTP Control Protocol (RTCP)
   channel.





















<span class="grey">Levin, et al.                Informational                      [Page 1]</span>

<span id="page-2" ></span>
<span class="grey"><a href="./rfc5168">RFC 5168</a>                     Media Control                    March 2008</span>


Table of Contents

   <a href="#section-1">1</a>. Introduction ....................................................<a href="#page-2">2</a>
   <a href="#section-2">2</a>. Conventions .....................................................<a href="#page-2">2</a>
   <a href="#section-3">3</a>. Background ......................................................<a href="#page-3">3</a>
   <a href="#section-4">4</a>. The Video Control Commands ......................................<a href="#page-3">3</a>
   <a href="#section-5">5</a>. The Schema Definition ...........................................<a href="#page-4">4</a>
   <a href="#section-6">6</a>. Error Handling ..................................................<a href="#page-5">5</a>
   <a href="#section-7">7</a>. Examples ........................................................<a href="#page-5">5</a>
      <a href="#section-7.1">7.1</a>. The Fast Update Command for the Full Picture ...............<a href="#page-5">5</a>
      <a href="#section-7.2">7.2</a>. Reporting an Error .........................................<a href="#page-5">5</a>
   <a href="#section-8">8</a>. Transport .......................................................<a href="#page-6">6</a>
   <a href="#section-9">9</a>. IANA Considerations .............................................<a href="#page-6">6</a>
      <a href="#section-9.1">9.1</a>. Application/media_control+xml Media Type Registration ......<a href="#page-6">6</a>
   <a href="#section-10">10</a>. Security Considerations ........................................<a href="#page-7">7</a>
   <a href="#section-11">11</a>. References .....................................................<a href="#page-8">8</a>
      <a href="#section-11.1">11.1</a>. Normative References ......................................<a href="#page-8">8</a>
      <a href="#section-11.2">11.2</a>. Informative References ....................................<a href="#page-8">8</a>

<span class="h2"><a class="selflink" id="section-1" href="#section-1">1</a>.  Introduction</span>

   This document defines an Extensible Markup Language (XML) Schema for
   video fast update request in a tightly controlled environment,
   developed by Microsoft, Polycom, Radvision and used by multiple
   vendors.  Implementation of this schema for interactive video
   applications in Session Initiation Protocol (SIP) [<a href="#ref-5" title=""SIP: Session Initiation Protocol"">5</a>] environments
   was designed in order to improve user experience.  This mechanism is
   being used by both end user video conferencing terminals and
   conferencing servers in shipping products.  This document describes
   the current method, but new implementations are discouraged from
   using this method, except for backward compatibility with legacy
   systems.  Shipping products and new products SHOULD use the Full
   Intra Request, described in [<a href="#ref-7" title=""Codec Control Messages in the RTP Audio-Visual Profile with Feedback (AVPF)"">7</a>].

   Sending video fast update using the SIP signaling path, as described
   in this document, is inferior to using the RTP Control Protocol
   (RTCP) feedback method [<a href="#ref-7" title=""Codec Control Messages in the RTP Audio-Visual Profile with Feedback (AVPF)"">7</a>], since the command flows through all the
   proxies in the signaling path adding delay to the messages and
   causing unnecessary overload to the proxies.  RTCP messages flow
   end-to-end and not through the signaling proxies.  The RTCP feedback
   document [<a href="#ref-7" title=""Codec Control Messages in the RTP Audio-Visual Profile with Feedback (AVPF)"">7</a>] also adds other required control functions, such as the
   flow control command, which is missing from this document.

<span class="h2"><a class="selflink" id="section-2" href="#section-2">2</a>.  Conventions</span>

   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
   document are to be interpreted as described in <a href="./rfc2119">RFC 2119</a> [<a href="#ref-2" title=""Key words for use in RFCs to Indicate Requirement Levels"">2</a>].



<span class="grey">Levin, et al.                Informational                      [Page 2]</span>

<span id="page-3" ></span>
<span class="grey"><a href="./rfc5168">RFC 5168</a>                     Media Control                    March 2008</span>


<span class="h2"><a class="selflink" id="section-3" href="#section-3">3</a>.  Background</span>

   SIP typically uses the Real-time Transport Protocol (RTP) [<a href="#ref-6" title=""RTP: A Transport Protocol for Real-Time Applications"">6</a>] for the
   transferring of real-time media.

   RTP is augmented by a control protocol (RTCP) to allow monitoring of
   the data delivery in a manner scalable to large multicast networks.
   The RTCP feedback mechanism [<a href="#ref-8" title=""Extended RTP Profile for Real-time Transport Control Protocol (RTCP)-Based Feedback (RTP/AVPF)"">8</a>] has been introduced in order to
   improve basic RTCP feedback time in case of loss conditions across
   different coding schemes.  This technique addresses signaling of loss
   conditions and the recommended recovery steps.

   Just recently, an extension to the feedback mechanism has been
   proposed [<a href="#ref-7" title=""Codec Control Messages in the RTP Audio-Visual Profile with Feedback (AVPF)"">7</a>] to express control operations on media streams as a
   result of application logic rather than a result of loss conditions.
   Note that in the decomposed systems, the implementation of the new
   mechanism will require proprietary communications between the
   applications/call control components and the media components.

   This document describes a technology that has been deployed in
   SIP-based systems over the last three years and is being used across
   real-time interactive applications from different vendors in an
   interoperable manner.  This memo documents this technology for the
   purpose of describing current practice and new implementation MUST
   use the RTCP Full Intra Request command specified in the RTCP-based
   codec control messages document[7].

<span class="h2"><a class="selflink" id="section-4" href="#section-4">4</a>.  The Video Control Commands</span>

   Output of a video codec is a frame.  The frame can carry complete
   information about a picture or about a picture segment.  These frames
   are known as "Intra" frames.  In order to save bandwidth, other
   frames can carry only changes relative to previously sent frames.
   Frames carrying relative information are known as "Inter" frames.

   Based on application logic (such as need to present a new video
   source), the application needs to have an ability to explicitly
   request from a remote encoder the complete information about a "full"
   picture.

   An "Intra" frame may be of large size.  In order to prevent causing
   network congestion, the current media capacity and network conditions
   MUST be validated before sending an "Intra" frame when receiving a
   fast update command, defined in this document.

   In order to meet the presented requirements, a video primitive is
   defined by this document.




<span class="grey">Levin, et al.                Informational                      [Page 3]</span>

<span id="page-4" ></span>
<span class="grey"><a href="./rfc5168">RFC 5168</a>                     Media Control                    March 2008</span>


   The following command is sent to the remote encoder:

   o  Video Picture Fast Update

<span class="h2"><a class="selflink" id="section-5" href="#section-5">5</a>.  The Schema Definition</span>

   <?xml version="1.0" encoding="utf-8" ?>

   <xs:schema id="TightMediaControl"
    elementFormDefault="qualified"
     xmlns:xs="http://www.w3.org/2001/XMLSchema">

           <xs:element name="media_control">
               <xs:complexType>
                  <xs:sequence>
                     <xs:element name="vc_primitive"
                                           type="vc_primitive"
                                           minOccurs="0"
                                           maxOccurs="unbounded" />
                     <xs:element name="general_error"
                                           type="xs:string"
                                           minOccurs="0"
                                           maxOccurs="unbounded" />
                  </xs:sequence>
               </xs:complexType>
           </xs:element>

           <!-- Video control primitive.  -->

           <xs:complexType name="vc_primitive">
                   <xs:sequence>
                     <xs:element name="to_encoder" type="to_encoder" />
                      <xs:element name="stream_id"
                                       type="xs:string"
                                       minOccurs="0"
                                       maxOccurs="unbounded" />
                           </xs:sequence>
           </xs:complexType>

           <!-- Encoder Command:
                Picture Fast Update
           -->

           <xs:complexType name="to_encoder">
                   <xs:choice>
                           <xs:element name="picture_fast_update"/>
                   </xs:choice>
           </xs:complexType>



<span class="grey">Levin, et al.                Informational                      [Page 4]</span>

<span id="page-5" ></span>
<span class="grey"><a href="./rfc5168">RFC 5168</a>                     Media Control                    March 2008</span>


   </xs:schema>

<span class="h2"><a class="selflink" id="section-6" href="#section-6">6</a>.  Error Handling</span>

   Currently, only a single general error primitive is defined.  It MAY
   be used for indicating errors in free-text format.  The general error
   primitive MAY report problems regarding XML document parsing,
   inadequate level of media control support, inability to perform the
   requested action, etc.

   The general error primitive MUST NOT be used for the indication of
   errors other than those related to media control parsing or to
   resultant execution.  The general error primitive MUST NOT be sent
   back as a result of getting an error primitive.

   When receiving the general error response, the user agent client
   (UAC) that sent the request SHOULD NOT send further fast update
   requests in the current dialog.

   According to <a href="./rfc2976">RFC 2976</a> [<a href="#ref-3" title=""The SIP INFO Method"">3</a>], the only allowable final response to a SIP
   INFO containing a message body is a 200 OK.  If SIP INFO is used to
   carry the request, the error message should be carried in a separate
   INFO request.

<span class="h2"><a class="selflink" id="section-7" href="#section-7">7</a>.  Examples</span>

<span class="h3"><a class="selflink" id="section-7.1" href="#section-7.1">7.1</a>.  The Fast Update Command for the Full Picture</span>

   In the following example, the full picture "Fast Update" command is
   issued towards the remote video decoder(s).

   <?xml version="1.0" encoding="utf-8" ?>

   <media_control>

      <vc_primitive>
       <to_encoder>
         <picture_fast_update/>
       </to_encoder>
     </vc_primitive>

   </media_control>

<span class="h3"><a class="selflink" id="section-7.2" href="#section-7.2">7.2</a>.  Reporting an Error</span>

   If an error occurs during the parsing of the XML document, the
   following XML document would be sent back to the originator of the
   original Media Control document.



<span class="grey">Levin, et al.                Informational                      [Page 5]</span>

<span id="page-6" ></span>
<span class="grey"><a href="./rfc5168">RFC 5168</a>                     Media Control                    March 2008</span>


   <?xml version="1.0" encoding="utf-8" ?>

   <media_control>

     <general_error>
      Parsing error: The original XML segment is:...
     </general_error>

   </media_control>

<span class="h2"><a class="selflink" id="section-8" href="#section-8">8</a>.  Transport</span>

   The defined XML document is conveyed using the SIP INFO method [<a href="#ref-3" title=""The SIP INFO Method"">3</a>]
   with the "Content-Type" set to "application/media_control+xml".  This
   approach benefits from the SIP built-in reliability.

<span class="h2"><a class="selflink" id="section-9" href="#section-9">9</a>.  IANA Considerations</span>

   This document registers a new media type.

<span class="h3"><a class="selflink" id="section-9.1" href="#section-9.1">9.1</a>.  Application/media_control+xml Media Type Registration</span>

   Type name:   application
   Subtype name:   media_control+xml
   Required parameters:   None
   Optional parameters:   charset

   Indicates the character encoding of enclosed XML.

   Encoding considerations:   8bit
      Uses XML, which can employ 8-bit characters, depending on the
      character encoding used.  See <a href="./rfc3023">RFC 3023</a> [<a href="#ref-4" title=""XML Media Types"">4</a>], Section 3.2.

   Security considerations:   Security considerations specific to uses
      of this type are discussed in <a href="./rfc5168">RFC 5168</a>.  <a href="./rfc1874">RFC 1874</a> [<a href="#ref-1" title=""SGML Media Types"">1</a>] and <a href="./rfc3023">RFC 3023</a>
      [<a href="#ref-4" title=""XML Media Types"">4</a>] discuss security issues common to all uses of XML.

   Interoperability considerations:   None.

   Published specification:   <a href="./rfc5168">RFC 5168</a>











<span class="grey">Levin, et al.                Informational                      [Page 6]</span>

<span id="page-7" ></span>
<span class="grey"><a href="./rfc5168">RFC 5168</a>                     Media Control                    March 2008</span>


   Applications that use this media type:   This media type is used to
      convey information regarding media control commands and responses
      between SIP endpoints particularly for allowing a Video Fast
      Update intra-frame request.

   Additional information:

   Magic Number(s):   None.
   File Extension(s):   None.
   Macintosh File Type Code(s):   None.

   Person and email address to contact for further information:

   Name:  Roni Even
   E-Mail:  [email protected]

   Intended usage: LIMITED USE

   Restrictions on usage: None.

   Author: Roni Even. <even.roni@gmail.com>

   Change Controller: Roni Even. <even.roni@gmail.com>

<span class="h2"><a class="selflink" id="section-10" href="#section-10">10</a>.  Security Considerations</span>

   The video control command transported using the method described in
   the document may cause the sender of the video data to send more data
   within the allowed bandwidth, as described in <a href="#section-4">Section 4</a>.

   This document defines one control message; changing the content of
   the message will cause the video sender to ignore the request and
   send an error response.  This may prevent the display of a video
   stream.  The control message itself does not carry any sensitive
   information.

   An eavesdropper may inject messages or change them, which may lead to
   either more data on the network or loss of video image.  Using data
   integrity validation will prevent adding or changing of messages.

   If the video media is sent over a secure transport, it is recommended
   to secure the signaling using TLS as explained in [<a href="#ref-5" title=""SIP: Session Initiation Protocol"">5</a>].  In most
   cases, securing the media will require a secure signaling path.

   The security considerations of [<a href="#ref-3" title=""The SIP INFO Method"">3</a>] and [<a href="#ref-5" title=""SIP: Session Initiation Protocol"">5</a>] apply here.






<span class="grey">Levin, et al.                Informational                      [Page 7]</span>

<span id="page-8" ></span>
<span class="grey"><a href="./rfc5168">RFC 5168</a>                     Media Control                    March 2008</span>


<span class="h2"><a class="selflink" id="section-11" href="#section-11">11</a>.  References</span>

<span class="h3"><a class="selflink" id="section-11.1" href="#section-11.1">11.1</a>.  Normative References</span>

   [<a id="ref-1">1</a>]  Levinson, E., "SGML Media Types", <a href="./rfc1874">RFC 1874</a>, December 1995.

   [<a id="ref-2">2</a>]  Bradner, S., "Key words for use in RFCs to Indicate Requirement
        Levels", <a href="https://www.rfc-editor.org/bcp/bcp14">BCP 14</a>, <a href="./rfc2119">RFC 2119</a>, March 1997.

   [<a id="ref-3">3</a>]  Donovan, S., "The SIP INFO Method", <a href="./rfc2976">RFC 2976</a>, October 2000.

   [<a id="ref-4">4</a>]  Murata, M., St. Laurent, S., and D. Kohn, "XML Media Types", <a href="./rfc3023">RFC</a>
        <a href="./rfc3023">3023</a>, January 2001.

   [<a id="ref-5">5</a>]  Rosenberg, J., Schulzrinne, H., Camarillo, G., Johnston, A.,
        Peterson, J., Sparks, R., Handley, M., and E. Schooler, "SIP:
        Session Initiation Protocol", <a href="./rfc3261">RFC 3261</a>, June 2002.

   [<a id="ref-6">6</a>]  Schulzrinne, H., Casner, S., Frederick, R., and V. Jacobson,
        "RTP: A Transport Protocol for Real-Time Applications", STD 64,
        <a href="./rfc3550">RFC 3550</a>, July 2003.

   [<a id="ref-7">7</a>]  Wenger, S., Chandra, U., Westerlund, M., and B. Burman, "Codec
        Control Messages in the RTP Audio-Visual Profile with Feedback
        (AVPF)", <a href="./rfc5104">RFC 5104</a>, February 2008.

<span class="h3"><a class="selflink" id="section-11.2" href="#section-11.2">11.2</a>.  Informative References</span>

   [<a id="ref-8">8</a>]  Ott, J., Wenger, S., Sato, N., Burmeister, C., and J. Rey,
        "Extended RTP Profile for Real-time Transport Control Protocol
        (RTCP)-Based Feedback (RTP/AVPF)", <a href="./rfc4585">RFC 4585</a>, July 2006.




















<span class="grey">Levin, et al.                Informational                      [Page 8]</span>

<span id="page-9" ></span>
<span class="grey"><a href="./rfc5168">RFC 5168</a>                     Media Control                    March 2008</span>


Authors' Addresses

   Orit Levin
   Microsoft Corporation
   One Microsoft Way
   Redmond, WA  98052
   USA

   EMail: [email protected]


   Roni Even
   Polycom
   94 Derech Em Hamoshavot
   Petach Tikva,   49130
   Israel

   EMail: [email protected]


   Pierre Hagendorf
   RADVISION
   24, Raul Wallenberg St.
   Tel-Aviv,   69719
   Israel

   EMail: [email protected]
























<span class="grey">Levin, et al.                Informational                      [Page 9]</span>

<span id="page-10" ></span>
<span class="grey"><a href="./rfc5168">RFC 5168</a>                     Media Control                    March 2008</span>


Full Copyright Statement

   Copyright (C) The IETF Trust (2008).

   This document is subject to the rights, licenses and restrictions
   contained in <a href="https://www.rfc-editor.org/bcp/bcp78">BCP 78</a>, and except as set forth therein, the authors
   retain all their rights.

   This document and the information contained herein are provided on an
   "AS IS" basis and THE CONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS
   OR IS SPONSORED BY (IF ANY), THE INTERNET SOCIETY, THE IETF TRUST AND
   THE INTERNET ENGINEERING TASK FORCE DISCLAIM ALL WARRANTIES, EXPRESS
   OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF
   THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED
   WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.

Intellectual Property

   The IETF takes no position regarding the validity or scope of any
   Intellectual Property Rights or other rights that might be claimed to
   pertain to the implementation or use of the technology described in
   this document or the extent to which any license under such rights
   might or might not be available; nor does it represent that it has
   made any independent effort to identify any such rights.  Information
   on the procedures with respect to rights in RFC documents can be
   found in <a href="https://www.rfc-editor.org/bcp/bcp78">BCP 78</a> and <a href="https://www.rfc-editor.org/bcp/bcp79">BCP 79</a>.

   Copies of IPR disclosures made to the IETF Secretariat and any
   assurances of licenses to be made available, or the result of an
   attempt made to obtain a general license or permission for the use of
   such proprietary rights by implementers or users of this
   specification can be obtained from the IETF on-line IPR repository at
   <a href="http://www.ietf.org/ipr">http://www.ietf.org/ipr</a>.

   The IETF invites any interested party to bring to its attention any
   copyrights, patents or patent applications, or other proprietary
   rights that may cover technology that may be required to implement
   this standard.  Please address the information to the IETF at
   [email protected].












Levin, et al.                Informational                     [Page 10]

Additional Resources