U.S. patent application number 16/036703 was filed with the patent office on 2018-11-08 for apparatus and method for providing streaming content.
This patent application is currently assigned to Electronics and Telecommunications Research Institute. The applicant listed for this patent is Electronics and Telecommunications Research Institute, INDUSTRY-UNIVERSITY COOPERATION FOUNDATION KOREA AEROSPACE UNIVERSITY. Invention is credited to Seong Jun BAE, Soon Heung JUNG, Jung Won KANG, Jae Gon KIM, Jin Young LEE, Sang Taick PARK, Won RYU, Truong Cong THANG.
Application Number | 20180324241 16/036703 |
Document ID | / |
Family ID | 50548518 |
Filed Date | 2018-11-08 |
United States Patent
Application |
20180324241 |
Kind Code |
A1 |
THANG; Truong Cong ; et
al. |
November 8, 2018 |
APPARATUS AND METHOD FOR PROVIDING STREAMING CONTENT
Abstract
A method and apparatus for an adaptive Hypertext Transfer
Protocol (HTTP) streaming service using metadata of content are
provided. The metadata may include one or more BaseURL elements.
Uniform Resource Locators (URLs) of segments forming media may be
generated based on the BaseURL elements. Additionally, a URL of a
segment may be generated by mapping a BaseURL element among the
BaseURL elements to a sourceURL attribute of the segment. Segments
indicated by generated URLs may be identical to each other.
Inventors: |
THANG; Truong Cong;
(Daejeon, KR) ; LEE; Jin Young; (Daejeon, KR)
; BAE; Seong Jun; (Daejeon, KR) ; KANG; Jung
Won; (Daejeon, KR) ; JUNG; Soon Heung;
(Daejeon, KR) ; PARK; Sang Taick; (Daejeon,
KR) ; RYU; Won; (Daejeon, KR) ; KIM; Jae
Gon; (Goyang-si Gyeonggi-do, KR) |
|
Applicant: |
Name |
City |
State |
Country |
Type |
Electronics and Telecommunications Research Institute
INDUSTRY-UNIVERSITY COOPERATION FOUNDATION KOREA AEROSPACE
UNIVERSITY |
Daejeon
Goyang-si Gyeonggi-do |
|
KR
KR |
|
|
Assignee: |
Electronics and Telecommunications
Research Institute
Daejeon
KR
INDUSTRY-UNIVERSITY COOPERATION FOUNDATION KOREA AEROSPACE
UNIVERSITY
Goyang-si Gyeonggi-do
KR
|
Family ID: |
50548518 |
Appl. No.: |
16/036703 |
Filed: |
July 16, 2018 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
15287260 |
Oct 6, 2016 |
10027736 |
|
|
16036703 |
|
|
|
|
14146500 |
Jan 2, 2014 |
9467493 |
|
|
15287260 |
|
|
|
|
13820955 |
Mar 5, 2013 |
8645562 |
|
|
PCT/KR2011/006573 |
Sep 6, 2011 |
|
|
|
14146500 |
|
|
|
|
61434036 |
Jan 19, 2011 |
|
|
|
61417931 |
Nov 30, 2010 |
|
|
|
61414462 |
Nov 17, 2010 |
|
|
|
61405674 |
Oct 22, 2010 |
|
|
|
61390328 |
Oct 6, 2010 |
|
|
|
61380277 |
Sep 6, 2010 |
|
|
|
Current U.S.
Class: |
1/1 |
Current CPC
Class: |
H04L 29/06027 20130101;
H04L 65/4084 20130101; H04L 65/607 20130101; H04N 21/47202
20130101; H04L 67/02 20130101; H04L 65/608 20130101; H04N 21/6175
20130101; H04N 21/84 20130101; H04N 21/4305 20130101; H04N 21/8543
20130101; H04N 21/6125 20130101; H04N 21/26258 20130101; H04N
21/23439 20130101; H04N 21/6581 20130101 |
International
Class: |
H04L 29/08 20060101
H04L029/08; H04L 29/06 20060101 H04L029/06; H04N 21/8543 20060101
H04N021/8543; H04N 21/84 20060101 H04N021/84; H04N 21/658 20060101
H04N021/658; H04N 21/61 20060101 H04N021/61; H04N 21/262 20060101
H04N021/262; H04N 21/2343 20060101 H04N021/2343; H04N 21/472
20060101 H04N021/472 |
Foreign Application Data
Date |
Code |
Application Number |
Sep 6, 2011 |
KR |
10-2011-0089923 |
Claims
1. A terminal comprising: one or more processor is configured to:
receive metadata of media content, the metadata comprising an
attribute with multiple BaseURL elements, select a BaseURL element
from the multiple BaseURL elements; send a request for a segment of
the media content using a URL of the segment to a server, the URL
being generated based on the selected BaseURL element; receive the
segment from the server, wherein the metadata is Media Presentation
Description (MPD), wherein the MPD describes one or more periods,
wherein the period includes one or more groups wherein the group
includes one or more representation, wherein the representation
includes one or more segments of the media content, wherein the
media content is requested using a URI generated based on a
resource type including bitrate, vertical resolution, or horizontal
resolution.
2. The terminal of claim 1, wherein the BaseURL element include an
MPD level Base URL element of MPD, a period level Base URL element
of each period, a group level Base URL element of each group, and a
representation level Base URL element of each representation.
3. The terminal of claim 2, wherein the BaseURL element of a
specific level is resolved with respect to a BaseURL element of a
higher level, wherein the period level BaseURL element is resolved
with respect to the MPD level BaseURL element, wherein the group
level BaseURL element is resolved with respect to the period level
BaseURL element, wherein the representation level BaseURL element
is resolved with respect to the group level BaseURL element.
4. The terminal of claim 1, wherein identical segments are
accessible at multiple locations indicated by URLs resolved with
respect to the respective BaseURL elements, and a first BaseURL
element among the BaseURL elements is used as a base Universal
Resource Indicator (URI).
5. The terminal of claim 1, wherein the request is sent using an
HTTP GET method.
6. The terminal of claim 1, wherein the metadata comprises a range
attribute, and wherein the request comprises a request for bytes of
a resource indicated by the URL that are designated by the range
attribute.
7. The terminal of claim 1, wherein the URL is an absolute URL or a
relative URL.
8. The terminal of claim 1, wherein the metadata selectively
comprises a sourceURL attribute of the segment, and wherein, when
the metadata selectively comprises the sourceURL attribute of the
segment, a BaseURL element among the BaseURL elements is mapped to
the sourceURL attribute, so that the URL is generated.
9. A server comprising: one or more processor is configured to:
receive a request for a segment of the media content using a URL of
the segment from a terminal, the URL being generated based on the
selected BaseURL element; provide the segment to the terminal,
wherein the terminal selects a BaseURL element from the multiple
BaseURL elements based on the metadata of the media content,
wherein the metadata is Media Presentation Description (MPD),
wherein the MPD describes one or more periods, wherein the period
includes one or more groups wherein the group includes one or more
representation, wherein the representation includes one or more
segments of the media content, wherein the media content is
requested using a URI generated based on a resource type including
bitrate, vertical resolution, or horizontal resolution.
10. The server of claim 9, wherein the BaseURL element include an
MPD level Base URL element of MPD, a period level Base URL element
of each period, a group level Base URL element of each group, and a
representation level Base URL element of each representation.
11. The server of claim 10, wherein the BaseURL element of a
specific level is resolved with respect to a BaseURL element of a
higher level, wherein the period level BaseURL element is resolved
with respect to the MPD level BaseURL element, wherein the group
level BaseURL element is resolved with respect to the period level
BaseURL element, wherein the representation level BaseURL element
is resolved with respect to the group level BaseURL element.
12. The server of claim 9, wherein identical segments are
accessible at multiple locations indicated by URLs resolved with
respect to the respective BaseURL elements, and a first BaseURL
element among the BaseURL elements is used as a base Universal
Resource Indicator (URI).
13. The server of claim 9, wherein the request is sent using an
HTTP GET method.
14. The server of claim 9, wherein the metadata comprises a range
attribute, and wherein the request comprises a request for bytes of
a resource indicated by the URL that are designated by the range
attribute.
15. The server of claim 9, wherein the URL is an absolute URL or a
relative URL.
16. The server of claim 9, wherein the metadata selectively
comprises a sourceURL attribute of the segment, and wherein, when
the metadata selectively comprises the sourceURL attribute of the
segment, a BaseURL element among the BaseURL elements is mapped to
the sourceURL attribute, so that the URL is generated.
Description
TECHNICAL FIELD
[0001] The present invention relates to a technology for providing
streaming content, and more particularly, to an apparatus and
method for providing media content using adaptive streaming.
BACKGROUND ART
[0002] Streaming is one of schemes for transmitting and playing
back multimedia content such as sounds, moving images, and the
like. A client may play back content while receiving the content
through the streaming.
[0003] An adaptive streaming service refers to providing a
streaming service employing a communication scheme with a request
of a client and a response of a server in response to the
request.
[0004] The client may request a media sequence suitable for an
environment of the client (for example, a transmission channel of
the client), using the adaptive streaming service. The server may
provide a media sequence matched to the request of the client among
media sequences with various qualities that are included in the
server.
[0005] The adaptive streaming service may be provided based on
various protocols.
[0006] A Hypertext Transfer Protocol (HTTP) adaptive streaming
service refers to an adaptive streaming service provided based on
an HTTP protocol. A client of the HTTP adaptive streaming service
may receive content from a server using the HTTP protocol, and may
transmit a request associated with a streaming service to the
server.
DISCLOSURE OF INVENTION
Technical Goals
[0007] An aspect of the present invention provides an apparatus and
method that may interpret a Uniform Resource Locator (URL) of a
segment using one or more BaseURL elements during playback of
content.
[0008] Another aspect of the present invention provides an
apparatus and method that may generate a URL of a segment by
mapping a BaseURL element among one or more BaseURL elements to a
sourceURL attribute of the segment.
Technical Solutions
[0009] According to an aspect of the present invention, there is
provided a method for providing media, the method including:
receiving metadata of media, the metadata including one or more
BaseURL elements; sending a request for a segment of the media
using a Uniform Resource Locator (URL) of the segment, the URL
being resolved with respect to a BaseURL element; receiving the
segment; and decoding and rendering data of the media that is
included in the segment.
[0010] The request may be sent using an HTTP GET method.
[0011] The metadata may include a range attribute.
[0012] The request may include a request for bytes of a resource
indicated by the URL that are designated by the range
attribute.
[0013] The URL may be an absolute URL or a relative URL.
[0014] Identical segments may be accessible at multiple locations
indicated by URLs resolved with respect to the respective BaseURL
elements.
[0015] A first BaseURL element among the BaseURL elements may be
used as a basic Universal Resource Indicator (URI), and BaseURL
elements other than the first BaseURL element may be used as
alternative BaseURL elements.
[0016] The metadata may selectively include a sourceURL attribute
of the segment. When the metadata selectively includes the
sourceURL attribute of the segment, a BaseURL element among the
BaseURL elements may be mapped to the sourceURL attribute, so that
the URL may be generated.
[0017] The metadata may be a Media Presentation Description (MPD)
of the media.
[0018] The media may include a sequence of one or more periods.
[0019] A BaseURL element may include one or more MPD level BaseURL
elements of the MPD, and one or more period level BaseURL elements
of the periods.
[0020] A URL of a segment included in each of the periods may be
resolved with respect to a period level BaseURL element.
[0021] The period level BaseURL elements may be resolved with
respect to the MPD level BaseURL elements.
[0022] Each of the periods may include one or more groups.
[0023] The BaseURL element may further include one or more group
level BaseURL elements of the groups.
[0024] A URL of a segment included in each of the groups may be
resolved with respect to a group level BaseURL element.
[0025] The group level BaseURL elements may be resolved with
respect to the period level BaseURL elements.
[0026] Each of the groups may include one or more
representations.
[0027] Each of the representations may be a structured collection
of one or more components of the media within a period.
[0028] The BaseURL element may further include one or more
representation level BaseURL elements of the representations
[0029] A URL of a segment included in each of the representations
may be resolved with respect to a representation level BaseURL
element.
[0030] The representation level BaseURL elements may be resolved
with respect to the group level BaseURL elements or the period
level BaseURL elements.
[0031] According to another aspect of the present invention, there
is provided a terminal, including: an access engine to receive
metadata of media, to send a request for a segment of the media
using a Uniform Resource Locator (URL) of the segment, to receive
the segment, and to decode data of the media that is included in
the segment, the metadata including one or more BaseURL elements,
and the URL being resolved with respect to a BaseURL element; and a
media engine to receive the data of the media from the access
engine, and to output the media.
Effect of the Invention
[0032] According to embodiments of the present invention, it is
possible to interpret a Uniform Resource Locator (URL) of a segment
using one or more BaseURL elements during playback of content.
[0033] Additionally, according to embodiments of the present
invention, it is possible to generate a URL of a segment by mapping
a BaseURL element among one or more BaseURL elements to a sourceURL
attribute of the segment.
BRIEF DESCRIPTION OF DRAWINGS
[0034] FIG. 1 is a diagram illustrating categories of signaling
information according to an embodiment of the present
invention.
[0035] FIG. 2 is a diagram illustrating categories of signaling
information according to an embodiment of the present
invention.
[0036] FIG. 3 is a diagram illustrating a hierarchy of content
division and levels of signaling information according to an
embodiment of the present invention.
[0037] FIG. 4 is a diagram illustrating detection of virtual
boundaries in a Moving Picture Experts Group-2 Transport Stream
(MPEG-2 TS) according to an embodiment of the present
invention.
[0038] FIG. 5 is a diagram illustrating a configuration of a
terminal 100 according to an embodiment of the present
invention.
[0039] FIG. 6 is a diagram illustrating a configuration of a
terminal 100 according to an embodiment of the present
invention.
BEST MODE FOR CARRYING OUT THE INVENTION
[0040] Reference will now be made in detail to embodiments of the
present invention, examples of which are illustrated in the
accompanying drawings, wherein like reference numerals refer to the
like elements throughout. The embodiments are described below in
order to explain the present invention by referring to the
figures.
[0041] A Dynamic Adaptive Streaming over HTTP (DASH) may specify
formats that enable 1) delivery of media content from an HTTP
server to an HTTP client, and enable 2) caching of content by
standard HTTP cashes.
[0042] A media component may be an encoded version of individual
media types, such as audios, videos, or timed texts with specific
attributes, for example bandwidths, languages, or resolutions.
[0043] Media content may be a set of media components having a
common timeline, for example audios, videos, or timed texts.
Additionally, media components may have relationships on how the
media components may be presented (for example, individually,
jointly, or mutually exclusive) as programs or movies.
[0044] Media content and content may be used as interchangeable
terms.
[0045] A media presentation (or media) may be a structured
collection of data used to establish bounded or unbounded
presentation of media content including components of continuous
media.
[0046] In other words, the media presentation may be a structured
collection of data that is accessible to a DASH client in order to
provide a streaming service to a user.
[0047] A Media Presentation Description (MPD) may be a formalized
description for a media presentation.
[0048] The media presentation may be described by an MPD including
possible updates of the MPD.
[0049] Content may be content on demand, or live content.
[0050] The content may be divided into one or more intervals. In
other words, the content may include one or more intervals.
[0051] Intervals may be interchangeable with periods. The term
period may be used as a term of 3.sup.rd Generation Partnership
Project (3GPP) adaptive HTTP streaming
[0052] A period may be an interval of a media presentation. A
continuous sequence of all periods may constitute the media
presentation.
[0053] In other words, the media presentation may include a
sequence of one or more periods.
[0054] One or more intervals may be a basic unit. One or more
intervals may be described by signaling metadata. In other words,
metadata may describe each of the one or more intervals.
[0055] The metadata may be an MPD.
[0056] The MPD may define a format to announce resource identifiers
for segments. The MPD may provide a context for identified
resources within the media presentation. The resource identifiers
may be HTTP-Uniform Resource Locators (URLs). URLs may be
restricted by a byte range attribute.
[0057] Each interval may be divided into fragments.
[0058] Fragments may be interchangeable with segments. The term
segment may be used as a term of 3GPP adaptive HTTP streaming
[0059] A segment may refer to an entity body of a response to an
HTTP/1.1 GET request for an HTTP-URL, for example as defined in RFC
2616, (or a GET request for a part indicated by a byte range).
[0060] A terminal may play back media content using received bytes
(namely, a segment).
[0061] A sub-segment may refer to a smallest unit within segments
that may be indexed by a segment index at the segment level.
[0062] Two or more sets of fragments corresponding to a single
interval may exist. Each of the sets may be called an
alternative.
[0063] An alternative may be interchangeable with a representation
(or an expression).
[0064] Each period may include one or more groups.
[0065] Each group may include one or more representations of the
same media content.
[0066] A representation may refer to a structured collection of one
or more media components within a single period. A representation
may be one of alternative choices of the media content or a subset
of the media content typically differing by the encoding choice,
for example by a bitrate, a resolution, a language, a codec, and
the like.
[0067] An MPD (or an MPD element) may provide descriptive
information that enables a client to select one or more
representations.
[0068] A Random Access Point (RAP) may be a specific location in a
media segment. The RAP may be identified as a location in which
playback may be started continuously from a location of the RAP
using only information included in a media segment.
[0069] Each representation may be formed of one or more segments.
In other words, a representation may include one or more
segments.
[0070] An MPD may be a document including metadata required to a
DASH client to form appropriate HTTP-URLs in order to 1) access
segments and to 2) provide a user with a streaming service. The
HTTP-URLs may be absolute or relative.
[0071] The MPD may be an Extensible Markup Language
(XML)-document.
[0072] The MPD may include an MPD element. The MPD may include only
a single MPD element.
[0073] FIG. 1 is a signal flowchart illustrating a content
processing method according to an embodiment of the present
invention.
[0074] A terminal 100 may be a DASH client.
[0075] The DASH client may be compatible with a client specified in
RFC 2616.
[0076] The DASH client may typically use an HTTP GET method or an
HTTP partial GET method, as specified in RFC 2616, to access
segments or parts of segments.
[0077] A server 110 may perform hosting on DASH segments. The
server 110 may be compatible with a server specified in RFC
2616.
[0078] In operation 120, the terminal 100 may receive metadata of
media (or content) from the server 110. In other words, the server
110 may transmit the metadata of the media to the terminal 100.
[0079] The metadata may include BaseURL elements. One or more
BaseURL elements may be provided.
[0080] In operation 130, the terminal 100 may process the received
metadata. In operation 130, the terminal 100 may extract
information provided by the metadata, or information included in
the metadata.
[0081] In operations 140 through 150, the terminal 100 may access a
segment of the media based on the information provided by the
metadata.
[0082] Each period may include one or more groups, and each of the
groups may include one or more representations of the media. Each
of the representations may include one or more segments.
[0083] The metadata may describe a group element describing each of
the groups.
[0084] In operation 140, the terminal 100 may send a request for a
segment of the media to the server 110 using a URL of the segment.
The URL may be resolved with respect to one of the above-described
BaseURL elements. For example, the URL of the segment may be
generated based on a BaseURL element.
[0085] The terminal 100 may request the server 110 to transmit a
segment suitable for a specific interval based on the processed
metadata. In other words, the requested segment may be selected
based on the metadata. The request may be sent using the HTTP GET
method.
[0086] The metadata may include a range attribute. The request may
include a request for bytes of a resource indicated by a URL that
are designated by the range attribute.
[0087] The URL of the segment may be an absolute URL or a relative
URL.
[0088] Identical segments may be accessible at multiple locations
indicated by URLs resolved with respect to the respective BaseURL
elements. In other words, identical segments may be selectively
accessible by the URLs provided by the BaseURL elements.
[0089] Additionally, a first BaseURL element among the BaseURL
elements may be used as a basic Universal Resource Indicator (URI),
and BaseURL elements other than the first BaseURL element may be
used as alternative BaseURL elements.
[0090] The metadata may selectively include a sourceURL attribute
of the segment. When the metadata selectively includes the
sourceURL attribute of the segment, a BaseURL element among the
BaseURL elements may be mapped to the sourceURL attribute, so that
the URL of the segment may be generated.
[0091] In operation 145, in several cases, the server 110 may need
to parse content (for example, a Moving Picture Experts Group
(MPEG) layer 4 (MP4) file for Scalable Video Coding (SVC)), and may
extract a data part suitable for the requested segment.
[0092] In operation 150, the server 110 may transmit, to the
terminal 100, segments suitable for each request from the terminal
100. The terminal 100 may receive the segments from the server.
[0093] In operation 160, the terminal 100 may perform decoding and
rendering on data of the media included in the segment, to play
back the media.
[0094] The terminal 100 may play back the media using the received
segments by repeating operations 120 through 160.
[0095] Here, a BaseURL element may include an MPD level BaseURL
element of an MPD, and a period level BaseURL element of each
period. The period level BaseURL element may refer to a BaseURL
element applied to a period to which the period level BaseURL
element belongs. In other words, a URL of a segment included in
each period may be resolved with respect to the period level
BaseURL element.
[0096] One or more MPD level BaseURL elements may be provided, and
one or more period level BaseURL elements may be provided.
[0097] Additionally, the BaseURL element may further include a
group level BaseURL element of a group. A URL of a segment included
in each group may be resolved with respect to the group level
BaseURL element. One or more group level BaseURL elements may be
provided.
[0098] The BaseURL element may further include a representation
level BaseURL element of a representation. A URL of a segment
included in each representation may be resolved with respect to the
representation level BaseURL element.
[0099] A BaseURL element of a specific level may be resolved with
respect to a BaseURL element of a higher level. For example, a
period level BaseURL element may be resolved with respect to an MPD
level BaseURL element. A group level BaseURL element may be
resolved with respect to a period level BaseURL element. A
representation level BaseURL element may be resolved with respect
to a group level BaseURL element or a period level BaseURL
element.
[0100] FIG. 2 is a diagram illustrating categories of signaling
information according to an embodiment of the present
invention.
[0101] The signaling information (namely, metadata) may be divided
into the following categories 1) through 4):
[0102] 1) General information 210: includes common description of
content, and general description of each interval, such as a
duration, and a start time.
[0103] 2) Quality of Service (QoS) information 220: describes
characteristics of each alternative, such as a bitrate, a
resolution, and a quality. In other words, the QoS information
describes characteristics of each of alternatives of content.
[0104] An alternative may be physical (namely, created in advance),
or may be virtual (namely, to be created on the fly). Based on
information of alternatives, the client may select a fragment of an
appropriate alternative. Accordingly, adaptivity to contexts of
terminals and networks may be supported.
[0105] 3) Mapping information 230: describes locations to retrieve
content. Depending on specific cases, different alternatives may
have the same or different locations.
[0106] 4) Client request 240: this type of signaling information
may conform to a format of HTTP 1.1 request message. As shown in
FIG. 1, parameters requested by the client may be derived from the
information of categories 1) through 3).
[0107] FIG. 3 is a diagram illustrating a hierarchy of content
division and levels of signaling information according to an
embodiment of the present invention.
[0108] Signaling of metadata according to an embodiment of the
present invention may be physically separated into content-level
information 310, interval-level information 320, QoS information
330, and mapping information 340. Linking of related parts of the
content-level information 310, the interval-level information 320,
the QoS information 330, and the mapping information 340 may be
performed by reference.
[0109] These parts of signaling information may be combined in
different ways to support the flexibility.
[0110] For example, when only the content-level information 310 and
interval-level information 320 are sent to a client, all
computations for deciding alternatives and resolving locations may
be performed by a server. Accordingly, when only the content-level
information 310 and interval-level information 320 are sent to the
client, a processing model may be "server-based."
[0111] When the content-level information 310, the interval-level
information 320, and the QoS information 330 are sent to the
client, all computations for deciding alternatives and resolving
locations may be distributed and performed by the client and the
server. Accordingly, when the content-level information 310, the
interval-level information 320, and the QoS information 330 are
sent to the client, the model may be "distributed."
[0112] When all the signaling information (namely, the
content-level information 310, the interval-level information 320,
the QoS information 330, and the mapping information 340) is sent
to the client, the model may be client-based, because most (or all)
processing (namely, computations for deciding alternatives and
resolving locations) is performed by the client.
[0113] The separation of metadata parts may enable efficiency in
storage and delivery. For example, during a session, metadata of
the content-level information 310 may be sent once, and only the
interval-level information 320 may be periodically updated.
Similarly, a single file containing the QoSInfo 330 may be used for
different intervals and different contents.
[0114] There are different ways to represent a set of metadata, for
example XML, pseudo-code, a Session Description Protocol (SDP), and
the like.
[0115] In embodiments of the present invention, both XML and
pseudo-code may be used to represent signaling syntax. XML syntax
may be used for clients that support an XLM profile based on an
MPEG-21 and similar schema. On the other hand, pseudo-code syntax
may be based on a "language" of an International Organization for
Standardization (ISO) base media file format and the like, and may
be used in non-XML clients. In particular, a profile of the
pseudo-code syntax may employ a parsing module similar to a parsing
module of file-format parsing. To design the syntax for the above
purpose, a table of common elements may be defined and elements in
each format may be represented, in the following embodiments.
[0116] The proposed syntax may be represented by any other
languages.
[0117] Hereinafter, tables of general syntax elements will be
described.
[0118] In the following tables, the syntax elements may be
described hierarchically. In other words, a child element may be
shown in a column on the right of a parent element of the child
element. "Leaf"-level elements may be shown in italic, and parent
elements may be shown in bold letter. A parent element may be
represented by a corresponding XML type and a file box.
[0119] In the column of occurrence, "0 . . . N" may mean that the
number of instances of an occurrence element may be from 0 to
"unbounded." The minimum occurrence of 0 may mean that an element
may be optional (namely, not present). A minimum occurrence equal
to or greater than 1 may mean that an element is mandatory in the
syntax.
[0120] Occurrence may be interchangeable with cardinality.
[0121] In a row of type, A indicates an attribute, and E indicates
an element. In a row of optionality, M indicates mandatory, and O
indicates optional. For attributes, M indicates mandatory, O
indicates optional, OD indicates optional with default value, and
CM indicates conditionally mandatory. For elements, elements may be
represented as <minOccurs> . . . <maxOccurs>. Here, N
may be unbounded.
[0122] The above meaning may equally be applied to other tables in
the present specification.
[0123] Additionally, QoSlnfo may be also called AdaptationInfo to
make QoSlnfo more specific. Moreover, a few elements may be revised
to increase flexibility of the syntax.
[0124] The following Table 1 describes general information.
TABLE-US-00001 TABLE 1 XML syntax Pseudo-code (based on (File
format Occurrence Semantics MPEG-21) box) HttpStr 1 Describes the
top-level element of HttpStreamingType `htps` signaling metadata
for HTTP streaming box GeneralInfo 0 . . . N Contains the general
information of GeneralInfoType `geni` the described content box
TimeScale 0 . . . 1 Describes the number of time units integer
unsigned in 1 second. int(32) This value is used with time-related
elements, when a time unit is not specified. LiveStartTime 0 . . .
1 If LiveStartTime element is not dateTime unsigned present, the
content is of VoD type. int(64) The presence of LiveStartTime
element indicates a live content that is to be displayed at a time
value of LiveStartTime. If LiveStartTime has a time value of 0, the
display time is unknown. Duration 0 . . . 1 If present, indicates
duration of the integer unsigned content. Otherwise, the duration
is int(32) unknown. DefaultIntDuration 0 . . . 1 If present,
indicates a default integer unsigned duration of each interval of
the int(32) content. MinUpdateTime 0 . . . 1 If present, indicates
the minimum integer unsigned waiting time before requesting the
int(32) main description file again. ConsistentQoSInfo 0 . . . 1 If
true, indicates that QoS boolean flag of information is the same as
the the box whole content duration. DefaultContentLoc 0 . . . 1
Provides a default location for the anyURI string content. of URL
IntervalsRef 0 . . . N Provides reference to description Intervals
`iref` containing one or more instances of RefType box Interval
element. One or more instances of Interval element represent a
sequence of consecutive interval(s). Interval 0 . . . N Provides
information of an interval IntervalType `intv` of content. box The
information of the interval may be either included as an instance
of Interval element or referenced by IntervalsRef element.
IntervalInfo 0 . . . 1 Provides general information of an
IntervalInfoType `inti` interval. box QoSInfoRef 0 . . . 1 Provides
reference to description dia:ReferenceType `qref` represented by
QoSInfo element. box If QosInfoRef element is present, QoSInfo
element may not be present at the same level. QoSInfo 0 . . . 1
Provides information about QoSInfoType `QoSi` alternatives of
content, such as box resource characteristics and quality/utility.
If QoSInfo element is present, QoSInfoRef element may not be
present. MappingInfoRef 0 . . . 1 Provides reference to description
dia:ReferenceType `mref` represented by MappingInfo box element. If
MappingInfoRef element is present, MappingInfo element may not be
present at the same level. MappingInfo 0 . . . 1 Provides
information about MappingInfoType `mapi` locations of content
alternatives. box If the information is not provided,
DefaultContentIntLoc element (if not, DefaultContentLoc) can be
used to retrieve content. If MappingInfo element is present,
MappingInfoRef element may not be present. NextIntervalsRef 0 . . .
1 Provides reference to information of Intervals `nref` next
interval(s). RefType, may box The information of next interval(s)
be extended from is description containing one or dia:ReferenceType
more instances of Interval element. The information of next
interval(s) is description represented by Interval element. Using
NextIntervalsRef element, the client does not need to reload the
main description represented by HttpStr element. Within the current
time window, only the final interval may contain NextIntervalsRef
element. PreviousIntervalsRef 0 . . . 1 Provides reference to
information of Intervals `nref` previous interval(s). RefType, may
box The information of next interval(s) be extended from is
description containing one or dia:ReferenceType more instances of
Interval element. The information of next interval(s) is
description represented by Interval element. Using
PreviousIntervalsRef element, the client does not need to reload
the main description represented by HttpStr element. Within the
current time window, only the first interval may contain
NextIntervalsRef element.
[0125] The following Table 2 describes IntervalsRef,
NextlntervalsRef, PreviouslntervalsRef, QoSlnfoRef, MappinglnfoRef
a, and IntervalInfo.
TABLE-US-00002 TABLE 2 XML syntax Pseudo- (based on code (File
Occurrence Semantics MPEG-21) format box) QoSInfoRef, Index 1
Indicates the order (starting from 1) not unsigned MappingInfoRef
of the referenced description or box applicable int(8) (Interval,
QoSInfo, MappingInfo) in the description file referenced by the
next Location element. Location 1 Provides reference to description
uri element in string represented by Interval, QoSInfo, or
dia:ReferenceType (representing MappingInfo. url) IntervalInfo
TimeScale 0 . . . 1 Describes the number of time units in integer
unsigned one second. This value is used with int(32) time-related
elements, when a time unit is not specified. TimeScale element, if
present, overrides the time scale provided by GeneralInfo.
StartTime 0 . . . 1 Indicates the start time of the interval.
Duration 0 . . . 1 Indicates the duration of the interval. integer
unsigned int(32) DefaultFragDuration 0 . . . 1 Indicates the
default duration of integer unsigned fragments of the interval
(except the int(32) last fragment). DefaultContentIntLoc 0 . . . 1
Provides a default location for the anyURItype string content
interval. Last 0 . . . 1 If true, indicates the final interval of
boolean by flag the content. IntervalsRef, startTime Indicates the
start time of the xs:duration PreviousIntervalsRef referenced
sequence of NextIntervalRef intervals/periods relative to the start
time of the content (LiveStartTime for live content and 0 for
on-demand content). AvailableTime 0 . . . 1 Indicates the time the
description of integer unsigned the next interval is available. The
is int(32) the relative time from the start time of the content.
Index 1 Indicates the order (starting from 1) not unsigned of the
referenced interval description applicable int(8) (or box) in the
description file referenced by the next Location element. Location
1 Provides reference to description file sx:anyURI string that
contains Interval descriptions. type or (representing uri element
in url) dia:ReferenceType
[0126] The following Table 3 describes the QoSlnfo element.
TABLE-US-00003 TABLE 3 XML syntax Pseudo-code (based on (File
format Occurrence Semantics MPEG-21) box) QoSInfo 1 Provides
information about a list of QoSInfoType `QoSi` content
alternatives, such as containing a box resource characteristics and
UtilityFunction of quality/utility. dia:AdaptiveQoSType
ClassSchemeRef 0 . . . 1 Provides a list of classification
dia:Description `csmr` schemes. MetadataType box The classification
schemes provide semantics for some terms or names. scheme 1 . . . N
Provides reference to a Attr. alias & href a url classification
scheme. in dia:Description string MetadataType (1) Resource 0 . . .
N Each instance of Resource element Element constraint `resi`
describes, for a list of alternatives, of dia:UF box characteristic
values of a certain DataType resource type (e.g., bitrate). in DIA
Utility- FunctionType (2) AdaptationOperator 0 . . . N Each
instance of dia:UF `adpo` AdaptationOperator element DataType box
describes, for a list of alternatives, values of a certain
adaptation type (e.g., remove temporal layers). (3) Utility 0 . . .
N Each instance of Utility element dia:UF `util` describes, for a
list of alternatives, DataType box values in a certain
quality/utility type (e.g., MOS). UtilityRank 0 . . . 1 Describes
the quality ranking for a dia:UtilityRank `utir` list of
alternatives. Type box Value 1 . . . N Indicates the
quality/utility rank integer unsigned of an alternative. int(16)
The number of instances of Value element is equal to the number of
alternatives.
[0127] The following Table 4 shows common semantics of (1)
Resource, (2)
[0128] AdaptationOperator, and (3) Utility of Table 3.
TABLE-US-00004 TABLE 4 XML syntax Pseudo-code (based on (File
format Occurrence Semantics MPEG-21) box) QoSInfo 1 Provides
information about a list QoSInfoType `QoSi` of content
alternatives, such as containing a box resource characteristics and
UtilityFunction of quality/utility. dia:AdaptiveQoSType
ClassSchemeRef 0 . . . 1 Provides a list of classification
dia:Description `csmr` schemes. MetadataType box The classification
schemes provide semantics for some terms or names. scheme 1 . . . N
Provides reference to a Attr. alias & href a url classification
scheme. in dia:Description string MetadataType (1) Resource 0 . . .
N Each instance of Resource element Element constraint `resi`
describes, for a list of alternatives, of dia:UF box characteristic
values of a certain DataType resource type (e.g., bitrate). in DIA
Utility- FunctionType (2) AdaptationOperator 0 . . . N Each
instance of dia:UF `adpo` AdaptationOperator element DataType box
describes, for a list of alternatives, values of a certain
adaptation type (e.g., remove temporal layers). (3) Utility 0 . . .
N Each instance of Utility element dia:UF `util` describes, for a
list of alternatives, DataType box values in a certain
quality/utility type (e.g., MOS). UtilityRank 0 . . . 1 Describes
the quality ranking for a dia:UtilityRankType `utir` list of
alternatives. box Value 1 . . . N Indicates the quality/utility
rank of integer unsigned an alternative. int(16) The number of
instances of Value element is equal to the number of alternatives.
Element XML syntax Pseudo-code (1), (2), (based on (File format (3)
Semantics MPEG-21) box) Name 1 Describes an identifier for a Att.
unsigned certain type of the element. `iOPinRef`, ref. int(32) When
the identifier is not a CS term semantically defined by the above
embodiment, next three elements are used to find semantics of the
identifier in a classification scheme. CSref_ind 0 . . . 1
Indicates the reference index of a Not unsigned classification
scheme in the list applicable int(16) provided by ClassSchemeRef
element. LevelNum 0 . . . 1 Indicates the number of levels. Not
unsigned applicable int(16) LevelIndex 1 . . . N Each instance of
LevelIndex Not unsigned element represents an index value
applicable int(16) at a level of the classification scheme. Value 1
. . . N Indicates the value of a resource a component in unsigned
type (adaptation operator, or dia:VectorDataType int(32) utility)
of an alternative. The number of instances of Value element is
equal to the number of alternatives.
[0129] The following Table 5 shows mapping information.
TABLE-US-00005 TABLE 5 XML syntax (based Pseudocode on (File MPEG-
format Occurrence Semantics 21) box) Mapping 0 . . . 1 Mapping
`mapi`box Info InfoType AlterLoc 0 . . . 1 Provides a location ID
for each dia:Integer `aloc` ID alternative described in QoSInfo.
Vector box If AlterLocID element is not Type present, the first
location in the location list may be used for all alternatives.
Value 1 . . . N Indicates a location ID for an integer unsigned
alternative. int(16) The number of instances of this element is
equal to the number of alternatives. The n.sup.th instance of Value
element corresponds to the n.sup.th alternative of QoSInfo
description. ReqQoS 0 . . . N Indicates a parameter of QoSInfo
ReqQoS `reqp` Para that may be put in the request (for ParaType box
an alternative) sent by the client to that the server. extends A
parameter may be an instance of dia:Boolean Resource,
AdaptationOperator, Vector Utility, or UtilityRank elements. Type
RefIndex 1 Indicates instance index/reference represented unsigned
in the instance list of Resource, by int(16) AdaptationOperator,
Utility and attribute UtilityRank elements. `iOPinRef` that
references an IOPin in QoSInfo All 1 If true, the parameter needs
to be boolean flag requested for all alternatives and ReqFlag may
be skipped. ReqFlag 0 . . . N Each instance of ReqFlag element
component unsigned corresponds to an alternative. of int(8) If
ReqFlag is true, the request for Boolean the corresponding
alternative has VectorType the parameter identified above. Location
1 Provides a list of locations for Location `locl` List retrieving
content alternatives ListType box Location 1 . . . N Provides
information of a location Location `loca` Type box
[0130] Semantics of Location elements may be further provided as
shown in Table 6.
TABLE-US-00006 TABLE 6 XML syntax (based Pseudocode Occurrence
Semantics on DIA) (MP4) Location 0 . . . N Location `loca` Type box
LocID 1 Indicate an ID of an instance of integer unsigned Location
element. Location int(16) element is referred to by AlterLocID.
StrLocation 0 . . . N Provides location information StrLocation
`stlo` of a stream of a content Type box interval. Each stream is
provided by either a stream URL or a number of fragment URLs.
FragNum 0 . . . 1 Provides the number of integer unsigned fragments
int(16) Stream 0 . . . 1 Describes a URL of a stream anyURI string
Url type Fragment 0 . . . N Describes a URL of a fragment. anyURI
string Url The number of instances of type FragmentUrl element is
the number of fragments. FragTime 0 . . . 1 Provides durations of
dia:Vector `frtm` fragments. DataType box Value 1 . . . N Indicates
a duration of a integer unsigned fragment. int(32) The number of
instances of Value element is the number of fragments. RandAccess 0
. . . 1 Describes fragments that dia:Vector `rdac` support random
access. DataType box Value 1 . . . N Indicates the order of a
random- integer unsigned access fragment. int(16) MP2TS 0 . . . 1
Describes additional parameters MP2TS `mp2p` Para (beside URL) for
locating a ParaType box content/program in a MPEG-2 TS. PID 0 . . .
N Describes values of PIDs of integer unsigned content/program in a
MPEG-2 int(16) TS. FragBoundaries 0 . . . 1 Describes boundaries of
FragBoundaries `frbd` (virtual) fragments in a stream. Type box The
number of instances of FragBoundaries element is equal to the
number of fragments of the stream. Only one type of following
elements is present in a FragBoundaries instance. MP2TS 0 . . . N
Describes parameters for MP2TS `mp2b` Boundary detecting (virtual)
fragment Boundary box boundary in a MPEG-2 TS. Type If there are
two instances of MP2TSBoundary element, the two instances are
starting and ending boundaries of a fragment. If there is only one
instance of MP2TSBoundary element, the instance is the starting
boundary. The ending boundary is right before the starting boundary
of the next fragment. ISOFile 1 . . . 2 Describes parameters for
ISOFile `isfb` Boundary detecting (virtual) fragment Boundary box
boundary in a file based on ISO Type base media file format. If
there are two instances of ISOFileBoundary element, the two
instances are starting and ending boundaries of a fragment. If
there is only one instance of ISOFileBoundary element, the instance
is the starting boundary. The ending boundary is right before the
starting boundary of the next fragment. ByteRanges 1 Describes byte
ranges that ByteRanges `brag` identify a part/fragment of a Type
box file. Parameters provided by ByteRanges element may be used for
byte range options in an HTTP request.
[0131] Semantics of MP2TSBoundary, ISOFileBoundary, and ByteRanges
may be further provided as shown in Table 7.
TABLE-US-00007 TABLE 7 XML syntax Pseudo- (based on code Occurrence
Semantics DIA) (MP4) MP2TSBoundary MP2TSBoundaryType `mp2b` box
PCR_PID 1 Describes PID carrying PCR of the integer unsigned
concerned content/program. int(16) PCR_base 1 Describes a value of
a PCR base long unsigned field int(40) PCR_ext 1 Describes a value
of a PCR integer unsigned extension field. int(16) Appearance 1
Describes the appearance order integer unsigned (e.g., 1.sup.st,
2.sup.nd) of the TS packet int(16) containing the PCR value
identified by the above two elements. If there is
resetting/discontinuity of PCR, a PCR value may appear more than
once during an interval. Media_PID 1 . . . N Describes PID of a
media (e.g., integer unsigned video) of a program. int(16) The
number of instances of Media_PID element is equal to the number of
media of the program/content. Media_Offset 1 . . . N Describes the
offset (in TS packets integer unsigned of the same media PID) from
the int(16) above identified PCR packet to the first media packet
of a fragment. The n.sup.th instance of Media_Offset is associated
with the n.sup.th instance of Media_PID. ISOFileBoundary
ISOFileBoundaryType `isfb` box SequenceNo 1 Describes the sequence
number integer unsigned provided in the mfhd box. int(16) The mfhd
box defines a fragment of an MP4 file. SequenceNo with a value of 0
indicates the beginning of the file. ByteRanges ByteRangesType
`brag` box Start 1 . . . N Describes the starting value of a
integer unsigned byte range. int(32) A value of `-1` means that
this value in the HTTP request is missing. End 1 . . . N Describes
the ending value of a byte integer unsigned range. int(32) A value
of `-1` means that this value in the HTTP request is missing.
Start-End instances are present in pair. The n.sup.th instance of
End is associated with the n.sup.th instance of Start. Media_PID 0
. . . N Describes PID of a media (e.g., integer unsigned video)
that needs to be extracted int(16) from the byte range of the above
pair of Start-End. Media_PID element is used when the byte range is
a segment of MPEG-2 TS, and all PIDs do not need to be
delivered.
[0132] Hereinafter, a client request will be described.
[0133] Signaling of metadata obtained by a client may include
different parts or levels of signaling information. Accordingly, a
request from the client to a server may include parameters of
different levels of details.
[0134] Main parameters of the client may be URIs, and may be
associated with a query part.
[0135] Three main scenarios are examined as follows:
[0136] 1) Server-based scenario
[0137] In the server-based scenario, the metadata provided from the
server to the client may include general content information 310
and general interval information 320.
[0138] For an URI of requested content, DefaultContentlntLoc (if
not, DefaultContentlntLoc) may be used. To enable the client to
request a specific fragment of content, the following parameters a)
and b) are defined in the query part (of the request in operation
140):
[0139] a) "fragno": Order value of the fragment in the interval
[0140] b) "fragti": Start time of the fragment in the interval
[0141] For example, a request URI may be
"HTTP://server.com/file.mp4?fragno=5."
[0142] 2) Distributed scenario
[0143] In the distributed scenario, the metadata provided from the
server to the client may include general content information 310,
general interval information 320, and QoS information 330.
[0144] In addition to the above parameters, the following
QoS-related parameters a) through c) are defined in the query part
(of the request in operation 140) to enable the client to request
an appropriate alternative:
[0145] a) "alter": Order value of an alternative. Based on the
order value of the alternative, the alternative may appear in the
QoS information.
[0146] b) "oper1", "oper2", . . . , and "operN": "operi" carries a
value of an i.sup.th adaptation operation that appears in the QoS
information.
[0147] c) "res1", "res2", . . . , "resN": "resi" carries a value of
an i.sup.th resource that appears in the QoS information.
[0148] Only one of the above three options may be used in a single
request.
[0149] With typical adaptation operators and resource types,
specific parameter names for better intelligibility and
interoperability may be defined.
[0150] Adaptation operators are as the following a) through e).
[0151] a) audiolayers: indicates the number of scalable audio
layers to be discarded.
[0152] b) temporallayers: indicates the number of temporal layers
of scalable video to be discarded.
[0153] c) spatiallayers: indicates the number of spatial layers of
scalable video to be discarded.
[0154] d) qualitylayers: indicates the number of quality layers of
scalable video to be discarded.
[0155] e) prioritylayers: indicates the number of priority layers
of scalable video to be discarded.
[0156] Resource types are as the following a) through d).
[0157] a) bitrate: indicates the average bitrate (in Kbps) of the
requested alternative.
[0158] b) vertresolution: indicates the vertical resolution of the
requested alternative.
[0159] c) horiresolution: indicates the horizontal resolution of
the requested alternative.
[0160] d) framerate: indicates the framerate of the requested
alternative.
[0161] Using the pre-defined parameters, an example of a request
URI based on the bitrate may be
"http://server.com/file.mp4?fragno=5&bitrate=550."
[0162] 3) Client-based scenario
[0163] In the client-based scenario, the metadata provided from the
server to the client may include general content, general interval
information, QoS information, and mapping information.
[0164] The QoS-related parameters used in the request may be
indicated by a ReqQoSPara part of QoSlnfo metadata. For example,
when Reflndex of ReqQoSPara is 0 or null, the "alter" parameter may
be used instead of other options.
[0165] When ReqQoSPara is not present in the QoSlnfo metadata, the
QoS-related parameters may not be used. Alternatives in this case
may be implied by locations of MappingInfo.
[0166] A URI of content may be derived from rich description of
MappingInfo. When content/program is conveyed in an MPEG-2 TS, one
or more PIDs may be used to locate the content in the stream.
[0167] When additional information for detecting fragment
boundaries are provided, the following parameters 1) through 3) for
the query part (of the request in operation 140) may be used.
[0168] 1) For an MPEG-2 TS boundary, Appearance, PCR_PID, PCR_base,
PCR_ext, Media_PID, and Media_Offset
[0169] 2) For an ISO media file boundary, SequenceNo
[0170] 3) For a file considered as a raw byte-sequence, Start and
End
[0171] Semantics of the above parameters may be provided in
semantics of a FragBoundaries element.
[0172] Start-End pairs may be used by a range header of an HTTP
request message. For example, if {(Start=0, End=99); (Start=200,
End=299)}, the header may be "Range: bytes=0-99,200-299."
[0173] Hereinafter, syntax representation in an XML format will be
described. Representations of the above syntax elements may be
provided in the XML format. Semantics of each element may be traced
back in the above Tables 1 through 7.
[0174] Several elements may be extensions of several types defined
in an MPEG-21 DIA. Several few elements may take several types
defined in the MPEG-21 DIA.
[0175] The following Table 8 shows syntax representation of
HTTPStreamingType in the XML format.
TABLE-US-00008 TABLE 8 <complexType name="HTTPStreamingType">
<complexContent> <extension
base=''dia:DIADescriptionType''> <sequence> <element
name="GeneralInfo'' type="GeneralInfoType'' minOccurs=''0''/>
<choice minOccurs=''0'' maxOccurs=''unbounded''> <element
name=''IntervalsRef'' type=''IntervalsRefType''/> <element
name=''Interval'' type=''IntervalType''/> </choice>
</sequence> </complexContent> </complexType>
[0176] The following Table 9 shows syntax representation of
GenerallnfoType in the XML format.
TABLE-US-00009 TABLE 9 <complexType name="GeneralInfoType''>
<complexContent> <extension
base=''dia:DIADescriptionType''> <sequence> <element
name="TimeScale" type="integer'' minOccurs=''0''/> <element
name="LiveStartTime" type="dateTime'' minOccurs=''0''/>
<element name="Duration" type="integer'' minOccurs=''0''/>
<element name="DefaultIntDuration" type="integer''
minOccurs=''0''/> <element name="MinUpdateTime"
type="integer'' minOccurs=''0''/> <element
name="ConsistentQoSInfo" type="boolean'' minOccurs=''0''/>
<element name="DefaultContentLoc" type="anyURI''
minOccurs=''0''/> </sequence> </extension>
</complexContent> </complexType>
[0177] The following Table 10 shows syntax representation of
IntervalRefType in the XML format.
TABLE-US-00010 TABLE 10 <complexType name="IntervalsRefType">
<complexContent> <extension base="dia:ReferenceType">
<sequence> <element name="AvaliableTime" type="integer"
minOccurs="0"/> </sequence> <attribute name="startTime"
type="xs:duration" use="optional"/> </extension>
</complexContent> </complexType>
[0178] The following Table 11 shows syntax representation of
IntervalType in the XML format.
TABLE-US-00011 TABLE 11 <complexType name="IntervalType">
<complexContent> <extension
base=''dia:DIADescriptionType''> <sequence> <element
name=''IntervalInfo'' type=''IntervalInfoType''
minOccurs=''0''/> <choice minOccurs=''0''> <element
name=''QoSInfo'' type=''QoSInfoType''/> <element
name=''QoSInfoRef'' type=''dia:ReferenceType''/> </choice>
<choice minOccurs=''0''> <element name=''MappingInfo''
type=''MappingInfoType''/> <element name=''MappingInfoRef''
type=''dia:ReferenceType''/> </choice> <element
name=''PreviousIntervalsRef'' type=''IntervalsRefType''
minOccurs=''0''/> <element name=''NextIntervalsRef''
type=''IntervalsRefType'' minOccurs=''0''/> </sequence>
</extension> </complexContent> </complexType>
[0179] The following Table 12 shows syntax representation of
IntervalInfoType in the XML format.
TABLE-US-00012 TABLE 12 <complexType
name="IntervalInfoType''> <sequence> <element
name="TimeScale" type="integer'' minOccurs=''0''/> <element
name="StartTime" type="dateTime'' minOccurs=''0''/> <element
name="Duration" type="integer'' minOccurs=''0''/> <element
name="DefaultFragDuration" type="integer'' minOccurs=''0''/>
<element name="DefaultContentIntLoc" type="anyURI''
minOccurs=''0''/> <element name="Last" type="boolean''
minOccurs=''0''/> </sequence> </complexType>
[0180] The following Table 13 shows syntax representations of
ISOFileBoundaryType and ByteRangesType in the XML format.
TABLE-US-00013 TABLE 13 <complexType
name="ISOFileBoundaryType"> <sequence> <element
name="SequenceNo" type="integer" maxOccurs=''unbounded''/>
</sequence> </complexType> <complexType
name="ByteRangesType"> <sequence maxOccurs=''unbounded''>
<element name="Start" type="integer"/> <element name="End"
type="integer"/> <element name="Media_PID" type="integer"
minOccurs=''0''/> </sequence> </complexType>
[0181] Hereinafter, syntax representation in an MP4 pseudo-code
format will be described. Representation of the above syntax
elements may be provided in the MP4 pseudo-code format.
[0182] The following Table 14 shows syntax representation of
HTTPStreamingBox in the MP4 pseudo-code format.
TABLE-US-00014 TABLE 14 HTTPStreamingBox Box Type: `htps`
Container: Signaling file Mandatory: Yes Quantity: One Aligned(8)
class HTTPStreamingBox extends Box(`htps`) { }
[0183] FIG. 4 is a diagram illustrating detecting of virtual
boundaries in an MPEG-2 TS according to an embodiment of the
present invention.
[0184] In a TS, PCR packets of a given program may be carried with
fixed PIDs (namely, PCR_PID), and may be inserted at least every
100 ms.
[0185] The PCT packets (with increasing values) may be considered
as anchor points of the program. On the other hand, each media of a
program may be carried by packets of a given PID (namely,
Media_PID).
[0186] Accordingly, a fragment boundary of a media stream may be
defined or identified by 1) a specific anchor point and 2) an
offset from the anchor to the packet at the boundary.
[0187] The offset may be counted by the packets of the same
Media_PID.
[0188] PCR values may be occasionally reset (discontinuity). For
example, when one or more PCR packets have the same PCR value in an
interval, an appearance order of PCR packets used as anchors may be
indicated.
[0189] A sourceURL attribute may be changed from required to
optional. This is because baseURL already provides a complete URL.
The sourceURL may be unnecessary.
[0190] The use of multiple byte ranges may provide flexibility in
downloading "virtual segments." For example, a segment of a low
frame rate (that is able to be used in a trickmode) may be
extracted on-the-fly from a stream or an original segment.
[0191] In addition, to support using multiple URLs for a
Representation, the following modifications may be applied to the
schema of 3GPP Adaptive HTTP Streaming
[0192] Hereinafter, multiple locations for the same
resource/content will be described.
[0193] Each level of description (a top-level, a Period level, and
a Representation level) may provide only a single BaseURL for
building absolute URLs from the description.
[0194] Multiple BaseURLs may be provided at each description level.
Multiple BaseURLs may signal availability of resources at multiple
locations.
[0195] Depending on an actual location of a client, the client may
select one or more BaseURLs in a process of retrieving
resources.
[0196] Such a modification may be implemented by different ways.
One way may be to use an additional attribute called
"morebaseURLs," or an element called "BaseURLs."
[0197] The attribute or element may be a string formed of multiple
(base) URLs. The string may be separated by several special
characters, for example ";" (namely, a semicolon and a space).
[0198] For example, when a semicolon or space appears within a URL,
the semicolon or space may be encoded by the rules of RFC 2616.
[0199] The morebaseURLs attribute (or BaseURLs element) of a lower
description level may override the same attribute (or element) of
the higher description level.
[0200] For clarity, the morebaseURLs attribute and BaseURLs element
may be restricted to be mutually exclusive. In other words, only a
single type may exist in a whole description.
[0201] Another way may be to use a MoreBaseURL element of any URI
type with multiple instances, where each instance provides a
BaseURL.
[0202] The different ways may be merely examples of ideas for
providing multiple BaseURLs. The ideas may be implemented in many
other ways or even other languages.
[0203] Hereinafter, multiple locations for resource/content
components will be described.
[0204] Resource/content may be divided into one or more
components/streams. Each of the one or more components/streams may
be delivered from a location. The delivering may be supported by
allowing multiple instances of an UrlTemplate element or a Url
element set in SegmentlnfoType. A modification "<xs:choice
maxOccurs="unbounded">" in SegmentlnfoType may be used for the
above purpose.
[0205] The appearance order of an UrlTemplate instance or a Url set
instance may indicate importance of "location/stream." A more
important location may appear before a less important location. For
example, a video Representation may include two streams (for
example, a spatial base layer, and a spatial enhancement layer).
Each of the two streams may be delivered from a location described
by UrlTemplate. Subsequently, the first instance of UrlTemplate may
be a location for the spatial base layer.
[0206] Moreover, multiple instances of InitialisationSegmentURL may
be allowed. An n.sup.th instance of InitialisationSegmentURL may
correspond to an n.sup.th instance of a location (by either the
UrlTemplate element or the Url element set).
[0207] For example, when only a single instance of
InitialisationSegmentURL exists, the instance may be used for all
locations.
[0208] The following Tables 15 through 19 show the schema of 3GPP
AdaptiveHTTPStreaming.
TABLE-US-00015 TABLE 15 <?xml version=''1.0''
encoding=''UTF-8''?> <xs:schema
targetNamespace=''urn:3GPP:ns:PSS:AdaptiveHTTPStreamingMPD:2009''
attributeFormDefault=''unqualified''
elementFormDefault=''qualified''
xmlns:xs=''http://www.w3.org/2001/XMLSchema''
xmlns=''urn:3GPP:ns:PSS:AdaptiveHTTPStreamingMPD:2009''>
<xs:annotation> <xs:appinfo>Media Presentation
Description</xs:appinfo> <xs:documentation
xml:lang=''en''> This Schema defines 3GPP Media Presentation
Description! </xs:documentation> </xs:annotation>
<!-- MPD: main element --> <xs:element name=''MPD''
type=''MPDtype''/> <!-- MPD Type --> <xs:complexType
name=''MPDtype''> <xs:sequence> <xs:element
minOccurs=''0'' name=''ProgramInformation''
type=''ProgramInformationType''/> <xs:choice
maxOccurs=''unbounded''> <xs:element name=''Period''
type=''PeriodType''/> <xs:element name=''PeriodsRef''
type=''PeriodsRefType''/> </xs:choice> <xs:element
minOccurs=''0'' name=''BaseUrls'' type=''xs:string''/>
<xs:element minOccurs=''0'' maxOccurs=''unbounded''
name=''MoreBaseUrl'' type=''xs:anyURI''/> <xs:any
namespace=''##other'' processContents=''lax'' minOccurs=''0''
maxOccurs=''unbounded''/> </xs:sequence> <xs:attribute
default=''OnDemand'' name=''type'' type=''PresentationType''/>
<xs:attribute name=''availabilityStartTime''
type=''xs:dateTime''/> <xs:attribute
name=''availabilityEndTime'' type=''xs:dateTime''/>
<xs:attribute name=''mediaPresentationDuration''
type=''xs:duration''/> <xs:attribute
name=''minimumUpdatePeriodMPD'' type=''xs:duration''/>
<xs:attribute name=''minBufferTime'' type=''xs:duration''
use=''required''/> <xs:attribute
name=''timeShiftBufferDepth'' type=''xs:duration''/>
<xs:attribute name=''baseUrl'' type=''xs:anyURI''/>
<xs:attribute name=''morebaseUrls'' type=''xs:string''/>
<xs:anyAttribute namespace=''##other''
processContents=''lax''/> </xs:complexType>
<xs:complexType name="PeriodsRefType''> <xs:sequence>
<xs:element name="Location" type="xs:anyURI''/>
</xs:sequence> <xs:attribute name=''startTime''
type=''xs:duration''/> <xs:attribute name=''duration''
type=''xs:duration''/> <xs:attribute name=''availableTime''
type=''xs:duration''/> </xs:complexType>
TABLE-US-00016 TABLE 16 <!-- Type of presentation - live or
on-demand --> <xs:simpleType name=''PresentationType''>
<xs:restriction base=''xs:string''> <xs:enumeration
value=''OnDemand''/> <xs:enumeration value=''Live''/>
</xs:restriction> </xs:simpleType> <!-- Period of a
presentation --> <xs:complexType name=''PeriodType''>
<xs:sequence> <xs:element minOccurs=''0''
name=''SegmentInfoDefault'' type=''SegmentInfoDefaultType''/>
<xs:element maxOccurs=''unbounded'' name=''Representation''
type=''RepresentationType''/> <xs:element minOccurs=''0''
name=''PreviousPeriodsRef'' type=''xs:PeriodsRef"/>
<xs:element minOccurs=''0'' name=''NextPeriodsRef''
type=''xs:PeriodsRef"/> <xs:any namespace=''##other''
processContents=''lax'' minOccurs=''0''
maxOccurs=''unbounded''/> </xs:sequence> <xs:attribute
name=''start'' type=''xs:duration''/> <xs:attribute
default=''false'' name=''segmentAlignmentFlag''
type=''xs:boolean''/> <xs:attribute default=''false''
name=''bitStreamSwitchingFlag'' type=''xs:boolean''/>
<xs:attribute default=''false'' name=''lastPeriodFlag''
type=''xs:boolean''/> <xs:anyAttribute namespace=''##other''
processContents=''lax''/> </xs:complexType> <!--
Program information for a presentation --> <xs:complexType
name=''ProgramInformationType''> <xs:sequence>
<xs:element minOccurs=''0'' name=''Title''
type=''xs:string''/> <xs:element minOccurs=''0''
name=''Source'' type=''xs:string''/> <xs:element
minOccurs=''0'' name=''Copyright'' type=''xs:string''/>
<xs:any namespace=''##other'' processContents=''lax''
minOccurs=''0'' maxOccurs=''unbounded''/> </xs:sequence>
<xs:attribute name=''moreInformationURL''
type=''xs:anyURI''/> <xs:anyAttribute namespace=''##other''
processContents=''lax''/> </xs:complexType>
TABLE-US-00017 TABLE 17 <!-- Default Segment access information
--> <xs:complexType name="SegmentInfoDefaultType">
<xs:sequence> <xs:element minOccurs="0" name="BaseUrls"
type="xs:string"/> <xs:element minOccurs="0"
maxOccurs="unbounded" name="MoreBaseUrl" type="xs:anyURI"/>
<xs:any namespace="##other" processContents="lax" minOccurs="0"
maxOccurs="unbounded"/> </xs:sequence> <xs:attribute
name="baseURL" type="xs:anyURI"/> <xs:attribute
name="morebaseUrls" type="xs:string"/> <xs:attribute
name="duration" type="xs:duration"/> <xs:attribute
name="sourceUrlTemplatePeriod" type="xs:string"/>
<xs:anyAttribute namespace="##other" processContents="lax"/>
</xs:complexType> <!-- A Representation of the
presentation content for a specific Period -->
<xs:complexType name="RepresentationType">
<xs:sequence> <xs:element name="SegmentInfo"
type="SegmentInfoType"/> <xs:element minOccurs="0"
name="ContentProtection" type="ContentProtectionType"/>
<xs:element minOccurs="0" name="TrickMode"
type="TrickModeType"/> <xs:element minOccurs="0"
maxOccurs="unbounded" name="Quality" type="QualityType"/>
<xs:any namespace="##other" processContents="lax" minOccurs="0"
maxOccurs="unbounded"/> </xs:sequence> <xs:attribute
name="bandwidth" type="xs:unsignedInt" use="required"/>
<xs:attribute default="0" name="group"
type="xs:unsignedInt"/> <xs:attribute name="width"
type="xs:unsignedInt"/> <xs:attribute name="height"
type="xs:unsignedInt"/> <xs:attribute name="lang"
type="xs:string"/> <xs:attribute name="mimeType"
type="xs:string" use="required"/> <xs:attribute
default="false" name="startWithRAP" type="xs:boolean"/>
<xs:attribute name="qualityRanking" type="xs:unsignedInt"/>
<xs:attribute name="requestPara" type="xs:string"/>
<xs:anyAttribute namespace="##other" processContents="lax"/>
</xs:complexType> <xs:complexType name="QualityType">
<xs:sequence> <xs:any namespace="##other"
processContents="lax" minOccurs="0" maxOccurs="unbounded"/>
</xs:sequence> <xs:attribute name="name"
type="xs:QualityNameType"/> <xs:attribute name="value"
type="xs:float"/> </xs:complexType>
TABLE-US-00018 TABLE 18 <xs:simpleType
name=''QualityNameType''> <xs:restriction
base=''xs:string''> <xs:enumeration value=''PSNR''/>
<xs:enumeration value=''MOS''/> <xs:enumeration
value=''ODG''/> <xs:enumeration value=''DI''/>
</xs:restriction> </xs:simpleType> <!-- Segment
access information --> <xs:complexType
name=''SegmentInfoType''> <xs:sequence> <xs:element
minOccurs=''0'' name=''BaseUrls'' type=''xs:string''/>
<xs:element minOccurs=''0'' maxOccurs=''unbounded''
name=''MoreBaseUrl'' type=''xs:anyURI''/> <xs:element
minOccurs=''0'' maxOccurs="unbounded"
name=''InitialisationSegmentURL'' type=''UrlType''/>
<xs:choice maxOccurs="unbounded"> <xs:element
minOccurs=''0'' name=''UrlTemplate'' type=''UrlTemplateType''/>
<xs:sequence> <xs:element maxOccurs=''unbounded''
name=''Url'' type=''UrlType''/> <xs:any namespace=''##other''
processContents=''lax'' minOccurs=''0''
maxOccurs=''unbounded''/> </xs:sequence> <xs:any
namespace=''##other'' processContents=''lax'' minOccurs=''0''
maxOccurs=''unbounded''/> </xs:choice>
</xs:sequence> <xs:attribute name=''baseURL''
type=''xs:anyURI''/> <xs:attribute name=''morebaseUrls''
type=''xs:string''/> <xs:attribute name=''duration''
type=''xs:duration''/> <xs:attribute name=''randAccess''
type=''xs:string''/> <xs:anyAttribute namespace=''##other''
processContents=''lax''/> </xs:complexType> <!-- A
Segment URL --> <xs:complexType name=''UrlType''>
<xs:sequence> <xs:any namespace=''##other''
processContents=''lax'' minOccurs=''0''
maxOccurs=''unbounded''/> </xs:sequence> <xs:attribute
name=''sourceURL'' type=''xs:anyURI'' use=''optional''/>
<xs:attribute name=''range'' type=''xs:string''/>
<xs:anyAttribute namespace=''##other''
processContents=''lax''/> </xs:complexType>
TABLE-US-00019 TABLE 19 <!-- A URL template -->
<xs:complexType name="UrlTemplateType"> <xs:sequence>
<xs:any namespace="##other" processContents="lax" minOccurs="0"
maxOccurs="unbounded"/> </xs:sequence> <xs:attribute
name="sourceURL" type="xs:anyURI"/> <xs:attribute name="id"
type="xs:string"/> <xs:attribute default="1"
name="startIndex" type="xs:unsignedInt"/> <xs:attribute
name="endIndex" type="xs:unsignedInt"/> <xs:anyAttribute
namespace="##other" processContents="lax"/>
</xs:complexType> <!-- Gives information about the content
protection --> <xs:complexType
name="ContentProtectionType"> <xs:sequence> <xs:element
minOccurs="0" name="SchemeInformation" type="xs:string"/>
<xs:any namespace="##other" processContents="lax" minOccurs="0"
maxOccurs="unbounded"/> </xs:sequence> <xs:attribute
name="schemeIdUri" type="xs:anyURI"/> <xs:anyAttribute
namespace="##other" processContents="lax"/>
</xs:complexType> <!-- Gives information about trick mode
--> <xs:complexType name="TrickModeType">
<xs:sequence> <xs:any namespace="##other"
processContents="lax" minOccurs="0" maxOccurs="unbounded"/>
</xs:sequence> <xs:attribute name="alternatePlayoutRate"
type="xs:string"/> <xs:anyAttribute namespace="##other"
processContents="lax"/> </xs:complexType>
</xs:schema>
[0209] FIG. 5 is a diagram illustrating a configuration of the
terminal 100 according to an embodiment of the present
invention.
[0210] A controller 510 may perform operations 130 and 140.
Specifically, the controller 510 may process the metadata of the
interval.
[0211] A transceiver 520 that selects a fragment suitable for the
interval based on the processing may perform operations 120, 140,
and 150. Specifically, the transceiver 520 may receive the metadata
of the interval of the content from the server 110, may send a
request for the fragment suitable for the interval to the server,
and may receive the fragment from the server.
[0212] FIG. 6 is a diagram illustrating a configuration of the
terminal 100 according to an embodiment of the present
invention.
[0213] The terminal 100 may include an access engine 610, and a
media engine 620.
[0214] The access engine 610 may be a DASH access engine.
[0215] The access engine 610 may receive metadata (for example, an
MPD) from the server 110.
[0216] The access engine 610 may form requests, and may issue the
formed requests to the server 110. The access engine 610 may
receive media (for example, segments or parts of the segments) from
the server 110. The access engine may request a segment of the
media using a URL of the segment.
[0217] The access engine 610 may receive segments of the media
based on information provided by the metadata. Here, each period
may include one or more groups, and each of the groups may include
one or more representations of the media. Each of the
representations may include one or more segments.
[0218] The access engine 610 may provide the media to the media
engine 620. The access engine 610 may decode data of the media
included in the segments.
[0219] An output of the access engine 610 may include media (or a
part of the media) of an MPEG container (for example, an ISO/IEC
14492-12 ISO base media file format, or an ISO/IEC 13818-2 MPEG-2
TS). Additionally, the output of the access engine 610 may include
timing information used to map internal timing of the media to a
timeline of a media presentation.
[0220] The media engine 620 may play back the provided media.
Specifically, the media engine 620 may receive data of the media
from the access engine, and may output the media. The media engine
620 may output the media using the data of the media and the timing
information that are output from the access engine 610.
[0221] Technical information according to the embodiments of the
present invention described above with reference to FIGS. 1 through
5 may equally be applied to the present embodiment. Accordingly,
further description thereof will be omitted.
[0222] The method according to the above-described embodiments of
the present invention may be recorded in computer-readable media
including program instructions to implement various operations
embodied by a computer. The media may also include, alone or in
combination with the program instructions, data files, data
structures, and the like. The program instructions recorded on the
media may be those specially designed and constructed for the
purposes of the embodiments, or they may be of the kind well-known
and available to those having skill in the computer software arts.
Examples of computer-readable media include magnetic media such as
hard disks, floppy disks, and magnetic tape; optical media such as
CD ROM disks and DVDs; magneto-optical media such as floptical
disks; and hardware devices that are specially configured to store
and perform program instructions, such as read-only memory (ROM),
random access memory (RAM), flash memory, and the like. Examples of
program instructions include both machine code, such as produced by
a compiler, and files containing higher level code that may be
executed by the computer using an interpreter. The described
hardware devices may be configured to act as one or more software
modules in order to perform the operations of the above-described
embodiments of the present invention, or vice versa.
[0223] Although a few embodiments of the present invention have
been shown and described, the present invention is not limited to
the described embodiments. Instead, it would be appreciated by
those skilled in the art that changes may be made to these
embodiments without departing from the principles and spirit of the
invention, the scope of which is defined by the claims and their
equivalents.
* * * * *
References