U.S. patent application number 13/276833 was filed with the patent office on 2012-02-09 for closed captioning language translation.
This patent application is currently assigned to Rockstar BIDCO LP. Invention is credited to William Bishop, M. Neil Harrington, Steve J. McKinnon.
Application Number | 20120033133 13/276833 |
Document ID | / |
Family ID | 39171300 |
Filed Date | 2012-02-09 |
United States Patent
Application |
20120033133 |
Kind Code |
A1 |
Bishop; William ; et
al. |
February 9, 2012 |
CLOSED CAPTIONING LANGUAGE TRANSLATION
Abstract
The present invention provides an architecture for translating
closed captioning text originally provided with a video program
from one language to another and presenting the translated closed
captioning text with the video program to a viewer. As such, the
viewers are able to receive the closed captioning text in languages
other than that used for the closed captioning originally provided
with the video program. The original closed captioning text may be
translated from one language to another by a centralized closed
captioning processor, such that the customer equipment for various
subscribers can take advantage of centralized translation services.
Once, the original closed captioning text is translated, the
translated closed captioning text may be delivered to the customer
equipment in different ways.
Inventors: |
Bishop; William; (Raleigh,
NC) ; Harrington; M. Neil; (Raleigh, NC) ;
McKinnon; Steve J.; (Cary, NC) |
Assignee: |
Rockstar BIDCO LP
New York
NY
|
Family ID: |
39171300 |
Appl. No.: |
13/276833 |
Filed: |
October 19, 2011 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
11531562 |
Sep 13, 2006 |
8045054 |
|
|
13276833 |
|
|
|
|
Current U.S.
Class: |
348/468 ;
348/E7.033 |
Current CPC
Class: |
H04N 5/445 20130101;
H04N 21/4884 20130101; H04N 7/0885 20130101; G06F 40/58 20200101;
H04N 21/4314 20130101; H04N 21/4355 20130101; H04N 21/2355
20130101; H04N 21/4312 20130101 |
Class at
Publication: |
348/468 ;
348/E07.033 |
International
Class: |
H04N 7/00 20110101
H04N007/00 |
Claims
1. A video display system, comprising: a user interface operable to
receive a user input requesting information presentation in a
user-specified format; at least one communication interface
operable to: in response to the user input, transmit a request for
information presentation in the user-specified format over a
communications network to a remote source of requested information;
and receive a video signal and at least one requested information
signal over a communications network from at least one remote
signal source, the at least one requested information signal
comprising information in the user-specified format; a video
display; and a control system operable to present the video signal
and the at least one requested information signal in the
user-specified format on the video display.
2. The video display system of claim 1, wherein the requested
information is textual information.
3. The video display system of claim 2, wherein the textual
information is closed captioning.
4. The video display system of claim 1, wherein the user-specified
format is a user-specified language.
5. The video display system of claim 1, wherein the at least one
communication interface comprises a video interface operable to
receive the video signal from a remote video signal source.
6. The video display system of claim 5, wherein the video interface
is operable to receive the video signal and the at least one
requested information signal from the remote video signal
source.
7. The video display system of claim 6, wherein the at least one
requested information signal is incorporated in the video
signal.
8. The video display system of claim 6, wherein the remote video
signal source is the remote source of the requested
information.
9. The video display system of claim 6, wherein the at least one
communication interface further comprises at least one requested
information signal interface operable to receive the at least one
requested information signal from the remote source of the
requested information, the at least one requested information
signal comprising the requested information in the user-specified
format.
10. The video display system of claim 1, wherein the control system
is operable to synchronize the requested information in the
user-specified format with video information from the video
signal.
11. The video display system of claim 10, wherein the control
system is operable to use synchronization information in the at
least one requested information signal to synchronize the requested
information in the user-specified format with the video
information.
12. The video display system of claim 10, wherein the control
system is operable to communicate with the remote source of the
requested information over the at least one requested information
signal interface to synchronize the requested information in the
user-specified format with the video information.
13. The video display system of claim 10, further comprising a
video buffer controlled by the control system to synchronize the
requested information with the video information.
14. A method of operating a video display system, comprising:
receiving a user input requesting information presentation in a
user-specified format in response to, the user input, transmitting
a request for information presentation in the user-specified format
over a communications network to a remote source of requested
information; and receiving a video signal and at least one
requested information signal over a communications network from at
least one remote signal source, the at least one requested
information signal comprising information in the user-specified
format; presenting the video signal and the requested information
in the user-specified format on a video display.
15. The method of claim 14, wherein the requested information is
textual information.
16. The method of claim 15, wherein the textual information is
closed captioning.
17. The method of claim 14, wherein the user-specified format is a
user-specified language.
18. The method of claim 14, further comprising synchronizing the
requested information in the user-specified format with video
information from the video signal.
19. The method of claim 14, comprising using synchronization
information in the at least one requested information signal to
synchronize the requested information in the user-specified format
with the video information.
20. The method of claim 14, comprising communicating with the
remote source of the requested information to synchronize the
requested information in the user-specified format with the video
information.
21. An apparatus for providing requested information in a
user-requested format system, comprising: at least one
communication interface operable to receive a request for
information presentation in a user-specified format over a
communications network from a remote user video display system; and
a control system operable to provide a requested information signal
comprising requested information in the user-specified format; the
at least one communication interface being further operable to
transmit the requested information signal to the remote user video
display system.
22. The apparatus of claim 21, wherein the control system is
operable to provide the requested information signal by generating
the requested information signal in response to the received
request.
23. The apparatus of claim 21, wherein the requested information is
textual information.
24. The apparatus of claim 21, wherein the user-specified format is
a user-specified language.
25. The apparatus of claim 24, wherein the control system is
operable to translate an information signal into the user-specified
language in response to the request.
26. The apparatus of claim 21, wherein the at least one
communication interface is further operable to receive at least one
video signal and to transmit at least one video signal to the
remote user video display system.
27. The apparatus of claim 26, wherein the control system is
operable to incorporate the requested information signal in an
associated video signal, and the communication interface is
operable to transmit the video signal with the incorporated
requested information signal to the remote user video display
system.
28. The apparatus of claim 26, wherein the control system is
operable to provide the requested information signal by generating
the requested information signal from a selected received video
signal in response to the received request.
29. The apparatus of claim 28, wherein the user-specified format is
a user-specified language, and the control system is operable to
translate an information signal into the user-specified language in
response to the request.
30. The apparatus of claim 29, wherein the requested information is
textual information.
31. A method of providing requested information in a user-requested
format system from a central information source, the method
comprising: receiving a request for information presentation in a
user-specified format over a communications network from a remote
user video display system; generating a requested information
signal comprising the requested information in the user-specified
format; and transmitting the requested information signal to the
remote user video display system.
32. The method of claim 31, wherein generating the requested
information signal comprises generating the requested information
signal in response to the request.
33. The method of claim 31, wherein the requested information is
textual information.
34. The apparatus of claim 31, wherein the user-specified format is
a user-specified language.
35. The method of claim 34, wherein generating the requested
information signal comprises translating an information signal into
the user-specified language in response to the received
request.
36. The method of claim 31, wherein further comprising receiving at
least one video signal and transmitting at least one video signal
to the remote user video display system.
37. The method of claim 36, comprising incorporating the requested
information signal in an associated video signal, and transmitting
the associated video signal with the incorporated requested
information signal to the remote user video display system.
38. The method of claim 36, wherein generating the requested
information signal comprises generating the requested information
signal from a selected received video signal in response to the
request.
39. The method of claim 38, wherein the user-specified format is a
user-specified language, and generating the requested information
signal comprises translating an information signal into the
user-specified language in response to the request.
40. The method of claim 39, wherein the requested information is
textual information.
Description
RELATED APPLICATIONS
[0001] The present application is a continuation of co-pending U.S.
patent application Ser. No. 11/531,562, filed Sep. 13, 2006,
entitled "CLOSED CAPTIONING LANGUAGE TRANSLATION."
FIELD OF THE INVENTION
[0002] The present invention relates to closed captioning, and in
particular to translating closed captioning text provided in a
first language into a second language, wherein the translated
closed captioning text is presented, to a viewer along with the
corresponding video.
BACKGROUND OF THE INVENTION
[0003] Closed captioning allows deaf, hard of hearing, and hearing
impaired people to read a transcript of an audio portion of a
television, video, or film presentation. As the video is presented
to the viewer, text captions are displayed identifying who is
speaking, transcribing what is being said, and indicating relevant
sounds, such as laughing, crying, crashes, explosions, and the
like. Closed captioning is also used to assist people who are
learning an additional language, learning to read, or for those in
a noisy environment.
[0004] For the present disclosure, television, video, and film
presentations are referred to as "video," and the text captions
representing the closed captioning of the video are referred to as
"closed captioning text." Closed captioning text is encoded into
the video using any number of closed captioning techniques. In many
instances, different types of video programming employ different
types of closed captioning encoding and decoding.
[0005] In North America, National Television Systems Committee
(NTSC)-based programming encodes closed captioning text into line
21 of the vertical blanking interval. The vertical blanking
interval is a portion of the analog television picture that resides
just above the visible portion of the video, and is not seen by the
viewer. The viewer's set-top box or television is able to decode
the encoded closed captioning text provided in line 21 of the
vertical blanking interval and present it to the viewer in
association with the video. For digital television, the Advanced
Television Systems Committee (ATSC)-based programming encodes three
data streams into the video to support closed captioning. One of
the streams can support up to 63 unique closed captioning streams,
which are encoded in an EIAA-708 format as set forth by the
Electronic Industries Alliance (EIA). The other two streams are
encoded such that when the digital video is converted to analog
video, the closed captioning text appears as encoded closed
captioning in line 21 of the vertical blanking interval. Outside of
North America, Phase Alternation Line (PAL) and Sequential Color
With Memory (SCCAM) video standards transmit and store closed
captioning information in a different manner, but the overall
result is the same.
[0006] Regardless of the encoding and delivery technique, closed
captioning is extremely beneficial in providing a transcript of an
audio portion of a video program. Unfortunately, closed captioning
text is generally only available in one language, although most
closed captioning standards support different closed captioning
streams for different languages. However, the significant effort
and expense associated with providing closed captioning generally
limits the closed captioning text to the most prevalent language in
which the video will be presented. In the United States, for
example, closed captioning is generally only provided in English,
even though there are significant Hispanic, Asian, and European
contingents who would benefit from closed captioning in their
native languages.
[0007] Accordingly, there is a need for a way to efficiently and
effectively translate closed captioning text presented in a first
language into a second language, and make the translated closed
captioning text available to viewers of the associated video.
SUMMARY OF THE INVENTION
[0008] The present invention provides an architecture for
translating closed captioning text originally provided with a video
program from one language to another and presenting the translated
closed captioning text with the video program to a viewer. As such,
the viewers are able to receive the closed captioning text in a
language other than that used for the closed captioning originally
provided with the video program. The original closed captioning
text may be translated from one language to another by a
centralized closed captioning processor, such that the customer
equipment for various subscribers can take advantage of centralized
translation services. Once the original closed captioning text is
translated, the translated closed captioning text may be delivered
to the customer equipment in different ways.
[0009] In a first embodiment, the video program is sent to the
closed captioning processor and the customer equipment at the same
time. The closed captioning processor will translate the original
closed captioning text from one language to another. After
translation, the closed captioning processor will deliver the
translated closed captioning text to the customer equipment, which
will present the translated closed captioning text with the video
program.
[0010] In another embodiment, the video program is initially sent
to the closed captioning processor. The closed captioning processor
will translate the original closed captioning text from one
language to another. After translation, the closed captioning
processor will encode the translated closed captioning text into
the video program as closed captioning. The closed captioning
processor will then deliver the video program with the new closed
captioning content to the customer equipment, which will recover
the translated closed captioning text using traditional closed
captioning decoding techniques.
[0011] Those skilled in the art will appreciate the scope of the
present invention and realize additional aspects thereof after
reading the following detailed description of the preferred
embodiments in association with the accompanying drawing
figures.
BRIEF DESCRIPTION OF THE DRAWING FIGURES
[0012] The accompanying drawing figures incorporated in and forming
a part of this specification illustrate several aspects of the
invention, and together with the description serve to explain the
principles of the invention.
[0013] FIG. 1 is a block representation of a closed captioning
architecture according to a first embodiment of the present
invention.
[0014] FIG. 2 is a block representation of a closed captioning
translation process according to the embodiment of FIG. 1.
[0015] FIG. 3 is a flow diagram illustrating the closed captioning
translation process according to the embodiment of FIG. 1.
[0016] FIG. 4 is a block representation of a closed captioning
architecture according to a second embodiment of the present
invention.
[0017] FIG. 5 is a block representation of, a closed captioning
translation process according to the embodiment of FIG. 4.
[0018] FIG. 6 is a flow diagram illustrating the closed captioning
translation process according to the embodiment of FIG. 4.
[0019] FIG. 7 is a block representation of a closed captioning
processor according to one embodiment of the present invention.
[0020] FIG. 8 is a block representation of customer equipment
according to one embodiment of the present invention.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0021] The embodiments set forth below represent the necessary
information to enable those skilled in the art to practice the
invention and illustrate the best mode of practicing the invention.
Upon reading the following description in light of the accompanying
drawing figures, those skilled in the art will understand the
concepts of the invention and will recognize applications of these
concepts not particularly addressed herein. It should be understood
that these concepts and applications fall within the scope of the
disclosure and the accompanying claims.
[0022] The present invention provides an architecture for
translating closed captioning (CC) text originally provided with a
video program from one language to another and presenting the
translated closed captioning text with the video program to a
viewer. As such, the viewers are able to receive the closed
captioning text in a language other than that used for the closed
captioning originally provided with the video program. The original
closed captioning text may be translated from one language to
another by a centralized closed captioning processor, such that the
customer equipment for various subscribers can take advantage of
centralized translation services. Once the original closed
captioning text is translated, the translated closed captioning
text may be delivered to the customer equipment in different ways.
In a first embodiment, the video program is sent to the closed
captioning processor and the customer equipment at the same time.
The closed captioning processor will translate the original closed
captioning text from one language to another. After translation,
the closed captioning processor will deliver the translated closed
captioning, text to the customer equipment, which will present the
translated closed captioning text with the video program.
[0023] In another embodiment, the video program is initially sent
to the closed captioning processor. The closed captioning processor
will translate the original closed captioning text from one
language to another. After translation, the closed captioning
processor will encode the translated closed captioning text into
the video program as closed captioning. The closed captioning
processor will then deliver the video program with the new closed
captioning content to the customer equipment, which will recover
the translated closed captioning text using traditional closed
captioning decoding techniques.
[0024] In either embodiment, any number of translations may be
provided for original closed captioning text. The translation
process and the desired translation or translations may be
requested by the customer equipment, the service provider, or
another subscriber device. The video program may be provided in an
analog or digital format via broadcast or recorded medium.
[0025] With reference to FIG. 1, a closed captioning translation
architecture is illustrated according to a first embodiment of the
present invention. The closed captioning translation architecture
10 includes a video source 12 capable of delivering video programs
to various customer equipment 14, which may be associated with
different subscribers at different locations. The customer
equipment 14 may take various forms, including a set-top box (STB)
14A, an associated television or monitor 14B, a personal computer
(PC) 14C, as well as any other mobile device, such as a personal
digital assistant (PDA) 14D or mobile telephone (not shown). The
video programs that require closed captioning translation are also
sent to a closed captioning (CC) processor 16 at the same time they
are being sent to the customer equipment 14.
[0026] In FIG. 1, video programs are delivered to the customer
equipment 14 and the CC processor 16 via an appropriate video
signal 18, which provides closed captioning in a first language.
The CC processor 16 will translate the closed captioning text to a
second language and will provide corresponding translated CC
information 20 to the appropriate customer equipment 14, as
indicated above. To reduce the computational requirements of the
customer equipment 14, and to minimize delays associated with
translating closed captioning text from one language to another,
the CC processor 16 is provided in a central network 22. As such,
the CC processor 22 is separate from the customer equipment 14.
Preferably, the CC processor 16 is able to provide translation
services to customer equipment 14 at numerous customer premises
24.
[0027] FIG. 2 illustrates a simplified block representation of the
closed captioning translation architecture of the embodiment
illustrated in FIG. 1. As illustrated, a video source 12 will
simultaneously provide the video signal 18 for the video program to
the CC processor 16 and the customer equipment 14. The video signal
18 will have a video, audio, and closed captioning component having
closed captioning text (CC1), regardless of the signal type or
closed captioning encoding technique.
[0028] With reference to FIG. 3, operation of the CC processor 16
is illustrated. The CC processor 16 may receive a CC translation
request from the customer equipment 14 to identify a language to
which the closed captioning text (CC1) provided in the video signal
18 should be translated (step 100). The CC processor 16 will then
receive the video signal 18 with the original closed captioning for
the video program (step 102), and extract the original closed
captioning text (CC1) from the video signal 18 using the
appropriate closed captioning decoding techniques (step 104). Once
the original dosed captioning text (CC1) is extracted, the CC
processor 16 will translate the original closed captioning text
from one language to another to create translated closed captioning
text (CC2) (step 106). The translated closed captioning text is
delivered to the customer equipment 14 in translated closed
captioning information 20, which is based on the translated, closed
captioning text (step 108), as also illustrated in FIG. 2.
[0029] Translation of the closed captioning text may be provided
using different techniques. For example, the translation may be
provided on a word-by-word or phrase-by-phrase basis.
Alternatively, the closed captioning text over a certain period of
time may be accumulated and then translated as an entire segment.
The particular type of translation is beyond the scope of the
present invention, and those skilled in the art will recognize
various translation techniques that will be beneficial in various
video delivery environments.
[0030] In one embodiment, the translated closed captioning
information is not re-encoded into a closed captioning format, and
is sent to the customer equipment 14 without the video or audio
components of the video program. The translated closed captioning
information 20 is formatted such that the customer equipment 14 can
readily recover the translated closed captioning text (CC2) and
overlay the text on the video program being presented to the
viewer. The overlay procedure may be configured to emulate
traditional closed captioning, or may be presented in any desired
fashion. Notably, the customer equipment 14 need not have closed
captioning decoding capabilities, since the translated closed
captioning information is not necessarily provided in a closed
captioning format, although it represents a translation of the
original closed captioning text (CC1).
[0031] Since the translated closed captioning text (CC2) is being
delivered to the customer equipment 14 in the translated closed
captioning information 20 separately from the video signal 18,
steps must be taken to synchronize the presentation of the
translated closed captioning text (CC2) with the video of the video
program. In the translated closed captioning information 20, the CC
processor 16 may provide markers or like synchronization
information corresponding to the video in the video program, such
that the translated closed captioning text is presented at the
appropriate time and rate in association with the video of the
video program. Alternatively, the customer equipment 14 and the CC
processor 16 may periodically or continuously communicate with each
other to ensure that the translated closed captioning text (CC2) is
presented to the viewer along with the video of the video program
in a synchronized fashion.
[0032] The CC processor 16 will inevitably inject some delay in
presenting the translated closed captioning text (CC2) to the
customer equipment 14. The customer equipment 14 may employ at
least a video buffer to buffer the video of the video program for a
time sufficient to receive the translated closed captioning text
(CC2) from the CC processor 16. The synchronization information in
the closed captioning text will control the amount of buffering.
The customer equipment 14 will then present the translated closed
captioning text (CC2) and the video of the video program in a
synchronized fashion to the viewer.
[0033] Given the centralized nature of the CC processor 16, any
customer equipment 14 may receive translation services. Further,
the CC processor 16 may be configured to translate between any
number of languages, such that closed captioning text may be
translated into any number of languages and may be presented to any
amount of customer equipment 14 in an effective and efficient
manner. Preferably, the customer equipment 14 is able to request a
particular type of translation for a particular video program or
for all programming in light of the viewer's needs or desires.
[0034] With reference to FIG. 4, an alternative closed captioning
translation architecture is provided. As illustrated, the video
signal 18 is sent to the CC processor 16 from the video source 12.
The video signal 18 is not directly delivered to the customer
equipment 14 when closed captioning translation services are
employed. Instead, the CC processor 16 will extract the closed
captioning text from the video signal 18, translate the closed
captioning text from one language to another, and encode translated
closed captioning text into the video signal 18 as traditional
closed captioning. The CC processor 16 will then deliver a video
signal 26 having closed captioning with the translated closed
captioning text to the customer equipment 14.
[0035] This process is further illustrated in FIG. 5. Notably, the
CC processor 16 presents video, audio, and closed captioning
aspects of the video program in the video signal 26 to the customer
equipment 14. Notably, the translated closed captioning text (CC2)
may replace the closed captioning text (CC1) that was provided in
the original video signal 18, or may be added in a separate closed
captioning field, as illustrated. Upon receipt of the video signal
26, the customer equipment 14 may use traditional closed captioning
decoding techniques to present the translated closed captioning
text CC2 or original closed captioning text CC1 to the viewer.
Unlike the previous embodiment where the translated closed
captioning text (CC2) required a different process to overlay the
translated closed captioning text on the video of the video
program, this embodiment embeds translated closed captioning text
into the video signal 26 prior to the customer equipment 14
receiving the video signal 26. This embodiment is particularly
beneficial in environments where the customer equipment 14 is not
configured to receive translated closed captioning information via
a separate source and overlay the corresponding translated closed
captioning text (CC2) over the video of the video signal 18.
However, this embodiment requires that the customer equipment 14
have closed captioning decoding capabilities.
[0036] With reference to FIG. 6, operation of the CC processor 16
is provided according to the second embodiment. Initially, the CC
processor 16 may receive closed captioning translation instructions
from the customer equipment 14, a service provider, or a subscriber
via another piece of customer equipment 14 (step 200). The
instructions may identify a desired language for closed captioning,
as well as the video program or programs for which closed
captioning translation services are desired. The CC processor 16
may then receive a video signal with closed captioning for a video
program (step 202). Again, the closed captioning will include
closed captioning text (CC1) in a first language. The CC processor
16 will extract the closed captioning text (CC1) (step 204) and
translate the closed captioning text (CC1) into translated closed
captioning text (CC2) (step 206). The translated closed captioning
text is then encoded into the video signal 18 of the video program
using appropriate closed captioning encoding techniques (step 208).
The CC processor 16 will deliver the resultant video signal 26 to
the customer equipment 14 (step 210). Again, the video signal 26
will have closed captioning including the translated closed
captioning text (CC2), and perhaps the original closed captioning
text (CC1).
[0037] In this embodiment, synchronization of the translated closed
captioning text (CC2) with the video of the video program is
provided in the CC processor 16 during the closed captioning
encoding process. The customer equipment 14 will simply decode the
appropriate closed captioning stream, which includes the translated
closed captioning text (CC2), and present the translated closed
captioning text (CC2) to the viewer along with the video in
traditional fashion.
[0038] Those skilled in the art will recognize that different
closed captioning encoding and decoding techniques are available
and known in the art. In light of the different closed captioning
processes and the various manners in which video programs may be
recorded, broadcast, or delivered to customer equipment 14, the
concepts of the present invention may take corresponding forms, as
will be appreciated by those skilled in the art.
[0039] With reference to FIG. 7, a block representation of a CC
processor 16 is illustrated. The CC processor 16 may include a
control system 30 associated with sufficient memory 32 for the
requisite software 34 and data 36 to operate as described above.
The control system 30 may also be associated with one or more
communication interfaces 38 to facilitate communications directly
or indirectly with the video source 12 and the customer equipment
14.
[0040] A block representation of customer equipment 14 is provided
in FIG. 8. The customer equipment 14 may include a control system
40 having sufficient memory 42 for the requisite software 44 and
data 46 to operate as described above. The control system 40 may
also be associated with one or more communication interfaces 48 to
facilitate communications with other communication equipment 14,
the CC processor 16, and the video source 12, in a direct or
indirect fashion. The control system 40 may also be associated with
a user interface 50, which may represent a display, monitor,
keypad, mouse, remote control input, or the like.
[0041] Those skilled in the art will recognize improvements and
modifications to the preferred embodiments of the present
invention. All such improvements and modifications are considered
within the scope of the concepts disclosed herein and the claims
that follow.
* * * * *