U.S. patent application number 10/902010 was filed with the patent office on 2004-12-30 for document processing apparatus and document processing method.
This patent application is currently assigned to FUJITSU LIMITED. Invention is credited to Shiotsu, Makoto, Yamamoto, Yuki.
Application Number | 20040268243 10/902010 |
Document ID | / |
Family ID | 18612909 |
Filed Date | 2004-12-30 |
United States Patent
Application |
20040268243 |
Kind Code |
A1 |
Yamamoto, Yuki ; et
al. |
December 30, 2004 |
Document processing apparatus and document processing method
Abstract
A document processing apparatus has a language tag setting
section for setting a language tag designating a kind of language
at each of constituent unit positions in an original and those in a
translated version corresponding to the original, and a document
forming section for forming a tagged document including the
original and the translated version each having the language tag
set therein. This apparatus enables a user to easily form and
maintain a home page (Web page data) expressed in a plurality of
languages.
Inventors: |
Yamamoto, Yuki; (Nei-gun,
JP) ; Shiotsu, Makoto; (Kawasaki, JP) |
Correspondence
Address: |
STAAS & HALSEY LLP
SUITE 700
1201 NEW YORK AVENUE, N.W.
WASHINGTON
DC
20005
US
|
Assignee: |
FUJITSU LIMITED
Kawasaki
JP
|
Family ID: |
18612909 |
Appl. No.: |
10/902010 |
Filed: |
July 30, 2004 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
10902010 |
Jul 30, 2004 |
|
|
|
09748177 |
Dec 27, 2000 |
|
|
|
Current U.S.
Class: |
715/234 ;
715/256 |
Current CPC
Class: |
G06F 40/117 20200101;
G06F 40/151 20200101; G06F 40/58 20200101 |
Class at
Publication: |
715/513 |
International
Class: |
G06F 017/00 |
Foreign Application Data
Date |
Code |
Application Number |
Mar 31, 2000 |
JP |
2000-98427 |
Claims
What is claimed is:
1. A document processing apparatus comprising: a language tag
setting unit setting a language tag designating a kind of language
at each of constituent unit positions in an original and those in a
translated version corresponding to the original; and a document
forming unit forming a tagged document including the original and
the translated version each having the language tag set
therein.
2. A document processing apparatus according to claim 1, wherein
said document forming unit forms a tagged document in which the
original and the translated version each having the language tag
set therein are described in an original-versus-version form.
3. A document processing apparatus according to claim 1 or 2,
further comprising a translation unit translating the original to
make the translated version.
4. A document processing apparatus according to any one of claims 1
to 3, wherein the original is contained in the processing-object
tagged document.
5. A document processing apparatus according to any one of claims 1
to 4, further comprising a visibility control tag setting unit
setting a visibility control tag for controlling any document
portion so that the document portion is invisible, said visibility
control tag setting unit setting the visibility control tag at such
a position that one of the original and the translated version is
in an invisible state.
6. A document processing apparatus according to claim 5, wherein
said visibility control tag setting unit setting the visibility
control tag at such a position that one of the original and the
translated version is set in an invisible state and the language
tag is also set in an invisible state.
7. A document processing apparatus according to claim 1, further
comprising a display processing unit interpreting the tag contained
in the tagged document and for enabling a browser for displaying
the tagged document to display the formed tagged document on the
basis of a result of the interpretation.
8. A document processing apparatus comprising: a language tag
setting unit setting a language tag at each of constituent unit
positions in a first text described in a first language and those
in a second text described in a second language of the first text;
and a document forming unit forming a tagged document including the
first text and the second text each having the language tag set
therein.
9. A document processing apparatus according to claim 8, wherein
said document forming unit forms a tagged document in which the
first text and the corresponding second text each having the
language tag set therein are described by being related to each
other.
10. A document processing apparatus according to claim 8, further
comprising a visibility control tag setting unit setting a
visibility control tag for controlling a text so that the text is
invisible, said visibility control tag setting unit setting the
visibility control tag at such a position that one of the first
text and the second text is in an invisible state.
11. A document processing apparatus according to claim 10, wherein
said visibility control tag setting unit sets the visibility
control tag at such a position that one of the first text and the
second text is set in an invisible state and the language tag is
also set in an invisible state.
12. A document processing apparatus according to claim 8, further
comprising a display processing unit interpreting the tag contained
in the tagged document and for enabling a browser for displaying
the tagged document to display the formed tagged document on the
basis of a result of the interpretation.
13. A document processing apparatus comprising: an analysis unit
determining and extracting an original text from a
processing-object tagged document; and an editing unit enabling
edit processing of the original text extracted from the tagged
document by displaying the original text.
14. A document processing apparatus according to claim 13, wherein
said analysis unit determines the original text according to a
language tag contained in the processing-object tagged
document.
15. A document processing apparatus according to claim 13 or 14,
further comprising a translation processing unit making a
translated version by translating the original text edited by said
editing unit.
16. A document processing apparatus according to claim 15, further
comprising a restoration unit restoring, from the original text
after the editing and the translated version made by said
translation processing unit, a tagged document in the same format
as the processing-object tagged document, according to tags
contained in the processing-object tagged document, said tags
including the language tag.
17. A document processing apparatus comprising: an analysis unit
determining an original text and a translated version corresponding
to the original text in a processing-object tagged document; and an
editing unit enabling editing of the original text and the
translated version by extracting the original text and the
translated version from the tagged document and by displaying the
original text and the translated version in an
original-versus-version form.
18. A document processing apparatus according to claim 17, wherein
said analysis unit determines the original text and the translated
version according to language tags contained in the
processing-object tagged document.
19. A document processing apparatus according to claim 17 or 18,
further comprising a translation processing unit making a
translated version by translating the original text edited by said
editing unit.
20. A document processing apparatus according to claim 19, further
comprising a restoration unit restoring, from the original text
after the edit processing and the translated version made by said
translation processing unit, a tagged document in the same format
as the processing-object tagged document, according to tags
contained in the processing-object tagged document, said tags
including the language tag.
21. A recording medium readable by a computer, said recording
medium having a program recorded thereon, said program for enabling
the computer to execute steps of: setting a language tag
designating a kind of language at each of constituent unit
positions in an original and those in a translated version
corresponding to the original; and forming a tagged document
including the original and the translated version each having the
language tag set therein.
22. A recording medium according to claim 21, wherein said program
recorded on said recording medium enables the computer to further
execute a step of setting a visibility control tag for controlling
any document portion so that the document portion is invisible,
said visibility control tag being set at such a position that one
of the original and the translated version is in an invisible
state.
23. A recording medium readable by a computer, said recording
medium having a program recorded thereon, said program for enabling
the computer to execute steps of: setting a language tag
designating a kind of language at each of constituent unit
positions in a first text described in a first language and those
in a second text described in a second language of the first text;
and forming a tagged document including the first text and the
second text each having the language tag set therein.
24. A recording medium readable by a computer according to claim
23, wherein said program recorded on said recording medium enables
the computer to further execute a step of setting a visibility
control tag for controlling a text so that the document is
invisible, said visibility control tag being set at such a position
that one of the first text and the second text is in an invisible
state.
25. A recording medium readable by a computer, said recording
medium having a program recorded thereon, said program for enabling
the computer to execute steps of: determining and extracting an
original text from a processing-object tagged document; and
enabling editing of the original text extracted from the tagged
document by displaying the original text.
26. A recording medium readable by a computer according to claim
25, wherein said program recorded on said recording medium enables
the computer to further execute steps of: making a translated
version by translating the original text after editing of the
original text; and restoring, from the original text after the edit
processing and the translated version formed by said translation, a
tagged document in the same format as the processing-object tagged
document according to tags contained in the processing-object
tagged document, said tags including a language tag designating a
kind of language.
27. A recording medium readable by a computer, said recording
medium having a program recorded thereon, said program for enabling
the computer to execute steps of: determining an original text and
a translated version corresponding to the original text in a
processing-object tagged document; and enabling edit processing of
the original text and the translated version by extracting the
original text and the translated version from the tagged document
and by displaying the original text and the translated version in
an original-versus-version form.
28. A recording medium readable by a computer according to claim
27, wherein said program recorded on said recording medium enables
the computer to further execute steps of: forming a translated
version by translating the original text after editing of the
original text; and restoring, from the original text after the edit
processing and the translated version made by said translation, a
tagged document in the same format as the processing-object tagged
document, according to tags contained in the processing-object
tagged document, said tags including a language tag designating a
kind of language.
29. A recording medium readable by a computer, said recording
medium having a tagged document recorded thereon, wherein said
recorded tagged document comprising: a first language tag
designating a first kind of language; a first text following said
first language tag, said first text being described in a first
language; a second language tag following said first text, said
second language tag designating a second kind of language; and a
second text following said second language tag, said second text
being described in a second language, said second text
corresponding to said first text.
30. A recording medium readable by a computer, said recording
medium having a tagged document recorded thereon, wherein said
recorded tagged document comprising: an invisibility start tag for
setting a following text in an invisible state; a first language
tag following said invisibility start tag, said first language tag
designating a first kind of language; a first text following said
first language tag, said first text being described in a first
language; a second language tag following said first text, said
second language tag designating a second kind of language; an
invisibility end tag following said second language tag, said
invisibility end tag canceling the invisible state; and a second
text following said invisibility end tag, said second text being
described in a second language, said second text corresponding to
said first text.
31. A document processing method comprising steps of: setting a
language tag designating a kind of language at each of constituent
unit positions in an original and those in a translated version
corresponding to the original; and forming a tagged document
including the original and the translated version each having the
language tag set therein.
32. A document processing method according to claim 31, wherein
further comprising a step of setting a visibility control tag for
controlling any document portion so that the document portion is
invisible, said visibility control tag being set at such a position
that one of the original and the translated version is in an
invisible state.
33. A document processing method comprising steps of: setting a
language tag designating a kind of language at each of constituent
unit positions in a first text described in a first language and
those in a second text described in a second language of the first
text; and forming a tagged document including the first text and
the second text each having the language tag set therein.
34. A document processing method according to claim 33, wherein
further comprising a step of setting a visibility control tag for
controlling a text so that the document is invisible, said
visibility control tag being set at such a position that one of the
first text and the second text is in an invisible state.
35. A document processing method comprising steps of: determining
and extracting an original text from a processing-object tagged
document; and enabling editing of the original text extracted from
the tagged document by displaying the original text.
36. A document processing method according to claim 35, wherein
further comprising steps of: making a translated version by
translating the original text after editing of the original text;
and restoring, from the original text after the edit processing and
the translated version formed by said translation, a tagged
document in the same format as the processing-object tagged
document according to tags contained in the processing-object
tagged document, said tags including a language tag designating a
kind of language.
37. A document processing method comprising steps of: determining
an original text and a translated version corresponding to the
original text in a processing-object tagged document; and enabling
edit processing of the original text and the translated version by
extracting the original text and the translated version from the
tagged document and by displaying the original text and the
translated version in an original-versus-version form.
38. A document processing method according to claim 37, wherein
further comprising steps of: forming a translated version by
translating the original text after editing of the original text;
and restoring, from the original text after the edit processing and
the translated version made by said translation, a tagged document
in the same format as the processing-object tagged document,
according to tags contained in the processing-object tagged
document, said tags including a language tag designating a kind of
language.
Description
BACKGROUND OF THE INVENTION
[0001] The present invention relates to a document processing
apparatus and method which analyzes a tagged document, e.g., a
Hyper Text Markup Language (HTML) document and forms another tagged
document containing original sentences and translated sentences.
The present invention also relates to a recording medium for
recording such a tagged document.
[0002] With the recent proliferation of personal computers and
communication apparatus, people have become able to use
communication networks represented by the Internet, i.e., Internet
protocol (IP) communication networks, and to easily obtain various
sorts of information through the networks. Ordinarily, the World
Wide Web in the Internet generally uses HTML as a language for
describing information. Dynamic HTML (DHTML) and Extensible Markup
Language (XML) are other languages presently used to form tagged
documents.
[0003] Conventionally, to form home pages (also called Web pages)
containing the same information described in different languages,
e.g., English and Japanese, the process of separately forming each
home page is required. That is, the steps of forming sentences
separately in each language, pasting common images including graphs
and figures, separately setting links from each of the English home
page and the Japanese home page, etc., are required.
[0004] Home pages are open to the public on the Internet and can be
read by people everywhere in the world. Therefore, people who set
up their home pages for various readers to read are making home
pages having the same format and contents but having sentences
written in different languages.
[0005] The English and Japanese home pages formed as described
above need scrupulous attention for maintenance because their
English and Japanese sentences after correction must be equivalent
in meaning and format.
[0006] To form such English and Japanese home pages, it is
necessary to make and manage two kinds of HTML document files for
the English home page and the Japanese home page. That is, the
number of files to be managed is increased in proportion to the
number of languages, and management and maintenance becomes
difficult.
[0007] A user who wishes to read document information in an English
home page in Japanese may translate the English document
information by using Internet translation software. However, if the
user wishes to edit the results of the translation displayed as an
HTML document, he or she must give up the idea of doing so or is
further required to translate the translation results by different
translation software, because direct editing of the translated HTML
document is impossible.
[0008] If the user dares to edit the translation-result HTML
document, he or she must perform the steps of storing the
translation-result HTML document on a local disk, opening the HTML
document file stored on the local disk by using HTML document
editing software, displaying the HTML document source, directly
editing the HTML document source, and storing the results of the
editing on the local disk. This process enables editing of the
translation results to some effect. However, it is difficult to
edit a document in which HTML tags, original sentences, and
translated sentences are mixed.
[0009] Further, in a case where an HTML document intended as an
object of translation is prepared in advance and, from this
document, another HTML document described in a different language
is formed by translation processing using Internet translation
software, a need may arise to edit the HTML document in the second
language formed by the translation processing and, if necessary,
the translation-object HTML document, if the author of the HTML
document is not satisfied with the results of the translation.
[0010] In this editing, it is difficult to determine document
portions to be edited and to confirm the correspondence between
original and translated sentences, since the translation-object
HTML document and the translation-result HTML document exist in
separate files. It is also possible that, through editing, the page
configuration (format) of one document will become different from
that of the other.
[0011] As described above, the conventional HTML document
processing apparatus can be designed to enable translation of an
original home page on the Internet using Internet translation
software and visual display of original and translated sentences in
a juxtaposed form. However, in editing translation results, an HTML
document itself cannot be edited. There is a way to directly edit
the HTML document source, but editing in such a way is extremely
troublesome and not satisfactorily effective.
SUMMARY OF THE INVENTION
[0012] In view of the above-described circumstances, an object of
the present invention is to provide a document processing apparatus
and a recording medium which make it possible to easily form and
maintain a home page (Web page data) expressed in two or more
languages.
[0013] Another object of the present invention is to provide a
document processing apparatus and a recording medium which make it
possible to easily edit translated sentences obtained as a result
of translation of a tagged document.
[0014] Still another object of the present invention is to provide
a document processing apparatus and a recording medium which make
it possible to selectively display original sentences in a tagged
document and translated sentences obtained as a result of
translation of the tagged document.
[0015] A further object of the present invention is to provide a
document processing apparatus and a recording medium which make it
possible to return the edited translation sentences of the result
of translation into a tagged document and to use it.
[0016] A further object of the present invention is to provide a
document processing apparatus and a recording medium which make it
possible to easily edit original sentences in a tagged document to
from a more favorable translation.
[0017] To achieve the above-described objects, according to one
aspect of the present invention, there is provided a document
processing apparatus comprising a language tag setting unit for
setting a language tag designating a kind of language at each of
constituent unit positions in an original and those in a translated
version corresponding to the original, and a document forming unit
for forming a tagged document including the original and the
translated version each having the language tag set therein.
[0018] The above-described document forming unit may form a tagged
document in which the original and the translated version each
having the language tag set therein are described in an
original-versus-version form.
[0019] The above-described document processing apparatus further
comprises a translation unit for translating the original to make
the corresponding translated version.
[0020] The original may be contained in the processing-object
tagged document.
[0021] The above-described document processing apparatus further
comprises a visibility control tag setting unit for setting a
visibility control tag for controlling any document portion so that
the document portion is invisible, the visibility control tag
setting unit setting the visibility control tag at such a position
that one of the original and the translated version is in an
invisible state.
[0022] The visibility control tag setting unit may set the
visibility control tag at such a position that one of the original
and the translated version is set in an invisible state and the
language tag is also set in an invisible state.
[0023] The above-described document processing apparatus further
comprises a display processing unit for interpreting the tag
contained in the tagged document and for enabling a browser for
displaying the tagged document to display the formed tagged
document on the basis of a result of the interpretation in the
above document processing apparatus.
[0024] According to another aspect of the present invention, there
is provided a document processing apparatus comprising a language
tag setting unit for setting a language tag at each of constituent
unit positions in a first text described in a first language and
those in a second text described in a second language of the first
text, and a document forming unit for forming a tagged document
including the first text and the second text each having the
language tag set therein.
[0025] The above-described document forming unit may form a tagged
document in which the first text and the corresponding second text
each having the language tag set therein are described by being
related to each other.
[0026] The above-described document processing apparatus further
comprises a visibility control tag setting unit for setting a
visibility control tag for controlling a text so that the text is
invisible, the visibility control tag setting unit setting the
visibility control tag at such a position that one of the first
text and the second text is in an invisible state.
[0027] The visibility tag setting unit may set the visibility
control tag at such a position that one of the first text and the
second text is set in an invisible state and the language tag is
also set in an invisible state.
[0028] The above-described document processing apparatus further
comprises a display processing unit for interpreting the tag
contained in the tagged document and for enabling a browser for
displaying the tagged document to display the formed tagged
document on the basis of a result of the interpretation.
[0029] According to still another aspect of the present invention,
there is provided a document processing apparatus comprising an
analysis unit for determining and extracting an original text from
a processing-object tagged document, and an editing unit for
enabling edit processing of the original text extracted from the
tagged document by displaying the original text.
[0030] The analysis unit may determine the original text according
to a language tag contained in the processing-object tagged
document.
[0031] The above-described document processing apparatus further
comprises a translation processing unit for making a translated
version by translating the original text edited by the editing
unit.
[0032] The above-described document processing apparatus further
comprises a restoration unit for restoring, from the original text
after the editing and the translated version made by the
translation processing unit, a tagged document in the same format
as the processing-object tagged document according to tags
contained in the processing-object tagged document, the tags
including the language tag.
[0033] According to yet another aspect of the present invention,
there is provided a document processing apparatus comprising an
analysis unit for determining an original text and a translated
version corresponding to the original text in a processing-object
tagged document, and an editing unit for enabling edit processing
of the original text and the translated version by extracting the
original text and the translated version from the tagged document
and by displaying the original text and the translated version in
an original-versus-version form.
[0034] The analysis unit may determine the original text and the
translated version according to language tags contained in the
processing-object tagged document.
[0035] The above-described document processing apparatus further
comprises a translation processing unit for making a translated
version by translating the original text edited by the editing
unit.
[0036] The above-described document processing apparatus further
comprises a restoration unit for restoring, from the original text
after the editing and the translated version made by the
translation processing unit, a tagged document in the same format
as the processing-object tagged document according to tags
contained in the processing-object tagged document, the tags
including the language tag.
[0037] According to a further aspect of the present invention,
there is provided a recording medium readable by a computer, the
recording medium having a program recorded thereon, the program for
enabling the computer to execute a step of setting a language tag
designating a kind of language at each of constituent unit
positions in an original and those in a translated version
corresponding to the original, and a step of forming a tagged
document including the original and the translated version each
having the language tag set therein.
[0038] The program recorded on this recording medium may enable the
computer to further execute a step of setting a visibility control
tag for controlling any document portion so that the document
portion is invisible, the visibility control tag being set at such
a position that one of the original and the translated version is
in an invisible state.
[0039] According to still a further aspect of the present
invention, there is provided a recording medium readable by a
computer, the recording medium having a program recorded thereon,
the program for enabling the computer to execute a step of setting
a language tag designating a kind of language at each of
constituent unit positions in a first text described in a first
language and those in a second text described in a second language
of the first text, and a step of forming a tagged document
including the first text and the second text each having the
language tag set therein.
[0040] The program recorded on this recording medium may enable the
computer to further execute a step of setting a visibility control
tag for controlling a text so that the document is invisible, the
visibility control tag being set at such a position that one of the
first text and the second text is in an invisible state.
[0041] According to still a further aspect of the present
invention, there is provided a recording medium readable by a
computer, the recording medium having a program recorded thereon,
the program for enabling the computer to execute a step of
determining and extracting an original text from a
processing-object tagged document, and a step of enabling editing
of the text extracted from the tagged document by displaying the
original text.
[0042] The program recorded on this recording medium may enable the
computer to further execute a step of making a translated version
by translating the original text after editing of the original
text, and a step of restoring, from the original text after the
editing and the translated version made by the translation, a
tagged document in the same format as the processing-object tagged
document according to tags contained in the processing-object
tagged document, the tags including a language tag designating a
kind of language.
[0043] According to still a further aspect of the present
invention, there is provided a recording medium readable by a
computer, the recording medium having a program recorded thereon,
the program for enabling the computer to execute a step of
determining an original text and a translated version corresponding
to the original text in a processing-object tagged document, and a
step of enabling editing of the original text and the translated
version by extracting the original text and the translated version
from the tagged document and by displaying the original text and
the translated version in an original-versus-version form.
[0044] The program recorded on this recording medium may enable the
computer to further execute a step of forming a translated version
by translating the original text after editing of the original
text, and a step of restoring, from the original text after the
editing and the translated version made by the translation, a
tagged document in the same format as the processing-object tagged
document according to tags contained in the processing-object
tagged document, the tags including a language tag designating a
kind of language.
[0045] According to still a further aspect of the present
invention, there is provided a recording medium readable by a
computer, the recording medium having a tagged document recorded
thereon, the recorded tagged document comprising a first language
tag designating a first kind of language; a first text following
the first language tag, the first text being described in a first
language; a second language tag following the first text, the
second language tag designating a second kind of language; and a
second text following the second language tag, the second text
being described in a second language and corresponding to the first
text.
[0046] According to still a further aspect of the present
invention, there is provided a recording medium readable by a
computer, the recording medium having a tagged document recorded
thereon, the recorded tagged document comprising an invisibility
start tag for setting a following text in an invisible state; a
first language tag following the invisibility start tag, the first
language tag designating a first kind of language; a first text
following the first language tag, the first text being described in
a first language; a second language tag following the first text,
the second language tag designating a second kind of language; an
invisibility end tag following the second language tag, the
invisibility end tag canceling the invisible state; and a second
text following the invisibility end tag, the second text being
described in a second language and corresponding to the first
text.
[0047] According to a further aspect of the present invention,
there is provided a document processing method comprising a step of
setting a language tag designating a kind of language at each of
constituent unit positions in an original and those in a translated
version corresponding to the original; and a step of forming a
tagged document including the original and the translated version
each having the language tag set therein.
[0048] The above-described document processing method further
comprises a step of setting a visibility control tag for
controlling any document portion so that the document portion is
invisible, said visibility control tag being set at such a position
that one of the original and the translated version is in an
invisible state.
[0049] According to still a further aspect of the present
invention, there is provided a document processing method
comprising a step of setting a language tag designating a kind of
language at each of constituent unit positions in a first text
described in a first language and those in a second text described
in a second language of the first text; and a step of forming a
tagged document including the first text and the second text each
having the language tag set therein.
[0050] The above-described document processing method further
comprises a step of setting a visibility control tag for
controlling a text so that the document is invisible, said
visibility control tag being set at such a position that one of the
first text and the second text is in an invisible state.
[0051] According to still a further aspect of the present
invention, there is provided a document processing method
comprising a step of determining and extracting an original text
from a processing-object tagged document; and a step of enabling
editing of the original text extracted from the tagged document by
displaying the original text.
[0052] The above-described document processing method further
comprises a step of making a translated version by translating the
original text after editing of the original text; and a step of
restoring, from the original text after the edit processing and the
translated version formed by said translation, a tagged document in
the same format as the processing-object tagged document according
to tags contained in the processing-object tagged document, said
tags including a language tag designating a kind of language.
[0053] According to still a further aspect of the present
invention, there is provided a document processing method
comprising a step of determining an original text and a translated
version corresponding to the original text in a processing-object
tagged document; and a step of enabling edit processing of the
original text and the translated version by extracting the original
text and the translated version from the tagged document and by
displaying the original text and the translated version in an
original-versus-version form.
[0054] The above-described document processing method further
comprises a step of forming a translated version by translating the
original text after editing of the original text; and a step of
restoring, from the original text after the edit processing and the
translated version made by said translation, a tagged document in
the same format as the processing-object tagged document, according
to tags contained in the processing-object tagged document, said
tags including a language tag designating a kind of language.
[0055] According to the present invention, it is not necessary to
form and manage a file with respect to each of a plurality of
languages, so that the maintenance can be easily carried out.
[0056] According to the present invention, an original text and a
translated version of the original text obtained by translation are
displayed in an original-versus-version form such that edit
processing and version can be easily carried out.
[0057] According to the present invention, an original text in a
tagged document and a translated version of the original text
obtained by translation can be selectively displayed.
[0058] Further, according to the present invention, an edited
translated version of the result of translation can be reformatted
to the tagged document and used.
[0059] Further, according to the present invention, an original
text in a tagged document can be easily edited to obtain a more
favorable translated version.
BRIEF DESCRIPTION OF THE DRAWINGS
[0060] These objects and advantages of the present invention will
become more apparent and more readily appreciated from the
following detailed description of the presently preferred exemplary
embodiments, taken in conjunction with the accompanying drawings of
which:
[0061] FIG. 1 is a block diagram showing the configuration of an
HTML document processing apparatus which represents a first
embodiment of the present invention;
[0062] FIG. 2 is a diagram schematically showing the format and
contents of an HTML document;
[0063] FIG. 3 is a diagram explaining a state where HTML documents
are combined into one file and stored, and an edited state by a
translation editor;
[0064] FIG. 4 is a flowchart for explaining the operation of the
HTML document processing apparatus of the first embodiment;
[0065] FIG. 5 is another flowchart for explaining the operation of
the HTML document processing apparatus of the first embodiment;
[0066] FIG. 6 is a diagram showing a specific example of an
original-versus-version HTML document source;
[0067] FIG. 7 is a block diagram showing the configuration of an
HTML document processing apparatus which represents second and
third embodiments of the present invention;
[0068] FIG. 8 is a flowchart for explaining the operation of the
HTML document processing apparatus of the second embodiment;
[0069] FIG. 9 is another flowchart for explaining the operation of
the HTML document processing apparatus of the second
embodiment;
[0070] FIG. 10 is a diagram showing an example of an initial window
of an English-Japanese translation editor (original-versus-version
editor program);
[0071] FIG. 11 is a flowchart for explaining the operation of the
HTML document processing apparatus of the third embodiment; and
[0072] FIG. 12 is another flowchart for explaining the operation of
the HTML document processing apparatus of the third embodiment.
DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0073] Embodiments of the present invention will be described with
reference to the accompanying drawings.
[0074] [First Embodiment]
[0075] [Configuration of HTML Document Processing Apparatus]
[0076] FIG. 1 shows functional components of an HTML document
processing apparatus 1 which represents a first embodiment of the
present invention. As shown in FIG. 1, the HTML document processing
apparatus 1 is connected to a World Wide Web (WWW) server 3 through
an IP communication network, i.e., Internet 2. In this embodiment,
the WWW server 3 holds English home pages (Web pages in which
contents are described in English).
[0077] The HTML document processing apparatus 1 can be realized by
a personal computer, and has an input section 10, a display section
11, a display processing section 12, a communication control
section 13, an automatic translation processing section 14, an
original-versus-version HTML document forming processing section
15, an individual-language HTML document conversion processing
section 16, a data storage section 17, etc. The HTML document
processing apparatus 1 is connected to a WWW browser 4 provided as
an Internet display tool.
[0078] The personal computer has, as is well known, a central
processing unit (CPU), a random access memory (RAM), a hard disk, a
drive unit for accessing a portable recording medium, such as a
floppy disk or a compact disk-read only memory (CD-ROM), to read
out a program or data recorded thereon, a communication control
unit, such as a modern or a local area network (LAN) board, a
display, a keyboard, and a mouse.
[0079] The input section 10 of the HTML document processing
apparatus 1 in this embodiment corresponds to the keyboard and the
mouse; the display section 11 to the display; the data storage
section 17 to the RAM, the hard disk or the portable recording
medium; and the communication control section 13 to the
communication control unit.
[0080] The display processing section 12, the automatic translation
processing section 14, the original-versus-version HTML document
forming processing section 15, and the individual-language HTML
document conversion processing section 16 are realized by making
the CPU execute a program of the present invention and a program of
Internet translation software.
[0081] The program of the present invention and the Internet
translation software are provided by being recorded on the portable
recording medium, and are read out by the drive unit and stored in
the hard disk in advance.
[0082] The above-described program and software stored in an
external unit, may be downloaded to the HTML document processing
apparatus 1 through the network using the communication control
unit, and stored in the hard disk.
[0083] The program of the present invention and the Internet
translation software stored in the hard disk are read to the RAM to
be executed by the CPU. The programs of the present invention and
the Internet translation software may be directly read to the RAM
without being stored in the hard disk, instead of being temporality
stored in the hard disk and thereafter read to the RAM as described
above.
[0084] HTML documents formed by the HTML document processing
apparatus 1 in this embodiment of the present invention are stored
in the RAM, the hard disk or the portable recording medium.
[0085] In the HTML document processing apparatus 1 arranged as
described above, the automatic translation processing section 14
translates an English HTML document (an original including
sentences and figures) of an English home page described in English
into a Japanese HTML document (a translated version) described in
Japanese by using the home page translation functions of the
Internet translation software, e.g., "ATLAS" (trademark of FUJITSU
LIMITED) and the original and the results of the translation are
visually displayed so that pairs of original and translated
paragraphs or sentences are juxtaposed in upper and lower
positions, as described below in detail.
[0086] In this translation and display process, the
original-versus-version HTML document forming processing section 15
attaches a language tag, which is a special tag for identification
of the kind of language, to the beginning of each of paragraphs or
sentences which are constituent units of the original document and
the translated document. The individual-language HTML document
conversion processing section 16 recognizes the attached language
tag to enable selective display of only one of the original and
translated documents by further adding an invisibility tag
(visibility control tag) to instruct to make no visual display.
[0087] [Operation of HTML Document Processing Apparatus]
[0088] The operation of the HTML document processing apparatus 1 in
the first embodiment of the present invention shown in FIG. 1 will
now be described. FIG. 2 schematically shows the compositions and
contents of original and translated documents and a state where
these documents are stored and displayed by being juxtaposed in the
above-mentioned manner. FIG. 3 shows a state where an original HTML
document and a version of the HTML document are stored and
displayed while being combined into one file in such a form that
each of pairs of original and translated sentences are juxtaposed
in left and right positions. FIGS. 4 and 5 are flowcharts for
explaining the operation of the HTML document processing apparatus
1.
[0089] (Original-Versus-Version HTML Document Forming
Processing)
[0090] The operation will be described by way of example with
respect to a case where, in the HTML document processing apparatus
1, an English HTML document (original) of an English home page
described in English is translated into a Japanese HTML document
(translated version) described in Japanese, and an
original-versus-version HTML document is formed from the original
and version and is visually displayed.
[0091] Referring also to FIGS. 1 through 5, a user using the HTML
document processing apparatus 1 searches the Internet for a home
page described in a language which he or she wishes to translate,
by operating the input section 10 and based on the contents of a
display made by the display section 11 (WWW browser). In this
example, the object of this search (processing step S401 in FIG. 4)
is an English home page held by the WWW server 3.
[0092] Next, the user initiates a program for translation in the
desired direction by operating the input section 10 and based on
the contents of the display made by the display section 11. In this
example, the user wishes to translate an English HTML document of
the English home page into a Japanese HTML document described in
Japanese and therefore initiates an English-into-Japanese
translation program (S402).
[0093] The user clicks a "translation" button displayed in an
initial window (dialogue window) (not shown) of the translation
program to make the HTML document processing apparatus 1 translate
and arrange the English HTML document in such a manner that each of
pairs of original and translated paragraphs or sentences can be
displayed by being juxtaposed in upper and lower positions (S403).
According to the user's "translation" button clicking operation,
the automatic translation processing section 14 having the
translation program read out from the hard disk, translates the
English HTML document supplied from the WWW server 3 through the
communication control section 13 into a Japanese document, and
thereby forms an HTML document in which each of a certain number of
English paragraphs or sentences are followed by a line break tag
and a corresponding Japanese paragraph or sentence, that is, each
of pairs of original and translated paragraphs or sentences are
displayed by being juxtaposed in upper and lower positions. Known
techniques can be used for the process of translation in the
automatic translation processing section 14, and therefore such
translation techniques will not be described in detail in this
specification.
[0094] The original-versus-version HTML document forming processing
section 15, supplied with the results of translation performed by
the automatic translation section 14, i.e., the HTML document
containing Japanese sentences, starts analyzing the HTML document
(S404) to add language tags for enabling discrimination between the
original and translated paragraphs or sentences in the
individual-language HTML document conversion processing section 16
described below in detail.
[0095] The original-versus-version HTML document forming processing
section 15 first recognizes the part of a text of the inputted HTML
document. The original-versus-version HTML document forming
processing section 15 detects an HTML tag such as specifically
shown in FIG. 6 (shown in the space between symbols < and >
in FIG. 6) to recognize the section following this tag as English
texts and Japanese texts (S405).
[0096] Next, the original-versus-version HTML document forming
processing section 15 determines, if the texts recognized in step
S405 are an English paragraph or sentence (original) or a Japanese
paragraph or sentence (translated version) according to the
position of the paragraphs or sentences (in the order of an English
paragraph or sentence, a line break tag, and a Japanese paragraph
or sentence) (step S406). If the original-versus-version HTML
document forming processing section 15 recognizes an English
paragraph or sentence as a result of this determination, it embeds
a language tag <E> at the beginning of the
document-component-unit paragraph or sentence (S407). If it
recognizes a Japanese paragraph or sentence, it embeds a language
tag <J> at the beginning of the document-component-unit
paragraph or sentence (S408).
[0097] The original-versus-version HTML document forming processing
section 15 proceeds to processing of the next paragraph in the
original-versus-version HTML document, and makes a determination as
to whether an HTML tag </html> indicating the end of the HTML
document is detected or not (steps S409, S410). The
original-versus-version HTML document forming processing section 15
repeats the above-described steps S404 to S410 until the HTML tag
indicating the end of the HTML document is detected.
[0098] The original-versus-version HTML document forming processing
section 15 stores in the data storage section 17 an
original-versus-version HTML document obtained as a result of the
original-versus-version HTML document forming processing based on
the above-described sequence of document analysis steps, when the
HTML tag indicating the end of the HTML document is detected. The
state of data thereby stored is shown as an original-versus-version
HTML document source 21 in FIG. 2. In FIG. 2, symbols .DELTA., and
.gradient. indicate English sentences and symbols .largecircle.,
and .circleincircle. indicate Japanese sentences.
[0099] After the original-versus-version HTML document forming
processing performed by the original-versus-version HTML document
forming processing section 15 based on the above-described sequence
of document analysis steps, the display processing section 12
visually displays the stored original-versus-version HTML document
in the upper-lower juxtaposition form on the screen of the display
section 11 (WWW browser 4) (step S411).
[0100] The original-versus-version HTML document source 21 is
displayed in the upper-lower juxtaposition form, as indicated by
reference numerals 20 in FIG. 2. In FIG. 2, "image1" and "image2"
represent figures in the document.
[0101] (Processing for Selectively Displaying English HTML Document
(Original) and Japanese HTML Document (Version))
[0102] In the HTML document processing apparatus, processing
described below is performed after the original-versus-version HTML
document forming processing based on the above-described sequence
of document analysis steps. The processing described below enables
selective display of only one of the English HTML document
(original) and the Japanese HTML document (translated version)
displayed in the upper-lower juxtaposition form on the screen of
the display section 11.
[0103] The user determines display of the document in the
particular language for selective display of the document, and
clicks a button corresponding to the language to be displayed (a
"Japanese" button in this example) (S412 in FIG. 5). In response to
the user's "Japanese" button clicking operation, the HTML document
processing apparatus 1 hands over processing to the
individual-language HTML document conversion processing section 16
to enable selective display of only the English HTML document
(original) or the Japanese HTML document (translated version). In
the case where the objective languages are English and Japanese as
in this example, "English" and "Japanese" buttons (not shown) are
displayed.
[0104] The individual-language HTML document conversion processing
section 16 reads out the original-versus-version HTML document
source 21 to the data storage section 17 as shown in FIG. 2, and
starts analyzing the document (S413). The individual-language HTML
document conversion processing section 16 makes a determination as
to whether or not a language tag <E> or <J> embedded in
the original-versus-version HTML document source 21 has been
detected (S414).
[0105] If the individual-language HTML document conversion
processing section 16 has detected a language tag in the English
original-vs.-Japanese version HTML document, it sets an
invisibility start tag <INVISIBILITY START> as a visibility
control tag immediately before the detected language tag
(S415).
[0106] Thereafter, to enable visual display of only the Japanese
sentences in the English original-vs.-Japanese version HTML
document, the individual-language HTML document conversion
processing section 16 makes a determination as to whether or not
the language tag <J> is detected (S416). If the
individual-language HTML document conversion processing section 16
has detected a language tag <J>, it sets an invisibility end
tag <INVISIBILITY END> as a visibility control tag
immediately after the detected tag (immediately before a translated
sentence) (S417).
[0107] The well-known comment tag may be used as the
above-mentioned visibility control tag. A sentence or paragraph
bracketed by a pair of comment tags is not displayed by the WWW
browser 4.
[0108] If the individual-language HTML document conversion
processing section 16 recognizes the detected language tag as a
language tag other than the language tag <J> in above step
S416, it sets an invisibility end tag <INVISIBILITY END> at
the end of the paragraph (S418). This process is effective in
processing an original-versus-version HTML document containing
sentences written in three or more languages. For example, if the
original-versus-version HTML document contains German sentences
(original) together with English sentences (original) and Japanese
sentences (translated version), this process makes the German
sentences invisible as well as the English sentences.
[0109] An original-versus-version HTML document 22 having
invisibility tags <INVISIBILITY START>, <INVISIBILITY
END> as visibility control tags added thereto as well as
language tags <E> and <J>, as shown in FIG. 2, is
formed in this manner to display Japanese sentences only. FIG. 2
shows this HTML document with source codes.
[0110] If no language tag is detected in above step S414, or after
the completion of processing of the step S417 or 418, the
individual-language HTML document conversion processing section 16
proceeds to processing of the next paragraph in the
original-versus-version HTML document, and makes a determination as
to whether or not an HTML tag </html> indicating the end of
the HTML document has been detected (steps S419, S420). The
individual-language HTML document conversion processing section 16
repeats the above-described steps S413 to S420 until the HTML tag
indicating the end of the HTML document is detected.
[0111] The individual-language HTML document conversion processing
section 16 stores in the data storage section 17 the HTML document
with invisibility tags obtained as a result of the above-described
sequence of process steps for selective display of the
original-versus-version HTML document when the HTML tag indicating
the end of the original-versus-version HTML document is
detected.
[0112] The display processing section 12 then opens the HTML
document with invisibility tags by the WWW browser 4, and visually
displays the document on the screen of the display section 11
(S421). A Japanese sentence HTML display 23 shown in FIG. 2
represents the state of this display on the screen.
[0113] Further, FIG. 2 shows the source code for enabling display
of English sentences only in an original-versus-version HTML
document 24, and shows an HTML display 25 which represents the
state of display on the screen when the HTML document 24 is
displayed by the WWW browser 4.
[0114] [Second Embodiment]
[0115] [Configuration of HTML Document Processing Apparatus]
[0116] FIG. 7 shows functional components of an HTML document
processing apparatus 1 which represents a Embodiment 2 of the
present invention. The HTML document processing apparatus 1 is
connected to a World Wide Web (WWW) server 3 through an IP
communication network, i.e., Internet 2. In this embodiment, the
WWW server 3 holds English home pages.
[0117] The HTML document processing apparatus 1 can be realized by
a personal computer, and has an input section 10, a display section
11, a display processing section 12, a communication control
section 13, an automatic translation processing section 14, an
original-versus-version HTML document forming processing section
15, an individual-language HTML document conversion processing
section 16, a data storage section 17, an HTML document analysis
and conversion processing section 18, an English-Japanese
translation editor (original-versus-version editor program) 19, an
original-versus-version HTML document conversion processing section
40, etc. The HTML document processing apparatus 1 is connected to a
WWW browser 4 provided as an Internet display tool.
[0118] In the HTML document processing apparatus arranged as
described above, the automatic translation processing section 14
translates an English HTML document (an original including
sentences and figures) of an English home page described in English
into a Japanese HTML document (a translated version) described in
Japanese, by using the home page translation functions of the
above-mentioned Internet translation software, and the original and
the results of the translation are visually displayed in such a
form that each of pairs of original and translated paragraphs or
sentences are juxtaposed in upper and lower positions.
[0119] In this translation and display process, the
original-versus-version HTML document forming processing section 15
attaches a language tag, which is a special tag for determination
of the kind of language, to the beginning of each of paragraphs or
sentences which are constituent units of the original document and
the translated document.
[0120] The HTML document analysis and conversion processing section
18 and the English-Japanese translation editor
(original-versus-version editor program) 19 extract data with
respect to each language, when data is edited in an
original-versus-version form. After editing of
original-versus-version data performed by the editor, HTML tags are
restored to form an HTML document from the edited data.
[0121] If an HTML document is translated by Internet translation
software, translation results outputted in an
original-versus-version form cannot be edited. However, document
data prepared by removing HTML tags from the translation results is
supplied to the editor capable of editing data in an
original-versus-version form to enable editing of the translation
results. Also, HTML tags are restored in edit results to enable the
edit results to be used as an HTML document.
[0122] [Operation of HTML Document Processing Apparatus]
[0123] The operation of the HTML document processing apparatus 1 in
Embodiment 2 of the present invention shown in FIG. 7 will now be
described. FIGS. 8 and 9 are flowcharts for explaining the
operation of the HTML document processing apparatus 1.
[0124] (Original-Versus-Version HTML Document Forming
Processing)
[0125] Referring to FIGS. 7 through 9, in the HTML document
processing apparatus 1, an English HTML document (original) of an
English home page described in English is translated, an HTML
document including the formed original and translated version is
analyzed, and an original-versus-version HTML document having
language tags embedded therein is formed and visually displayed.
This document forming and displaying processing is the same as that
performed by the HTML document processing apparatus 1 Embodiment 1
described above, i.e., the process steps S401 to S414 shown in FIG.
4.
[0126] (Original-Versus-Version HTML Document Data Conversion
Processing and Translated Version Edit Processing)
[0127] In the HTML document processing apparatus 1, after the
above-described processing of forming an original-versus-version
HTML document displayed in an upper-lower juxtaposition form on the
screen of the display section 11, data conversion processing and
translated version edit processing can be performed in a process
described below.
[0128] When a user clicks an "original-versus-version edit
processing start" button, the automatic translation processing
section 14 having translation software hands over processing to the
HTML document analysis and conversion processing section 18 (S812
in FIG. 8) to enable editing of the Japanese HTML document
(version) in the original-versus-version HTML document 20 shown in
FIG. 2.
[0129] The HTML document analysis and conversion processing section
18 starts processing for enabling editing using the
English-Japanese translation editor (original-versus-version editor
program) 19 by removing language tags from the
original-versus-version HTML document source 21 having language
tags <E> and <J> added thereto (S813).
[0130] First, the HTML document analysis and conversion processing
section 18 makes a determination as to whether a language tag
<E> or <J> embedded in the original-versus-version HTML
document in the original-versus-version HTML document forming
processing section 15 has been detected (S814).
[0131] If the HTML document analysis and conversion processing
section 18 has detected a language tag <E> or <J> in
the English original-versus-Japanese version HTML document, it
stores in the data storage section 17 language data together with
language tag start and end addresses (S815).
[0132] Thereafter, to enable display and edit of English sentences
in the English original-vs.-Japanese version HTML document, the
HTML document analysis and conversion processing section 18 makes a
determination as to whether or not the detected language tag is a
language tag <E> (S816). If the detected language tag is a
language tag <E>, the HTML document analysis and conversion
processing section 18 stores in an original document storage area
in the data storage section 17, the start address of a document
constituent unit paragraph (or sentence) and the contents of the
paragraph (S817).
[0133] If the result of above determination in step S816 detects a
language tag <J>, the HTML document analysis and conversion
processing section 18 stores in a version storage area in the data
storage section 17, the start address of a document constituent
unit paragraph (or sentence) and the contents of the paragraph
(S818) to enable Japanese sentence display and edit. As a result of
the above process steps S815 to S818, English and Japanese
paragraphs or sentences from which HTML tags have been removed and
the start and end addresses of the paragraphs or sentences are
stored in the data storage section 17.
[0134] If no language tag is detected in above step S814, or after
the completion of processing in above step S817 or 818, the HTML
document analysis and conversion processing section 18 proceeds to
processing of the next paragraph in the original-versus-version
HTML document 21, and makes a determination as to whether or not an
HTML tag </html> indicating the end of the HTML document is
detected (steps S819, S820).
[0135] The HTML document analysis and conversion processing section
18 repeats the above-described steps S813 to S820 in cooperation
with the English-Japanese translation editor
(original-versus-version editor program) 19, until the HTML tag
indicating the end of the HTML document is detected. FIG. 10 shows
an example of an initial window of the English-Japanese translation
editor.
[0136] When the HTML tag indicating the end of the
original-versus-version HTML document is detected, the HTML
document analysis and conversion processing section 18 visually
displays (S821), in the window of the English-Japanese translation
editor (display section 11), the results of the above-described
sequence of process steps for converting the
original-versus-version HTML document data, i.e., English
paragraphs or sentences (original) and Japanese paragraphs or
sentences (translated version) from which the language tags and the
HTML tags have been removed.
[0137] The user edits the Japanese sentences according to his or
her need on the basis of the English and Japanese paragraphs or
sentences visually displayed in the left-right juxtaposition form
in the window of the English-Japanese translation editor. The
symbol .circle-solid. in the translated sentences shown in FIG. 3
designates edited portions. When the user clicks an "end" button in
the English-Japanese translation editor window after suitably
editing the Japanese version, the English-Japanese translation
editor (original-versus-version editor program) 19 stores the
results of editing in the data storage section 17, thereby
completing the edit processing (S822).
[0138] (Processing for Restoration to HTML Document Format)
[0139] In the HTML document processing apparatus 1, after the
above-described original-versus-version HTML document data
conversion processing and translated version editing processing, a
process described below is further performed to enable restoration
to the HTML document format.
[0140] When the user clicks an "HTML display" button in the
English-Japanese translation editor window, the English-Japanese
translation editor (original-versus-version editor program) 19 is
started (S823). By an instruction from the English-Japanese
translation editor 19, the original-versus-version HTML document
conversion processing section 40 starts analyzing the edited
document (S824).
[0141] First, the original-versus-version HTML document conversion
processing section 40 reads out one of the English paragraphs or
sentences (original) or the Japanese paragraphs or sentences
(translated version) stored in the data storage section 17, and
makes a determination as to whether the document constituent unit,
i.e., the paragraph or sentence, is English (original) or Japanese
(translated version) (S825). This determination can be definitely
made based on the language tags stored in the data storage section
17.
[0142] If the result of this determination is that the paragraph or
sentence read out is English, the original-versus-version HTML
document conversion processing section 40 adds the language tag
<E> to the beginning of the paragraph (S826). If the
paragraph read out is Japanese, the original-versus-version HTML
document conversion processing section 40 adds the language tag
<J> to the beginning of the paragraph (S827). The
original-versus-version HTML document conversion processing section
40 substitutes (S828) the paragraph of the edited document for the
corresponding paragraph of the original-versus-version HTML
document before editing by referring to the paragraph start address
stored in the data storage section 17.
[0143] The original-versus-version HTML document conversion
processing section 40 then proceeds to processing of the next
paragraph and makes a determination as to whether or not the end of
the edited document has been reached (steps S829, S830). The
original-versus-version HTML document conversion processing section
40 repeats the above-described steps S824 to S830 until the end of
the edited document is reached.
[0144] The original-versus-version HTML document conversion
processing section 40 stores in the data storage section 17 the
original-versus-version HTML document obtained as a result of the
above-described sequence of HTML document restoration processing
steps. The display processing section 12 opens, by the WWW browser
4, the original-versus-version HTML document stored in the data
storage section 17, thereby making a display reflecting the editing
results (S831).
[0145] When the original-versus-version HTML document is displayed
on the editor, the English paragraphs or sentences and the
positions of the same, and the Japanese paragraphs or sentences and
the positions of the same in the original-versus-version HTML
document are stored in the respective storage areas in the data
storage section 17, and the original-versus-version HTML document
before editing is also held by the data storage section 17. From
the information on the positions of the English and Japanese
paragraphs or sentences stored in the storage areas, the other
portions can be identified as the tags. In this manner, the
positions of the tags can be obtained reciprocally from the
positions of the English and Japanese paragraphs or sentences, and
HTML document restoration processing can be performed by using the
tags in the original-versus-version HTML document before editing
with reference to the tag position information obtained from the
paragraph or sentence positions.
[0146] The formed original-versus-version HTML document can be
displayed in the form of an original-versus-version home page by
the Internet display tool WWW browser 4 if it may formed without
invisibility tags. Selective display processing (see FIG. 5) of the
original-versus-version HTML document may be performed by the
individual-language HTML document conversion processing section 16
if necessary, as in the HTML document processing apparatus 1 in
above-described Embodiment 1, thereby enabling only the Japanese
HTML document (version) obtained by this processing to be visually
displayed by the display section 11.
[0147] [Embodiment 3]
[0148] [Configuration of HTML Document Processing Apparatus]
[0149] An HTML document processing apparatus 1 which represents
Embodiment 3 of the present invention has basically the same
functions and configuration as the HTML document processing
apparatus 1 of Embodiment 2 shown in FIG. 7.
[0150] In the HTML document processing apparatus 1 in the third
embodiment, however, there is no need for the automatic translation
processing section 14 and the original-versus-version HTML document
forming processing section 15.
[0151] In this HTML document processing apparatus 1, the HTML
document analysis and conversion processing section 18 and the
English-Japanese translation editor (original-versus-version editor
program) 19 cooperate to perform data conversion by removing HTML
tags from an English HTML document (original) of an English home
page described in English, thereby enabling editing of the original
and inputting of a translated version. A user edits the editable
English sentences according to his or her need and inputs Japanese
sentences (translated version).
[0152] Also, the original-versus-version HTML document conversion
processing section 40 substitutes edited English and Japanese
sentences for the original English sentences, restores the removed
HTML tags, and sets translation-result Japanese sentences
(translated version) in correspondence with the edited sentences,
thereby enabling visual display of an original-versus-version HTML
document containing a more favorable version.
[0153] [Operation of HTML Document Processing Apparatus]
[0154] (Data Conversion Processing and Original Edit Processing of
English HTML Document (Original))
[0155] FIG. 7 shows the configuration of this HTML document
processing apparatus 1, and FIGS. 11 and 12 are flowcharts for
explaining the operation of this HTML document processing apparatus
1. Referring to FIGS. 7, 11, and 12, a user using the HTML document
processing apparatus 1 searches the Internet for a home page
described in a language which he or she wishes to translate, by
operating the input section 10 and based on the contents of a
display made by the display section 11. In this example, the object
of this search (processing step S1101 in FIG. 11) is an English
home page held by the WWW server 3.
[0156] Next, the user initiates a program for translation in the
desired direction by operating the input section 10 and based on
the contents of the display made by the display section 11. In this
example, the English home page is to be finally translated into a
Japanese HTML document described in Japanese. Therefore, an
English-into-Japanese translation program is started (S1102).
[0157] The user clicks an "original-versus-version edit processing
start" button in an initial window (dialogue window) (not shown) of
the translation program to start the English-Japanese translation
editor (original-versus-version editor program) 19 (S1103).
[0158] The HTML document analysis and conversion processing section
18 starts analyzing (S1104) the English HTML document (original)
supplied from the WWW server 3 through the communication control
section 13. First, the HTML document analysis and conversion
processing section 18 recognizes a text portion of the English HTML
document The HTML document analysis and conversion processing
section 18 detects an HTML tag to recognize the section following
this tag as an English text (S1105).
[0159] Next, the HTML document analysis and conversion processing
section 18 stores in the original storage area in the data storage
section 17 the start address of a document constituent unit
paragraph (or sentence) and the contents of the paragraph (S1106).
As a result of the processing in the above steps S1105 and S1106,
the English paragraph or sentence from which the HTML tag has been
removed is stored in the data storage section 17.
[0160] If HTML tag is detected in above step S1105, or after the
completion of processing in the step S1106, the HTML document
analysis and conversion processing section 18 proceeds to
processing of the next paragraph in the English HTML document, and
makes a determination as to whether or not an HTML tag
</html> indicating the end of the HTML document is detected
(steps S1107, S1108).
[0161] The HTML document analysis and conversion processing section
18 repeats the above-described steps S1104 to S1108 in cooperation
with the English-Japanese translation editor
(original-versus-version editor program) 19 until the HTML tag
indicating the end of the English HTML document is detected.
[0162] When the HTML tag indicating the end of the English HTML
document is detected, the HTML document analysis and conversion
processing section 18 visually displays (S1109) in a left-hand
section of the window of the English-Japanese translation editor
(display section 11), the results of the above-described sequence
of process steps for converting the English HTML document data.
[0163] The user performs manual translation on the basis of the
English paragraphs or sentences (original) visually displayed in
the window of the English-Japanese translation editor, and forms
translated sentences (Japanese sentences) by operating the input
section 10. In this translation processing, the user edits the
English sentences and, if necessary the Japanese translated
sentences according to his or her need. When the user clicks an
"end" button in the English-Japanese translation editor window
after suitably editing the original and completing the Japanese
version, the English-Japanese translation editor
(original-versus-version editor program) 19 stores the results of
editing and translation in the data storage section 17, thereby
completing the processing (S1110).
[0164] (Processing for Restoration to HTML Document Format)
[0165] In the HTML document processing apparatus 1, after the
above-described English HTML document (original) data conversion
processing and original editing processing, a process described
below is next performed to enable restoration processing of the
HTML document format.
[0166] When the user clicks an "HTML display" button in the
English-Japanese translation editor window, the English-Japanese
translation editor (original-versus-version editor program) 19 is
again started (S1111). By an instruction from the English-Japanese
translation editor 19, the original-versus-version HTML document
conversion processing section 40 starts analyzing the resulting
edited document (S1112).
[0167] First, the original-versus-version HTML document conversion
processing section 40 reads out one of the English paragraphs or
sentences (original) stored in the data storage section 17, and
makes a determination as to whether or not the document constituent
unit, i.e., the paragraph (or sentence), is English (original)
(S1113). This determination can be definitely made based on the
storage positions stored in the left-right juxtaposition form in
the data storage section 17.
[0168] If the result of this determination is that the paragraph or
sentence read out is English, the original-versus-version HTML
document conversion processing section 40 adds the language tag
<E> to the beginning of the paragraph (S1114). If the
paragraph read out is Japanese, the original-versus-version HTML
document conversion processing section 40 adds the language tag
<J> to the beginning of the paragraph (S1115).
[0169] The original-versus-version HTML document conversion
processing section 40 substitutes (S1116) the paragraph of the
resulting edited original with the language tag added thereto for
the corresponding original paragraph of the HTML document before
editing, by referring to the paragraph start address stored in the
data storage section 17, and inserts the translated sentence formed
in the above-described step S1110 immediately after the substituted
original paragraph (S1117).
[0170] The original-versus-version HTML document conversion
processing section 40 then proceeds to processing of the next
paragraph and makes a determination as to whether or not the end of
the edited document has been reached (steps S1118, S1119). The
original-versus-version HTML document conversion processing section
40 repeats the above-described steps S1112 to S1119 until the end
of the edited document is reached.
[0171] The original-versus-version HTML document conversion
processing section 40 stores in the data storage section 17 the
original-versus-version HTML document obtained as a result of the
above-described sequence of HTML document restoration processing
steps. The display processing section 12 then opens, by the WWW
browser 4, the original-versus-version HTML document stored in the
data storage section 17, thereby making a display reflecting the
editing results (S1120).
[0172] Also in this embodiment, HTML document restoration
processing can be performed by using the tags in the
original-versus-version HTML document in the same manner as that in
Embodiment 2.
[0173] Also, selective display processing (see FIG. 5) of the
original-versus-version HTML document may be performed by the
individual-language HTML document conversion processing section 16
if necessary, as in the HTML document processing apparatus 1 in the
above-described Embodiment 1, thereby enabling only the Japanese
HTML document (translated version) obtained by this processing to
be visually displayed in the display section 11.
[0174] [Examples of Modification]
[0175] In the HTML document processing apparatus 1 in the
above-described third embodiment, translated sentences are formed
by manual translation. However, an automatic translation process
using the automatic translation processing section 14 may also be
performed in the third embodiment, similar to that in the HTML
document processing apparatuses in the first and second
embodiments. Conversely, manual translation may be performed in the
HTML document processing apparatuses in the first and second
embodiments.
[0176] In each of the HTML document processing apparatuses in the
above-described embodiments, an original written in English, is
translated into Japanese. However, the HTML document processing
apparatus of the present invention can also operate in the same
manner with respect to the respective languages, and can similarly
process an HTML document having sentences written in three or more
languages.
[0177] In the HTML document processing apparatus 1 in the first
embodiment, invisibility tags designating inhibition of visual
display are set as visibility control tags in an HTML document.
However, it is also possible to print only a document expressed in
a particular language selected by setting such tags.
[0178] Each of the above-described processes according to the
present invention can be applied in association with a
computer-readable medium.
[0179] Although only a few embodiments of the present invention
have been described in detail above, those skilled in the art will
readily appreciate that many modifications are possible in the
preferred embodiments without departing from the novel teachings
and advantages of the present invention. Accordingly, all such
modifications are intended to be included within the scope of the
present invention as defined by the following claims.
* * * * *