U.S. patent application number 09/525020 was filed with the patent office on 2001-11-29 for translating apparatus, dictionary search apparatus, and translating method.
This patent application is currently assigned to FUJITSU LIMITED. Invention is credited to Fuji, Masaru.
Application Number | 20010047255 09/525020 |
Document ID | / |
Family ID | 17963588 |
Filed Date | 2001-11-29 |
United States Patent
Application |
20010047255 |
Kind Code |
A1 |
Fuji, Masaru |
November 29, 2001 |
Translating apparatus, dictionary search apparatus, and translating
method
Abstract
If a context process range extending unit cannot obtain context
information required by a context processing unit, from a range to
be translated, it extends the context process range. Then, the
context processing unit performs a context process, and passes
extracted context information to a translation processing unit in
order to perform translation, based on the extended context process
range.
Inventors: |
Fuji, Masaru; (Kawasaki-shi,
JP) |
Correspondence
Address: |
STAAS & HALSEY LLP
700 11TH STREET, NW
SUITE 500
WASHINGTON
DC
20001
US
|
Assignee: |
FUJITSU LIMITED
|
Family ID: |
17963588 |
Appl. No.: |
09/525020 |
Filed: |
March 14, 2000 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
09525020 |
Mar 14, 2000 |
|
|
|
08728852 |
Oct 10, 1996 |
|
|
|
6151570 |
|
|
|
|
Current U.S.
Class: |
704/10 ;
707/999.003; 715/259 |
Current CPC
Class: |
G06F 40/55 20200101;
G06F 40/242 20200101; G06F 40/44 20200101 |
Class at
Publication: |
704/10 ; 707/3;
707/532 |
International
Class: |
G06F 017/21 |
Foreign Application Data
Date |
Code |
Application Number |
Nov 27, 1995 |
JP |
07-306981 |
Claims
What is claimed is:
1. A translating apparatus, comprising: translation target range
setting means for setting a range to be translated in an original
text; context process range setting means for setting a context
process range in the original text which is at least partially
different from the range to be translated; and translation
processing means for translating an original text in the range to
be translated into a translated text, based on a context in the
context process range.
2. A translating apparatus, comprising: input means for inputting a
range to be translated in an original text; display means for
displaying the original text; context process range setting means
for setting a context process range in the original text; context
processing means for performing a context process for the original
text; and translation processing means for translating an original
text in the range to be translated into translated text, based on a
context in the context process range.
3. The translating apparatus according to claim 2, wherein said
context processing means makes a field estimation.
4. The translating apparatus according to claim 2, wherein said
context processing means makes a document type estimation.
5. The translating apparatus according to claim 2, wherein said
context processing means estimates an abbreviation.
6. The translating apparatus according to claim 2, wherein said
context processing means estimates a reference destination.
7. The translating apparatus according to claim 2, further
comprising: translation range identifying means for identifying a
sentence to be translated, based on the range to be translated,
when the range to be translated in the original text, which is
displayed by said displaying means, is input from said input
means.
8. The translating apparatus according to claim 2, further
comprising: translation range identifying means for identifying a
group of sentences to be translated, based on the range to be
translated, when the range to be translated in the original text,
which is displayed by said displaying means, is input from said
input means.
9. The translating apparatus according to claim 7, wherein: if an
inline tag is included in the original text, said translation range
identifying means identifies the sentence to be translated by
ignoring the inline tag.
10. The translating apparatus according to claim 7, wherein: said
displaying means highlights the sentence to be translated, which is
identified by said translation range identifying means.
11. The translating apparatus according to claim 2, wherein: said
displaying means highlights a first word having a plurality of
translated words, and a second word closest to a location of the
first word, among words which identify a translated word of the
first word.
12. The translating apparatus according to claim 7, wherein: said
displaying means highlights the sentence to be translated, which is
identified by said translation range identifying means, by using a
first means, and highlights the context process range set by said
context process range setting means, by using a second means.
13. The translating apparatus according to claim 2, wherein: said
displaying means highlights a word at a reference destination and a
word at a reference source.
14. The translating apparatus according to claim 2, wherein: said
displaying means highlights an abbreviation and its complete
name.
15. The translating apparatus according to claim 2, wherein: if the
translated text cannot be identified for the original text, said
context process range setting means extends the context process
range.
16. The translating apparatus according to claim 15, wherein: said
context process range setting means extends the context process
range in sentences until the translated text can be determined for
the original text.
17. The translating apparatus according to claim 16, wherein: said
context process range setting means extends the context process
range in paragraphs which correspond to the range to be translated,
until the translated text can be determined for the original
text.
18. The translating apparatus according to claim 15, wherein: said
context process range setting means determines a direction to which
the context process range is extended, according to a type of a
context process.
19. The translating apparatus according to claim 16, wherein: said
context process range setting means extends the context process
range in sentences in a forward direction until the translated text
can be determined for the original text.
20. The translating apparatus according to claim 2, wherein: if the
range to be translated in the original text straddles a plurality
of paragraphs, said context process range setting means sets each
context process range in correspondence with each of ranges to be
translated, which are divided in paragraphs.
21. The translating apparatus according to claim 2, wherein: if an
inline tag is included in the original text, said context process
range setting means removes the inline tag from the context process
range.
22. The translating apparatus according to claim 2, wherein: said
context process range setting means comprises: boundary detecting
means for detecting a boundary of context; and terminating means
for terminating an extension of the context process range at the
boundary, when said boundary detecting means detects the boundary
of context.
23. A translating apparatus, comprising: input means for inputting
a range to be translated in an original text; context process range
setting means for setting a context process range in the original
text; context processing means for performing a context process for
the original text; translation processing means for translating at
least that part of the original text which is in the range to be
translated into a translated text, based on context in the context
process range; and displaying means for displaying context
information in correspondence with the range to be translated in
the original text.
24. The translating apparatus according to claim 23, wherein: if
the translated text cannot be determined for the original text,
said displaying means displays a plurality of pieces of context
information, as candidates.
25. The translating apparatus according to claim 23, wherein: the
context information includes a field name, a document type name, a
complete name corresponding to an abbreviation, or a word at a
reference destination.
26. A translating apparatus, comprising: translation target range
setting means for setting a range to be translated in an original
text; context processing means for dividing the range to be
translated into paragraphs in order to perform a context process,
if the range to be translated in the original text straddles a
plurality of paragraphs; and translation processing means for
translating the original text in the range to be translated into a
translated text, based on context of each paragraph.
27. A dictionary search apparatus, comprising: searching means for
searching a translated word for an original word; context process
range setting means for setting a context process range in an
original text; context processing means for performing a context
process for the original text; and determining means for
determining a translated word for the original word, based on
context in the context process range.
28. The dictionary search apparatus according to claim 27, wherein:
if the translated word for the original word cannot be determined,
said context process range setting means extends the context
process range.
29. The dictionary search apparatus according to claim 28, wherein:
said context process range setting means extends the context
process range in sentences until the translated word for the
original word can be determined.
30. The dictionary search apparatus according to claim 27, wherein:
said context processing means makes a field estimation.
31. The dictionary search apparatus according to claim 27, wherein:
said context processing means estimates an abbreviation.
32. A dictionary search apparatus, comprising: specifying means for
specifying a word to be searched; context process range setting
means for setting a context process range; context processing means
for performing a context process in the context process range; and
determining means for determining a definition of the word to be
searched, based on context of the context process range.
33. A translating method, comprising the steps of: specifying a
portion of an original text as a range to be translated;
referencing context of the original text outside the range to be
translated, which is stored in a storage apparatus; and translating
the original text within the range to be translated.
34. A translating method, comprising the steps of: referencing a
second original word outside a range to be translated in an
original text, if a first translated word belonging to a first
field and a second translated word belonging to a second field
exist for a first original word in the range to be translated in
the original text; and selecting the first translated word for the
first original word, if the second original word belongs to the
first field.
35. A data conversion apparatus, comprising: conversion range
setting means for setting a data conversion range; context process
range setting means for setting a context process range where a
context process is performed; and conversion processing means for
converting the data, based on context of the context process
range.
36. A computer-readable storage medium used to direct a computer to
perform the functions of: specifying a portion of an original text
as a range to be translated; referencing context of the original
text outside the range to be translated, which is stored in a
storage apparatus; and translating the original text within the
range to be translated.
Description
FIELD OF THE INVENTION
[0001] The present invention relates to a translating apparatus, a
dictionary search apparatus, and a translating method, which are
preferably applied for when translation is to be efficiently
performed based on context in a machine translating process.
DESCRIPTION OF THE RELATED ART
[0002] Conventional machine translation is performed in sentences
due to the limitation of a processing capability. For example, if a
text written in a foreign language is translated, a grammatical
structure of the text to be translated and a knowledge of the words
used are necessary.
[0003] However, an understandable text can often not be obtained if
context information, that is, a meaning of one sentence to be
translated in the entire text, is not properly interpreted, even if
the grammatical structure of the text to be translated, and the
knowledge of words, etc. used are known.
[0004] If it cannot be determined whether the English word "car"
means either an "automobile" or a "railroad car" from the
information included in a sentence, a sentence preceding or
succeeding the sentence in question, or the context of the entire
text must be referenced in order to determine the meaning of the
one word.
[0005] Actually, not only the simple selection of a translated
word, but also an interpretation of a complex meaning is required
to be made based on the context.
[0006] In recent years, a short sentence has been able to be
analyzed with high accuracy, thereby performing translation with
higher accuracy according to a context process.
[0007] This context process enables the following processes to be
performed, which could not be realized by using only a
sentence.
[0008] Selecting a translated word by estimating a field of a
document to be translated
[0009] Selecting a suitable expression method by estimating a type
of a document to be translated
[0010] Removing ambiguity existing in a reference relationship
[0011] Completing an abbreviation
[0012] A mechanism and a method for performing a translation
process using context information for a document to be translated,
have been developed. Additionally, a method or apparatus for
learning context information, and a translating apparatus for
performing a translation process using learned context information,
have been developed.
[0013] However, the translation process using a conventional
context process is performed only when an entire document is
specified as a translation target.
[0014] That is, if a portion of a document is specified as a
translation target, the context process is not performed.
[0015] Therefore, when a portion of a document such as one
sentence, one paragraph, etc. is translated, the entire document
must be specified as a translation target in order to obtain
accurate translation using the context process. As a result, it
takes a large amount of time to perform the process.
SUMMARY OF THE INVENTION
[0016] An object of the present invention is to provide a
translating apparatus, a dictionary search apparatus, and a
translating method for translating a portion of a document
efficiently and accurately.
[0017] According to a feature of the present invention, a range to
be translated and a context process range in original text are set
separately. Therefore, even if only a portion of a document is
specified as a range to be translated, it can be translated
efficiently and accurately by referencing context; outside the
range to be translated.
[0018] According to an additional feature of the present invention,
a range to be translated and a context process range in original
text are separately set to perform a context process. As a result,
a translation process is performed by referencing context outside
the range to be translated, thereby translating the portion of a
document efficiently and accurately.
[0019] According to a further feature of the present invention,
field estimation is made as a context process. Even if there is no
word identifying a field in a range to be translated, the field can
be identified according to context outside the range to be
translated. As a result, a suitable translation process depending
on a field can be performed.
[0020] According to a still further feature of the present
invention, document type estimation is made as a context process.
Even if there is no expression which allows a document type to be
identified in a range to be translated, a suitable translation
process depending on a document type can be performed by
identifying the document type according to context outside the
range to be translated.
[0021] According to a still further feature of the present
invention, an abbreviation is estimated as a context process. Even
if there is no complete form of the abbreviation in a range to be
translated, it can be searched according to context outside the
range to be translated. As a result, a translation process can be
performed with completion of an abbreviation.
[0022] According to a still further feature of the present
invention, a reference destination is estimated as a context
process. Even if a range to be translated includes no word pointed
to by a pronoun included in this range, a translation process can
be performed while replacing the pronoun depending on need, by
searching the word pointed to by the pronoun according to context
outside the range to be translated.
[0023] According to a still further feature of the present
invention, a sentence to be translated is identified according to a
range to be translated. As a result, a translation process can be
performed accurately.
[0024] According to a still further feature of the present
invention, a group of sentences to be translated is identified
according to a range to be translated. As a result, a translation
process can be performed accurately.
[0025] According to a still further feature of the present
invention, a sentence to be translated is identified by ignoring an
inline tag. That is, a translation process can be accurately
performed by removing a portion unrelated to translation.
[0026] According to a still further feature of the present
invention, a sentence to be translated is highlighted. As a result,
a translation process can be performed while recognizing the
sentence to be translated.
[0027] According to a still further feature of the present
invention, a first word having a plurality of translated words, and
one of second words closest to the location of the first word among
second words representing the translated words of the first word,
are highlighted. As a result, a state of a context process can be
recognized with ease by displaying context information simply.
[0028] According to a still further feature of the present
invention, a sentence to be translated is highlighted with a first
method, and a context process range is highlighted with a second
method. As a result, a translation process can be performed while
recognizing the context process range, by making a distinction
between the sentence to be translated and the context process
range.
[0029] According to a still further feature of the present
invention, a word at a reference destination and a word at a
reference source are highlighted. As a result, a translation
process can be performed while recognizing a state of a context
process.
[0030] According to a still further feature of the present
invention, an abbreviation and its complete form are highlighted.
As a result, a translation process can be performed while
recognizing a state of a context process.
[0031] According to a still further feature of the present
invention, a context process range is extended if a translated
sentence corresponding to an original sentence cannot be
identified. If context information which allows a translated
sentence to be identified is not included in a range to be
translated, the translated sentence can be identified according to
context outside the range to be translated. As a result, a portion
of a document can be translated efficiently and accurately.
[0032] According to a still further feature of the present
invention, a context process range is extended in sentences until a
sentence to be translated corresponding to an original sentence can
be determined. As a result, a portion of a document can be
translated efficiently and accurately by performing a context
process such as estimating an abbreviation, estimating a reference
destination, etc. efficiently.
[0033] According to a still further feature of the present
invention, a context process range is extended in paragraphs in
correspondence with a range to be translated until a translated
sentence corresponding to an original sentence can be determined.
As a result, a portion of a document can be translated efficiently
and accurately by performing a context process such as estimating a
field, estimating a document type, etc. efficiently.
[0034] According to a still further feature of the present
invention, a direction in which a context process range is extended
is determined according to a type of a context process. As a
result, the context process is efficiently performed, thereby
translating a portion of a document efficiently and accurately.
[0035] According to a still further feature of the present
invention, a context process range is extended in sentences in a
forward direction until a translated sentence corresponding to an
original sentence can be determined. The context process such as
estimating a reference destination is performed efficiently,
thereby translating a portion of a document efficiently and
accurately.
[0036] According to a still further feature of the present
invention, a context process range is set in correspondence with a
range to be translated separated into paragraphs, if the range to
be translated in original text straddles a plurality of paragraphs.
As a result, even if a topic changes at the beginning of a next
paragraph, a context process can be performed accurately by
extracting context corresponding to the range to be translated
correctly, thereby performing a context process correctly.
[0037] According to a still further feature of the present
invention, an inline tag is removed from a context process range.
That is, by removing a portion unrelated to context, a context
process can be performed efficiently.
[0038] According to a still further feature of the present
invention, a context process range is stopped from extending at a
boundary of context. As a result, a context process can be
performed accurately and quickly by performing the context process
in a context process range including context corresponding to a
range to be translated.
[0039] According to a still further feature of the present
invention, context information is displayed in correspondence with
a range to be translated in original text. As a result, a
translation process can be performed while recognizing a state of a
context process in the range to be translated.
[0040] According to a still further feature of the present
invention, a plurality of pieces of context information are
displayed as candidates if a translated sentence corresponding to
an original sentence cannot be determined. As a result, a
translation process can be performed while recognizing a state of a
context process being performed in a range to be translated.
[0041] According to a still further feature of the present
invention, context information includes a field name, document
type, complete form of an abbreviation, or a word at a reference
destination. As a result, a translation process can be performed
while recognizing the field name, document type, complete form, or
the reference destination.
[0042] According to a still further feature of the present
invention, if a range to be translated in original text straddles a
plurality of paragraphs, it is divided into a plurality of
paragraphs to perform a context process. As a result, even if a
topic changes at the beginning of a next paragraph, context
corresponding to a range to be translated can be extracted
correctly. Therefore, a translation process can be performed
accurately.
[0043] According to a still further feature of the present
invention, a context process of original text is performed based on
a context process range which is set, and a translated word
corresponding to an original word is determined based on text in
the context process range. As a result, a dictionary search is
properly performed in consideration of context.
[0044] According to a still further feature of the present
invention, a context process range is extended if a translated word
corresponding to an original word cannot be determined. As a
result, a dictionary search can be properly made by allowing a
context process to be performed efficiently.
[0045] According to a still further feature of the present
invention, a context process range is extended in sentences until a
translated word corresponding to an original word can be
determined. That is, by allowing a context process such as
estimating an abbreviation, etc., to be performed efficiently, a
dictionary search can be made efficiently and accurately.
[0046] According to a still further feature of the present
invention, field estimation is made as a context process. If there
are a plurality of translated words corresponding to an original
word, the field of the original word is estimated with a context
process in order to select a translated word corresponding to the
estimated field. As a result, a dictionary search is performed
accurately.
[0047] According to a still further feature of the present
invention, an abbreviated expression is estimated as a context
process. By allowing a complete form of an abbreviation to be
searched, a dictionary search can be performed accurately.
[0048] According to a still further feature of the present
invention, a meaning of a word to be searched is determined
according to context in a context process range. As a result, a
dictionary search is performed in consideration of context, thereby
performing the dictionary search accurately.
[0049] According to a still further feature of the present
invention, context of original text outside a range to be
translated is referenced when a portion of the original text is
translated. As a result, even if the portion of the original text
is specified as the range to be translated, the context outside the
range to be translated is referenced. Therefore, the portion of the
document can be translated efficiently and accurately.
[0050] According to a still further feature of the present
invention, if there are first and second translated words
respectively belonging to first and second fields, which correspond
to a first original word in a range to be translated of original
text, a second original word outside the range to be translated of
the original text is referenced. If the second original word
belongs to the first field, the first translated word is selected
for the first original word. As a result, the field of the first
original word in the range to be translated is estimated based on
context outside the range to be translated, thereby determining a
translated word of the first original word based on the estimated
field.
[0051] According to a still further feature of the present
invention, data in a conversion range is converted based on context
in a context process range. Data is converted in consideration of
information other than the data in the conversion range, thereby
converting the data accurately.
BRIEF DESCRIPTION OF THE DRAWINGS
[0052] FIG. 1 is a block diagram showing the configuration of a
data conversion apparatus according to an embodiment of the present
invention;
[0053] FIG. 2 is a block diagram showing the configuration of a
translating apparatus according to the embodiment of the present
invention;
[0054] FIG. 3 is a schematic diagram showing the configuration of a
computer system of the translation apparatus according to the
embodiment of the present invention;
[0055] FIG. 4 is a flowchart showing operations performed by a
translating apparatus according to a first embodiment of the
present invention;
[0056] FIG. 5 shows an example of original text input to the
translating apparatus according to the first embodiment of the
present invention;
[0057] FIG. 6 shows a correspondence between an original sentence
(English) input to the translating apparatus according to the first
embodiment of the present invention and a translated sentence
(Japanese) output from the translating apparatus;
[0058] FIG. 7 shows a correspondence table between original words
(English) and corresponding translated words (Japanese) provided by
the translating apparatus according to the first embodiment of the
present invention;
[0059] FIG. 8 shows a correspondence table between an original
sentence (English) input to the translating apparatus according to
the first embodiment of the present invention and a translated
sentence (French) output from the translating apparatus;
[0060] FIG. 9 shows a correspondence table between an original word
(English) and corresponding translated words in respective fields
(French), provided by the translating apparatus according to the
first embodiment of the present invention;
[0061] FIG. 10 shows a correspondence table between a document type
and an expression method of translation, implemented by the
translating apparatus according to a second embodiment of the
present invention;
[0062] FIG. 11 is a flowchart showing operations performed in a
context process according to a third embodiment of the present
invention;
[0063] FIG. 12 is a flowchart showing operations performed in a
context process according to a fourth embodiment of the present
invention;
[0064] FIG. 13 is a schematic diagram showing a method for
specifying a translation portion of an original text input to the
translation apparatus according to the first embodiment of the
present invention;
[0065] FIG. 14 shows a method for specifying a translation range of
original text input to the translating apparatus according to the
second embodiment of the present invention;
[0066] FIG. 15 shows an example of original text into which inline
tags are input to the translating apparatus according to the first
embodiment of the present invention;
[0067] FIG. 16 shows an example of original text into which inline
tags are input to the translating apparatus according to the second
embodiment of the present invention;
[0068] FIGS. 17A to 17C show a method for extending a context
process range by the translating apparatus according to the second
embodiment of the present invention;
[0069] FIG. 18 shows a correspondence table between a method for
extending a context process range and a context process type,
implemented by the translating apparatus according to the third
embodiment of the present invention;
[0070] FIG. 19 is a flowchart showing the method for extending a
context process range according to the third embodiment of the
present invention;
[0071] FIG. 20 shows a state where context information is displayed
for original text input to the translating apparatus according to
the first embodiment of the present invention;
[0072] FIG. 21 shows a state where context information is displayed
for the original text input to the translating apparatus according
to the second embodiment of the present invention;
[0073] FIG. 22 shows a state where context information is displayed
for the original text input to the translating apparatus according
to the third embodiment of the present invention;
[0074] FIG. 23 shows a state where context information is displayed
for the original text input to the translating apparatus according
to the fourth embodiment of the present invention;
[0075] FIG. 24 is a block diagram showing the configuration of a
dictionary search apparatus according to an embodiment of the
present invention;
[0076] FIG. 25 shows a method for specifying an original word input
to the dictionary search apparatus according to the embodiment of
the present invention;
[0077] FIG. 26 is a flowchart showing operations performed by the
dictionary search apparatus according to a first embodiment of the
present invention; and
[0078] FIG. 27 is a flowchart showing operations performed by the
dictionary search apparatus according to a second embodiment of the
present invention.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0079] FIG. 1 is a block diagram showing the configuration of a
data conversion apparatus according to an embodiment of the present
invention. The data conversion apparatus shown in this figure
comprises a conversion range setting unit 100 for setting a
conversion range of data, a context process range setting unit 101
for setting a context process range where a context process is
performed, and a conversion processing unit 102 for converting the
data in consideration of context in the context process range. As a
result, data can be converted in consideration of context
information obtained from other than data in a conversion range,
thereby converting the data accurately.
[0080] Provided next is the explanation about a translating
apparatus according to an embodiment of the present invention. FIG.
2 is a simplified block diagram showing configuration of the
translating apparatus according to the embodiment of the present
invention. This apparatus is intended to quickly perform a
translation process by obtaining appropriate context from a
neighborhood of a range to be translated depending on need, when a
portion of a document is specified as the range to be
translated.
[0081] In FIG. 2, a translation processing apparatus 9 comprises a
user interface 8, translation range identifying unit 4, context
processing unit 5, translation processing unit 6, and a context
process range extending unit 7. The user interface 8 comprises an
original text displaying unit 1 and an input unit 2.
[0082] The original text displaying unit 1 is intended to display
original text stored in a storing unit 3 for a translation
operator. The original text displaying unit 1 is implemented by,
for example, a CRT display.
[0083] The input unit 2 is intended to accept a translation
instruction given by a translation operator. The translation
operator instructs the input unit 2 of a range to be translated
while watching the original text displaying unit 1. The input unit
2 is implemented by, for example, a mouse, a keyboard, etc.
[0084] The storing unit 3 is intended to store original text, which
is implemented by, for example, a hard disk unit, optical disk
unit, etc.
[0085] The translation range identifying unit 4 is intended to
extract a portion to be translated in original text from the
storing unit 3, according to a translation instruction from the
input unit 2. Note that a system instructing a sentence to be
translated using a sentence number, etc. does not always require
the translation range identifying unit 4.
[0086] The context processing unit 5 is intended to perform a
context process in a context process range specified by the context
process range extending unit 7 for a sentence to be translated
identified by the translation range identifying unit 4.
[0087] The translation processing unit 6 is intended to translate a
sentence to be translated according to context obtained by the
context processing unit 5, and output a translated sentence.
[0088] The context process range extending unit 7 is intended to
extend a context process range where a context is retrieved by
reading an original text around a range to be translated from the
storing unit 3, if context information required by the context
processing unit 5 cannot be obtained from a sentence to be
translated which is identified by the translation range identifying
unit 4.
[0089] Provided next is the explanation about a computer system
implemented as the translating apparatus according to the
embodiment of the present invention. FIG. 3 shows an example of the
translating apparatus implemented by a computer system according to
the embodiment of the present invention.
[0090] In FIG. 3, a CPU 111, a memory 112, an input device 113, a
display 114, and a disk device 115 are connected via a bus 110. For
example, the input unit; 2 is implemented by the input device 113,
the original text displaying unit 1 is implemented by the display
114, and the original text storing unit 3 is; implemented by the
disk device 115. The translation range identifying unit 4, context
processing unit 5, translation processing unit 6 and the context
process range extending unit 7 are implemented by, for example, the
CPU 111 and the memory 112.
[0091] Provided next is the explanation about operations performed
by the translating apparatus according to the embodiment of the
present invention, by referring to the drawings. FIG. 4 is a
flowchart showing the operations performed by the translating
apparatus according to the embodiment of the present invention.
[0092] In FIG. 4, a translation operator inputs a translation
instruction to an input unit 2, and the input unit 2 accepts the
translation instruction in step S1.
[0093] In step 2, the translation range identifying unit 4
identifies a sentence to be translated in a document to be
translated according to the translation instruction input to the
input unit 2.
[0094] In step S3, a context processing unit 5 performs a context
process for the sentence to be translated identified by the
translation range identifying unit 4 to extract context
information.
[0095] In step S4, it is determined whether or not the context
information extracted in step S3 is sufficient for performing a
translation process by the translation processing unit 6. If the
context information is insufficient for the translation process,
the process goes to step S5.
[0096] Then, in step S5, a context process range extending unit 7
extends a context process range where the context process will be
performed, beyond a sentence to be translated, and passes the
original text in the extended context process range to a context
processing unit 5.
[0097] If it is determined that the context information is
sufficient for performing the translation process in step S4, the
context processing unit 5 passes sufficient context information for
performing the translation process as well as the document
including the sentence to be translated identified by the
translation range identifying unit 4, to the translation processing
unit 6. Then, the process goes to step S6.
[0098] In step S6, the translation processing unit 6 translates the
sentence to be translated included in the document to be
translated, based on the context information passed from the
context processing unit 5.
[0099] As described above, by separately setting a range to be
translated and a context process range in original text, context
outside the range to be translated can be referenced even if only a
portion of a document is specified as a range to be translated. As
a result, the portion of the document can be translated efficiently
and accurately.
[0100] Provided next is the explanation about a context process
according to the first embodiment of the present invention, by
referring to the drawings. The context process according to the
first embodiment of the present invention is intended to estimate a
field. Note that the following explanation is based on the
assumption that a document to be translated includes sentences,
each of which terminates in one line and is numbered.
[0101] Provided first is the explanation about an example of
translating English text into the equivalent Japanese.
[0102] As shown in FIG. 5, an original text displaying unit 1 is
assumed to display an original sentence numbered 31 "The talks were
held in the Upper House.", and an original sentence numbered 32
"This has forced the party to change its stance."
[0103] A translation operator inputs the number of a desired
sentence to be translated to an input unit 2, and issues a
translation instruction. If the translation operator specifies the
sentence numbered 32, the original sentence numbered 32 "This has
forced the party to change its stance." will be a translation
target.
[0104] Then, the context processing unit 5 performs a context
process for the original sentence numbered 32. In the context
process, a dictionary is consulted for each original word included
in the original sentence numbered 32. As a result of consulting the
dictionary, the translated word "" is selected for the original
word "This"; the translated word "" is selected for the original
word "forced"; the translated word "" or "" is selected for the
original word "party"; the translated word "" is selected for the
original word "change"; the translated word "" is selected for the
original word "its"; and the translated word "" or "" is selected
for the original word "stance", as shown in FIG. 6.
[0105] Furthermore, the original word "has" is recognized as an
auxiliary verb, the original word "the" is recognized as a definite
article, and the original word "to" is recognized as a function
word.
[0106] Since there are a plurality of candidates for the original
words "party" and "stance", a suitable translated word must be
selected using a context process. In the meantime, there is only
one translated word corresponding to each of the original words
"This", "forced", "change", and "its". Accordingly, a Japanese
equivalent for each of the original words is determined.
[0107] Then, field estimation is made in order to select a suitable
translated word for each of the original words "party" and
"stance".
[0108] FIG. 7 shows a correspondence table between the translated
words of each of the original words "party" and "stance" and the
corresponding fields.
[0109] In FIG. 7, if the Japanese word "" is selected as a
translated word of the original word "party", the original sentence
indicates a "general" field. If the translated word "" is selected
as a translated word of the original word "party", the original
sentence indicates a "political" field.
[0110] Furthermore, if the word "" is selected as a translated word
of the original word "stance", the original sentence indicates the
"general" field. Or, if the word "" is selected as a translated
word of the original word "stance", the original sentence indicates
a "sports" field. Note that the "general" field indicates a word
available for all fields.
[0111] Accordingly, since the "political" and "sports" fields are
candidates for the fields of the original words "party" and
"stance" for which a translated word must be selected, a
corresponding field cannot be determined uniquely.
[0112] As a result, if field estimation is made within a range of
the original sentence numbered 32, this range includes no
translated word indicating the political field or the sports field
only. Therefore, a suitable translated word cannot be determined
uniquely for each of the original words "party" and "stance".
[0113] If context cannot be determined in a range to be translated
included in the original text, a context process range is extended
in order to analyze context beyond the context process range of the
original sentence to be translated.
[0114] Assume that a sentence preceding the sentence to be
translated shown in FIG. 5 is added to the context: process range.
That is, the context process range extending unit 7 extracts the
original sentence numbered 31 "The talks were held in the Upper
House.", and passes the extracted sentence to the context
processing unit 5.
[0115] The context processing unit 5 which receives the original
sentence numbered 31, consults a dictionary for each original word
of this sentence. As a result, a compound "Upper House" is
retrieved from the original sentence numbered 31. Since the
compound "Upper House" indicates the "political" field only, it is
determined that the original sentence to be translated includes
context related to the political field. Note that context
information other than field information can be also extracted from
the original sentence numbered 31.
[0116] As a result of the context process in the extended context
process range, sufficient context related to the original sentence
to be translated can be obtained. This context information is
therefore passed to the translation processing unit 6 to perform
the translation process.
[0117] Then, the translation processing unit 6 performs the
translation process according to the context information, that is,
the political field. The word "" used in the political field is
selected as a translated word of the original word "party". In
addition, the translated word "" used in the sports field is
deleted, and the translated word "" available for all fields is
selected.
[0118] As a result, the translated sentence "" can be obtained for
the original sentence numbered 32 "This has forced the party to
change its stance." Provided next is the explanation about an
example of translating English text into the equivalent French.
[0119] As shown in FIG. 5, the original text displaying unit 1 is
assumed to display the original sentence numbered 31 "The talks
were held in the Upper House.", and the original sentence numbered
32 "This has forced the party to change its stance.".
[0120] A translation operator inputs a number of a sentence to be
translated to the input unit 2, and issues a translation
instruction. Assuming that the translation operator specifies the
sentence number 32, the original sentence numbered 32 "This has
forced the party to change its stance." will be a translation
target.
[0121] The context processing unit 5 then performs a context
process for the original sentence numbered 32. With the context
process, a dictionary is consulted for each of the original words
included in the original sentence numbered 32. As a result of
consulting the dictionary: a translated word "Ceci" is selected for
the original word "This"; a translated word "a" is selected for the
original word "has"; a translated word "force" is selected for the
original word "forced"; a translated word "la" is selected for the
original word "the"; a translated word "parti" or "groupe" is
selected for the original word "party"; a translated word "a" is
selected for the original word "to"; a translated word "changer" is
selected for the original word "change"; a translated word "sa" is
selected for the original word "its"; and a translated word
"position" is selected for the original word "stance", as shown in
FIG. 8.
[0122] Since there are a plurality of candidates for the original
word "party", a suitable translated word must be selected with the
context process. In the meantime, there is only one equivalent for
the original words "This", "has", "forced", "the", "to", "change",
"its", and "stance". Accordingly, a translated word for each of the
original words is determined uniquely.
[0123] Then, field estimation is made for selecting a suitable
translated word for the original word "party".
[0124] FIG. 9 shows a correspondence table between a translated
word of the original word "party" and a field where the translated
word is used.
[0125] In FIG. 9, if a word "parti" is selected as a translated
word of the original word "party", the original sentence indicates
the "political" field. Or, if the word "groupe" is selected as the
translated word of the original word "party", the original sentence
indicates the "general" field. The "general" field indicates a word
available for all fields.
[0126] That is, the "political" and "general" fields will be
candidates for the field of the original word "party" for which a
translated word must be selected. Therefore, a corresponding field
cannot be determined uniquely.
[0127] Therefore, even if the field estimation is made within a
range to be translated in the original sentence numbered 32, this
range includes no translated word indicating the political field
only. As a result, a suitable translated word cannot be uniquely
determined for the original word "party".
[0128] If context cannot be determined within a range to be
translated in desired original text, a context process range is
extended for analyzing the context beyond the range to be
translated in the desired original text.
[0129] Assume that a sentence preceding the sentence to be
translated is added to the context process range in FIG. 5. That
is, the context process range extending unit 7 extracts the
original sentence numbered 31 "The talks were held in the Upper
House.", and passes the extracted sentence to the context
processing unit 5.
[0130] The context processing unit 5 which receives the original
sentence numbered 31, consults a dictionary for each original word
of the sentence numbered 31 As a result, the compound "Upper House"
is retrieved from the original sentence numbered 31. Since the
compound "Upper House" indicates the "political" field only, it is
determined that the original sentence to be translated includes
context related to the political field. Note that context
information other than field information can be also extracted from
the original sentence numbered 31.
[0131] As a result of the context process performed for the
extended context process range, sufficient context related to the
original sentence to be translated can be obtained. Therefore, this
context information is passed to the translation processing unit 6
in order to perform a translation process.
[0132] Then, the translation processing unit 6 performs the
translation process based on the context information, that is, the
political field. The word "parti" used in the political field is
selected as a translated word of the original word "party".
[0133] As a result, a translated sentence "Ceci a force la parti
changer sa position." can be obtained for the original sentence
"This has forced the party to change its stance."
[0134] As described above, even if there is no translated word
identifying a field in a range to be translated, the field can be
identified from context outside the range to be translated.
Therefore, a suitable translation process depending on a field can
be performed.
[0135] Provided next is the explanation about a context process
according to the second embodiment of the present invention, by
referring to the drawings. This context process is intended to
estimate a document type.
[0136] As a document type, a political article, thesis, operating
manual, etc. can be specified. Translation is sometimes performed
in a "da/dearu" tone, or performed without using a personal
pronoun, depending on a document type.
[0137] FIG. 10 shows a correspondence table between a field and a
document type when English text is translated into Japanese text.
In this figure, if the original text is a political article,
translation is performed in a "da/dearu" tone, and in a passive
voice. If the original text is an operating manual, translation is
performed in a "desu/masu" tone, and in an imperative form. Here,
the imperative form and the passive voice are default expressions
of translated text corresponding to an English expression such as
"must be", etc.
[0138] For example, the field of the original text shown in FIG. 5
is the political field. Accordingly, the context processing unit 5
references the column of the political article shown in FIG. 10,
and instructs the translation processing unit 6 to translate the
article in the "da/dearu" tone and in a passive voice.
[0139] The translation processing unit 6 receives document type
information as well as other context information from the context
processing unit 5, and performs the translation process.
[0140] As described above, even if a range to be translated
includes no expression identifying a document type, it can be
identified based on context outside the range to be translated by
estimating a document type. As a result, a suitable translation
process depending on a document type can be performed.
[0141] Provided next is the explanation about a context process
according to the third embodiment of the present invention, by
referring to the drawings. This context process is intended to
complete an abbreviation.
[0142] The original text displaying unit 1 is assumed to display
the original text "In the last election, the Japan Socialist Party
(JSP) enjoyed victory. The reason for JSP's victory was . . .
".
[0143] A translation operator instructs the input unit 2 to
translate the sentence beginning with "The reason . . . " as a
sentence to be translated.
[0144] Then, the context processing unit 5 performs a context
process for this sentence. With the context process, a dictionary
is consulted for each of the words included in the sentence
beginning with "The reason . . . ". As a result of consulting the
dictionary, the word "JSP" is recognized as an unregistered
word.
[0145] The meaning of the unregistered word "JSP" cannot be
determined according to this sentence only.
[0146] Therefore, if context cannot be determined within a range to
be translated in original text to be translated, a context process
range in the original text to be translated is extended in order to
analyze the context beyond the range to be translated in the
original text to be translated.
[0147] Assume that a sentence preceding the sentence to be
translated is added to the context process range. That is, the
context process range extending unit 7 extracts the original
sentence "In the last election, the Japan Socialist Party (JSP)
enjoyed victory.", and passes the extracted sentence to the context
processing unit 5.
[0148] The context processing unit 5 makes a comparison between the
original sentence "In the last election, the Japan Socialist Party
(JSP) enjoyed victory." and the sentence to be translated including
the unregistered word "JSP".
[0149] As a result, the unregistered word "JSP" is determined as an
abbreviation of the proper noun "Japan Socialist Party".
[0150] To make such a comparison, factors such as that each of the
words in the proper noun "Japan Socialist Party" begins with an
uppercase letter, that the words beginning with the uppercase
letters are in sequence, and that the abbreviation "JSP" generated
by combining only the uppercase letters of the respective words is
included, are used.
[0151] As described above, the meaning of the abbreviation can be
clarified according to context of a neighborhood.
[0152] As a result of the context process performed in the extended
context process range, the meaning of the abbreviation can be
identified. Therefore, the result of the identification is passed
to the translation processing unit 6 to perform a translation
process.
[0153] The translation processing unit 6 then performs the
translation process by using a translated word of the noun "Japan
Socialist Party" as that of "JSP".
[0154] As a result, the original sentence is translated as ". . . "
if the context process is not performed. In the meantime, the
sentence is translated as ". . . " if the context process is
performed. That is, a translated sentence that is easy to read can
be obtained.
[0155] FIG. 11 is a flowchart showing a context process according
to the third embodiment of the present invention.
[0156] In FIG. 11, the context processing unit 5 consults a
dictionary for each word included in a sentence to be translated in
step S11.
[0157] Then, it is determined whether or not an abbreviation is
included in the sentence to be translated in step S12. If "YES",
the process goes to step S13.
[0158] In step S13, it is determined whether or not an expression
which completes the abbreviation exists in a context process range.
If "NO", the process goes to step S14.
[0159] In step S14, the context process range extending unit 7
extends the context process range.
[0160] The context processing unit 5 then consults a dictionary for
each word included in a sentence in the extended context process
range in step S15. After that, the process goes back to step
S13.
[0161] If it is determined that the expression which completes the
abbreviation exists in the context process range in step S13, the
process goes to step S16.
[0162] In step S16, the abbreviation is completed in the sentence
to be translated according to the expression which completes the
abbreviation in the context process range.
[0163] In step S17, the translation processing unit 6 performs a
translation process in consideration of the completed form of the
abbreviation.
[0164] In the meantime, if there is no abbreviation in the sentence
to be translated in step S12, the process skips to step S17 to
perform a translation process without considering a completion of
the abbreviation.
[0165] As described above, even if a range to be translated
includes no complete form of an abbreviation, it can be retrieved
from context outside the range to be translated by estimating the
abbreviation. As a result, a translation process can be performed
with completion of the abbreviation, depending on need.
[0166] Provided next is the explanation about a context process
according to the fourth embodiment of the present invention, by
referring to the drawings. This context process is intended to
identify a reference destination.
[0167] Assume that the original text displaying unit 1 displays the
original text "In the last election, the Japan Socialist Party
enjoyed victory. The reason for its victory was . . . " .
[0168] A translation operator instructs the input unit 2 to
translate the sentence beginning with "The reason . . . " as a
sentence to be translated.
[0169] Then, the context processing unit 5 performs a context
process for this sentence. With the context process, a dictionary
is consulted for each of the original words included in the
sentence beginning with "The reason . . . " . As a result of
consulting the dictionary, what the pronoun "its" indicates is
determined as being indefinite.
[0170] What the pronoun "its" indicates cannot be determined from
only the sentence to be translated which begins with "The reason .
. . " .
[0171] Therefore, if context cannot be determined within a range to
be translated in original text to be translated, a context process
range is extended in order to analyze context beyond the range to
be translated in the original text to be translated.
[0172] Assume that a sentence preceding the sentence to be
translated is added to the context process range. That is, the
context process range extending unit 7 extracts the original
sentence "In the last election, the Japan Socialist Party enjoyed
victory.", and passes the extracted sentence to the context
processing unit 5.
[0173] The context processing unit 5 then performs a context
process for the original sentence "In the last election, the Japan
Socialist Party enjoyed victory." As a result, it is determined
that the pronoun "its;" included in the phrase "its victory"
indicates the "Japan Socialist Party". This is because the winner
of the "victory" is the "Japan Socialist Party".
[0174] The following paper refers to such a context process.
[0175] Source Document:
[0176] Information Processing Society Study Report
[0177] Natural Language Processing Study Report
[0178] Issue Date: November 19, 1993
[0179] Title of Paper:
[0180] Context Processing System DIANA Using Context Restriction
and Context Selection
[0181] Number of Paper: 98-8
[0182] Author:
[0183] Tetsuya Nasukawa
[0184] As described above, a reference destination can be clarified
according to context of a neighborhood.
[0185] As a result of the context process performed in the extended
context process range, the reference destination can be identified.
Then, the result of the identification is passed to the translation
processing unit 6 to perform a translation process.
[0186] Then, the translation processing unit 6 performs the
translation process using the context information indicating that
the pronoun "its" points to "the Japan Socialist Party", depending
on need.
[0187] As a result, the phrase "its victory" is translated as "" if
the context process is not performed. In the meantime, this phrase
is translated as "" if the context process is performed. That is, a
translated sentence which is easy to read can be obtained.
[0188] FIG. 12 is a flowchart showing a context process according
to the fourth embodiment of the present invention.
[0189] In FIG. 12, the context processing unit 5 first consults a
dictionary for each word in a sentence to be translated in step
S21.
[0190] Then, it is determined whether or not there is a word having
an ambiguous reference relationship in a range to be translated in
step S22. If "YES", the process goes to step S23.
[0191] In step S23, it is determined whether or not a context
process range includes a word or a word sequence which could
possibly be a reference destination. If "NO", the process goes to
step S24.
[0192] In step S24, the context process range extending unit 7
extends the context process range.
[0193] After the context processing unit 5 consults a dictionary
for each word in a sentence in the extended context process range
in step S25, the process goes to step S23.
[0194] If it is determined that the word or the word sequence could
possibly be a reference destination in step S23, the process goes
to step S26.
[0195] In step S26, the reference destination of the sentence in
the extended context process range is identified.
[0196] In step S27, a process such as replacing the word having the
ambiguous reference relationship in the range to be translated with
a word at a reference destination, etc. is performed.
[0197] In step S28, the translation processing unit 6 performs a
translation process in consideration of the word at the reference
destination.
[0198] In step S22, if it Ls determined that the range to be
translated includes no word having the ambiguous reference
relationship, the process goes to step S28 to perform the
translation process without considering the word at the reference
destination.
[0199] As described above, even if a range to be translated
includes no word pointed to by a demonstrative pronoun included in
that range, a word pointed to by the demonstrative pronoun can be
retrieved from context outside the range to be translated, by
estimating a reference destination. As a result, a translation
process can be performed while replacing a demonstrative pronoun,
depending on need.
[0200] Provided next is the explanation about operations performed
by the translation range identifying unit 4 included in the
translating apparatus according to the first embodiment of the
present invention, by referring to the drawings. This unit is
intended to identify a sentence to be translated included in
original text where one sentence does not terminate in one line,
according to a position of a cursor 10.
[0201] Assume that the original text displaying unit 1 displays the
original text "The talks were held in the Upper House. This has
forced the party to change its stance.", as shown in FIG. 13.
[0202] In this case, the sentences do not terminate in one line,
and they are also not numbered.
[0203] A translation operator operates the input unit 2 to move the
cursor 10 to an arbitrary position of a sentence to be translated
for specification.
[0204] Then, the translation range identifying unit 4 identifies
the sentence to be translated. That is, this unit detects the
position of the cursor 10, and automatically determines which
portion the translation operator intends to translate according to
the position of the cursor 10. Since translation is performed by
sentence, the sentence pointed to by the cursor 10 is
extracted.
[0205] For example, if the cursor is positioned at the character
"e" as shown in FIG. 13, backward scanning is started from the
character "e". Then, the word beginning with an uppercase letter
"This" is searched. Since the period "." immediately follows the
word "House" which immediately precedes the word "This", the word
"This" is determined to be the beginning of the sentence in which
the cursor 10 is positioned.
[0206] Then, forward scanning is performed from the character "e"
at which the cursor 10 is positioned. Since the period "."
immediately follows the word "stance", this word is determined to
be the last word of the sentence in which the cursor 10 is
positioned.
[0207] As a result, the sentence "This has forced the party to
change its stance" is determined as the sentence to be
translated.
[0208] Then, the context processing unit 5 performs a context
process for the original sentence "This has forced the party to
change its stance.", which is identified by the translation range
identifying unit 4. If context cannot be determined from this
sentence only, the context process range extending unit 7 extends
the context process range where the context process is performed,
to the original sentence "The talks were held in the Upper
House."
[0209] The translation processing unit 6 then performs a
translation process based on the context information obtained by
the context processing unit 5.
[0210] As a result, a translated sentence "" can be obtained for
the original sentence "This has forced the party to change its
stance."
[0211] Provided next is the explanation about operations performed
by the translation range identifying unit 4 included in the
translating apparatus according to the second embodiment of the
present invention, by referring to the drawings. The translation
range identifying unit 4 is intended to identify a range to be
translated in original text where one sentence does not terminate
in one line, according to shaded area 11 produced by dragging a
mouse, etc.
[0212] Assume that the original text displaying unit 1 displays the
original text "The talks were held in the Upper House. This has
forced the party to change its stance.", as shown in FIG. 14.
[0213] In this case, the sentences do not terminate in one line,
and they are also not numbered.
[0214] A translation operator operates the input unit 2, and
produces a shaded area 11 in order to select a sentence to be
translated.
[0215] Then, the translation range identifying unit 4 identifies
the sentence to be translated. That is, the translation range
identifying unit 4 detects the range of the shaded area 11, and
automatically determines which portion the translation operator
intends to translate according to the shaded area 11.
[0216] For example, if the shaded area 11 is made over the range
"The talks were held in the Upper House. This has forced the party
to change" in FIG. 14, it is determined whether or not the
beginning of the range contained in the shaded area 11, is the
beginning of a sentence. Since the word beginning with the
uppercase letter "The" is located at the beginning, this word is
determined as the beginning of the sentence.
[0217] Then, forward scanning is started from the word "The" in
order to detect a termination of the sentence. Since the period "."
exists between the word "House" and the word "This", it is
determined that the termination of the sentence exists between the
words "House" and "This".
[0218] The forward scanning finally reaches the last word "change"
in the selected range of the shaded area 11, without detecting an
additional termination of a sentence other than the above described
termination, as a result of continuing the forward scanning.
[0219] Then, it is determined whether or not the last word "change"
is the end of the sentence. Since there is no period "."
immediately after the last word "change", the last word "change" is
not determined as the end of the sentence.
[0220] Accordingly, the range from the most recently detected
termination to the last word "change" is determined as being
incomplete, and the range "This has forced the party to change." is
removed from the range selected by the shaded area 11.
[0221] As a result, the sentence "The talks were held in the Upper
House." is determined as the sentence to be translated.
[0222] The above described embodiment refers to the case where a
sentence is completely included in the range selected by the shaded
area 11. However, a sentence partially included in a range selected
by the shaded area 11 may also be a sentence to be translated.
[0223] Then, the context processing unit 5 performs a context
process for the original sentence "The talks were held in the Upper
House.", which is identified by the translation range identifying
unit 4. If context cannot be determined based on only this
sentence, the context process range extending unit 7 extends a
context process range where the context process is performed.
[0224] The translation processing unit 6 then performs a
translation process based on context information obtained by the
context processing unit 5.
[0225] As a result, a translated sentence "" can be obtained for
the original sentence "The talks were held in the Upper
House.".
[0226] As described above, a sentence to be translated can be
extracted by being suitably identified, thereby performing a
translation process accurately.
[0227] Furthermore, a group of sentences to be translated is
identified, thereby performing a translation process
accurately.
[0228] Provided next is the explanation about a method for
processing an inline tag performed by the translating apparatus
according to the first embodiment of the present invention, by
referring to the drawings. This method is intended to detect an
inline tag and ignore it as a non-target in a range to be
translated, if the inline tag of a markup language, etc. exists in
original text.
[0229] Assume that the original text displaying unit 1 displays the
original sentence "This is <B> a </B> pen.", as shown
in FIG. 15. In this case, "<B>" and "</B>" are inline
tags used to emphasize the indefinite article "a".
[0230] A translation operator operates the input unit 2, moves a
cursor 41 to a character "i", and issues a translation
instruction.
[0231] The translation range identifying unit 4 then identifies a
sentence to be translated. That is, this unit detects the position
of the cursor 41, and automatically determines which portion the
translation operator intends to translate, according to the
position of the cursor 41. Since translation is performed by
sentence, a sentence where the cursor 41 is positioned is
extracted.
[0232] For example, since the cursor 41 is positioned at the
character "i", backward scanning is started from the character "i".
Then, the word beginning with an uppercase letter "This" is located
in FIG. 15, and this word is determined as the beginning of the
sentence where the cursor 41 is positioned.
[0233] Then, forward scanning is started from the character "i" at
which the cursor 41 is positioned. Then, the inline tags
"<B>" and "</B>" are detected using an interpreter of a
markup language. The detected inline tags are ignored as a
non-target of translation. Since the period "." immediately follows
the word "pen" as a result of continuing the forward scanning, the
word "pen" is determined as the last word in the sentence in which
the cursor 41 is positioned.
[0234] As a result, the sentence "This is a pen." is determined as
the sentence to be translated.
[0235] Then, the context processing unit 5 performs a context
process for the original sentence "This is a pen.", which is
identified by the translation range identifying unit 4. If context
cannot be determined according to only the original sentence "This
is a pen.", the context process range extending unit 7 extends a
context process range where the context process is performed.
[0236] The translation processing unit 6 then performs a
translation process based on the context information obtained by
the context processing unit 5.
[0237] As a result, a translated sentence "" can be obtained for
the original sentence "This is a pen.".
[0238] As described above, a portion unrelated to translation can
be removed by ignoring an inline tag, thereby a translation process
can be performed accurately.
[0239] Provided next is the explanation about a method for
processing an inline tag performed by the translating apparatus
according to the second embodiment of the present invention, by
referring to the drawings. This method is intended to detect an
inline tag, and ignore it as a non-target in a context process
range to be extended, if the inline tag of a markup language, etc.
is included in original text.
[0240] Assume that the original text displaying unit 1 displays the
original text "This is <B> a </B> pen. And that is a
book." is displayed, as shown in FIG. 16. Here, "<B>" and
"</B>" are inline tags used to emphasize the indefinite
article "a".
[0241] The translation operator operates the input unit 2, moves a
cursor 42 to a character "b", and issues a translation
instruction.
[0242] The translation range identifying unit 4 identifies a
sentence to be translated. The translation range identifying unit 4
detects the position of the cursor 42, and automatically determines
which portion the translation operator intends to translate,
according to the position of the cursor 42. Since translation is
performed by sentence, the sentence where the cursor 42 is
positioned is extracted.
[0243] Since the cursor 42 is positioned at the character "b" in
FIG. 16, backward scanning is started from this character. The word
"And" which begins with an uppercase letter is located as a result
of the scanning, and this word is determined to be the beginning of
the sentence in which the cursor 42 is positioned.
[0244] Then, forward scanning is started from the character "b" at
which the cursor 42 is positioned. Since the period "." immediately
follows the word "book", this word is determined to be the last
word of the sentence in which the cursor 42 is positioned.
[0245] As a result, the sentence "And that is a book." is
determined as a sentence to be translated.
[0246] The context processing unit 5 performs a context process for
the original sentence "And that is a book.", which is identified by
the translation range identifying unit 4. If context cannot be
determined according to only the original sentence "And that is a
book.", the context process range extending unit 7 extends a
context process range where the context process is performed.
[0247] If the context process range is extended to a sentence
immediately preceding the original sentence "And that is a book.",
backward scanning is started from the word "pen". The inline tags
"<B>" and "</B>" are detected using an interpreter of a
markup language, and these tags are removed from the context
process range as a non-targets in the extended context process
range. The word "This" which begins with an uppercase letter, is
located as a result of continuing the backward scanning, and
determined to be the beginning of the sentence in the context
process range.
[0248] As a result, the original sentence "This is a pen." is added
to the context process range, and the context processing unit 5
detects context information from the extended context process
range.
[0249] The translation processing unit 6 then performs a
translation process based on the context information obtained by
the context processing unit 5.
[0250] As a result, a translated sentence "" is obtained for the
original sentence "And that is a book."
[0251] As described above, a portion unrelated to context can be
removed by removing an inline tag from a context process range,
thereby performing a context process accurately.
[0252] Provided next is the explanation about a method for
extending a context process range performed by the translating
apparatus according to the first embodiment of the present
invention. This method is intended to extend a context process
range sentence by sentence, backwards and forwards until required
context can be obtained.
[0253] In FIG. 5, if context information cannot be obtained from
only the original sentence numbered 32 "This has forced the party
to change its stance.", the context process range is extended to a
sentence immediately preceding or succeeding the original sentence
numbered 32.
[0254] If required context cannot be obtained even though the
context process range is extended to the sentence immediately
preceding or succeeding the sentence to be translated, the context
process range extending unit 7 further extends the context process
range to sentences preceding or succeeding the sentence to be
translated until the required context can be obtained. The method
for extending a context process range by sentence backwards and
forwards is helpful for examining a correspondence where an
interval between sentences is an important factor, etc.
[0255] If a sentence including required context is separate from a
sentence to be translated, a context process range may be extended
to before and after the sentence to be translated in units of a
plurality of sentences.
[0256] As described above, by extending a context process range
sentence by sentence, a context process such as estimating an
abbreviation, estimating a reference destination, etc. can be
performed efficiently, thereby translating a portion of a document
efficiently and accurately.
[0257] Provided next is the explanation about a method for
extending a context process range performed by the translating
apparatus according to the second embodiment of the present
invention, by referring to the drawings. This method is intended to
extend the context process range in paragraphs before or after a
sentence to be translated until required context can be
obtained.
[0258] As shown in FIG. 17A, the original text displaying unit 1 is
assumed to display original text, and shaded area 20 is made in a
sentence to be translated "The talks were held in the Upper House."
included in the original text.
[0259] The context processing unit 5 performs a context process for
the sentence to be translated in which the shaded area 20 is made.
If sufficient context information cannot be obtained from the
shaded sentence as a result of the context process, the context
process range extending unit 7 extends the context process range to
an entire paragraph including the sentence to be translated.
[0260] That is, the context process range extending unit 7
specifies the text contained in the shaded area 21 is made
"Secondly, . . . The talks were held in the Upper House. This has
forced the party to change its stance.", as a context process
range, as shown in FIG. 17B.
[0261] To identify the paragraph as the context process range,
backward scanning is started from the shaded original sentence to
be translated in order to check to see if any sentence exists. A
sentence which begins with "Secondly," is determined to precede the
sentence to be translated where the shaded area 20 is made. Then,
it is determined whether or not the sentence beginning with
"Secondly," is the beginning of the paragraph. Since this sentence
is indented and an empty line exists between the preceding sentence
beginning with "Firstly" and this line, it is determined to be the
first sentence of the paragraph.
[0262] Next, forward scanning is started from the sentence to be
translated where the shaded area 20 is made, in order to check to
see if any sentence exists. The sentence "This has forced the party
to change its stance." follows the shaded sentence to be
translated. Then, it is determined whether or not the detected
sentence is the last sentence of the paragraph. Since no sentence
immediately follows this sentence in the same line; there is an
empty line between the succeeding sentence and this sentence; and
the next sentence is indented, the sentence "This has forced the
party to change its stance." is determined to be the last sentence
of the paragraph.
[0263] With the above described method, the context process range
after extension is identified by detecting the first and the last
sentences of a paragraph.
[0264] Then, the context processing unit 5 extracts context
information from the context process range contained in the shaded
area 21 is made, as shown in FIG. 17B. If sufficient context
information cannot be obtained from the context process range
contained in the shaded area 21 is made, the context process range
extending unit 7 extends the context process range to the entire
text of the next paragraph.
[0265] Assume that the context process range extending unit 7
extends the context process range to the entire text of the
preceding paragraph, as shown in FIG. 17C, and adds the sentence
which begins with "Firstly," to the context process range. As a
result, the text contained in the shaded area 22 is made "Firstly,
. . . , secondly, . . . . The talks were held in the Upper House.
This has forced the party to change its stance." is specified as
the context process range.
[0266] The context processing unit 5 extracts context information
from the context process range contained in the shaded area 22, as
shown in FIG. 17C. If sufficient context information is extracted
from the context process range contained in the shaded area 22, the
context processing unit 5 passes the extracted context information
to the translation processing unit 6.
[0267] The translation processing unit 6 translates the sentence to
be translated where the shaded area 20 is made, based on the
context information passed from the context processing unit 5.
[0268] Note that the method for extending a context process range
in paragraphs backwards and forwards is helpful for making a field
determination where the existence/non-existence of a word
identifying that a specific field exists is important.
[0269] If a sentence including required context is separate from a
sentence to be translated, a context process range may be extended
before or after the sentence to be translated in units of a
plurality of paragraphs.
[0270] As described above, a context process range is extended in
paragraphs corresponding to a range to be translated until a
translated sentence can be determined for an original sentence.
Accordingly, a context process such as estimating a field,
estimating a document type, etc. can be performed efficiently,
thereby translating a portion of a document efficiently and
accurately.
[0271] Provided next is the explanation about the method for
extending a context process range performed by the translating
apparatus according to the third embodiment of the present
invention, by referring to the drawings. This method is intended to
change the method for extending a context process range depending
on a required type of context process.
[0272] FIG. 18 shows a correspondence table between a type of
context process and the method for extending a context process
range. This table is pre-stored in the context process range
extending unit 7.
[0273] In FIG. 18, when field estimation is made, a context process
range is extended in paragraphs before or after a sentence to be
translated.
[0274] When document type estimation is made, the context process
range is extended in paragraphs before or after the sentence to be
translated.
[0275] To complete an abbreviation, the context process range is
extended in sentences before or after the sentence to be
translated.
[0276] In the meantime, to refer to a reference destination, the
context process range is extended in sentences before the sentence
to be translated.
[0277] FIG. 19 is a flowchart showing the method for extending a
context process range performed by the translating apparatus
according to the third embodiment of the present invention.
[0278] In FIG. 19, the context processing unit 5 consults a
dictionary for each word included in a sentence to be translated in
step S31.
[0279] Then, one type of context process is selected for the
sentence to be translated, and the selected context process is
performed in step S32.
[0280] In step S33, it is determined whether or not sufficient
context information can be obtained from a range to be translated.
If "NO", the process goes to step S34.
[0281] In step S34, the correspondence table shown in FIG. 18 is
referenced to obtain a method for extending a context process range
according to the selected type of context process. If the type of
context process is a field estimation or document type estimation,
the context process range is extended in paragraphs before or after
the sentence to be translated. If the type of context process is
completion of an abbreviation, the context process range is
extended in sentences before or after the sentence to be
translated. If the type of context process is referring to a
reference destination, the context process range is extended in
sentences before the sentence to be translated.
[0282] In step S35, the context process range extending unit 7
extends the context process range based on the method for extending
a context process range obtained in step S34.
[0283] The context processing unit 5 then performs a context
process for text in the extended context process range in step
S36.
[0284] In step S37, the context processing unit 5 determines
whether or not sufficient context information can be obtained from
the extended context process range. If "NO", the process goes to
step S35 to repeatedly extend the context process range.
[0285] If it is determined that sufficient context information can
be obtained from the extended context process range, the process
goes to step S38.
[0286] The context processing unit 5 determines whether or not all
types of context processes have been performed in step S38. If
"NO", the process goes to step S32 to perform a next type of
context process, and completes the process after all types of
context processes have been performed.
[0287] If it is determined that sufficient context information can
be obtained from the context process range in step S33, the process
goes to step S38.
[0288] As described above, a context process can be efficiently
performed by determining a direction in which a context process
range is extended based on a type of context process, thereby
translating a portion of a document efficiently and accurately.
[0289] Additionally, a context process for estimating a reference
destination can be efficiently performed by extending a context
process range in sentences in a backward direction, thereby
translating a portion of a document efficiently and accurately.
[0290] Furthermore, a context process can be performed in a context
process range having a context corresponding to a range to be
translated by terminating extension of the context process range at
a boundary of the context, thereby performing the context process
accurately and quickly.
[0291] Provided next is the explanation about a method for dividing
a range to be translated performed by the translating apparatus
according to the first embodiment of the present invention. This
method is intended to perform translation by dividing the range to
be translated for each paragraph, if the range to be translated
straddles a plurality of paragraphs. This is also because context
may change at a change of paragraph.
[0292] That is, if the last sentence of a paragraph A and the first
sentence of a paragraph B are simultaneously specified as sentences
to be translated by a process of dragging a mouse, etc., the last
sentence of the paragraph A and the first sentence of the paragraph
B are scanned when a context process is performed. Then, it is
detected that these sentences respectively belong to different
paragraphs, so that the range to be translated is determined to
straddle the plurality of paragraphs.
[0293] Then, a translation process is respectively performed for
the last sentence of the paragraph A and the first sentence of the
paragraph B that are determined to straddle the plurality of
paragraphs.
[0294] As a result, the last sentence of the paragraph A can be
translated based on only the context of the paragraph A, while the
first sentence of the paragraph B can be translated based on only
the context of the paragraph B. Accordingly, even if a topic
suddenly changes at the change of paragraph from A to B,
translation can be performed accurately.
[0295] Assuming that a topic of baseball changes to the one such as
the weather of the next day, means of transportation to a baseball
ground, etc., a translation process can be performed for respective
fields such as a baseball field, a weather field, and a
transportation field, thereby performing the translation process
accurately.
[0296] As described above, context corresponding to a range to be
translated can be accurately extracted by dividing the range into
paragraphs, even if a topic suddenly changes at a change to a next
paragraph. As a result, a translation process can be performed
accurately.
[0297] Provided next is the explanation about a method for dividing
a range to be translated, performed by the translating apparatus
according to the second embodiment of the present invention. This
method is intended to divide a range to be translated for each
paragraph, and set each context process range for each of the
divided ranges to be translated for each of the paragraphs. This is
because context may often change as a paragraph changes.
[0298] If the last sentence of the paragraph A and the first
sentence of the paragraph B are simultaneously specified as
sentences to be translated by a process of dragging a mouse, etc.,
these sentences are scanned when a context process is performed.
Then, it is detected that these sentences respectively belong to
different paragraphs, so that the range to be translated is
determined to straddle the plurality of paragraphs.
[0299] A translation process is then performed respectively for the
last sentence of the paragraph A and the first sentence of the
paragraph B that are determined to straddle the plurality of
paragraphs.
[0300] If sufficient information cannot be obtained from the last
sentence of the paragraph A when a translation process is performed
for the last sentence of the paragraph A, a context process range
is extended based on only the last sentence of the paragraph A.
Then, context information is extracted from the extended context
process range, and the last sentence of the paragraph A is
translated.
[0301] In the meantime, if sufficient context information cannot be
obtained from the first sentence of the paragraph B, a context
process range is extended based on only the first sentence of the
paragraph B. Then, context information is extracted from the
extended context process range, and the first sentence of the
paragraph B is translated.
[0302] That is, the last sentence of the paragraph A can be
translated independently from the first sentence of the paragraph
B, and the first sentence of the paragraph B can be translated
independently from the last sentence of the paragraph A. Therefore,
even if a topic suddenly changes at the change of paragraph from A
to B, translation can be performed accurately.
[0303] As described above, context corresponding to a context
process range can be accurately extracted by setting a context
process range for each divided range to be translated for each
paragraph, when a topic changes at a change of paragraph.
[0304] Provided next is the explanation about a method for
displaying context information according to the first embodiment of
the present invention, by referring to the drawings. This method is
intended to display context information determined and obtained by
a context process, at the side of a sentence to be translated.
[0305] Assume that the original text displaying unit 1 displays the
original text "The talks were held in the Upper House. This has
forced the party to change its stance.", as shown in FIG. 20.
[0306] A shaded area 43 is made for a sentence to be translated,
which is identified by the translation range identifying unit
4.
[0307] Then, the context processing unit 5 performs a context
process for the sentence to be translated, and detects a compound
"Upper House" from the sentence to be translated in which the
shaded area 43 is made. Since the compound "Upper House" is used in
a political field only, it is determined that the sentence to be
translated in which the shaded area is made, relates to the
political field.
[0308] Next, the original text displaying unit 1 displays context
information 44 extracted by the context processing unit 5 from the
sentence to be translated, at the side of the sentence to be
translated. Since the sentence to be translated in which the shaded
area 43 is made, for example, relates to the political field, a
word "political" is displayed as context information 44.
[0309] As a result, the translation operator can recognize context
of a sentence to be translated, and can perform a translation
process while checking if a suitable context process is
performed.
[0310] As described above, a translation process can be performed
while recognizing a state of a context process in a range to be
translated, by displaying context information in correspondence
with a range to be translated in original text.
[0311] Furthermore, a translation process can be performed while
identifying a sentence to be translated, by highlighting the
sentence to be translated.
[0312] Provided next is the explanation about a method for
displaying context information according to the second embodiment
of the present invention, by referring to the drawings. This method
is intended to display context information yet to be determined,
which is obtained by a context process, at the side of a sentence
to be translated.
[0313] Assume that the original text displaying unit 1 displays the
original text "The talks were held in the Upper House. This has
forced the party to change its stance", as shown in FIG. 21A.
[0314] The shaded area 45 is made for the original sentence "This
has forced the party to change its stance.", which is identified as
a sentence to be translated by the translation range identifying
unit 45.
[0315] Then, the context processing unit 5 performs a context
process for the sentence to be translated in which the shaded area
45 is made, and the context processing unit 5 detects the original
words "party" and "stance", respectively having a plurality of
translated words, from the sentence to be translated in which the
shaded area 45 is made. The original word "party" is used for a
general field in addition to the political field, while the
original word "stance" is used for the general field in addition to
the sports field.
[0316] This original sentence contained in the shaded area 45,
includes no word used in one particular field. Accordingly, the
field of the sentence to be. translated in which the shaded area 45
is made, cannot be identified at this stage.
[0317] Therefore, the original text displaying unit 1 displays the
context information 46 yet to be determined, which is extracted by
the context processing unit 5 from the sentence to be translated.
For example, since the political and sports fields are candidates
of the sentence to be translated in which the shaded area 45 is
made, the words "political" and "sports" are displayed as the
context information 46.
[0318] With such a display, a translation operator can recognize
whether or not context of a sentence to be translated is
determined. As a result, the translation operator can perform a
translation process while checking to see if a suitable context
process is performed.
[0319] As described above, a translation process can be performed
while recognizing a state of a context process being performed in a
range to be translated, by displaying a plurality of pieces of
context information as candidates.
[0320] Provided next is the explanation about a method for
displaying context information according to the third embodiment of
the present invention, by referring to the drawings. This method is
intended to display a word at a reference destination, which is
obtained by a context process, in correspondence with a word at a
reference source.
[0321] Assume that the original text displaying unit 1 displays the
original text "The talks were held in the Upper House. This has
forced the party to change its stance.", as shown in FIG. 22.
[0322] The context processing unit 5 performs a context process for
a sentence to be translated, and determines that the original word
"it" refers to the original word "party".
[0323] The original text displaying unit 1 then displays the word
at the reference destination in correspondence with the word at the
reference source, based on the context information obtained by the
context processing unit 5. For example, areas 50 and 51 shaded in
identical colors are provided for the original word "party" at the
reference destination and the original word "it" at the reference
source.
[0324] As a result, the translation operator can perform a
translation process while checking to see if a process of reference
relationship is properly performed.
[0325] As described above, a translation process can be performed
while recognizing a state of a context process, by highlighting a
word at a reference destination and a word at a reference
source.
[0326] Provided next is the explanation about a method for
displaying context information according to the fourth embodiment
of the present invention, by referring to the drawings. This method
is intended to display a word having a plurality of translated
words obtained by a context process in correspondence with a word
identifying a field.
[0327] Assume that the original text displaying unit 1 displays the
original text "The talks were held in the Upper House. This has
forced the party to change its stance.", as shown in FIG. 23.
[0328] The context processing unit 5 performs a context process for
a sentence to be translated, and locates a word "" used in a
general field and a word "" used in the political field, as
translated words of the original word "party". Additionally, the
compound "Upper House" used in the political field only, is
located. Accordingly, it is determined that the sentence to be
translated relates to the political field.
[0329] As a result, the word "" used in the political field is
selected as a translated word of the original word "party".
[0330] The original text displaying unit 1 displays a word having a
plurality of translated words obtained by the context process based
on the context information obtained by the context processing unit
5, in correspondence with a word identifying a field. For example,
areas 52 an 53 shaded in identical colors are made for the original
word "party" having a plurality of translated words, and the
compound "Upper House" used as a basis for selecting one of the
plurality of translated words for the original word "party".
[0331] Accordingly, a translation operator can perform a
translation process while checking to see if a process of reference
relationship is properly performed for a sentence to be
translated.
[0332] If there are a plurality of words that could be a basis for
selecting a translated word for an original word, only a word
closest to the original word may be displayed, in order to prevent
a display from being confusing to the user.
[0333] For example, if there are a number of words used only in the
political field in addition to the compound "Upper House" in a
sentence to be translated, only the compound "Upper House" closest
to the original word "party" is displayed in correspondence with
the original word "party".
[0334] In this case, searching for the word closest to the original
word "party" to be used as the basis for selecting the translated
word, is performed as described below.
[0335] That is, the field of the context of the sentence including
the original word "party" is determined as being "political", by
making a field estimation.
[0336] Then, backward scanning is started from the original word
"party", and a word in the political field only is searched.
[0337] Next, a first searched word among words used in the
political field only is displayed. In FIG. 23, the compound "Upper
House", for example, corresponds to the first searched word.
[0338] Additionally, a translation process can be performed while
recognizing a state of a context process, by highlighting a short
substitute and its formal name.
[0339] Note that a translation process may be performed while
recognizing a field name, document type name, formal name or a word
at a reference destination, by displaying the above described names
or word.
[0340] Additionally, context information can be concisely displayed
by highlighting a word closest to the first word among second words
identifying a translated word of the first word, thereby
recognizing a state of a context process with ease.
[0341] Furthermore, a sentence to be translated and a context
process range can be separately recognized by highlighting the
sentence to be translated using a first method, and highlighting
the context process range using a second method, thereby performing
a translation process while recognizing the context process
range.
[0342] Provided next is the explanation about a dictionary search
apparatus according to an embodiment of the present invention, by
referring to the drawings.
[0343] FIG. 24 is a simplified block diagram showing configuration
of the dictionary search apparatus according to the embodiment of
the present invention. This apparatus is intended to properly
perform a dictionary search process by obtaining appropriate
context from a neighborhood of an original word for which a search
operation is performed, depending on need.
[0344] In FIG. 24, the dictionary search apparatus comprises a
searching unit 61, context process range setting unit 62, context
processing unit 63, and a determining unit 64.
[0345] The searching unit 61 is intended to search a translated
word for an original word, when specified.
[0346] The context process range setting unit 62 is intended to
read original text around a specified original word, and set a
context process range, if a plurality of translated words exist for
the specified original word.
[0347] The context processing unit 63 is intended to perform a
context process in a context process range set by the context
process range setting unit 62.
[0348] The determining unit 64 is intended to determine a
translated word for an original word, based on context obtained by
the context processing unit 63.
[0349] As described above, a dictionary search can be performed in
consideration of context by determining a translated word for an
original word, thereby performing the dictionary search
accurately.
[0350] Provided next is the explanation about operations performed
by a dictionary search apparatus according to the first embodiment
of the present invention. This apparatus is intended to estimate a
field as a context process.
[0351] Assume that the original text "The talks were held in the
Upper House. This has forced the party to change its stance." is
displayed, as shown in FIG. 25. First of all, a translation
operator who performs a search moves a cursor 70 to a character "r"
included in the original word "party", in order to specify this
word as an original word for which a search operation is to be
performed.
[0352] Then, scanning is performed from the character "r" in
backward and forward directions until a character space is
detected. As a result, the word "party" is recognized as the
original word for which the search operation is to be
performed.
[0353] The searching unit 61 then consults a dictionary for the
original word "party", and selects translated words "" or "" when
an English-Japanese search is performed. Since there are a
plurality of translated words for the original word "party" in this
case, a suitable translated word cannot be determined.
[0354] Then, a field is estimated to select the suitable translated
word for the original word "party". That is, if the original text
relates to a general field, the word "" can be selected as a
translated word of the original word "party". Or, if the original
text relates to the political field, the word "" is selected as the
translated word of the original word "party".
[0355] The context process range setting unit 62 sets the sentence
"This has forced the party to change its stance.", which includes
the original word "party", as a context process range, in order to
identify a field of the original text.
[0356] The context processing unit 63 then performs a context
process for the sentence "This has forced the party to change its
stance.", which is set by the context process range setting unit
62. Since this sentence includes no word identifying a field in
this case, the field of the original word "party" cannot be
determined.
[0357] Therefore, the context process range setting unit 62 extends
the context process range, and sets the text "The talks were held
in the Upper House. This has forced the party to change its
stance." as a context process range.
[0358] The context processing unit 63 then performs a context
process for this text set by the context process range setting unit
62. Since this text includes the compound "Upper House" used in the
political field only, the field of the original word "party" can be
determined to be the political field according to this
compound.
[0359] The determining unit 64 therefore determines the word ""
relating to the political field, as the translated word of the
original word "party" shown in FIG. 9.
[0360] The searching unit 61 consults a dictionary for the original
word "party", and selects the translated words "parti" and
"groupe", when an English-French search is performed. Since there
are a plurality of translated words for the original word "party"
in this case, a suitable translated word cannot be determined.
[0361] Next, field estimation is made in order to select the
suitable word for the original word "party". If the original text
relates to a general field, the word "groupe" can be selected as a
translated word of the original word "party". Or, if the original
text relates to the political field, the word "parti" can be
selected as the translated word of the original word "party".
[0362] The context process range setting unit 62 sets the sentence
"This has forced the party to change its stance.", which include
the original word "party", as a context process range, in order to
identify the field of the original text.
[0363] The context processing unit 63 then performs a context
process for the sentence "This has forced the party to change its
stance.", which is set by the context process range setting unit
62. Since this sentence does not include a word identifying a field
in this case, the field of the original word "party" cannot be
determined.
[0364] Therefore, the context process range setting unit 62 extends
the context process range, and sets the text "The talks were held
in the Upper House. This has forced the party to change its
stance." as a context process range.
[0365] The context processing unit 63 performs a context process
for this text set by the context process range setting unit 62.
Since this text includes the compound "Upper House" used in the
political field only in this case, the field of the original word
"party" can be determined to be the political field, based on this
compound.
[0366] The determining unit 64 therefore determines the word
"parti" relating to the political field as the translated word of
the original word "party", and displays "parti" with a higher
priority. The word "parti" is displayed as a first candidate in the
example shown in FIG. 9. If this word is displayed later, the order
of the candidates "parti" and groupe" is changed in order to
display "parti" first.
[0367] FIG. 26 is a flowchart showing operations performed by the
dictionary search apparatus according to the first embodiment of
the present invention.
[0368] In this figure, a word to be searched is first identified
according to a position of the cursor 70 in step S41.
[0369] Then, a dictionary is consulted for the word to be searched
in step S42.
[0370] In step S43, it is determined whether or not ambiguity of a
field exists in the word to be searched. If "YES", the process goes
to step S44.
[0371] In step S44, a context process range is set for original
text around the word to be searched, and a field estimation is made
for the original text included in the set context process
range.
[0372] In step S45, it is determined whether or not sufficient
context information can be obtained from the context process range.
If "NO", the process goes to step S46.
[0373] After the context process range is extended in step S46, the
process goes back to step S44 in order to repeatedly extend the
context process range until sufficient context information can be
obtained. If it is determined that sufficient context information
can be obtained from the context process range in step S45, the
process goes to step S47.
[0374] In step S47, a word definition matching an estimated field
is moved to a first location.
[0375] In step S48, a result of the search is displayed.
[0376] If it is determined that the word to be searched has no
ambiguity of a field in step S43, the process goes to step S48 to
display the word definition of the word searched.
[0377] As described above, even if a translated word cannot be
determined for an original word, a context process can be
efficiently performed for a dictionary search operation by
extending a context process range. As a result, the dictionary
search can be performed accurately.
[0378] In addition, a translated word corresponding to an estimated
field can be selected by estimating a field as a context process,
if there are a plurality of translated words for an original word.
As a result, a dictionary search can be performed accurately.
[0379] For example, if first and second translated words
respectively belonging to first and second fields for a first
original word, and if a second original word outside a range to be
translated belongs to the first field, the field of the first
original word is estimated by selecting the first translated word
for the first original word. As a result, a translated word of the
first original word can be determined.
[0380] Provided next is the explanation about operations performed
by the dictionary search apparatus according to the second
embodiment of the present invention, by referring to the drawings.
This apparatus completes an abbreviation as a context process.
[0381] Assume that the original text "In the last election, the
Japan Socialist Party enjoyed victory. The reason for JSP's victory
was . . . " is displayed.
[0382] A translation operator who makes a search moves a cursor to,
for example, the character "J" included in the word "JSP", in order
to specify this abbreviation as a word for which a search operation
of a word definition is required.
[0383] Then, scanning is performed from the character "J" in
forward and backward directions until a character space is
detected. The word "JSP" is recognized as the word for which a
search operation of a word definition is required.
[0384] The searching unit 61 consults a dictionary for the word
"JSP", and determines this word as being unregistered. Since the
word "JSP" is written in uppercase letters in this case, this word
is determined as an abbreviation.
[0385] The context process range setting unit 62 then sets the
sentence "The reason for JSP's victory was ", which includes the
abbreviation "JSP", as a context process range, in order to
complete the abbreviation "JSP".
[0386] The context processing unit 63 then performs a context
process for the sentence "The reason for JSP's victory was . . . ",
which is set by the context process range setting unit 62. Since
this sentence includes no character string which completes the
"JSP", the abbreviation "JSP" cannot be completed.
[0387] Accordingly, the context process range setting unit 62
extends the context process range, and sets the original text "In
the last election, "the Japan Socialist Party enjoyed victory. The
reason for JSP's victory was . . . ", as the context process range.
The context processing unit 63 performs a context process for this
original text set by the context process range setting unit 62.
Since this original text includes the character string "Japan
Socialist Party", each of which begins with an uppercase letter
"J", "S", and "P", this character string is determined as the
character string which completes the abbreviation "JSP".
[0388] Then, the searching unit 61 consults a dictionary for the
character string "Japan Socialist Party", and retrieves "" as the
definition of this character string.
[0389] The determining unit 64 therefore determines "" as the
definition of the word "JSP".
[0390] FIG. 27 is a flowchart showing operations performed by the
translating apparatus according to the second embodiment of the
present invention.
[0391] In this figure, a word to be searched is identified
according to a position of a cursor in step S51.
[0392] Next, a dictionary is consulted for the word to be searched
in step S52.
[0393] In step S53, it is determined whether or not the word to be
searched is an abbreviation. If "YES", the process goes to step
S54.
[0394] In step S54, a sentence including the word to be searched is
set as a context process range.
[0395] Then in step S55, it is determined whether or not included
in the context process range, and the abbreviation is
completed.
[0396] In step S56, it is determined whether or not an expression
completing the word to be searched exists in the context process
range. If "NO", the process goes to step S57.
[0397] The context process range is then extended to before and
after the sentence including the word to be searched, and the
process goes back to step S54.
[0398] In the meantime, if the expression completing the
abbreviation exists in the context process range, the process goes
to step S58.
[0399] In step S58, a dictionary is consulted for the completed
word to be searched.
[0400] In step S59, a definition of the completed word to be
searched is displayed.
[0401] If the word to be searched is not determined as an
abbreviation in step S53, the process goes to step S59 to display a
definition of the word to be searched.
[0402] As described above, a complete name of an abbreviation can
be retrieved by performing a context process such as estimating an
abbreviation, thereby performing a dictionary search operation
efficiently.
* * * * *