U.S. patent application number 13/487247 was filed with the patent office on 2012-09-20 for optically imaging stylus for interacting with substrate having imageable position-coding pattern.
This patent application is currently assigned to SILVERBROOK RESEARCH PTY LTD. Invention is credited to Paul Lapstun, Kia Silverbrook.
Application Number | 20120237140 13/487247 |
Document ID | / |
Family ID | 27424494 |
Filed Date | 2012-09-20 |
United States Patent
Application |
20120237140 |
Kind Code |
A1 |
Lapstun; Paul ; et
al. |
September 20, 2012 |
OPTICALLY IMAGING STYLUS FOR INTERACTING WITH SUBSTRATE HAVING
IMAGEABLE POSITION-CODING PATTERN
Abstract
An optically imaging stylus for interacting with a substrate
having an imageable position-coding pattern identifying a plurality
of locations on the substrate. The stylus includes: a nib; an image
sensor; and a processor configured for: (i) inferring a perspective
transform using perspective-distorted features of the imaged
position-coding pattern; (ii) determining a rotational orientation
of the stylus relative to the position-coding pattern; (iii)
acquiring and decoding coded data contained in the imaged
position-coding pattern, the decoded coded data identifying an
imaged location on the substrate; (iv) calculating a nib position
using: the imaged location, the perspective transform and the
determined rotational orientation, the nib position being different
than the imaged location; and (v) generating digital ink
identifying the nib position.
Inventors: |
Lapstun; Paul; (Balmain,
AU) ; Silverbrook; Kia; (Balmain, AU) |
Assignee: |
SILVERBROOK RESEARCH PTY
LTD
|
Family ID: |
27424494 |
Appl. No.: |
13/487247 |
Filed: |
June 3, 2012 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
12140225 |
Jun 16, 2008 |
|
|
|
13487247 |
|
|
|
|
10291559 |
Nov 12, 2002 |
7412651 |
|
|
12140225 |
|
|
|
|
09575129 |
May 23, 2000 |
7233320 |
|
|
10291559 |
|
|
|
|
Current U.S.
Class: |
382/314 |
Current CPC
Class: |
B42C 19/02 20130101;
H04N 1/00376 20130101; B41J 2/17503 20130101; H04N 1/32778
20130101; B41J 13/103 20130101; G06F 3/03545 20130101; G06K 17/00
20130101; H04N 1/00326 20130101; B41J 2/17563 20130101; H04N
1/00127 20130101; B42C 9/0006 20130101; H04N 1/32128 20130101; G06F
3/0321 20130101; G06F 16/00 20190101; B41J 2/2114 20130101; H04N
1/32122 20130101; G06F 3/1204 20130101; H04N 2201/3249 20130101;
H04N 1/32101 20130101; H04N 1/00244 20130101; G06F 3/04883
20130101; H04N 1/00567 20130101; H04N 2201/3247 20130101; G06Q
20/346 20130101; G06F 3/1265 20130101; B41J 2/17523 20130101; B41J
2/17556 20130101; G07F 7/1008 20130101; G06Q 30/00 20130101; H04N
2201/0082 20130101; H04N 1/00358 20130101; H04N 1/00968 20130101;
B41J 2/17553 20130101; G06K 9/2063 20130101; B41J 13/106 20130101;
B65H 29/34 20130101; B42C 9/0081 20130101; G06F 3/1284 20130101;
G06F 3/016 20130101; G06K 9/222 20130101; B41J 2/17546 20130101;
B65H 37/04 20130101; G06K 2009/226 20130101; G06Q 20/40 20130101;
B41J 2/1752 20130101; B42P 2261/04 20130101; G06K 2209/01 20130101;
B41J 2/17513 20130101; H04N 1/32771 20130101; H04N 2201/327
20130101; H04N 1/00204 20130101; H04N 2201/3243 20130101; H04N
2201/3269 20130101 |
Class at
Publication: |
382/314 |
International
Class: |
G06K 9/22 20060101
G06K009/22 |
Foreign Application Data
Date |
Code |
Application Number |
May 25, 1999 |
AU |
PQ0559 |
Jun 30, 1999 |
AU |
PQ1313 |
Oct 15, 1999 |
AU |
PQ3457 |
Dec 1, 1999 |
AU |
PQ4392 |
Claims
1. An optically imaging stylus for interacting with a substrate
having an imageable position-coding pattern, said position-coding
pattern identifying a plurality of locations on the substrate, said
stylus comprising: a stylus body; a nib for interacting with the
substrate; an image sensor for imaging an area of the
position-coding pattern; and a processor configured for: (i)
inferring a perspective transform using perspective-distorted
features of the imaged position-coding pattern, said features being
perspective-distorted due to a tilt of the stylus; (ii)
determining, from the imaged position-coding pattern, a rotational
orientation of the stylus relative to the position-coding pattern;
(iii) acquiring and decoding coded data contained in the imaged
position-coding pattern, the decoded coded data identifying an
imaged location on the substrate; (iv) calculating a nib position
using: the imaged location, the perspective transform and the
determined rotational orientation, said nib position being
different than the imaged location; and (v) generating digital ink
identifying the nib position.
2. The optically imaging stylus of claim 1, further comprising a
force sensor for detecting a nib force.
3. The optically imaging stylus of claim 2, wherein the processor
is configured to interpret said nib force as an indication of
contact between the nib and the substrate.
4. The optically imaging stylus of claim 1, wherein said stylus
contains a unique identity.
5. The optically imaging stylus of claim 1, wherein said digital
ink comprises a plurality of nib positions representing handwritten
input on the substrate.
6. The optically imaging stylus of claim 1, wherein said image
sensor is offset from said nib.
7. The optically imaging stylus of claim 1, further comprising a
transceiver for communicating the digital ink to a computer system,
such that the computer system can interpret the digital ink and
perform an action in response to the interaction with the
substrate.
8. The optically imaging stylus of claim 1, wherein said nib is a
non-marking stylus nib.
9. The optically imaging of claim 1, wherein said image sensor
comprises an infrared filter.
10. An imaging system comprising: a substrate having an imageable
position-coding, said position-coding pattern identifying a
plurality of locations on the substrate; and an optically imaging
stylus for interacting with the substrate, said stylus comprising:
a nib for interacting with the substrate; an image sensor for
imaging an area of the position-coding pattern; and a processor
configured for: (i) inferring a perspective transform using
perspective-distorted features of the imaged position-coding
pattern due to a tilt of the stylus; (ii) determining from the
imaged position-coding pattern a rotational orientation of the
stylus relative to the position-coding pattern; (iii) acquiring and
decoding coded data contained in the imaged position-coding
pattern, the decoded coded data identifying an imaged location on
the substrate; (iv) calculating a nib position using: the imaged
location, the perspective transform and the determined rotational
orientation, said nib position being different than the imaged
location; and (v) generating digital ink identifying the nib
position.
11. The imaging system of claim 10, wherein the position-coding
pattern is printed on the substrate.
12. The imaging system of claim 11, wherein the position-coding
pattern is printed with an infrared absorbing ink.
13. The imaging system of claim 10, wherein the stylus further
comprises a force sensor for detecting a nib force.
14. The imaging system of claim 13, wherein the processor is
configured to interpret said nib force as an indication of contact
between said nib and said substrate.
15. The imaging system of claim 10, wherein said stylus contains a
unique identity.
16. The imaging system of claim 10, wherein said digital ink
comprises a plurality of nib positions representing handwritten
input on the substrate.
17. The imaging system of claim 10, wherein said image sensor is
offset from said nib.
18. The imaging system of claim 10, wherein said stylus further
comprises a transceiver for communicating the digital ink to a
computer system, such that the computer system can interpret the
digital ink and perform an action in response to the interaction
with the substrate.
Description
CROSS-REFERENCE TO RELATED APPLICATION
[0001] This is a Continuation Application of U.S. application Ser.
No. 12/140,225 filed Jun. 16, 2008, which is a Continuation
Application of U.S. application Ser. No. 10/291,559 filed on Nov.
12, 2002, now issued U.S. Pat. No. 7,412,651, which is a
Continuation Application of U.S. application Ser. No. 09/575,129,
filed on May 23, 2000, now issued U.S. Pat. No. 7,233,320, all of
which are herein incorporated by reference.
FIELD OF INVENTION
[0002] The present invention relates to a method and system for
enabling user interaction with computer software running in a
computer system.
[0003] The invention has been developed primarily to provide a
surface-based interface which allows a user to interact with
networked information and to obtain interactive printed matter on
demand via high-speed networked color printers. Although the
invention will largely be described herein with reference to this
use, it will be appreciated that the invention is not limited to
use in this field.
CO-PENDING APPLICATIONS
[0004] Various methods, systems and apparatus relating to the
present invention are disclosed in the following co-pending
applications/granted patents filed by the applicant or assignee of
the present invention with the present application:
TABLE-US-00001 7,721,948 7,079,712 6,825,945 7,330,974 6,813,039
6,987,506 7,038,797 6,980,318 6,816,274 7,102,772 7,350,236
6,681,045 6,728,000 7,173,722 7,088,459 7,707,082 7,068,382
7,062,651 6,789,194 6,789,191 6,644,642 6,502,614 6,622,999
6,669,385 6,549,935 6,987,573 6,727,996 6,591,884 6,439,706
6,760,119 7,295,332 6,290,349 6,428,155 6,785,016 6,870,966
6,822,639 6,737,591 7,055,739 7,233,320 6,830,196 6,832,717
6,957,768 7,456,820 7,170,499 7,106,888 7,123,239 6,428,133
6,526,658 6,315,399 6,338,548 6,540,319 6,328,431 6,328,425
6,991,320 6,383,833 6,464,332 6,439,693 6,390,591 7,018,016
6,328,417 6,409,323 6,281,912 6,604,810 6,318,920 6,488,422
6,795,215 7,154,638 6,859,289
The disclosures of these co-pending applications are incorporated
herein by cross-reference.
BACKGROUND
[0005] Presently, a user wishing to interact with a computer system
usually does so using a display device such as a computer monitor
and an input device such as a keyboard or a computer mouse. Whilst
such interfaces are useful, they are relatively bulky and
non-portable, and can lack the immediacy and expressiveness
afforded by a pen applied to paper. Moreover, in many situations,
paper is a more comfortable and convenient recording medium, due to
its superior visual characteristics and its portability.
Nevertheless, information recorded on paper is generally not as
accessible and malleable as information recorded on a computer
system. Furthermore, unlike a computer interface, a pen applied to
paper generally lacks the ability to control computer software.
OBJECT
[0006] It is an object of the present invention to combine
advantages of paper-based recordal of information and computer
interfacing.
SUMMARY OF INVENTION
[0007] In a first aspect of the invention, there is provided a
method of enabling user interaction with computer software running
in a computer system via:
[0008] an interface surface containing information relating to the
computer software and including coded data indicative of an
identity of the interface surface and of at least one reference
point of the interface surface; and
[0009] a sensing device which, when placed in an operative position
relative to the interface surface, senses indicating data
indicative of: the identity of the interface surface; and a
position of the sensing device relative to the interface surface,
using at least some of the coded data;
[0010] the method including the steps of, in the computer
system:
(a) receiving the indicating data from the sensing device; (b)
using the indicating data to identify at least one interactive
element relating to the computer software; and (c) operating the
computer software in accordance with instructions associated with
the at least one interactive element.
[0011] In a second aspect of the invention, there is provided a
method of enabling user interaction with computer software running
in a computer system, the method including the steps of:
[0012] providing an interface surface containing: information
relating to the computer software; and coded data indicative of an
identity of the interface surface and of at least one reference
point of the interface surface; and
[0013] in the computer system:
(a) receiving indicating data from a sensing device, the indicating
data being indicative of: the identity of the interface surface;
and a position of the sensing device relative to the interface
surface, the indicative data being sensed when the sensing device
is placed in an operative position relative to the interface
surface, using at least some of the coded data; (b) using the
indicating data to identify at least one interactive element
relating to the computer software; and (c) operating the computer
software in accordance with instructions associated with the at
least one interactive element.
[0014] Preferably, in the first and second aspects, the interactive
element is associated with a zone of the interface surface, and
step (b) includes using the position of the sensing device to
identify the zone and thereby the interactive element.
[0015] In preferred embodiments of the first and second aspects,
the sensing device generates movement data indicative of its
movement relative to the interface surface, using at least some of
the coded data. The method then includes the step of receiving, in
the computer system, the movement data, wherein step (b) includes
using the movement data to identify the zone and thereby the
interactive element.
[0016] In a third aspect of the invention, there is provided a
system for enabling user interaction with computer software running
in a computer system via:
[0017] an interface surface containing information relating to the
computer software and including coded data indicative of an
identity of the interface surface and of at least one reference
point of the interface surface; and
[0018] a sensing device which, when placed in an operative position
relative to the interface surface, senses indicating data
indicative of the identity of the interface surface and a position
of the sensing device relative to the interface surface, using at
least some of the coded data;
[0019] the system being configured to, in the computer system:
(a) receive the indicating data from the sensing device; (b) use
the indicating data to identify at least one interactive element
relating to the computer software; and (c) operate the computer
software in accordance with instructions associated with the at
least one interactive element.
[0020] In a fourth aspect of the invention, there is provided a
system for enabling user interaction with computer software running
in a computer system, the system including:
[0021] an interface surface containing information relating to the
computer software and including coded data indicative of an
identity of the interface surface and of at least one reference
point of the interface surface;
[0022] the system being configured to, in the computer system:
(a) receive indicating data from a sensing device, the indicating
data being indicative of the identity of the interface surface and
a position of the sensing device relative to the interface surface,
wherein the sensing device, when placed in an operative position
relative to the interface surface, senses the indicating data using
at least some of the coded data; (b) use the indicating data to
identify at least one interactive element relating to the computer
software; and (c) operate the computer software in accordance with
instructions associated with the at least one interactive
element.
[0023] Preferably, in the third and fourth aspects, the interactive
element is associated with a zone of the interface surface, the
system being configured to use the position of the sensing device
to identify the zone and thereby the interactive element.
[0024] In preferred forms of the third and fourth aspects, the
sensing device generates movement data indicative of its movement
relative to the interface surface using at least some of the coded
data. The computer system is configured to receive the movement
data, wherein (b) includes using the movement data to identify the
zone and thereby the interactive element.
[0025] Preferably, the coded data takes the form of tags disposed
on the interface surface.
[0026] In a particularly preferred form, the tags are printed onto
a surface in the form of a piece of paper, and are configured to be
read by a sensing device in the form of an optical sensing stylus.
The tags are preferably printed using an ink that absorbs near
infrared light but is substantially invisible to a human viewer
under normal lighting conditions. When a user brings a sensing end
of the stylus close to the surface, one or more of the tags are
imaged, interpreted and decoded to provide an indication of the
identity of the region from which the tag was imaged, and a
position of the sensing device in relation to the surface. This
information is used to identify an interactive element, and
software in an associated computer system is then operated on the
basis of instructions associated with the interactive element.
[0027] Further aspects of the invention will become apparent from
reading the following detailed description of preferred and other
embodiments of the invention.
BRIEF DESCRIPTION OF DRAWINGS
[0028] Preferred and other embodiments of the invention will now be
described, by way of non-limiting example only, with reference to
the accompanying drawings, in which:
[0029] FIG. 1 is a schematic of a the relationship between a sample
printed netpage and its online page description;
[0030] FIG. 2 is a schematic view of a interaction between a
netpage pen, a netpage printer, a netpage page server, and a
netpage application server;
[0031] FIG. 3 illustrates a collection of netpage servers and
printers interconnected via a network;
[0032] FIG. 4 is a schematic view of a high-level structure of a
printed netpage and its online page description;
[0033] FIG. 5 is a plan view showing a structure of a netpage
tag;
[0034] FIG. 6 is a plan view showing a relationship between a set
of the tags shown in FIG. 5 and a field of view of a netpage
sensing device in the form of a netpage pen;
[0035] FIG. 7 is a flowchart of a tag image processing and decoding
algorithm;
[0036] FIG. 8 is a perspective view of a netpage pen and its
associated tag-sensing field-of-view cone;
[0037] FIG. 9 is a perspective exploded view of the netpage pen
shown in FIG. 8;
[0038] FIG. 10 is a schematic block diagram of a pen controller for
the netpage pen shown in FIGS. 8 and 9;
[0039] FIG. 11 is a perspective view of a wall-mounted netpage
printer;
[0040] FIG. 12 is a section through the length of the netpage
printer of FIG. 11;
[0041] FIG. 12A is an enlarged portion of FIG. 12 showing a section
of the duplexed print engines and glue wheel assembly;
[0042] FIG. 13 is a detailed view of the ink cartridge, ink, air
and glue paths, and print engines of the netpage printer of FIGS.
11 and 12;
[0043] FIG. 14 is a schematic block diagram of a printer controller
for the netpage printer shown in FIGS. 11 and 12;
[0044] FIG. 15 is a schematic block diagram of duplexed print
engine controllers and Memjet.TM. printheads associated with the
printer controller shown in FIG. 14;
[0045] FIG. 16 is a schematic block diagram of the print engine
controller shown in FIGS. 14 and 15;
[0046] FIG. 17 is a perspective view of a single Memjet.TM.
printing element, as used in, for example, the netpage printer of
FIGS. 10 to 12;
[0047] FIG. 18 is a perspective view of a small part of an array of
Memjet.TM. printing elements;
[0048] FIG. 19 is a series of perspective views illustrating the
operating cycle of the Memjet.TM. printing element shown in FIG.
13;
[0049] FIG. 20 is a perspective view of a short segment of a
pagewidth Memjet.TM. printhead;
[0050] FIG. 21 is a schematic view of a user class diagram;
[0051] FIG. 22 is a schematic view of a printer class diagram;
[0052] FIG. 23 is a schematic view of a pen class diagram;
[0053] FIG. 24 is a schematic view of an application class
diagram;
[0054] FIG. 25 is a schematic view of a document and page
description class diagram;
[0055] FIG. 26 is a schematic view of a document and page ownership
class diagram;
[0056] FIG. 27 is a schematic view of a terminal element
specialization class diagram;
[0057] FIG. 28 is a schematic view of a static element
specialization class diagram;
[0058] FIG. 29 is a schematic view of a hyperlink element class
diagram;
[0059] FIG. 30 is a schematic view of a hyperlink element
specialization class diagram;
[0060] FIG. 31 is a schematic view of a hyperlinked group class
diagram;
[0061] FIG. 32 is a schematic view of a form class diagram;
[0062] FIG. 33 is a schematic view of a digital ink class
diagram;
[0063] FIG. 34 is a schematic view of a field element
specialization class diagram;
[0064] FIG. 35 is a schematic view of a checkbox field class
diagram;
[0065] FIG. 36 is a schematic view of a text field class
diagram;
[0066] FIG. 37 is a schematic view of a signature field class
diagram;
[0067] FIG. 38 is a flowchart of an input processing algorithm;
[0068] FIG. 38A is a detailed flowchart of one step of the
flowchart of FIG. 38;
[0069] FIG. 39 is a schematic view of a page server command element
class diagram;
[0070] FIG. 40 is a schematic view of a resource description class
diagram;
[0071] FIG. 41 is a schematic view of a favorites list class
diagram;
[0072] FIG. 42 is a schematic view of a history list class
diagram;
[0073] FIG. 43 is a schematic view of a subscription delivery
protocol;
[0074] FIG. 44 is a schematic view of a hyperlink request class
diagram;
[0075] FIG. 45 is a schematic view of a hyperlink activation
protocol;
[0076] FIG. 46 is a schematic view of a form submission
protocol;
[0077] FIG. 47 is a schematic view of a commission payment
protocol;
[0078] FIG. 48 is a schematic view of a set of radial wedges making
up a symbol;
[0079] FIG. 49 is a schematic view of a ring A and B symbol
allocation scheme;
[0080] FIG. 50 is a schematic view of a first ring C and D symbol
allocation scheme;
[0081] FIG. 51 is a schematic view of a second ring C and D symbol
allocation scheme;
[0082] FIG. 52 is a schematic view of a triangular tag packing;
[0083] FIG. 53 is a perspective view of an icosahedron;
[0084] FIG. 54 is a perspective view of an icosahedral geodesic
with frequency 3;
[0085] FIG. 55 is a schematic view of a minimum tag spacing;
[0086] FIG. 56 is a schematic view of a minimum tag spacing which
avoids overlap;
[0087] FIG. 57 is a schematic view of a first tag insertion
case;
[0088] FIG. 58 is a schematic view of a second tag insertion
case;
[0089] FIG. 59 is a schematic view of a third tag insertion
case;
[0090] FIG. 60 is a schematic view of a fourth tag insertion
case;
[0091] FIG. 61 is a schematic view of a pen orientation relative to
a surface;
[0092] FIG. 62 is a schematic view of a pen pitch geometry;
[0093] FIG. 63 is a schematic view of a pen roll geometry;
[0094] FIG. 64 is a schematic view of a pen coordinate space
showing physical and optical axes of a pen;
[0095] FIG. 65 is a schematic view of a curved nib geometry;
[0096] FIG. 66 is a schematic view of an interaction between
sampling frequency and tag frequency;
[0097] FIG. 67 is a table containing equations numbered 1 to
10;
[0098] FIG. 68 is a table containing equations numbered 11 to
20;
[0099] FIG. 69 is a table containing equations numbered 21 to
26;
[0100] FIG. 70 is a table containing equations numbered 27 to
34;
[0101] FIG. 71 is a table containing equations numbered 35 to
41;
[0102] FIG. 72 is a table containing equations numbered 42 to
44;
[0103] FIG. 73 is a table containing equations numbered 45 to
47;
[0104] FIG. 74 is a table containing equations numbered 48 to
51;
[0105] FIG. 75 is a table containing equations numbered 52 to
54;
[0106] FIG. 76 is a table containing equations numbered 55 to
57;
[0107] FIG. 77 is a table containing equations numbered 58 to
59;
[0108] FIG. 78 is a table containing equations numbered 60 to
63;
[0109] FIG. 79 is a table containing equations numbered 64 to
74;
[0110] FIG. 80 is a table containing equations numbered 75 to
86;
[0111] FIG. 81 is a table containing equations numbered 87 to
99;
[0112] FIG. 82 is a table containing equations numbered 100 to
111;
[0113] FIG. 83 is a table containing equations numbered 112 to
120;
[0114] FIG. 84 is a table containing equations numbered 121 to
129;
[0115] FIG. 85 is a table containing a set of degenerate forms of
equations 64 to 71;
[0116] FIG. 86 is a first part of a table containing conditions and
special handling for zero pitch and zero roll; and
[0117] FIG. 87 is a the second part of the table of FIG. 86.
DETAILED DESCRIPTION OF PREFERRED AND OTHER EMBODIMENTS
[0118] Note: Memjet.TM. is a trade mark of Silverbrook Research Pty
Ltd, Australia.
[0119] In the preferred embodiment, the invention is configured to
work with the netpage networked computer system, a detailed
overview of which follows. It will be appreciated that not every
implementation will necessarily embody all or even most of the
specific details and extensions discussed below in relation to the
basic system. However, the system is described in its most complete
form to reduce the need for external reference when attempting to
understand the context in which the preferred embodiments and
aspects of the present invention operate.
[0120] In brief summary, the preferred form of the netpage system
employs a computer interface in the form of a mapped surface, that
is, a physical surface which contains references to a map of the
surface maintained in a computer system. The map references can be
queried by an appropriate sensing device. Depending upon the
specific implementation, the map references may be encoded visibly
or invisibly, and defined in such a way that a local query on the
mapped surface yields an unambiguous map reference both within the
map and among different maps. The computer system can contain
information about features on the mapped surface, and such
information can be retrieved based on map references supplied by a
sensing device used with the mapped surface. The information thus
retrieved can take the form of actions which are initiated by the
computer system on behalf of the operator in response to the
operator's interaction with the surface features.
[0121] In its preferred form, the netpage system relies on the
production of, and human interaction with, netpages. These are
pages of text, graphics and images printed on ordinary paper, but
which work like interactive web pages. Information is encoded on
each page using ink which is substantially invisible to the unaided
human eye. The ink, however, and thereby the coded data, can be
sensed by an optically imaging pen and transmitted to the netpage
system.
[0122] In the preferred form, active buttons and hyperlinks on each
page can be clicked with the pen to request information from the
network or to signal preferences to a network server. In one
embodiment, text written by hand on a netpage is automatically
recognized and converted to computer text in the netpage system,
allowing forms to be filled in. In other embodiments, signatures
recorded on a netpage are automatically verified, allowing
e-commerce transactions to be securely authorized.
[0123] As illustrated in FIG. 1, a printed netpage 1 can represent
a interactive form which can be filled in by the user both
physically, on the printed page, and "electronically", via
communication between the pen and the netpage system. The example
shows a "Request" form containing name and address fields and a
submit button. The netpage consists of graphic data 2 printed using
visible ink, and coded data 3 printed as a collection of tags 4
using invisible ink. The corresponding page description 5, stored
on the netpage network, describes the individual elements of the
netpage. In particular it describes the type and spatial extent
(zone) of each interactive element (i.e. text field or button in
the example), to allow the netpage system to correctly interpret
input via the netpage. The submit button 6, for example, has a zone
7 which corresponds to the spatial extent of the corresponding
graphic 8.
[0124] As illustrated in FIG. 2, the netpage pen 101, a preferred
form of which is shown in FIGS. 8 and 9 and described in more
detail below, works in conjunction with a netpage printer 601, an
Internet-connected printing appliance for home, office or mobile
use. The pen is wireless and communicates securely with the netpage
printer via a short-range radio link 9.
[0125] The netpage printer 601, a preferred form of which is shown
in FIGS. 11 to 13 and described in more detail below, is able to
deliver, periodically or on demand, personalized newspapers,
magazines, catalogs, brochures and other publications, all printed
at high quality as interactive netpages. Unlike a personal
computer, the netpage printer is an appliance which can be, for
example, wall-mounted adjacent to an area where the morning news is
first consumed, such as in a user's kitchen, near a breakfast
table, or near the household's point of departure for the day. It
also comes in tabletop, desktop, portable and miniature
versions.
[0126] Netpages printed at their point of consumption combine the
ease-of-use of paper with the timeliness and interactivity of an
interactive medium.
[0127] As shown in FIG. 2, the netpage pen 101 interacts with the
coded data on a printed netpage 1 and communicates, via a
short-range radio link 9, the interaction to a netpage printer. The
printer 601 sends the interaction to the relevant netpage page
server 10 for interpretation. In appropriate circumstances, the
page server sends a corresponding message to application computer
software running on a netpage application server 13. The
application server may in turn send a response which is printed on
the originating printer.
[0128] The netpage system is made considerably more convenient in
the preferred embodiment by being used in conjunction with
high-speed microelectromechanical system (MEMS) based inkjet
(Memjet.TM.) printers. In the preferred form of this technology,
relatively high-speed and high-quality printing is made more
affordable to consumers. In its preferred form, a netpage
publication has the physical characteristics of a traditional
newsmagazine, such as a set of letter-size glossy pages printed in
full color on both sides, bound together for easy navigation and
comfortable handling.
[0129] The netpage printer exploits the growing availability of
broadband Internet access. Cable service is available to 95% of
households in the United States, and cable modem service offering
broadband Internet access is already available to 20% of these. The
netpage printer can also operate with slower connections, but with
longer delivery times and lower image quality. Indeed, the netpage
system can be enabled using existing consumer inkjet and laser
printers, although the system will operate more slowly and will
therefore be less acceptable from a consumer's point of view. In
other embodiments, the netpage system is hosted on a private
intranet. In still other embodiments, the netpage system is hosted
on a single computer or computer-enabled device, such as a
printer.
[0130] Netpage publication servers 14 on the netpage network are
configured to deliver print-quality publications to netpage
printers. Periodical publications are delivered automatically to
subscribing netpage printers via pointcasting and multicasting
Internet protocols. Personalized publications are filtered and
formatted according to individual user profiles.
[0131] A netpage printer can be configured to support any number of
pens, and a pen can work with any number of netpage printers. In
the preferred implementation, each netpage pen has a unique
identifier. A household may have a collection of colored netpage
pens, one assigned to each member of the family. This allows each
user to maintain a distinct profile with respect to a netpage
publication server or application server.
[0132] A netpage pen can also be registered with a netpage
registration server 11 and linked to one or more payment card
accounts. This allows e-commerce payments to be securely authorized
using the netpage pen. The netpage registration server compares the
signature captured by the netpage pen with a previously registered
signature, allowing it to authenticate the user's identity to an
e-commerce server. Other biometrics can also be used to verify
identity. A version of the netpage pen includes fingerprint
scanning, verified in a similar way by the netpage registration
server.
[0133] Although a netpage printer may deliver periodicals such as
the morning newspaper without user intervention, it can be
configured never to deliver unsolicited junk mail. In its preferred
form, it only delivers periodicals from subscribed or otherwise
authorized sources. In this respect, the netpage printer is unlike
a fax machine or e-mail account which is visible to any junk mailer
who knows the telephone number or email address.
1 Netpage System Architecture
[0134] Each object model in the system is described using a Unified
Modeling Language (UML) class diagram. A class diagram consists of
a set of object classes connected by relationships, and two kinds
of relationships are of interest here: associations and
generalizations. An association represents some kind of
relationship between objects, i.e. between instances of classes. A
generalization relates actual classes, and can be understood in the
following way: if a class is thought of as the set of all objects
of that class, and class A is a generalization of class B, then B
is simply a subset of A. The UML does not directly support
second-order modelling--i.e. classes of classes.
[0135] Each class is drawn as a rectangle labelled with the name of
the class. It contains a list of the attributes of the class,
separated from the name by a horizontal line, and a list of the
operations of the class, separated from the attribute list by a
horizontal line. In the class diagrams which follow, however,
operations are never modelled.
[0136] An association is drawn as a line joining two classes,
optionally labelled at either end with the multiplicity of the
association. The default multiplicity is one. An asterisk (*)
indicates a multiplicity of "many", i.e. zero or more. Each
association is optionally labelled with its name, and is also
optionally labelled at either end with the role of the
corresponding class. An open diamond indicates an aggregation
association ("is-part-of"), and is drawn at the aggregator end of
the association line.
[0137] A generalization relationship ("is-a") is drawn as a solid
line joining two classes, with an arrow (in the form of an open
triangle) at the generalization end.
[0138] When a class diagram is broken up into multiple diagrams,
any class which is duplicated is shown with a dashed outline in all
but the main diagram which defines it. It is shown with attributes
only where it is defined.
1.1 Netpages
[0139] Netpages are the foundation on which a netpage network is
built. They provide a paper-based user interface to published
information and interactive services.
[0140] A netpage consists of a printed page (or other surface
region) invisibly tagged with references to an online description
of the page. The online page description is maintained persistently
by a netpage page server. The page description describes the
visible layout and content of the page, including text, graphics
and images. It also describes the input elements on the page,
including buttons, hyperlinks, and input fields. A netpage allows
markings made with a netpage pen on its surface to be
simultaneously captured and processed by the netpage system.
[0141] Multiple netpages can share the same page description.
However, to allow input through otherwise identical pages to be
distinguished, each netpage is assigned a unique page identifier.
This page ID has sufficient precision to distinguish between a very
large number of netpages.
[0142] Each reference to the page description is encoded in a
printed tag. The tag identifies the unique page on which it
appears, and thereby indirectly identifies the page description.
The tag also identifies its own position on the page.
Characteristics of the tags are described in more detail below.
[0143] Tags are printed in infrared-absorptive ink on any substrate
which is infrared-reflective, such as ordinary paper. Near-infrared
wavelengths are invisible to the human eye but are easily sensed by
a solid-state image sensor with an appropriate filter.
[0144] A tag is sensed by an area image sensor in the netpage pen,
and the tag data is transmitted to the netpage system via the
nearest netpage printer. The pen is wireless and communicates with
the netpage printer via a short-range radio link. Tags are
sufficiently small and densely arranged that the pen can reliably
image at least one tag even on a single click on the page. It is
important that the pen recognize the page ID and position on every
interaction with the page, since the interaction is stateless. Tags
are error-correctably encoded to make them partially tolerant to
surface damage.
[0145] The netpage page server maintains a unique page instance for
each printed netpage, allowing it to maintain a distinct set of
user-supplied values for input fields in the page description for
each printed netpage.
[0146] The relationship between the page description, the page
instance, and the printed netpage is shown in FIG. 4. The page
instance is associated with both the netpage printer which printed
it and, if known, the netpage user who requested it.
1.2 Netpage Tags
1.2.1 Tag Data Content
[0147] In a preferred form, each tag identifies the region in which
it appears, and the location of that tag within the region. A tag
may also contain flags which relate to the region as a whole or to
the tag. One or more flag bits may, for example, signal a tag
sensing device to provide feedback indicative of a function
associated with the immediate area of the tag, without the sensing
device having to refer to a description of the region. A netpage
pen may, for example, illuminate an "active area" LED when in the
zone of a hyperlink.
[0148] As will be more clearly explained below, in a preferred
embodiment, each tag contains an easily recognized invariant
structure which aids initial detection, and which assists in
minimizing the effect of any warp induced by the surface or by the
sensing process. The tags preferably tile the entire page, and are
sufficiently small and densely arranged that the pen can reliably
image at least one tag even on a single click on the page. It is
important that the pen recognize the page ID and position on every
interaction with the page, since the interaction is stateless.
[0149] In a preferred embodiment, the region to which a tag refers
coincides with an entire page, and the region ID encoded in the tag
is therefore synonymous with the page ID of the page on which the
tag appears. In other embodiments, the region to which a tag refers
can be an arbitrary subregion of a page or other surface. For
example, it can coincide with the zone of an interactive element,
in which case the region ID can directly identify the interactive
element.
TABLE-US-00002 TABLE 1 Tag data Field Precision (bits) Region ID
100 Tag ID 16 Flags 4 Total 120
[0150] Each tag contains 120 bits of information, typically
allocated as shown in Table 1. Assuming a maximum tag density of 64
per square inch, a 16-bit tag ID supports a region size of up to
1024 square inches. Larger regions can be mapped continuously
without increasing the tag ID precision simply by using abutting
regions and maps. The 100-bit region ID allows 2.sup.100
(.about.10.sup.30 or a million trillion trillion) different regions
to be uniquely identified.
1.2.2 Tag Data Encoding
[0151] The 120 bits of tag data are redundantly encoded using a
(15, 5) Reed-Solomon code. This yields 360 encoded bits consisting
of 6 codewords of 15 4-bit symbols each. The (15, 5) code allows up
to 5 symbol errors to be corrected per codeword, i.e. it is
tolerant of a symbol error rate of up to 33% per codeword.
[0152] Each 4-bit symbol is represented in a spatially coherent way
in the tag, and the symbols of the six codewords are interleaved
spatially within the tag. This ensures that a burst error (an error
affecting multiple spatially adjacent bits) damages a minimum
number of symbols overall and a minimum number of symbols in any
one codeword, thus maximising the likelihood that the burst error
can be fully corrected.
1.2.3 Physical Tag Structure
[0153] The physical representation of the tag, shown in FIG. 5,
includes fixed target structures 15, 16, 17 and variable data areas
18. The fixed target structures allow a sensing device such as the
netpage pen to detect the tag and infer its three-dimensional
orientation relative to the sensor. The data areas contain
representations of the individual bits of the encoded tag data.
[0154] To achieve proper tag reproduction, the tag is rendered at a
resolution of 256.times.256 dots. When printed at 1600 dots per
inch this yields a tag with a diameter of about 4 mm. At this
resolution the tag is designed to be surrounded by a "quiet area"
of radius 16 dots. Since the quiet area is also contributed by
adjacent tags, it only adds 16 dots to the effective diameter of
the tag.
[0155] The tag includes six target structures: a detection ring 15;
an orientation axis target 16; and four perspective targets 17.
[0156] The detection ring 15 allows the sensing device to initially
detect the tag 4. The ring is easy to detect because it is
rotationally invariant and because a simple correction of its
aspect ratio removes most of the effects of perspective distortion.
The orientation axis 16 allows the sensing device to determine the
approximate planar orientation of the tag due to the yaw of the
sensor. The orientation axis is skewed to yield a unique
orientation. The four perspective targets 17 allow the sensing
device to infer an accurate two-dimensional perspective transform
of the tag and hence an accurate three-dimensional position and
orientation of the tag relative to the sensor.
[0157] All target structures are redundantly large to improve their
immunity to noise.
[0158] The overall tag shape is circular. This supports, amongst
other things, optimal tag packing on an irregular triangular grid.
In combination with the circular detection ring 15, this makes a
circular arrangement of data bits within the tag optimal. To
maximise its size, each data bit is represented by a radial wedge
510 in the form of an area bounded by two radial lines 512, a
radially inner arc 514 and a radially outer arc 516. Each wedge 510
has a minimum dimension of 8 dots at 1600 dpi and is designed so
that its base (i.e. its inner arc 514), is at least equal to this
minimum dimension. The radial height of the wedge 510 is always
equal to the minimum dimension. Each 4-bit data symbol is
represented by an array 518 of 2.times.2 wedges 510, as best shown
in FIG. 48.
[0159] The 15 4-bit data symbols of each of the six codewords are
allocated to the four concentric symbol rings 18a to 18d, shown in
FIG. 5, in interleaved fashion as shown in FIGS. 49 to 51. Symbols
of first to sixth codewords 520-525 are allocated alternately in
circular progression around the tag.
[0160] The interleaving is designed to maximise the average spatial
distance between any two symbols of the same codeword.
[0161] In order to support "single-click" interaction with a tagged
region via a sensing device, the sensing device must be able to see
at least one entire tag in its field of view no matter where in the
region or at what orientation it is positioned. The required
diameter of the field of view of the sensing device is therefore a
function of the size and spacing of the tags.
[0162] Assuming a circular tag shape, the minimum diameter of the
sensor field of view is obtained when the tags are tiled on a
equilateral triangular grid, as shown in FIG. 6.
1.2.4 Tag Image Processing and Decoding
[0163] The tag image processing and decoding performed by a sensing
device such as the netpage pen is shown in FIG. 7. While a captured
image is being acquired from the image sensor, the dynamic range of
the image is determined (at 20). The center of the range is then
chosen as the binary threshold for the image 21. The image is then
thresholded and segmented into connected pixel regions (i.e. shapes
23) (at 22). Shapes which are too small to represent tag target
structures are discarded. The size and centroid of each shape is
also computed.
[0164] Binary shape moments 25 are then computed (at 24) for each
shape, and these provide the basis for subsequently locating target
structures. Central shape moments are by their nature invariant of
position, and can be easily made invariant of scale, aspect ratio
and rotation.
[0165] The ring target structure 15 is the first to be located (at
26). A ring has the advantage of being very well behaved when
perspective-distorted. Matching proceeds by aspect-normalizing and
rotation-normalizing each shape's moments. Once its second-order
moments are normalized the ring is easy to recognize even if the
perspective distortion was significant. The ring's original aspect
and rotation 27 together provide a useful approximation of the
perspective transform.
[0166] The axis target structure 16 is the next to be located (at
28). Matching proceeds by applying the ring's normalizations to
each shape's moments, and rotation-normalizing the resulting
moments. Once its second-order moments are normalized the axis
target is easily recognized. Note that one third order moment is
required to disambiguate the two possible orientations of the axis.
The shape is deliberately skewed to one side to make this possible.
Note also that it is only possible to rotation-normalize the axis
target after it has had the ring's normalizations applied, since
the perspective distortion can hide the axis target's axis. The
axis target's original rotation provides a useful approximation of
the tag's rotation due to pen yaw 29.
[0167] The four perspective target structures 17 are the last to be
located (at 30). Good estimates of their positions are computed
based on their known spatial relationships to the ring and axis
targets, the aspect and rotation of the ring, and the rotation of
the axis. Matching proceeds by applying the ring's normalizations
to each shape's moments. Once their second-order moments are
normalized the circular perspective targets are easy to recognize,
and the target closest to each estimated position is taken as a
match. The original centroids of the four perspective targets are
then taken to be the perspective-distorted corners 31 of a square
of known size in tag space, and an eight-degree-of-freedom
perspective transform 33 is inferred (at 32) based on solving the
well-understood equations relating the four tag-space and
image-space point pairs (see Heckbert, P., Fundamentals of Texture
Mapping and Image Warping, Masters Thesis, Dept. of EECS, U. of
California at Berkeley, Technical Report No. UCB/CSD 89/516, June
1989, the contents of which are herein incorporated by
cross-reference).
[0168] The inferred tag-space to image-space perspective transform
is used to project (at 36) each known data bit position in tag
space into image space where the real-valued position is used to
bilinearly interpolate (at 36) the four relevant adjacent pixels in
the input image. The previously computed image threshold 21 is used
to threshold the result to produce the final bit value 37.
[0169] Once all 360 data bits 37 have been obtained in this way,
each of the six 60-bit Reed-Solomon codewords is decoded (at 38) to
yield 20 decoded bits 39, or 120 decoded bits in total. Note that
the codeword symbols are sampled in codeword order, so that
codewords are implicitly de-interleaved during the sampling
process.
[0170] The ring target 15 is only sought in a subarea of the image
whose relationship to the image guarantees that the ring, if found,
is part of a complete tag. If a complete tag is not found and
successfully decoded, then no pen position is recorded for the
current frame. Given adequate processing power and ideally a
non-minimal field of view 193, an alternative strategy involves
seeking another tag in the current image.
[0171] The obtained tag data indicates the identity of the region
containing the tag and the position of the tag within the region.
An accurate position 35 of the pen nib in the region, as well as
the overall orientation 35 of the pen, is then inferred (at 34)
from the perspective transform 33 observed on the tag and the known
spatial relationship between the pen's physical axis and the pen's
optical axis.
1.2.5 Tag Map
[0172] Decoding a tag results in a region ID, a tag ID, and a
tag-relative pen transform. Before the tag ID and the tag-relative
pen location can be translated into an absolute location within the
tagged region, the location of the tag within the region must be
known. This is given by a tag map, a function which maps each tag
ID in a tagged region to a corresponding location. The tag map
class diagram is shown in FIG. 22, as part of the netpage printer
class diagram.
[0173] A tag map reflects the scheme used to tile the surface
region with tags, and this can vary according to surface type. When
multiple tagged regions share the same tiling scheme and the same
tag numbering scheme, they can also share the same tag map.
[0174] The tag map for a region must be retrievable via the region
ID. Thus, given a region ID, a tag ID and a pen transform, the tag
map can be retrieved, the tag ID can be translated into an absolute
tag location within the region, and the tag-relative pen location
can be added to the tag location to yield an absolute pen location
within the region.
1.2.6 Tagging Schemes
[0175] Two distinct surface coding schemes are of interest, both of
which use the tag structure described earlier in this section. The
preferred coding scheme uses "location-indicating" tags as already
discussed. An alternative coding scheme uses object-indicating
tags.
[0176] A location-indicating tag contains a tag ID which, when
translated through the tag map associated with the tagged region,
yields a unique tag location within the region. The tag-relative
location of the pen is added to this tag location to yield the
location of the pen within the region. This in turn is used to
determine the location of the pen relative to a user interface
element in the page description associated with the region. Not
only is the user interface element itself identified, but a
location relative to the user interface element is identified.
Location-indicating tags therefore trivially support the capture of
an absolute pen path in the zone of a particular user interface
element.
[0177] An object-indicating tag contains a tag ID which directly
identifies a user interface element in the page description
associated with the region. All the tags in the zone of the user
interface element identify the user interface element, making them
all identical and therefore indistinguishable. Object-indicating
tags do not, therefore, support the capture of an absolute pen
path. They do, however, support the capture of a relative pen path.
So long as the position sampling frequency exceeds twice the
encountered tag frequency, the displacement from one sampled pen
position to the next within a stroke can be unambiguously
determined.
[0178] Assume a sampling wavelength of .lamda..sub.S and a tag
wavelength of .lamda..sub.T, with a relationship as defined in EQ
38. For two adjacent position samples P.sub.i and P.sub.i+1, one of
EQ 39 and EQ 40 will hold.
[0179] Assuming both equations hold leads to the relationship
defined in EQ 41.
[0180] Since EQ 41 contradicts EQ 38, the assumption that both EQ
39 and EQ 40 hold must be incorrect, and the choice is therefore
unambiguous, as stated.
[0181] The illustration in FIG. 60 shows four tags 500 and a
one-dimensional stroke of six sample positions 582 which satisfy EQ
38. Possible aliases 584 of the sample positions are also shown.
From inspection, if the distance from one sample position to the
next is .lamda..sub.S, then the distance from a sample position to
the alias of the next sample position exceeds .lamda..sub.S.
[0182] If the tag wavelength .lamda..sub.T is 4.7 mm, as discussed
in earlier, then the sampling wavelength .lamda..sub.S must be less
than 2.35 mm. If the temporal sampling frequency is 100 Hz as
required for accurate handwriting recognition, then the pen speed
must be less than 235 mm/s to satisfy EQ 38.
[0183] With either tagging scheme, the tags function in cooperation
with associated visual elements on the netpage as user interactive
elements in that a user can interact with the printed page using an
appropriate sensing device in order for tag data to be read by the
sensing device and for an appropriate response to be generated in
the netpage system.
1.3 Document and Page Descriptions
[0184] A preferred embodiment of a document and page description
class diagram is shown in FIGS. 25 and 26.
[0185] In the netpage system a document is described at three
levels. At the most abstract level the document 836 has a
hierarchical structure whose terminal elements 839 are associated
with content objects 840 such as text objects, text style objects,
image objects, etc. Once the document is printed on a printer with
a particular page size and according to a particular user's scale
factor preference, the document is paginated and otherwise
formatted. Formatted terminal elements 835 will in some cases be
associated with content objects which are different from those
associated with their corresponding terminal elements, particularly
where the content objects are style-related. Each printed instance
of a document and page is also described separately, to allow input
captured through a particular page instance 830 to be recorded
separately from input captured through other instances of the same
page description.
[0186] The presence of the most abstract document description on
the page server allows a user to request a copy of a document
without being forced to accept the source document's specific
format. The user may be requesting a copy through a printer with a
different page size, for example. Conversely, the presence of the
formatted document description on the page server allows the page
server to efficiently interpret user actions on a particular
printed page.
[0187] A formatted document 834 consists of a set of formatted page
descriptions 5, each of which consists of a set of formatted
terminal elements 835. Each formatted element has a spatial extent
or zone 58 on the page. This defines the active area of input
elements such as hyperlinks and input fields.
[0188] A document instance 831 corresponds to a formatted document
834. It consists of a set of page instances 830, each of which
corresponds to a page description 5 of the formatted document. Each
page instance 830 describes a single unique printed netpage 1, and
records the page ID 50 of the netpage. A page instance is not part
of a document instance if it represents a copy of a page requested
in isolation.
[0189] A page instance consists of a set of terminal element
instances 832. An element instance only exists if it records
instance-specific information. Thus, a hyperlink instance exists
for a hyperlink element because it records a transaction ID 55
which is specific to the page instance, and a field instance exists
for a field element because it records input specific to the page
instance. An element instance does not exist, however, for static
elements such as textflows.
[0190] A terminal element can be a static element 843, a hyperlink
element 844, a field element 845 or a page server command element
846, as shown in FIG. 27. A static element 843 can be a style
element 847 with an associated style object 854, a textflow element
848 with an associated styled text object 855, an image element 849
with an associated image element 856, a graphic element 850 with an
associated graphic object 857, a video clip element 851 with an
associated video clip object 858, an audio clip element 852 with an
associated audio clip object 859, or a script element 853 with an
associated script object 860, as shown in FIG. 28.
[0191] A page instance has a background field 833 which is used to
record any digital ink captured on the page which does not apply to
a specific input element.
[0192] In the preferred form of the invention, a tag map 811 is
associated with each page instance to allow tags on the page to be
translated into locations on the page.
1.4 The Netpage Network
[0193] In a preferred embodiment, a netpage network consists of a
distributed set of netpage page servers 10, netpage registration
servers 11, netpage ID servers 12, netpage application servers 13,
netpage publication servers 14, and netpage printers 601 connected
via a network 19 such as the Internet, as shown in FIG. 3.
[0194] The netpage registration server 11 is a server which records
relationships between users, pens, printers, applications and
publications, and thereby authorizes various network activities. It
authenticates users and acts as a signing proxy on behalf of
authenticated users in application transactions. It also provides
handwriting recognition services. As described above, a netpage
page server 10 maintains persistent information about page
descriptions and page instances. The netpage network includes any
number of page servers, each handling a subset of page instances.
Since a page server also maintains user input values for each page
instance, clients such as netpage printers send netpage input
directly to the appropriate page server. The page server interprets
any such input relative to the description of the corresponding
page.
[0195] A netpage ID server 12 allocates document IDs 51 on demand,
and provides load-balancing of page servers via its ID allocation
scheme.
[0196] A netpage printer uses the Internet Distributed Name System
(DNS), or similar, to resolve a netpage page ID 50 into the network
address of the netpage page server handling the corresponding page
instance.
[0197] A netpage application server 13 is a server which hosts
interactive netpage applications. A netpage publication server 14
is an application server which publishes netpage documents to
netpage printers. They are described in detail in Section 2.
[0198] Netpage servers can be hosted on a variety of network server
platforms from manufacturers such as IBM, Hewlett-Packard, and Sun.
Multiple netpage servers can run concurrently on a single host, and
a single server can be distributed over a number of hosts. Some or
all of the functionality provided by netpage servers, and in
particular the functionality provided by the ID server and the page
server, can also be provided directly in a netpage appliance such
as a netpage printer, in a computer workstation, or on a local
network.
1.5 The Netpage Printer
[0199] The netpage printer 601 is an appliance which is registered
with the netpage system and prints netpage documents on demand and
via subscription. Each printer has a unique printer ID 62, and is
connected to the netpage network via a network such as the
Internet, ideally via a broadband connection.
[0200] Apart from identity and security settings in non-volatile
memory, the netpage printer contains no persistent storage. As far
as a user is concerned, "the network is the computer". Netpages
function interactively across space and time with the help of the
distributed netpage page servers 10, independently of particular
netpage printers.
[0201] The netpage printer receives subscribed netpage documents
from netpage publication servers 14. Each document is distributed
in two parts: the page layouts, and the actual text and image
objects which populate the pages. Because of personalization, page
layouts are typically specific to a particular subscriber and so
are pointcast to the subscriber's printer via the appropriate page
server. Text and image objects, on the other hand, are typically
shared with other subscribers, and so are multicast to all
subscribers' printers and the appropriate page servers.
[0202] The netpage publication server optimizes the segmentation of
document content into pointcasts and multicasts. After receiving
the pointcast of a document's page layouts, the printer knows which
multicasts, if any, to listen to.
[0203] Once the printer has received the complete page layouts and
objects that define the document to be printed, it can print the
document.
[0204] The printer rasterizes and prints odd and even pages
simultaneously on both sides of the sheet. It contains duplexed
print engine controllers 760 and print engines utilizing Memjet.TM.
printheads 350 for this purpose.
[0205] The printing process consists of two decoupled stages:
rasterization of page descriptions, and expansion and printing of
page images. The raster image processor (RIP) consists of one or
more standard DSPs 757 running in parallel. The duplexed print
engine controllers consist of custom processors which expand,
dither and print page images in real time, synchronized with the
operation of the printheads in the print engines.
[0206] Printers not enabled for IR printing have the option to
print tags using IR-absorptive black ink, although this restricts
tags to otherwise empty areas of the page. Although such pages have
more limited functionality than IR-printed pages, they are still
classed as netpages.
[0207] A normal netpage printer prints netpages on sheets of paper.
More specialised netpage printers may print onto more specialised
surfaces, such as globes. Each printer supports at least one
surface type, and supports at least one tag tiling scheme, and
hence tag map, for each surface type. The tag map 811 which
describes the tag tiling scheme actually used to print a document
becomes associated with that document so that the document's tags
can be correctly interpreted.
[0208] FIG. 2 shows the netpage printer class diagram, reflecting
printer-related information maintained by a registration server 11
on the netpage network.
[0209] A preferred embodiment of the netpage printer is described
in greater detail in Section 6 below, with reference to FIGS. 11 to
16.
1.5.1 Memjet.TM. Printheads
[0210] The netpage system can operate using printers made with a
wide range of digital printing technologies, including thermal
inkjet, piezoelectric inkjet, laser electrophotographic, and
others. However, for wide consumer acceptance, it is desirable that
a netpage printer have the following characteristics:
[0211] photographic quality color printing
[0212] high quality text printing
[0213] high reliability
[0214] low printer cost
[0215] low ink cost
[0216] low paper cost
[0217] simple operation
[0218] nearly silent printing
[0219] high printing speed
[0220] simultaneous double sided printing
[0221] compact form factor
[0222] low power consumption
[0223] No commercially available printing technology has all of
these characteristics.
[0224] To enable to production of printers with these
characteristics, the present applicant has invented a new print
technology, referred to as Memjet.TM. technology. Memjet.TM. is a
drop-on-demand inkjet technology that incorporates pagewidth
printheads fabricated using microelectromechanical systems (MEMS)
technology. FIG. 17 shows a single printing element 300 of a
Memjet.TM. printhead. The netpage wallprinter incorporates 168960
printing elements 300 to form a 1600 dpi pagewidth duplex printer.
This printer simultaneously prints cyan, magenta, yellow, black,
and infrared inks as well as paper conditioner and ink
fixative.
[0225] The printing element 300 is approximately 110 microns long
by 32 microns wide. Arrays of these printing elements are formed on
a silicon substrate 301 that incorporates CMOS logic, data
transfer, timing, and drive circuits (not shown).
[0226] Major elements of the printing element 300 are the nozzle
302, the nozzle rim 303, the nozzle chamber 304, the fluidic seal
305, the ink channel rim 306, the lever arm 307, the active
actuator beam pair 308, the passive actuator beam pair 309, the
active actuator anchor 310, the passive actuator anchor 311, and
the ink inlet 312.
[0227] The active actuator beam pair 308 is mechanically joined to
the passive actuator beam pair 309 at the join 319. Both beams
pairs are anchored at their respective anchor points 310 and 311.
The combination of elements 308, 309, 310, 311, and 319 form a
cantilevered electrothermal bend actuator 320.
[0228] FIG. 18 shows a small part of an array of printing elements
300, including a cross section 315 of a printing element 300. The
cross section 315 is shown without ink, to clearly show the ink
inlet 312 that passes through the silicon wafer 301.
[0229] FIGS. 19(a), 19(b) and 19(c) show the operating cycle of a
Memjet.TM. printing element 300.
[0230] FIG. 19(a) shows the quiescent position of the ink meniscus
316 prior to printing an ink droplet Ink is retained in the nozzle
chamber by surface tension at the ink meniscus 316 and at the
fluidic seal 305 formed between the nozzle chamber 304 and the ink
channel rim 306.
[0231] While printing, the printhead CMOS circuitry distributes
data from the print engine controller to the correct printing
element, latches the data, and buffers the data to drive the
electrodes 318 of the active actuator beam pair 308. This causes an
electrical current to pass through the beam pair 308 for about one
microsecond, resulting in Joule heating. The temperature increase
resulting from Joule heating causes the beam pair 308 to expand. As
the passive actuator beam pair 309 is not heated, it does not
expand, resulting in a stress difference between the two beam
pairs. This stress difference is partially resolved by the
cantilevered end of the electrothermal bend actuator 320 bending
towards the substrate 301. The lever arm 307 transmits this
movement to the nozzle chamber 304. The nozzle chamber 304 moves
about two microns to the position shown in FIG. 19(b). This
increases the ink pressure, forcing ink 321 out of the nozzle 302,
and causing the ink meniscus 316 to bulge. The nozzle rim 303
prevents the ink meniscus 316 from spreading across the surface of
the nozzle chamber 304.
[0232] As the temperature of the beam pairs 308 and 309 equalizes,
the actuator 320 returns to its original position. This aids in the
break-off of the ink droplet 317 from the ink 321 in the nozzle
chamber, as shown in FIG. 19(c). The nozzle chamber is refilled by
the action of the surface tension at the meniscus 316.
[0233] FIG. 20 shows a segment of a printhead 350. In a netpage
printer, the length of the printhead is the full width of the paper
(typically 210 mm) in the direction 351. The segment shown is 0.4
mm long (about 0.2% of a complete printhead). When printing, the
paper is moved past the fixed printhead in the direction 352. The
printhead has 6 rows of interdigitated printing elements 300,
printing the six colors or types of ink supplied by the ink inlets
312.
[0234] To protect the fragile surface of the printhead during
operation, a nozzle guard wafer 330 is attached to the printhead
substrate 301. For each nozzle 302 there is a corresponding nozzle
guard hole 331 through which the ink droplets are fired. To prevent
the nozzle guard holes 331 from becoming blocked by paper fibers or
other debris, filtered air is pumped through the air inlets 332 and
out of the nozzle guard holes during printing. To prevent ink 321
from drying, the nozzle guard is sealed while the printer is
idle.
1.6 The Netpage Pen
[0235] The active sensing device of the netpage system is typically
a pen 101, which, using its embedded controller 134, is able to
capture and decode IR position tags from a page via an image
sensor. The image sensor is a solid-state device provided with an
appropriate filter to permit sensing at only near-infrared
wavelengths. As described in more detail below, the system is able
to sense when the nib is in contact with the surface, and the pen
is able to sense tags at a sufficient rate to capture human
handwriting (i.e. at 200 dpi or greater and 100 Hz or faster).
Information captured by the pen is encrypted and wirelessly
transmitted to the printer (or base station), the printer or base
station interpreting the data with respect to the (known) page
structure.
[0236] The preferred embodiment of the netpage pen operates both as
a normal marking ink pen and as a non-marking stylus. The marking
aspect, however, is not necessary for using the netpage system as a
browsing system, such as when it is used as an Internet interface.
Each netpage pen is registered with the netpage system and has a
unique pen ID 61. FIG. 23 shows the netpage pen class diagram,
reflecting pen-related information maintained by a registration
server 11 on the netpage network.
[0237] When either nib is in contact with a netpage, the pen
determines its position and orientation relative to the page. The
nib is attached to a force sensor, and the force on the nib is
interpreted relative to a threshold to indicate whether the pen is
"up" or "down". This allows a interactive element on the page to be
`clicked` by pressing with the pen nib, in order to request, say,
information from a network. Furthermore, the force is captured as a
continuous value to allow, say, the full dynamics of a signature to
be verified.
[0238] The pen determines the position and orientation of its nib
on the netpage by imaging, in the infrared spectrum, an area 193 of
the page in the vicinity of the nib. It decodes the nearest tag and
computes the position of the nib relative to the tag from the
observed perspective distortion on the imaged tag and the known
geometry of the pen optics. Although the position resolution of the
tag may be low, because the tag density on the page is inversely
proportional to the tag size, the adjusted position resolution is
quite high, exceeding the minimum resolution required for accurate
handwriting recognition.
[0239] Pen actions relative to a netpage are captured as a series
of strokes. A stroke consists of a sequence of time-stamped pen
positions on the page, initiated by a pen-down event and completed
by the subsequent pen-up event. A stroke is also tagged with the
page ID 50 of the netpage whenever the page ID changes, which,
under normal circumstances, is at the commencement of the
stroke.
[0240] Each netpage pen has a current selection 826 associated with
it, allowing the user to perform copy and paste operations etc. The
selection is timestamped to allow the system to discard it after a
defined time period. The current selection describes a region of a
page instance.
[0241] It consists of the most recent digital ink stroke captured
through the pen relative to the background area of the page. It is
interpreted in an application-specific manner once it is submitted
to an application via a selection hyperlink activation.
[0242] Each pen has a current nib 824. This is the nib last
notified by the pen to the system. In the case of the default
netpage pen described above, either the marking black ink nib or
the non-marking stylus nib is current. Each pen also has a current
nib style 825. This is the nib style last associated with the pen
by an application, e.g. in response to the user selecting a color
from a palette. The default nib style is the nib style associated
with the current nib. Strokes captured through a pen are tagged
with the current nib style. When the strokes are subsequently
reproduced, they are reproduced in the nib style with which they
are tagged.
[0243] Whenever the pen is within range of a printer with which it
can communicate, the pen slowly flashes its "online" LED. When the
pen fails to decode a stroke relative to the page, it momentarily
activates its "error" LED. When the pen succeeds in decoding a
stroke relative to the page, it momentarily activates its "ok"
LED.
[0244] A sequence of captured strokes is referred to as digital
ink. Digital ink forms the basis for the digital exchange of
drawings and handwriting, for online recognition of handwriting,
and for online verification of signatures.
[0245] The pen is wireless and transmits digital ink to the netpage
printer via a short-range radio link. The transmitted digital ink
is encrypted for privacy and security and packetized for efficient
transmission, but is always flushed on a pen-up event to ensure
timely handling in the printer.
[0246] When the pen is out-of-range of a printer it buffers digital
ink in internal memory, which has a capacity of over ten minutes of
continuous handwriting. When the pen is once again within range of
a printer, it transfers any buffered digital ink.
[0247] A pen can be registered with any number of printers, but
because all state data resides in netpages both on paper and on the
network, it is largely immaterial which printer a pen is
communicating with at any particular time.
[0248] A preferred embodiment of the pen is described in greater
detail in Section 6 below, with reference to FIGS. 8 to 10.
1.7 Netpage Interaction
[0249] The netpage printer 601 receives data relating to a stroke
from the pen 101 when the pen is used to interact with a netpage 1.
The coded data 3 of the tags 4 is read by the pen when it is used
to execute a movement, such as a stroke. The data allows the
identity of the particular page and associated interactive element
to be determined and an indication of the relative positioning of
the pen relative to the page to be obtained. The indicating data is
transmitted to the printer, where it resolves, via the DNS, the
page ID 50 of the stroke into the network address of the netpage
page server 10 which maintains the corresponding page instance 830.
It then transmits the stroke to the page server. If the page was
recently identified in an earlier stroke, then the printer may
already have the address of the relevant page server in its cache.
Each netpage consists of a compact page layout maintained
persistently by a netpage page server (see below). The page layout
refers to objects such as images, fonts and pieces of text,
typically stored elsewhere on the netpage network.
[0250] When the page server receives the stroke from the pen, it
retrieves the page description to which the stroke applies, and
determines which element of the page description the stroke
intersects. It is then able to interpret the stroke in the context
of the type of the relevant element.
[0251] A "click" is a stroke where the distance and time between
the pen down position and the subsequent pen up position are both
less than some small maximum. An object which is activated by a
click typically requires a click to be activated, and accordingly,
a longer stroke is ignored. The failure of a pen action, such as a
"sloppy" click, to register is indicated by the lack of response
from the pen's "ok" LED.
[0252] There are two kinds of input elements in a netpage page
description: hyperlinks and form fields. Input through a form field
can also trigger the activation of an associated hyperlink.
1.7.1 Hyperlinks
[0253] A hyperlink is a means of sending a message to a remote
application, and typically elicits a printed response in the
netpage system.
[0254] A hyperlink element 844 identifies the application 71 which
handles activation of the hyperlink, a link ID 54 which identifies
the hyperlink to the application, an "alias required" flag which
asks the system to include the user's application alias ID 65 in
the hyperlink activation, and a description which is used when the
hyperlink is recorded as a favorite or appears in the user's
history. The hyperlink element class diagram is shown in FIG.
29.
[0255] When a hyperlink is activated, the page server sends a
request to an application somewhere on the network. The application
is identified by an application ID 64, and the application ID is
resolved in the normal way via the DNS. There are three types of
hyperlinks: general hyperlinks 863, form hyperlinks 865, and
selection hyperlinks 864, as shown in FIG. 30. A general hyperlink
can implement a request for a linked document, or may simply signal
a preference to a server. A form hyperlink submits the
corresponding form to the application. A selection hyperlink
submits the current selection to the application. If the current
selection contains a single-word piece of text, for example, the
application may return a single-page document giving the word's
meaning within the context in which it appears, or a translation
into a different language. Each hyperlink type is characterized by
what information is submitted to the application.
[0256] The corresponding hyperlink instance 862 records a
transaction ID 55 which can be specific to the page instance on
which the hyperlink instance appears. The transaction ID can
identify user-specific data to the application, for example a
"shopping cart" of pending purchases maintained by a purchasing
application on behalf of the user.
[0257] The system includes the pen's current selection 826 in a
selection hyperlink activation. The system includes the content of
the associated form instance 868 in a form hyperlink activation,
although if the hyperlink has its "submit delta" attribute set,
only input since the last form submission is included. The system
includes an effective return path in all hyperlink activations.
[0258] A hyperlinked group 866 is a group element 838 which has an
associated hyperlink, as shown in FIG. 31. When input occurs
through any field element in the group, the hyperlink 844
associated with the group is activated. A hyperlinked group can be
used to associate hyperlink behavior with a field such as a
checkbox. It can also be used, in conjunction with the "submit
delta" attribute of a form hyperlink, to provide continuous input
to an application. It can therefore be used to support a
"blackboard" interaction model, i.e. where input is captured and
therefore shared as soon as it occurs.
1.7.2 Forms
[0259] A form defines a collection of related input fields used to
capture a related set of inputs through a printed netpage. A form
allows a user to submit one or more parameters to an application
software program running on a server.
[0260] A form 867 is a group element 838 in the document hierarchy.
It ultimately contains a set of terminal field elements 839. A form
instance 868 represents a printed instance of a form. It consists
of a set of field instances 870 which correspond to the field
elements 845 of the form. Each field instance has an associated
value 871, whose type depends on the type of the corresponding
field element. Each field value records input through a particular
printed form instance, i.e. through one or more printed netpages.
The form class diagram is shown in FIG. 32.
[0261] Each form instance has a status 872 which indicates whether
the form is active, frozen, submitted, void or expired. A form is
active when first printed. A form becomes frozen once it is signed.
A form becomes submitted once one of its submission hyperlinks has
been activated, unless the hyperlink has its "submit delta"
attribute set. A form becomes void when the user invokes a void
form, reset form or duplicate form page command. A form expires
when the time the form has been active exceeds the form's specified
lifetime. While the form is active, form input is allowed. Input
through a form which is not active is instead captured in the
background field 833 of the relevant page instance. When the form
is active or frozen, form submission is allowed. Any attempt to
submit a form when the form is not active or frozen is rejected,
and instead elicits an form status report.
[0262] Each form instance is associated (at 59) with any form
instances derived from it, thus providing a version history. This
allows all but the latest version of a form in a particular time
period to be excluded from a search.
[0263] All input is captured as digital ink. Digital ink 873
consists of a set of timestamped stroke groups 874, each of which
consists of a set of styled strokes 875. Each stroke consists of a
set of timestamped pen positions 876, each of which also includes
pen orientation and nib force. The digital ink class diagram is
shown in FIG. 33.
[0264] A field element 845 can be a checkbox field 877, a text
field 878, a drawing field 879, or a signature field 880. The field
element class diagram is shown in FIG. 34. Any digital ink captured
in a field's zone 58 is assigned to the field.
[0265] A checkbox field has an associated boolean value 881, as
shown in FIG. 35. Any mark (a tick, a cross, a stroke, a fill
zigzag, etc.) captured in a checkbox field's zone causes a true
value to be assigned to the field's value.
[0266] A text field has an associated text value 882, as shown in
FIG. 36. Any digital ink captured in a text field's zone is
automatically converted to text via online handwriting recognition,
and the text is assigned to the field's value. Online handwriting
recognition is well-understood see, for example, Tappert, C., C. Y.
Suen and T. Wakahara, "The State of the Art in On-Line Handwriting
Recognition", IEEE Transactions on Pattern Analysis and Machine
Intelligence, Vol. 12, No. 8, August 1990, the contents of which
are herein incorporated by cross-reference).
[0267] A signature field has an associated digital signature value
883, as shown in FIG. 37. Any digital ink captured in a signature
field's zone is automatically verified with respect to the identity
of the owner of the pen, and a digital signature of the content of
the form of which the field is part is generated and assigned to
the field's value. The digital signature is generated using the pen
user's private signature key specific to the application which owns
the form. Online signature verification is well-understood (see,
for example, Plamondon, R. and G. Lorette, "Automatic Signature
Verification and Writer Identification--The State of the Art",
Pattern Recognition, Vol. 22, No. 2, 1989, the contents of which
are herein incorporated by cross-reference).
[0268] A field element is hidden if its "hidden" attribute is set.
A hidden field element does not have an input zone on a page and
does not accept input. It can have an associated field value which
is included in the form data when the form containing the field is
submitted.
[0269] "Editing" commands, such as strike-throughs indicating
deletion, can also be recognized in form fields.
[0270] Because the handwriting recognition algorithm works "online"
(i.e. with access to the dynamics of the pen movement), rather than
"offline" (i.e. with access only to a bitmap of pen markings), it
can recognize run-on discretely-written characters with relatively
high accuracy, without a writer-dependent training phase. A
writer-dependent model of handwriting is automatically generated
over time, however, and can be generated up-front if necessary,
[0271] Digital ink, as already stated, consists of a sequence of
strokes. Any stroke which starts in a particular element's zone is
appended to that element's digital ink stream, ready for
interpretation. Any stroke not appended to an object's digital ink
stream is appended to the background field's digital ink
stream.
[0272] Digital ink captured in the background field is interpreted
as a selection gesture. Circumscription of one or more objects is
generally interpreted as a selection of the circumscribed objects,
although the actual interpretation is application-specific.
[0273] Table 2 summarises these various pen interactions with a
netpage.
TABLE-US-00003 TABLE 2 Summary of pen interactions with a netpage
Object Type Pen input Action Hyperlink General Click Submit action
to application Form Click Submit form to application Selection
Click Submit selection to application Form field Checkbox Any mark
Assign true to field Text Handwriting Convert digital ink to text;
assign text to field Drawing Digital ink Assign digital ink to
field Signature Signature Verify digital ink signature; generate
digital signature of form; assign digital signature to field None
-- Circumscription Assign digital ink to current selection
[0274] The system maintains a current selection for each pen. The
selection consists simply of the most recent stroke captured in the
background field. The selection is cleared after an inactivity
timeout to ensure predictable behavior.
[0275] The raw digital ink captured in every field is retained on
the netpage page server and is optionally transmitted with the form
data when the form is submitted to the application. This allows the
application to interrogate the raw digital ink should it suspect
the original conversion, such as the conversion of handwritten
text. This can, for example, involve human intervention at the
application level for forms which fail certain application-specific
consistency checks. As an extension to this, the entire background
area of a form can be designated as a drawing field. The
application can then decide, on the basis of the presence of
digital ink outside the explicit fields of the form, to route the
form to a human operator, on the assumption that the user may have
indicated amendments to the filled-in fields outside of those
fields.
[0276] FIG. 38 shows a flowchart of the process of handling pen
input relative to a netpage. The process consists of receiving (at
884) a stroke from the pen; identifying (at 885) the page instance
830 to which the page ID 50 in the stroke refers; retrieving (at
886) the page description 5; identifying (at 887) a formatted
element 839 whose zone 58 the stroke intersects; determining (at
888) whether the formatted element corresponds to a field element,
and if so appending (at 892) the received stroke to the digital ink
of the field value 871, interpreting (at 893) the accumulated
digital ink of the field, and determining (at 894) whether the
field is part of a hyperlinked group 866 and if so activating (at
895) the associated hyperlink; alternatively determining (at 889)
whether the formatted element corresponds to a hyperlink element
and if so activating (at 895) the corresponding hyperlink;
alternatively, in the absence of an input field or hyperlink,
appending (at 890) the received stroke to the digital ink of the
background field 833; and copying (at 891) the received stroke to
the current selection 826 of the current pen, as maintained by the
registration server.
[0277] FIG. 38a shows a detailed flowchart of step 893 in the
process shown in FIG. 38, where the accumulated digital ink of a
field is interpreted according to the type of the field. The
process consists of determining (at 896) whether the field is a
checkbox and (at 897) whether the digital ink represents a
checkmark, and if so assigning (at 898) a true value to the field
value; alternatively determining (at 899) whether the field is a
text field and if so converting (at 900) the digital ink to
computer text, with the help of the appropriate registration
server, and assigning (at 901) the converted computer text to the
field value; alternatively determining (at 902) whether the field
is a signature field and if so verifying (at 903) the digital ink
as the signature of the pen's owner, with the help of the
appropriate registration server, creating (at 904) a digital
signature of the contents of the corresponding form, also with the
help of the registration server and using the pen owner's private
signature key relating to the corresponding application, and
assigning (at 905) the digital signature to the field value.
1.7.3 Page Server Commands
[0278] A page server command is a command which is handled locally
by the page server. It operates directly on form, page and document
instances.
[0279] A page server command 907 can be a void form command 908, a
duplicate form command 909, a reset form command 910, a get form
status command 911, a duplicate page command 912, a reset page
command 913, a get page status command 914, a duplicate document
command 915, a reset document command 916, or a get document status
command 917, as shown in FIG. 39.
[0280] A void form command voids the corresponding form instance. A
duplicate form command voids the corresponding form instance and
then produces an active printed copy of the current form instance
with field values preserved. The copy contains the same hyperlink
transaction IDs as the original, and so is indistinguishable from
the original to an application. A reset form command voids the
corresponding form instance and then produces an active printed
copy of the form instance with field values discarded. A get form
status command produces a printed report on the status of the
corresponding form instance, including who published it, when it
was printed, for whom it was printed, and the form status of the
form instance.
[0281] Since a form hyperlink instance contains a transaction ID,
the application has to be involved in producing a new form
instance. A button requesting a new form instance is therefore
typically implemented as a hyperlink.
[0282] A duplicate page command produces a printed copy of the
corresponding page instance with the background field value
preserved. If the page contains a form or is part of a form, then
the duplicate page command is interpreted as a duplicate form
command. A reset page command produces a printed copy of the
corresponding page instance with the background field value
discarded. If the page contains a form or is part of a form, then
the reset page command is interpreted as a reset form command. A
get page status command produces a printed report on the status of
the corresponding page instance, including who published it, when
it was printed, for whom it was printed, and the status of any
forms it contains or is part of.
[0283] The netpage logo which appears on every netpage is usually
associated with a duplicate page element.
[0284] When a page instance is duplicated with field values
preserved, field values are printed in their native form, i.e. a
checkmark appears as a standard checkmark graphic, and text appears
as typeset text. Only drawings and signatures appear in their
original form, with a signature accompanied by a standard graphic
indicating successful signature verification.
[0285] A duplicate document command produces a printed copy of the
corresponding document instance with background field values
preserved. If the document contains any forms, then the duplicate
document command duplicates the forms in the same way a duplicate
form command does. A reset document command produces a printed copy
of the corresponding document instance with background field values
discarded. If the document contains any forms, then the reset
document command resets the forms in the same way a reset form
command does. A get document status command produces a printed
report on the status of the corresponding document instance,
including who published it, when it was printed, for whom it was
printed, and the status of any forms it contains.
[0286] If the page server command's "on selected" attribute is set,
then the command operates on the page identified by the pen's
current selection rather than on the page containing the command.
This allows a menu of page server commands to be printed. If the
target page doesn't contain a page server command element for the
designated page server command, then the command is ignored.
[0287] An application can provide application-specific handling by
embedding the relevant page server command element in a hyperlinked
group. The page server activates the hyperlink associated with the
hyperlinked group rather than executing the page server
command.
[0288] A page server command element is hidden if its "hidden"
attribute is set. A hidden command element does not have an input
zone on a page and so cannot be activated directly by a user. It
can, however, be activated via a page server command embedded in a
different page, if that page server command has its "on selected"
attribute set.
1.8 Standard Features of Netpages
[0289] In the preferred form, each netpage is printed with the
netpage logo at the bottom to indicate that it is a netpage and
therefore has interactive properties. The logo also acts as a copy
button. In most cases pressing the logo produces a copy of the
page. In the case of a form, the button produces a copy of the
entire form. And in the case of a secure document, such as a ticket
or coupon, the button elicits an explanatory note or advertising
page.
[0290] The default single-page copy function is handled directly by
the relevant netpage page server. Special copy functions are
handled by linking the logo button to an application.
1.9 User Help System
[0291] In a preferred embodiment, the netpage printer has a single
button labelled "Help". When pressed it elicits a single page of
information, including:
[0292] status of printer connection
[0293] status of printer consumables
[0294] top-level help menu
[0295] document function menu
[0296] top-level netpage network directory
[0297] The help menu provides a hierarchical manual on how to use
the netpage system.
[0298] The document function menu includes the following
functions:
[0299] print a copy of a document
[0300] print a clean copy of a form
[0301] print the status of a document
[0302] A document function is initiated by simply pressing the
button and then touching any page of the document. The status of a
document indicates who published it and when, to whom it was
delivered, and to whom and when it was subsequently submitted as a
form.
[0303] The netpage network directory allows the user to navigate
the hierarchy of publications and services on the network. As an
alternative, the user can call the netpage network "900" number
"yellow pages" and speak to a human operator. The operator can
locate the desired document and route it to the user's printer.
Depending on the document type, the publisher or the user pays the
small "yellow pages" service fee.
[0304] The help page is obviously unavailable if the printer is
unable to print. In this case the "error" light is lit and the user
can request remote diagnosis over the network.
2 Personalized Publication Model
[0305] In the following description, news is used as a canonical
publication example to illustrate personalization mechanisms in the
netpage system. Although news is often used in the limited sense of
newspaper and newsmagazine news, the intended scope in the present
context is wider.
[0306] In the netpage system, the editorial content and the
advertising content of a news publication are personalized using
different mechanisms. The editorial content is personalized
according to the reader's explicitly stated and implicitly captured
interest profile. The advertising content is personalized according
to the reader's locality and demographic.
2.1 Editorial Personalization
[0307] A subscriber can draw on two kinds of news sources: those
that deliver news publications, and those that deliver news
streams. While news publications are aggregated and edited by the
publisher, news streams are aggregated either by a news publisher
or by a specialized news aggregator. News publications typically
correspond to traditional newspapers and newsmagazines, while news
streams can be many and varied: a "raw" news feed from a news
service, a cartoon strip, a freelance writer's column, a friend's
bulletin board, or the reader's own e-mail.
[0308] The netpage publication server supports the publication of
edited news publications as well as the aggregation of multiple
news streams. By handling the aggregation and hence the formatting
of news streams selected directly by the reader, the server is able
to place advertising on pages over which it otherwise has no
editorial control.
[0309] The subscriber builds a daily newspaper by selecting one or
more contributing news publications, and creating a personalized
version of each. The resulting daily editions are printed and bound
together into a single newspaper. The various members of a
household typically express their different interests and tastes by
selecting different daily publications and then customizing
them.
[0310] For each publication, the reader optionally selects specific
sections. Some sections appear daily, while others appear weekly.
The daily sections available from The New York Times online, for
example, include "Page One Plus", "National", "International",
"Opinion", "Business", "Arts/Living", "Technology", and "Sports".
The set of available sections is specific to a publication, as is
the default subset.
[0311] The reader can extend the daily newspaper by creating custom
sections, each one drawing on any number of news streams. Custom
sections might be created for e-mail and friends' announcements
("Personal"), or for monitoring news feeds for specific topics
("Alerts" or "Clippings").
[0312] For each section, the reader optionally specifies its size,
either qualitatively (e.g. short, medium, or long), or numerically
(i.e. as a limit on its number of pages), and the desired
proportion of advertising, either qualitatively (e.g. high, normal,
low, none), or numerically (i.e. as a percentage).
[0313] The reader also optionally expresses a preference for a
large number of shorter articles or a small number of longer
articles. Each article is ideally written (or edited) in both short
and long forms to support this preference.
[0314] An article may also be written (or edited) in different
versions to match the expected sophistication of the reader, for
example to provide children's and adults' versions. The appropriate
version is selected according to the reader's age. The reader can
specify a "reading age" which takes precedence over their
biological age.
[0315] The articles which make up each section are selected and
prioritized by the editors, and each is assigned a useful lifetime.
By default they are delivered to all relevant subscribers, in
priority order, subject to space constraints in the subscribers'
editions.
[0316] In sections where it is appropriate, the reader may
optionally enable collaborative filtering. This is then applied to
articles which have a sufficiently long lifetime. Each article
which qualifies for collaborative filtering is printed with rating
buttons at the end of the article. The buttons can provide an easy
choice (e.g. "liked" and "disliked"), making it more likely that
readers will bother to rate the article.
[0317] Articles with high priorities and short lifetimes are
therefore effectively considered essential reading by the editors
and are delivered to most relevant subscribers.
[0318] The reader optionally specifies a serendipity factor, either
qualitatively (e.g. do or don't surprise me), or numerically. A
high serendipity factor lowers the threshold used for matching
during collaborative filtering. A high factor makes it more likely
that the corresponding section will be filled to the reader's
specified capacity. A different serendipity factor can be specified
for different days of the week.
[0319] The reader also optionally specifies topics of particular
interest within a section, and this modifies the priorities
assigned by the editors.
[0320] The speed of the reader's Internet connection affects the
quality at which images can be delivered. The reader optionally
specifies a preference for fewer images or smaller images or both.
If the number or size of images is not reduced, then images may be
delivered at lower quality (i.e. at lower resolution or with
greater compression).
[0321] At a global level, the reader specifies how quantities,
dates, times and monetary values are localized. This involves
specifying whether units are imperial or metric, a local timezone
and time format, and a local currency, and whether the localization
consist of in situ translation or annotation. These preferences are
derived from the reader's locality by default.
[0322] To reduce reading difficulties caused by poor eyesight, the
reader optionally specifies a global preference for a larger
presentation. Both text and images are scaled accordingly, and less
information is accommodated on each page.
[0323] The language in which a news publication is published, and
its corresponding text encoding, is a property of the publication
and not a preference expressed by the user. However, the netpage
system can be configured to provide automatic translation services
in various guises.
2.2 Advertising Localization and Targeting
[0324] The personalization of the editorial content directly
affects the advertising content, because advertising is typically
placed to exploit the editorial context. Travel ads, for example,
are more likely to appear in a travel section than elsewhere. The
value of the editorial content to an advertiser (and therefore to
the publisher) lies in its ability to attract large numbers of
readers with the right demographics.
[0325] Effective advertising is placed on the basis of locality and
demographics. Locality determines proximity to particular services,
retailers etc., and particular interests and concerns associated
with the local community and environment. Demographics determine
general interests and preoccupations as well as likely spending
patterns.
[0326] A news publisher's most profitable product is advertising
"space", a multi-dimensional entity determined by the publication's
geographic coverage, the size of its readership, its readership
demographics, and the page area available for advertising.
[0327] In the netpage system, the netpage publication server
computes the approximate multi-dimensional size of a publication's
saleable advertising space on a per-section basis, taking into
account the publication's geographic coverage, the section's
readership, the size of each reader's section edition, each
reader's advertising proportion, and each reader's demographic.
[0328] In comparison with other media, the netpage system allows
the advertising space to be defined in greater detail, and allows
smaller pieces of it to be sold separately. It therefore allows it
to be sold at closer to its true value.
[0329] For example, the same advertising "slot" can be sold in
varying proportions to several advertisers, with individual
readers' pages randomly receiving the advertisement of one
advertiser or another, overall preserving the proportion of space
sold to each advertiser.
[0330] The netpage system allows advertising to be linked directly
to detailed product information and online purchasing. It therefore
raises the intrinsic value of the advertising space.
[0331] Because personalization and localization are handled
automatically by netpage publication servers, an advertising
aggregator can provide arbitrarily broad coverage of both geography
and demographics. The subsequent disaggregation is efficient
because it is automatic. This makes it more cost-effective for
publishers to deal with advertising aggregators than to directly
capture advertising. Even though the advertising aggregator is
taking a proportion of advertising revenue, publishers may find the
change profit-neutral because of the greater efficiency of
aggregation. The advertising aggregator acts as an intermediary
between advertisers and publishers, and may place the same
advertisement in multiple publications.
[0332] It is worth noting that ad placement in a netpage
publication can be more complex than ad placement in the
publication's traditional counterpart, because the publication's
advertising space is more complex. While ignoring the full
complexities of negotiations between advertisers, advertising
aggregators and publishers, the preferred form of the netpage
system provides some automated support for these negotiations,
including support for automated auctions of advertising space.
Automation is particularly desirable for the placement of
advertisements which generate small amounts of income, such as
small or highly localized advertisements.
[0333] Once placement has been negotiated, the aggregator captures
and edits the advertisement and records it on a netpage ad server.
Correspondingly, the publisher records the ad placement on the
relevant netpage publication server. When the netpage publication
server lays out each user's personalized publication, it picks the
relevant advertisements from the netpage ad server.
2.3 User Profiles
2.3.1 Information Filtering
[0334] The personalization of news and other publications relies on
an assortment of user-specific profile information, including:
[0335] publication customizations
[0336] collaborative filtering vectors
[0337] contact details
[0338] presentation preferences
[0339] The customization of a publication is typically
publication-specific, and so the customization information is
maintained by the relevant netpage publication server.
[0340] A collaborative filtering vector consists of the user's
ratings of a number of news items. It is used to correlate
different users' interests for the purposes of making
recommendations. Although there are benefits to maintaining a
single collaborative filtering vector independently of any
particular publication, there are two reasons why it is more
practical to maintain a separate vector for each publication: there
is likely to be more overlap between the vectors of subscribers to
the same publication than between those of subscribers to different
publications; and a publication is likely to want to present its
users' collaborative filtering vectors as part of the value of its
brand, not to be found elsewhere. Collaborative filtering vectors
are therefore also maintained by the relevant netpage publication
server.
[0341] Contact details, including name, street address, ZIP Code,
state, country, telephone numbers, are global by nature, and are
maintained by a netpage registration server.
[0342] Presentation preferences, including those for quantities,
dates and times, are likewise global and maintained in the same
way.
[0343] The localization of advertising relies on the locality
indicated in the user's contact details, while the targeting of
advertising relies on personal information such as date of birth,
gender, marital status, income, profession, education, or
qualitative derivatives such as age range and income range.
[0344] For those users who choose to reveal personal information
for advertising purposes, the information is maintained by the
relevant netpage registration server. In the absence of such
information, advertising can be targeted on the basis of the
demographic associated with the user's ZIP or ZIP+4 Code.
[0345] Each user, pen, printer, application provider and
application is assigned its own unique identifier, and the netpage
registration server maintains the relationships between them, as
shown in FIGS. 21, 22, 23 and 24. For registration purposes, a
publisher is a special kind of application provider, and a
publication is a special kind of application.
[0346] Each user 800 may be authorized to use any number of
printers 802, and each printer may allow any number of users to use
it. Each user has a single default printer (at 66), to which
periodical publications are delivered by default, whilst pages
printed on demand are delivered to the printer through which the
user is interacting. The server keeps track of which publishers a
user has authorized to print to the user's default printer. A
publisher does not record the ID of any particular printer, but
instead resolves the ID when it is required.
[0347] When a user subscribes 808 to a publication 807, the
publisher 806 (i.e. application provider 803) is authorized to
print to a specified printer or the user's default printer. This
authorization can be revoked at any time by the user. Each user may
have several pens 801, but a pen is specific to a single user. If a
user is authorized to use a particular printer, then that printer
recognizes any of the user's pens.
[0348] The pen ID is used to locate the corresponding user profile
maintained by a particular netpage registration server, via the DNS
in the usual way.
[0349] A Web terminal 809 can be authorized to print on a
particular netpage printer, allowing Web pages and netpage
documents encountered during Web browsing to be conveniently
printed on the nearest netpage printer.
[0350] The netpage system can collect, on behalf of a printer
provider, fees and commissions on income earned through
publications printed on the provider's printers. Such income can
include advertising fees, click-through fees, e-commerce
commissions, and transaction fees. If the printer is owned by the
user, then the user is the printer provider.
[0351] Each user also has a netpage account 820 which is used to
accumulate micro-debits and credits (such as those described in the
preceding paragraph); contact details 815, including name, address
and telephone numbers; global preferences 816, including privacy,
delivery and localization settings; any number of biometric records
817, containing the user's encoded signature 818, fingerprint 819
etc; a handwriting model 819 automatically maintained by the
system; and SET payment card accounts 821 with which e-commerce
payments can be made.
2.3.2 Favorites List
[0352] A netpage user can maintain a list 922 of "favorites"--links
to useful documents etc. on the netpage network. The list is
maintained by the system on the user's behalf. It is organized as a
hierarchy of folders 924, a preferred embodiment of which is shown
in the class diagram in FIG. 41.
2.3.3 History List
[0353] The system maintains a history list 929 on each user's
behalf, containing links to documents etc. accessed by the user
through the netpage system. It is organized as a date-ordered list,
a preferred embodiment of which is shown in the class diagram in
FIG. 42.
2.4 Intelligent Page Layout
[0354] The netpage publication server automatically lays out the
pages of each user's personalized publication on a
section-by-section basis. Since most advertisements are in the form
of pre-formatted rectangles, they are placed on the page before the
editorial content.
[0355] The advertising ratio for a section can be achieved with
wildly varying advertising ratios on individual pages within the
section, and the ad layout algorithm exploits this. The algorithm
is configured to attempt to co-locate closely tied editorial and
advertising content, such as placing ads for roofing material
specifically within the publication because of a special feature on
do-it-yourself roofing repairs.
[0356] The editorial content selected for the user, including text
and associated images and graphics, is then laid out according to
various aesthetic rules.
[0357] The entire process, including the selection of ads and the
selection of editorial content, must be iterated once the layout
has converged, to attempt to more closely achieve the user's stated
section size preference. The section size preference can, however,
be matched on average over time, allowing significant day-to-day
variations.
2.5 Document Format
[0358] Once the document is laid out, it is encoded for efficient
distribution and persistent storage on the netpage network.
[0359] The primary efficiency mechanism is the separation of
information specific to a single user's edition and information
shared between multiple users' editions. The specific information
consists of the page layout. The shared information consists of the
objects to which the page layout refers, including images,
graphics, and pieces of text.
[0360] A text object contains fully-formatted text represented in
the Extensible Markup Language (XML) using the Extensible
Stylesheet Language (XSL). XSL provides precise control over text
formatting independently of the region into which the text is being
set, which in this case is being provided by the layout. The text
object contains embedded language codes to enable automatic
translation, and embedded hyphenation hints to aid with paragraph
formatting.
[0361] An image object encodes an image in the JPEG 2000
wavelet-based compressed image format. A graphic object encodes a
2D graphic in Scalable Vector Graphics (SVG) format.
[0362] The layout itself consists of a series of placed image and
graphic objects, linked textflow objects through which text objects
flow, hyperlinks and input fields as described above, and watermark
regions. These layout objects are summarized in Table 3. The layout
uses a compact format suitable for efficient distribution and
storage.
TABLE-US-00004 TABLE 3 Netpage layout objects Layout Format of
object Attribute linked object Image Position -- Image object ID
JPEG 2000 Graphic Position -- Graphic object ID SVG Textflow
Textflow ID -- Zone -- Optional text object ID XML/XSL Hyperlink
Type -- Zone -- Application ID, etc. -- Field Type -- Meaning --
Zone -- Watermark Zone --
2.6 Document Distribution
[0363] As described above, for purposes of efficient distribution
and persistent storage on the netpage network, a user-specific page
layout is separated from the shared objects to which it refers.
[0364] When a subscribed publication is ready to be distributed,
the netpage publication server allocates, with the help of the
netpage ID server 12, a unique ID for each page, page instance,
document, and document instance.
[0365] The server computes a set of optimized subsets of the shared
content and creates a multicast channel for each subset, and then
tags each user-specific layout with the names of the multicast
channels which will carry the shared content used by that layout.
The server then pointcasts each user's layouts to that user's
printer via the appropriate page server, and when the pointcasting
is complete, multicasts the shared content on the specified
channels. After receiving its pointcast, each page server and
printer subscribes to the multicast channels specified in the page
layouts. During the multicasts, each page server and printer
extracts from the multicast streams those objects referred to by
its page layouts. The page servers persistently archive the
received page layouts and shared content.
[0366] Once a printer has received all the objects to which its
page layouts refer, the printer re-creates the fully-populated
layout and then rasterizes and prints it.
[0367] Under normal circumstances, the printer prints pages faster
than they can be delivered. Assuming a quarter of each page is
covered with images, the average page has a size of less than 400
KB. The printer can therefore hold in excess of 100 such pages in
its internal 64 MB memory, allowing for temporary buffers etc. The
printer prints at a rate of one page per second. This is equivalent
to 400 KB or about 3 Mbit of page data per second, which is similar
to the highest expected rate of page data delivery over a broadband
network.
[0368] Even under abnormal circumstances, such as when the printer
runs out of paper, it is likely that the user will be able to
replenish the paper supply before the printer's 100-page internal
storage capacity is exhausted.
[0369] However, if the printer's internal memory does fill up, then
the printer will be unable to make use of a multicast when it first
occurs. The netpage publication server therefore allows printers to
submit requests for re-multicasts. When a critical number of
requests is received or a timeout occurs, the server re-multicasts
the corresponding shared objects.
[0370] Once a document is printed, a printer can produce an exact
duplicate at any time by retrieving its page layouts and contents
from the relevant page server.
2.7 On-Demand Documents
[0371] When a netpage document is requested on demand, it can be
personalized and delivered in much the same way as a periodical.
However, since there is no shared content, delivery is made
directly to the requesting printer without the use of
multicast.
[0372] When a non-netpage document is requested on demand, it is
not personalized, and it is delivered via a designated netpage
formatting server which reformats it as a netpage document. A
netpage formatting server is a special instance of a netpage
publication server. The netpage formatting server has knowledge of
various Internet document formats, including Adobe's Portable
Document Format (PDF), and Hypertext Markup Language (HTML). In the
case of HTML, it can make use of the higher resolution of the
printed page to present Web pages in a multi-column format, with a
table of contents. It can automatically include all Web pages
directly linked to the requested page. The user can tune this
behavior via a preference.
[0373] The netpage formatting server makes standard netpage
behavior, including interactivity and persistence, available on any
Internet document, no matter what its origin and format. It hides
knowledge of different document formats from both the netpage
printer and the netpage page server, and hides knowledge of the
netpage system from Web servers.
3 Security
3.1 Cryptography
[0374] Cryptography is used to protect sensitive information, both
in storage and in transit, and to authenticate parties to a
transaction. There are two classes of cryptography in widespread
use: secret-key cryptography and public-key cryptography. The
netpage network uses both classes of cryptography.
[0375] Secret-key cryptography, also referred to as symmetric
cryptography, uses the same key to encrypt and decrypt a message.
Two parties wishing to exchange messages must first arrange to
securely exchange the secret key.
[0376] Public-key cryptography, also referred to as asymmetric
cryptography, uses two encryption keys. The two keys are
mathematically related in such a way that any message encrypted
using one key can only be decrypted using the other key. One of
these keys is then published, while the other is kept private. The
public key is used to encrypt any message intended for the holder
of the private key. Once encrypted using the public key, a message
can only be decrypted using the private key. Thus two parties can
securely exchange messages without first having to exchange a
secret key. To ensure that the private key is secure, it is normal
for the holder of the private key to generate the key pair.
[0377] Public-key cryptography can be used to create a digital
signature. The holder of the private key can create a known hash of
a message and then encrypt the hash using the private key. Anyone
can then verify that the encrypted hash constitutes the "signature"
of the holder of the private key with respect to that particular
message by decrypting the encrypted hash using the public key and
verifying the hash against the message. If the signature is
appended to the message, then the recipient of the message can
verify both that the message is genuine and that it has not been
altered in transit.
[0378] To make public-key cryptography work, there has to be a way
to distribute public keys which prevents impersonation. This is
normally done using certificates and certificate authorities. A
certificate authority is a trusted third party which authenticates
the connection between a public key and someone's identity. The
certificate authority verifies the person's identity by examining
identity documents, and then creates and signs a digital
certificate containing the person's identity details and public
key. Anyone who trusts the certificate authority can use the public
key in the certificate with a high degree of certainty that it is
genuine. They just have to verify that the certificate has indeed
been signed by the certificate authority, whose public key is
well-known.
[0379] In most transaction environments, public-key cryptography is
only used to create digital signatures and to securely exchange
secret session keys. Secret-key cryptography is used for all other
purposes.
[0380] In the following discussion, when reference is made to the
secure transmission of information between a netpage printer and a
server, what actually happens is that the printer obtains the
server's certificate, authenticates it with reference to the
certificate authority, uses the public key-exchange key in the
certificate to exchange a secret session key with the server, and
then uses the secret session key to encrypt the message data. A
session key, by definition, can have an arbitrarily short
lifetime.
3.2 Netpage Printer Security
[0381] Each netpage printer is assigned a pair of unique
identifiers at time of manufacture which are stored in read-only
memory in the printer and in the netpage registration server
database. The first ID 62 is public and uniquely identifies the
printer on the netpage network. The second ID is secret and is used
when the printer is first registered on the network.
[0382] When the printer connects to the netpage network for the
first time after installation, it creates a signature
public/private key pair. It transmits the secret ID and the public
key securely to the netpage registration server. The server
compares the secret ID against the printer's secret ID recorded in
its database, and accepts the registration if the IDs match. It
then creates and signs a certificate containing the printer's
public ID and public signature key, and stores the certificate in
the registration database.
[0383] The netpage registration server acts as a certificate
authority for netpage printers, since it has access to secret
information allowing it to verify printer identity.
[0384] When a user subscribes to a publication, a record is created
in the netpage registration server database authorizing the
publisher to print the publication to the user's default printer or
a specified printer. Every document sent to a printer via a page
server is addressed to a particular user and is signed by the
publisher using the publisher's private signature key. The page
server verifies, via the registration database, that the publisher
is authorized to deliver the publication to the specified user. The
page server verifies the signature using the publisher's public
key, obtained from the publisher's certificate stored in the
registration database.
[0385] The netpage registration server accepts requests to add
printing authorizations to the database, so long as those requests
are initiated via a pen registered to the printer.
3.3 NETPAGE PEN SECURITY
[0386] Each netpage pen is assigned a unique identifier at time of
manufacture which is stored in read-only memory in the pen and in
the netpage registration server database. The pen ID 61 uniquely
identifies the pen on the netpage network.
[0387] A netpage pen can "know" a number of netpage printers, and a
printer can "know" a number of pens. A pen communicates with a
printer via a radio frequency signal whenever it is within range of
the printer. Once a pen and printer are registered, they regularly
exchange session keys. Whenever the pen transmits digital ink to
the printer, the digital ink is always encrypted using the
appropriate session key. Digital ink is never transmitted in the
clear.
[0388] A pen stores a session key for every printer it knows,
indexed by printer ID, and a printer stores a session key for every
pen it knows, indexed by pen ID. Both have a large but finite
storage capacity for session keys, and will forget a session key on
a least-recently-used basis if necessary.
[0389] When a pen comes within range of a printer, the pen and
printer discover whether they know each other. If they don't know
each other, then the printer determines whether it is supposed to
know the pen. This might be, for example, because the pen belongs
to a user who is registered to use the printer. If the printer is
meant to know the pen but doesn't, then it initiates the automatic
pen registration procedure.
If the printer isn't meant to know the pen, then it agrees with the
pen to ignore it until the pen is placed in a charging cup, at
which time it initiates the registration procedure.
[0390] In addition to its public ID, the pen contains a secret
key-exchange key. The key-exchange key is also recorded in the
netpage registration server database at time of manufacture. During
registration, the pen transmits its pen ID to the printer, and the
printer transmits the pen ID to the netpage registration server.
The server generates a session key for the printer and pen to use,
and securely transmits the session key to the printer. It also
transmits a copy of the session key encrypted with the pen's
key-exchange key. The printer stores the session key internally,
indexed by the pen ID, and transmits the encrypted session key to
the pen. The pen stores the session key internally, indexed by the
printer ID.
[0391] Although a fake pen can impersonate a pen in the pen
registration protocol, only a real pen can decrypt the session key
transmitted by the printer.
[0392] When a previously unregistered pen is first registered, it
is of limited use until it is linked to a user. A registered but
"un-owned" pen is only allowed to be used to request and fill in
netpage user and pen registration forms, to register a new user to
which the new pen is automatically linked, or to add a new pen to
an existing user.
[0393] The pen uses secret-key rather than public-key encryption
because of hardware performance constraints in the pen.
3.4 Secure Documents
[0394] The netpage system supports the delivery of secure documents
such as tickets and coupons. The netpage printer includes a
facility to print watermarks, but will only do so on request from
publishers who are suitably authorized. The publisher indicates its
authority to print watermarks in its certificate, which the printer
is able to authenticate.
[0395] The "watermark" printing process uses an alternative dither
matrix in specified "watermark" regions of the page. Back-to-back
pages contain mirror-image watermark regions which coincide when
printed. The dither matrices used in odd and even pages' watermark
regions are designed to produce an interference effect when the
regions are viewed together, achieved by looking through the
printed sheet.
[0396] The effect is similar to a watermark in that it is not
visible when looking at only one side of the page, and is lost when
the page is copied by normal means.
[0397] Pages of secure documents cannot be copied using the
built-in netpage copy mechanism described in Section 1.9 above.
This extends to copying netpages on netpage-aware photocopiers.
[0398] Secure documents are typically generated as part of
e-commerce transactions. They can therefore include the user's
photograph which was captured when the user registered biometric
information with the netpage registration server, as described in
Section 2.
[0399] When presented with a secure netpage document, the recipient
can verify its authenticity by requesting its status in the usual
way. The unique ID of a secure document is only valid for the
lifetime of the document, and secure document IDs are allocated
non-contiguously to prevent their prediction by opportunistic
forgers. A secure document verification pen can be developed with
built-in feedback on verification failure, to support easy
point-of-presentation document verification.
[0400] Clearly neither the watermark nor the user's photograph are
secure in a cryptographic sense. They simply provide a significant
obstacle to casual forgery. Online document verification,
particularly using a verification pen, provides an added level of
security where it is needed, but is still not entirely immune to
forgeries.
3.5 Non-Repudiation
[0401] In the netpage system, forms submitted by users are
delivered reliably to forms handlers and are persistently archived
on netpage page servers. It is therefore impossible for recipients
to repudiate delivery.
[0402] E-commerce payments made through the system, as described in
Section 4, are also impossible for the payee to repudiate.
4 Electronic Commerce Model
4.1 Secure Electronic Transaction (Set)
[0403] The netpage system uses the Secure Electronic Transaction
(SET) system as one of its payment systems. SET, having been
developed by MasterCard and Visa, is organized around payment
cards, and this is reflected in the terminology. However, much of
the system is independent of the type of accounts being used.
[0404] In SET, cardholders and merchants register with a
certificate authority and are issued with certificates containing
their public signature keys. The certificate authority verifies a
cardholder's registration details with the card issuer as
appropriate, and verifies a merchant's registration details with
the acquirer as appropriate. Cardholders and merchants store their
respective private signature keys securely on their computers.
During the payment process, these certificates are used to mutually
authenticate a merchant and cardholder, and to authenticate them
both to the payment gateway.
[0405] SET has not yet been adopted widely, partly because
cardholder maintenance of keys and certificates is considered
burdensome. Interim solutions which maintain cardholder keys and
certificates on a server and give the cardholder access via a
password have met with some success.
4.2 Set Payments
[0406] In the netpage system the netpage registration server acts
as a proxy for the netpage user (i.e. the cardholder) in SET
payment transactions.
[0407] The netpage system uses biometrics to authenticate the user
and authorize SET payments. Because the system is pen-based, the
biometric used is the user's on-line signature, consisting of
time-varying pen position and pressure. A fingerprint biometric can
also be used by designing a fingerprint sensor into the pen,
although at a higher cost. The type of biometric used only affects
the capture of the biometric, not the authorization aspects of the
system.
[0408] The first step to being able to make SET payments is to
register the user's biometric with the netpage registration server.
This is done in a controlled environment, for example a bank, where
the biometric can be captured at the same time as the user's
identity is verified. The biometric is captured and stored in the
registration database, linked to the user's record. The user's
photograph is also optionally captured and linked to the record.
The SET cardholder registration process is completed, and the
resulting private signature key and certificate are stored in the
database. The user's payment card information is also stored,
giving the netpage registration server enough information to act as
the user's proxy in any SET payment transaction.
[0409] When the user eventually supplies the biometric to complete
a payment, for example by signing a netpage order form, the printer
securely transmits the order information, the pen ID and the
biometric data to the netpage registration server. The server
verifies the biometric with respect to the user identified by the
pen ID, and from then on acts as the user's proxy in completing the
SET payment transaction.
4.3 Micro-Payments
[0410] The netpage system includes a mechanism for micro-payments,
to allow the user to be conveniently charged for printing low-cost
documents on demand and for copying copyright documents, and
possibly also to allow the user to be reimbursed for expenses
incurred in printing advertising material. The latter depends on
the level of subsidy already provided to the user.
[0411] When the user registers for e-commerce, a network account is
established which aggregates micro-payments. The user receives a
statement on a regular basis, and can settle any outstanding debit
balance using the standard payment mechanism.
[0412] The network account can be extended to aggregate
subscription fees for periodicals, which would also otherwise be
presented to the user in the form of individual statements.
4.4 Transactions
[0413] When a user requests a netpage in a particular application
context, the application is able to embed a user-specific
transaction ID 55 in the page. Subsequent input through the page is
tagged with the transaction ID, and the application is thereby able
to establish an appropriate context for the user's input.
[0414] When input occurs through a page which is not user-specific,
however, the application must use the user's unique identity to
establish a context. A typical example involves adding items from a
pre-printed catalog page to the user's virtual "shopping cart". To
protect the user's privacy, however, the unique user ID 60 known to
the netpage system is not divulged to applications. This is to
prevent different application providers from easily correlating
independently accumulated behavioral data.
[0415] The netpage registration server instead maintains an
anonymous relationship between a user and an application via a
unique alias ID 65, as shown in FIG. 24. Whenever the user
activates a hyperlink tagged with the "registered" attribute, the
netpage page server asks the netpage registration server to
translate the associated application ID 64, together with the pen
ID 61, into an alias ID 65. The alias ID is then submitted to the
hyperlink's application.
[0416] The application maintains state information indexed by alias
ID, and is able to retrieve user-specific state information without
knowledge of the global identity of the user.
[0417] The system also maintains an independent certificate and
private signature key for each of a user's applications, to allow
it to sign application transactions on behalf of the user using
only application-specific information.
[0418] To assist the system in routing product bar code (UPC)
"hyperlink" activations, the system records a favorite application
on behalf of the user for any number of product types.
[0419] Each application is associated with an application provider,
and the system maintains an account on behalf of each application
provider, to allow it to credit and debit the provider for
click-through fees etc.
[0420] An application provider can be a publisher of periodical
subscribed content. The system records the user's willingness to
receive the subscribed publication, as well as the expected
frequency of publication.
4.5 Resource Descriptions and Copyright
[0421] A preferred embodiment of a resource description class
diagram is shown in FIG. 40.
[0422] Each document and content object may be described by one or
more resource descriptions 842. Resource descriptions use the
Dublin Core metadata element set, which is designed to facilitate
discovery of electronic resources. Dublin Core metadata conforms to
the World Wide Web Consortium (W3C) Resource Description Framework
(RDF).
[0423] A resource description may identify rights holders 920. The
netpage system automatically transfers copyright fees from users to
rights holders when users print copyright content.
5 Communications Protocols
[0424] A communications protocol defines an ordered exchange of
messages between entities. In the netpage system, entities such as
pens, printers and servers utilise a set of defined protocols to
cooperatively handle user interaction with the netpage system.
[0425] Each protocol is illustrated by way of a sequence diagram in
which the horizontal dimension is used to represent message flow
and the vertical dimension is used to represent time. Each entity
is represented by a rectangle containing the name of the entity and
a vertical column representing the lifeline of the entity. During
the time an entity exists, the lifeline is shown as a dashed line.
During the time an entity is active, the lifeline is shown as a
double line. Because the protocols considered here do not create or
destroy entities, lifelines are generally cut short as soon as an
entity ceases to participate in a protocol.
5.1 Subscription Delivery Protocol
[0426] A preferred embodiment of a subscription delivery protocol
is shown in FIG. 43.
[0427] A large number of users may subscribe to a periodical
publication. Each user's edition may be laid out differently, but
many users' editions will share common content such as text objects
and image objects. The subscription delivery protocol therefore
delivers document structures to individual printers via pointcast,
but delivers shared content objects via multicast.
[0428] The application (i.e. publisher) first obtains a document ID
51 for each document from an ID server 12. It then sends each
document structure, including its document ID and page
descriptions, to the page server 10 responsible for the document's
newly allocated ID. It includes its own application ID 64, the
subscriber's alias ID 65, and the relevant set of multicast channel
names. It signs the message using its private signature key.
[0429] The page server uses the application ID and alias ID to
obtain from the registration server the corresponding user ID 60,
the user's selected printer ID 62 (which may be explicitly selected
for the application, or may be the user's default printer), and the
application's certificate.
[0430] The application's certificate allows the page server to
verify the message signature. The page server's request to the
registration server fails if the application ID and alias ID don't
together identify a subscription 808.
[0431] The page server then allocates document and page instance
IDs and forwards the page descriptions, including page IDs 50, to
the printer. It includes the relevant set of multicast channel
names for the printer to listen to.
[0432] It then returns the newly allocated page IDs to the
application for future reference.
[0433] Once the application has distributed all of the document
structures to the subscribers' selected printers via the relevant
page servers, it multicasts the various subsets of the shared
objects on the previously selected multicast channels. Both page
servers and printers monitor the appropriate multicast channels and
receive their required content objects. They are then able to
populate the previously pointcast document structures. This allows
the page servers to add complete documents to their databases, and
it allows the printers to print the documents.
5.2 Hyperlink Activation Protocol
[0434] A preferred embodiment of a hyperlink activation protocol is
shown in FIG. 45.
[0435] When a user clicks on a netpage with a netpage pen, the pen
communicates the click to the nearest netpage printer 601. The
click identifies the page and a location on the page. The printer
already knows the ID 61 of the pen from the pen connection
protocol.
[0436] The printer determines, via the DNS, the network address of
the page server 10a handling the particular page ID 50. The address
may already be in its cache if the user has recently interacted
with the same page. The printer then forwards the pen ID, its own
printer ID 62, the page ID and click location to the page
server.
[0437] The page server loads the page description 5 identified by
the page ID and determines which input element's zone 58, if any,
the click lies in. Assuming the relevant input element is a
hyperlink element 844, the page server then obtains the associated
application ID 64 and link ID 54, and determines, via the DNS, the
network address of the application server hosting the application
71.
[0438] The page server uses the pen ID 61 to obtain the
corresponding user ID 60 from the registration server 11, and then
allocates a globally unique hyperlink request ID 52 and builds a
hyperlink request 934. The hyperlink request class diagram is shown
in FIG. 44. The hyperlink request records the IDs of the requesting
user and printer, and identifies the clicked hyperlink instance
862. The page server then sends its own server ID 53, the hyperlink
request ID, and the link ID to the application.
[0439] The application produces a response document according to
application-specific logic, and obtains a document ID 51 from an ID
server 12. It then sends the document to the page server 10b
responsible for the document's newly allocated ID, together with
the requesting page server's ID and the hyperlink request ID.
[0440] The second page server sends the hyperlink request ID and
application ID to the first page server to obtain the corresponding
user ID and printer ID 62. The first page server rejects the
request if the hyperlink request has expired or is for a different
application.
[0441] The second page server allocates document instance and page
IDs 50, returns the newly allocated page IDs to the application,
adds the complete document to its own database, and finally sends
the page descriptions to the requesting printer.
[0442] The hyperlink instance may include a meaningful transaction
ID 55, in which case the first page server includes the transaction
ID in the message sent to the application. This allows the
application to establish a transaction-specific context for the
hyperlink activation.
[0443] If the hyperlink requires a user alias, i.e. its "alias
required" attribute is set, then the first page server sends both
the pen ID 61 and the hyperlink's application ID 64 to the
registration server 11 to obtain not just the user ID corresponding
to the pen ID but also the alias ID 65 corresponding to the
application ID and the user ID. It includes the alias ID in the
message sent to the application, allowing the application to
establish a user-specific context for the hyperlink activation.
5.3 Handwriting Recognition Protocol
[0444] When a user draws a stroke on a netpage with a netpage pen,
the pen communicates the stroke to the nearest netpage printer. The
stroke identifies the page and a path on the page.
[0445] The printer forwards the pen ID 61, its own printer ID 62,
the page ID 50 and stroke path to the page server 10 in the usual
way.
[0446] The page server loads the page description 5 identified by
the page ID and determines which input element's zone 58, if any,
the stroke intersects. Assuming the relevant input element is a
text field 878, the page server appends the stroke to the text
field's digital ink.
[0447] After a period of inactivity in the zone of the text field,
the page server sends the pen ID and the pending strokes to the
registration server 11 for interpretation. The registration server
identifies the user corresponding to the pen, and uses the user's
accumulated handwriting model 822 to interpret the strokes as
handwritten text. Once it has converted the strokes to text, the
registration server returns the text to the requesting page server.
The page server appends the text to the text value of the text
field.
5.4 Signature Verification Protocol
[0448] Assuming the input element whose zone the stroke intersects
is a signature field 880, the page server 10 appends the stroke to
the signature field's digital ink.
[0449] After a period of inactivity in the zone of the signature
field, the page server sends the pen ID 61 and the pending strokes
to the registration server 11 for verification. It also sends the
application ID 64 associated with the form of which the signature
field is part, as well as the form ID 56 and the current data
content of the form. The registration server identifies the user
corresponding to the pen, and uses the user's dynamic signature
biometric 818 to verify the strokes as the user's signature. Once
it has verified the signature, the registration server uses the
application ID 64 and user ID 60 to identify the user's
application-specific private signature key. It then uses the key to
generate a digital signature of the form data, and returns the
digital signature to the requesting page server. The page server
assigns the digital signature to the signature field and sets the
associated form's status to frozen.
[0450] The digital signature includes the alias ID 65 of the
corresponding user. This allows a single form to capture multiple
users' signatures.
5.5 Form Submission Protocol
[0451] A preferred embodiment of a form submission protocol is
shown in FIG. 46.
[0452] Form submission occurs via a form hyperlink activation. It
thus follows the protocol defined in Section 5.2, with some
form-specific additions.
[0453] In the case of a form hyperlink, the hyperlink activation
message sent by the page server 10 to the application 71 also
contains the form ID 56 and the current data content of the form.
If the form contains any signature fields, then the application
verifies each one by extracting the alias ID 65 associated with the
corresponding digital signature and obtaining the corresponding
certificate from the registration server 11.
5.6 Commission Payment Protocol
[0454] A preferred embodiment of a commission payment protocol is
shown in FIG. 47.
[0455] In an e-commerce environment, fees and commissions may be
payable from an application provider to a publisher on
click-throughs, transactions and sales. Commissions on fees and
commissions on commissions may also be payable from the publisher
to the provider of the printer.
[0456] The hyperlink request ID 52 is used to route a fee or
commission credit from the target application provider 70a (e.g.
merchant) to the source application provider 70b (i.e. publisher),
and from the source application provider 70b to the printer
provider 72.
[0457] The target application receives the hyperlink request ID
from the page server 10 when the hyperlink is first activated, as
described in Section 5.2. When the target application needs to
credit the source application provider, it sends the application
provider credit to the original page server together with the
hyperlink request ID. The page server uses the hyperlink request ID
to identify the source application, and sends the credit on to the
relevant registration server 11 together with the source
application ID 64, its own server ID 53, and the hyperlink request
ID. The registration server credits the corresponding application
provider's account 827. It also notifies the application
provider.
[0458] If the application provider needs to credit the printer
provider, it sends the printer provider credit to the original page
server together with the hyperlink request ID. The page server uses
the hyperlink request ID to identify the printer, and sends the
credit on to the relevant registration server together with the
printer ID. The registration server credits the corresponding
printer provider account 814.
[0459] The source application provider is optionally notified of
the identity of the target application provider, and the printer
provider of the identity of the source application provider.
6. Netpage Pen Description
6.1 Pen Mechanics
[0460] Referring to FIGS. 8 and 9, the pen, generally designated by
reference numeral 101, includes a housing 102 in the form of a
plastics moulding having walls 103 defining an interior space 104
for mounting the pen components. The pen top 105 is in operation
rotatably mounted at one end 106 of the housing 102. A
semi-transparent cover 107 is secured to the opposite end 108 of
the housing 102. The cover 107 is also of moulded plastics, and is
formed from semi-transparent material in order to enable the user
to view the status of the LED mounted within the housing 102. The
cover 107 includes a main part 109 which substantially surrounds
the end 108 of the housing 102 and a projecting portion 110 which
projects back from the main part 109 and fits within a
corresponding slot 111 formed in the walls 103 of the housing 102.
A radio antenna 112 is mounted behind the projecting portion 110,
within the housing 102. Screw threads 113 surrounding an aperture
113A on the cover 107 are arranged to receive a metal end piece
114, including corresponding screw threads 115. The metal end piece
114 is removable to enable ink cartridge replacement.
[0461] Also mounted within the cover 107 is a tri-color status LED
116 on a flex PCB 117. The antenna 112 is also mounted on the flex
PCB 117. The status LED 116 is mounted at the top of the pen 101
for good all-around visibility.
[0462] The pen can operate both as a normal marking ink pen and as
a non-marking stylus. An ink pen cartridge 118 with nib 119 and a
stylus 120 with stylus nib 121 are mounted side by side within the
housing 102. Either the ink cartridge nib 119 or the stylus nib 121
can be brought forward through open end 122 of the metal end piece
114, by rotation of the pen top 105. Respective slider blocks 123
and 124 are mounted to the ink cartridge 118 and stylus 120,
respectively. A rotatable cam barrel 125 is secured to the pen top
105 in operation and arranged to rotate therewith. The cam barrel
125 includes a cam 126 in the form of a slot within the walls 181
of the cam barrel. Cam followers 127 and 128 projecting from slider
blocks 123 and 124 fit within the cam slot 126. On rotation of the
cam barrel 125, the slider blocks 123 or 124 move relative to each
other to project either the pen nib 119 or stylus nib 121 out
through the hole 122 in the metal end piece 114. The pen 101 has
three states of operation. By turning the top 105 through
90.degree. steps, the three states are:
[0463] Stylus 120 nib 121 out;
[0464] Ink cartridge 118 nib 119 out; and
[0465] Neither ink cartridge 118 nib 119 out nor stylus 120 nib 121
out.
[0466] A second flex PCB 129, is mounted on an electronics chassis
130 which sits within the housing 102. The second flex PCB 129
mounts an infrared LED 131 for providing infrared radiation for
projection onto the surface. An image sensor 132 is provided
mounted on the second flex PCB 129 for receiving reflected
radiation from the surface. The second flex PCB 129 also mounts a
radio frequency chip 133, which includes an RF transmitter and RF
receiver, and a controller chip 134 for controlling operation of
the pen 101. An optics block 135 (formed from moulded clear
plastics) sits within the cover 107 and projects an infrared beam
onto the surface and receives images onto the image sensor 132.
Power supply wires 136 connect the components on the second flex
PCB 129 to battery contacts 137 which are mounted within the cam
barrel 125. A terminal 138 connects to the battery contacts 137 and
the cam barrel 125. A three volt rechargeable battery 139 sits
within the cam barrel 125 in contact with the battery contacts. An
induction charging coil 140 is mounted about the second flex PCB
129 to enable recharging of the battery 139 via induction. The
second flex PCB 129 also mounts an infrared LED 143 and infrared
photodiode 144 for detecting displacement in the cam barrel 125
when either the stylus 120 or the ink cartridge 118 is used for
writing, in order to enable a determination of the force being
applied to the surface by the pen nib 119 or stylus nib 121. The IR
photodiode 144 detects light from the IR LED 143 via reflectors
(not shown) mounted on the slider blocks 123 and 124.
[0467] Rubber grip pads 141 and 142 are provided towards the end
108 of the housing 102 to assist gripping the pen 101, and top 105
also includes a clip 142 for clipping the pen 101 to a pocket.
6.2 Pen Controller
[0468] The pen 101 is arranged to determine the position of its nib
(stylus nib 121 or ink cartridge nib 119) by imaging, in the
infrared spectrum, an area of the surface in the vicinity of the
nib. It records the location data from the nearest location tag,
and is arranged to calculate the distance of the nib 121 or 119
from the location tab utilising optics 135 and controller chip 134.
The controller chip 134 calculates the orientation of the pen and
the nib-to-tag distance from the perspective distortion observed on
the imaged tag.
[0469] Utilising the RF chip 133 and antenna 112 the pen 101 can
transmit the digital ink data (which is encrypted for security and
packaged for efficient transmission) to the computing system.
[0470] When the pen is in range of a receiver, the digital ink data
is transmitted as it is formed. When the pen 101 moves out of
range, digital ink data is buffered within the pen 101 (the pen 101
circuitry includes a buffer arranged to store digital ink data for
approximately 12 minutes of the pen motion on the surface) and can
be transmitted later.
[0471] The controller chip 134 is mounted on the second flex PCB
129 in the pen 101. FIG. 10 is a block diagram illustrating in more
detail the architecture of the controller chip 134. FIG. 10 also
shows representations of the RF chip 133, the image sensor 132, the
tri-color status LED 116, the IR illumination LED 131, the IR force
sensor LED 143, and the force sensor photodiode 144.
[0472] The pen controller chip 134 includes a controlling processor
145. Bus 146 enables the exchange of data between components of the
controller chip 134. Flash memory 147 and a 512 KB DRAM 148 are
also included. An analog-to-digital converter 149 is arranged to
convert the analog signal from the force sensor photodiode 144 to a
digital signal.
[0473] An image sensor interface 152 interfaces with the image
sensor 132. A transceiver controller 153 and base band circuit 154
are also included to interface with the RF chip 133 which includes
an RF circuit 155 and RF resonators and inductors 156 connected to
the antenna 112.
[0474] The controlling processor 145 captures and decodes location
data from tags from the surface via the image sensor 132, monitors
the force sensor photodiode 144, controls the LEDs 116, 131 and
143, and handles short-range radio communication via the radio
transceiver 153. It is a medium-performance (.about.40 MHz)
general-purpose RISC processor.
[0475] The processor 145, digital transceiver components
(transceiver controller 153 and baseband circuit 154), image sensor
interface 152, flash memory 147 and 512 KB DRAM 148 are integrated
in a single controller ASIC. Analog RF components (RF circuit 155
and RF resonators and inductors 156) are provided in the separate
RF chip.
[0476] The image sensor is a 215.times.215 pixel CCD (such a sensor
is produced by Matsushita Electronic Corporation, and is described
in a paper by Itakura, K T Nobusada, N Okusenya, R Nagayoshi, and M
Ozaki, "A 1 mm 50 k-Pixel IT CCD Image Sensor for Miniature Camera
System", IEEE Transactions on Electronic Devices, Volt 47, number
1, January 2000, which is incorporated herein by reference) with an
IR filter.
[0477] The controller ASIC 134 enters a quiescent state after a
period of inactivity when the pen 101 is not in contact with a
surface. It incorporates a dedicated circuit 150 which monitors the
force sensor photodiode 144 and wakes up the controller 134 via the
power manager 151 on a pen-down event.
[0478] The radio transceiver communicates in the unlicensed 900 MHz
band normally used by cordless telephones, or alternatively in the
unlicensed 2.4 GHz industrial, scientific and medical (ISM) band,
and uses frequency hopping and collision detection to provide
interference-free communication.
[0479] In an alternative embodiment, the pen incorporates an
Infrared Data Association (IrDA) interface for short-range
communication with a base station or netpage printer.
[0480] In a further embodiment, the pen 101 includes a pair of
orthogonal accelerometers mounted in the normal plane of the pen
101 axis. The accelerometers 190 are shown in FIGS. 9 and 10 in
ghost outline.
[0481] The provision of the accelerometers enables this embodiment
of the pen 101 to sense motion without reference to surface
location tags, allowing the location tags to be sampled at a lower
rate. Each location tag ID can then identify an object of interest
rather than a position on the surface. For example, if the object
is a user interface input element (e.g. a command button), then the
tag ID of each location tag within the area of the input element
can directly identify the input element.
[0482] The acceleration measured by the accelerometers in each of
the x and y directions is integrated with respect to time to
produce an instantaneous velocity and position.
[0483] Since the starting position of the stroke is not known, only
relative positions within a stroke are calculated. Although
position integration accumulates errors in the sensed acceleration,
accelerometers typically have high resolution, and the time
duration of a stroke, over which errors accumulate, is short.
7. Netpage Printer Description
7.1 Printer Mechanics
[0484] The vertically-mounted netpage wallprinter 601 is shown
fully assembled in FIG. 11. It prints netpages on Letter/A4 sized
media using duplexed 81/2'' Memjet.TM. print engines 602 and 603,
as shown in FIGS. 12 and 12a. It uses a straight paper path with
the paper 604 passing through the duplexed print engines 602 and
603 which print both sides of a sheet simultaneously, in full color
and with full bleed.
[0485] An integral binding assembly 605 applies a strip of glue
along one edge of each printed sheet, allowing it to adhere to the
previous sheet when pressed against it. This creates a final bound
document 618 which can range in thickness from one sheet to several
hundred sheets.
[0486] The replaceable ink cartridge 627, shown in FIG. 13 coupled
with the duplexed print engines, has bladders or chambers for
storing fixative, adhesive, and cyan, magenta, yellow, black and
infrared inks. The cartridge also contains a micro air filter in a
base molding. The micro air filter interfaces with an air pump 638
inside the printer via a hose 639. This provides filtered air to
the printheads to prevent ingress of micro particles into the
Memjet.TM. printheads 350 which might otherwise clog the printhead
nozzles. By incorporating the air filter within the cartridge, the
operational life of the filter is effectively linked to the life of
the cartridge. The ink cartridge is a fully recyclable product with
a capacity for printing and gluing 3000 pages (1500 sheets).
[0487] Referring to FIG. 12, the motorized media pick-up roller
assembly 626 pushes the top sheet directly from the media tray past
a paper sensor on the first print engine 602 into the duplexed
Memjet.TM. printhead assembly. The two Memjet.TM. print engines 602
and 603 are mounted in an opposing in-line sequential configuration
along the straight paper path. The paper 604 is drawn into the
first print engine 602 by integral, powered pick-up rollers 626.
The position and size of the paper 604 is sensed and full bleed
printing commences. Fixative is printed simultaneously to aid
drying in the shortest possible time.
[0488] The paper exits the first Memjet.TM. print engine 602
through a set of powered exit spike wheels (aligned along the
straight paper path), which act against a rubberized roller. These
spike wheels contact the `wet` printed surface and continue to feed
the sheet 604 into the second Memjet.TM. print engine 603.
[0489] Referring to FIGS. 12 and 12a, the paper 604 passes from the
duplexed print engines 602 and 603 into the binder assembly 605.
The printed page passes between a powered spike wheel axle 670 with
a fibrous support roller and another movable axle with spike wheels
and a momentary action glue wheel. The movable axle/glue assembly
673 is mounted to a metal support bracket and it is transported
forward to interface with the powered axle 670 via gears by action
of a camshaft. A separate motor powers this camshaft.
[0490] The glue wheel assembly 673 consists of a partially hollow
axle 679 with a rotating coupling for the glue supply hose 641 from
the ink cartridge 627. This axle 679 connects to a glue wheel,
which absorbs adhesive by capillary action through radial holes. A
molded housing 682 surrounds the glue wheel, with an opening at the
front. Pivoting side moldings and sprung outer doors are attached
to the metal bracket and hinge out sideways when the rest of the
assembly 673 is thrust forward. This action exposes the glue wheel
through the front of the molded housing 682. Tension springs close
the assembly and effectively cap the glue wheel during periods of
inactivity.
[0491] As the sheet 604 passes into the glue wheel assembly 673,
adhesive is applied to one vertical edge on the front side (apart
from the first sheet of a document) as it is transported down into
the binding assembly 605.
7.2 Printer Controller Architecture
[0492] The netpage printer controller consists of a controlling
processor 750, a factory-installed or field-installed network
interface module 625, a radio transceiver (transceiver controller
753, baseband circuit 754, RF circuit 755, and RF resonators and
inductors 756), dual raster image processor (RIP) DSPs 757,
duplexed print engine controllers 760a and 760b, flash memory 658,
and 64 MB of DRAM 657, as illustrated in FIG. 14.
[0493] The controlling processor handles communication with the
network 19 and with local wireless netpage pens 101, senses the
help button 617, controls the user interface LEDs 613-616, and
feeds and synchronizes the RIP DSPs 757 and print engine
controllers 760. It consists of a medium-performance
general-purpose microprocessor. The controlling processor 750
communicates with the print engine controllers 760 via a high-speed
serial bus 659.
[0494] The RIP DSPs rasterize and compress page descriptions to the
netpage printer's compressed page format. Each print engine
controller expands, dithers and prints page images to its
associated Memjet.TM. printhead 350 in real time (i.e. at over 30
pages per minute). The duplexed print engine controllers print both
sides of a sheet simultaneously.
[0495] The master print engine controller 760a controls the paper
transport and monitors ink usage in conjunction with the master QA
chip 665 and the ink cartridge QA chip 761.
[0496] The printer controller's flash memory 658 holds the software
for both the processor 750 and the DSPs 757, as well as
configuration data. This is copied to main memory 657 at boot
time.
[0497] The processor 750, DSPs 757, and digital transceiver
components (transceiver controller 753 and baseband circuit 754)
are integrated in a single controller ASIC 656. Analog RF
components (RF circuit 755 and RF resonators and inductors 756) are
provided in a separate RF chip 762. The network interface module
625 is separate, since netpage printers allow the network
connection to be factory-selected or field-selected. Flash memory
658 and the 2.times.256 Mbit (64 MB) DRAM 657 is also off-chip. The
print engine controllers 760 are provided in separate ASICs.
[0498] A variety of network interface modules 625 are provided,
each providing a netpage network interface 751 and optionally a
local computer or network interface 752. Netpage network Internet
interfaces include POTS modems, Hybrid Fiber-Coax (HFC) cable
modems, ISDN modems, DSL modems, satellite transceivers, current
and next-generation cellular telephone transceivers, and wireless
local loop (WLL) transceivers. Local interfaces include IEEE 1284
(parallel port), 10Base-T and 100Base-T Ethernet, USB and USB 2.0,
IEEE 1394 (Firewire), and various emerging home networking
interfaces. If an Internet connection is available on the local
network, then the local network interface can be used as the
netpage network interface.
[0499] The radio transceiver 753 communicates in the unlicensed 900
MHz band normally used by cordless telephones, or alternatively in
the unlicensed 2.4 GHz industrial, scientific and medical (ISM)
band, and uses frequency hopping and collision detection to provide
interference-free communication.
[0500] The printer controller optionally incorporates an Infrared
Data Association (IrDA) interface for receiving data "squirted"
from devices such as netpage cameras. In an alternative embodiment,
the printer uses the IrDA interface for short-range communication
with suitably configured netpage pens.
7.2.1 Rasterization and Printing
[0501] Once the main processor 750 has received and verified the
document's page layouts and page objects, it runs the appropriate
RIP software on the DSPs 757.
[0502] The DSPs 757 rasterize each page description and compress
the rasterized page image. The main processor stores each
compressed page image in memory. The simplest way to load-balance
multiple DSPs is to let each DSP rasterize a separate page. The
DSPs can always be kept busy since an arbitrary number of
rasterized pages can, in general, be stored in memory. This
strategy only leads to potentially poor DSP utilization when
rasterizing short documents.
[0503] Watermark regions in the page description are rasterized to
a contone-resolution bi-level bitmap which is losslessly compressed
to negligible size and which forms part of the compressed page
image.
[0504] The infrared (IR) layer of the printed page contains coded
netpage tags at a density of about six per inch. Each tag encodes
the page ID, tag ID, and control bits, and the data content of each
tag is generated during rasterization and stored in the compressed
page image.
[0505] The main processor 750 passes back-to-back page images to
the duplexed print engine controllers 760. Each print engine
controller 760 stores the compressed page image in its local
memory, and starts the page expansion and printing pipeline. Page
expansion and printing is pipelined because it is impractical to
store an entire 114 MB bi-level CMYK+IR page image in memory.
7.2.2 Print Engine Controller
[0506] The page expansion and printing pipeline of the print engine
controller 760 consists of a high speed IEEE 1394 serial interface
659, a standard JPEG decoder 763, a standard Group 4 Fax decoder
764, a custom halftoner/compositor unit 765, a custom tag encoder
766, a line loader/formatter unit 767, and a custom interface 768
to the Memjet.TM. printhead 350.
[0507] The print engine controller 360 operates in a double
buffered manner. While one page is loaded into DRAM 769 via the
high speed serial interface 659, the previously loaded page is read
from DRAM 769 and passed through the print engine controller
pipeline. Once the page has finished printing, the page just loaded
is printed while another page is loaded.
[0508] The first stage of the pipeline expands (at 763) the
JPEG-compressed contone CMYK layer, expands (at 764) the Group 4
Fax-compressed bi-level black layer, and renders (at 766) the
bi-level netpage tag layer according to the tag format defined in
section 1.2, all in parallel. The second stage dithers (at 765) the
contone CMYK layer and composites (at 765) the bi-level black layer
over the resulting bi-level CMYK layer. The resultant bi-level
CMYK+IR dot data is buffered and formatted (at 767) for printing on
the Memjet.TM. printhead 350 via a set of line buffers. Most of
these line buffers are stored in the off-chip DRAM. The final stage
prints the six channels of bi-level dot data (including fixative)
to the Memjet.TM. printhead 350 via the printhead interface
768.
[0509] When several print engine controllers 760 are used in
unison, such as in a duplexed configuration, they are synchronized
via a shared line sync signal 770. Only one print engine 760,
selected via the external master/slave pin 771, generates the line
sync signal 770 onto the shared line.
[0510] The print engine controller 760 contains a low-speed
processor 772 for synchronizing the page expansion and rendering
pipeline, configuring the printhead 350 via a low-speed serial bus
773, and controlling the stepper motors 675, 676.
[0511] In the 81/2'' versions of the netpage printer, the two print
engines each prints 30 Letter pages per minute along the long
dimension of the page (11''), giving a line rate of 8.8 kHz at 1600
dpi. In the 12'' versions of the netpage printer, the two print
engines each prints 45 Letter pages per minute along the short
dimension of the page (81/2''), giving a line rate of 10.2 kHz.
These line rates are well within the operating frequency of the
Memjet.TM. printhead, which in the current design exceeds 30
kHz.
8 Netpage Tags
8.1 Tag Tiling
8.1.1 Planar Surface Tag Tiling
[0512] In order to support "single-click" interaction with a tagged
region via a sensing device, the sensing device must be able to see
at least one entire tag 4 in its field of view no matter where in
the region or at what orientation it is positioned. The required
diameter of the field of view of the sensing device is therefore a
function of the size and spacing of the tags 4.
[0513] In the case where the tag shape is circular, such as the
preferred tag 4 described earlier, the minimum diameter m of the
sensor field of view is obtained when the tags 500, of diameter k,
are tiled on an equilateral triangular grid, as shown in FIG. 52
and defined in EQ 1. This is achieved when the center-to-center tag
spacing is the same as the tag diameter k.
[0514] With a tag diameter k of 256 dots (.about.4 mm at 1600 dpi),
m is therefore 552 dots (.about.8.8 mm). With a quiet area of 16
dots, i.e. an effective tag diameter k of 272 dots (.about.4.3 mm),
m increases to 587 dots (.about.9.3 mm).
[0515] When the tags 4 are moved a distance s apart, where s is at
least as large as k, then the minimum field of view is given by EQ
2.
[0516] When no overlap is desired in the horizontal direction
between successive lines of tags 500, for example to make tag
rendering easier, the tags must be moved apart by a minimum amount
given by EQ 3. For a 256-dot diameter tag, u is therefore 40 dots
(.about.0.6 mm at 1600 dpi). Since this exceeds the quiet area
required for the tag, the quiet area can be ignored if tag lines
are rendered to not overlap.
[0517] Setting s=k+u in EQ 2 gives EQ 4. For a 256-dot diameter
tag, s is therefore 296 dots (.about.4.7 mm at 1600 dpi), and m is
598 dots (.about.9.5 mm).
8.1.2 Spherical Surface Tag Tiling
[0518] A regular icosahedron is often used as the basis for
generating an almost regular triangular tiling of a sphere. A
regular icosahedron, such as icosahedron 526 in FIG. 53, is
composed of twenty equal-sized equilateral triangular faces 528
sharing thirty edges 530 and twelve vertices 532, with five of the
edges 530 meeting at each of the vertices 532.
[0519] To achieve the required tiling, the icosahedron 526 is
inscribed in a target sphere, and each triangle 528 of the
icosahedron 526 is subdivided into an equal number of equal-sized
equilateral subdivision triangles to yield the desired total number
of triangles. If each edge 530 of the icosahedron is divided into v
equal intervals, defining a set of v-1 points along each edge, and
each pair of corresponding points along any two adjacent edges is
joined by a line parallel to the other shared adjacent edge, the
lines so drawn intersect at the vertices of the desired equal-sized
and equilateral subdivision triangles, resulting in the creation of
v.sup.2 triangles per triangular face 528 of the icosahedron 526,
or 20 v.sup.2 triangles in all. Of the resulting 10 v.sup.2+2
vertices, five triangular faces meet at each of the twelve original
vertices of the icosahedron 526, and six triangular faces meet at
the each of the remaining vertices. The twelve original vertices
532 already lie on the sphere, while the remaining vertices lie
inside the sphere. Each created vertex is therefore centrally
projected onto the sphere, giving the desired tiling.
[0520] A sphere approximated by a regular polyhedron in this way is
referred to as a geodesic, and the parameter v is referred to as
the frequency of the geodesic. FIG. 54 shows an icosahedral
geodesic 534 with v=3, i.e. with 180 faces 528.
[0521] The closer a subdivision triangle is to the center of a face
of the icosahedron 526, the further it is from the surface of the
sphere, and hence the larger it is when projected onto the sphere.
To minimise variation in the size of projected subdivision
triangles, subdivison vertices can systematically be displaced
prior to projection (Tegmark, M., "An Icosahedron-Based Method for
Pixelizing the Celestial Sphere", ApJ Letters, 470, L81, Oct. 14,
1996). If v=1 then no vertices are created and the angle subtended
by a triangular face at a vertex remains 60.degree.. As v
increases, however, the surface defined by the five triangular
faces surrounding each original vertex becomes increasingly flat,
and the vertex angle of each triangular face converges on
72.degree. (i.e. 360.degree./5). This defines the worst case for a
tag tiling of a sphere. In a 72.degree. isosceles triangle the base
length is 1.18 times the length of the two sides. The maximum tag
spacing s for the purposes of calculating the sensor field of view
is therefore close to 1.18 k. With a tag diameter of 256 dots and a
quiet area of 16 dots, i.e. an effective tag diameter k of 272 dots
(.about.4.3 mm), m is therefore 643 dots (.about.10.2 mm) according
to EQ 2.
[0522] The angle subtended by each edge of an icosahedron at the
center of the circumscribing sphere is given by EQ 5
[0523] For a sphere of radius r the arc length of each centrally
projected edge is r.theta.. Given a tag diameter of K in the same
units as r, the number of tags n required to cover the sphere is
given by EQ 6.
[0524] For a given n, r is limited by EQ 7.
[0525] If n is limited to 2.sup.16, to allow the use of a 16-bit
tag ID without requiring multiple regions to cover the sphere, and
K is taken to be 4.3 mm as above, then r is limited to .about.310
mm.
[0526] A typical globe has a radius of 160 mm. Its projected arc
length of .about.177 mm fits 41 evenly spaced tags with negligible
additional spacing. Such a globe uses 16812 tags in total.
8.1.3 Arbitrary Curved Surface Tag Tiling
[0527] A triangle mesh can approximate a surface of arbitrary
topography and topology without introducing discontinuities or
singularities, with the local scale of the mesh being dictated by
the local curvature of the surface and an error bound. Assuming the
existence of a triangle mesh for a particular surface, an effective
non-regular tiling of tags can be produced as long as each mesh
triangle respects a minimum vertex angle and a minimum edge length.
A tiling is considered effective with respect to a particular
sensing device if the field of view of the sensing device is
guaranteed to include at least one complete tag at any position of
the sensing device on the surface.
[0528] The tiling procedure starts by placing a tag at each vertex
of the mesh, so the minimum edge length is the same as the tag
diameter k. The tiling procedure proceeds by inserting a tag at the
midpoint of any edge whose length exceeds a maximum tag separation
s. As illustrated in FIG. 9, the maximum tag spacing s is
calculated so that if two adjacent tags 4a and 4b are a distance
s+.epsilon. apart, then there is room for another tag 4c between
them, i.e. EQ 8.
[0529] However, if the vertex angle between two edges of length
s+.epsilon. is less than 60.degree., then the inserted tags will
overlap.
[0530] To prevent inserted tags from overlapping, a minimum tag
separation t is introduced, where t.gtoreq.k. The minimum vertex
angle .alpha. then becomes a function of k and t, as shown in EQ
9.
[0531] Clearly, when t=k, .beta. is constrained to be 60.degree.,
i.e. the mesh is constrained to be equilateral. But as illustrated
in FIG. 56, when t>k, .beta. can be less than 60.degree. without
inserted tags overlapping.
[0532] The maximum tag separation s must be based on the new
minimum tag separation t, in accordance with EQ 10.
[0533] When considering a particular mesh triangle, there are four
distinct tag insertion scenarios. By assuming that the minimum
vertex angle is no less than 30.degree. (i.e. half of 60.degree.),
it can be shown that whenever a mesh triangle has at least one edge
less than or equal to s in length, the remaining two edges are less
than 2s in length. In practice the minimum vertex angle is
typically at least 45.degree..
[0534] In the first scenario (FIG. 57) no edges of a triangle 546
exceed s in length, so the tagging of the triangle is already
complete.
[0535] In the second scenario (FIG. 58) one edge 548 of a triangle
550 exceeds s in length. A tag 552 is inserted at the midpoint of
the edge 548 to complete the tagging of the triangle 550.
[0536] In the third scenario (FIG. 59) two edges 554, 556 of a
triangle 558 exceed s in length. Tags 560, 562 are inserted at the
midpoint of each of the two long edges 554, 556 and this may
complete the tagging of the triangle 558. Centers of the two
inserted tags 560, 562 together with the two vertices 564, 566 of
the short edge 568 of the original triangle 558 form a trapezoid.
If either diagonal of the trapezoid exceeds s in length then a
final tag 570 is inserted at the center of the trapezoid to
complete the tagging of the triangle.
[0537] In the fourth scenario (FIG. 60) all three edges 572 of a
triangle 573 exceed s in length. A tagged vertex 574 is inserted at
the midpoint of each edge 572 and the three new vertices 574 are
joined by edges 576. The tagging procedure is then recursively
applied to each of the four resultant triangles 577, 578, 579 and
580. Note that the new triangles respect the minimum vertex angle
because they have the same shape as the original triangle 573.
[0538] The tag tiling variables are summarized in Table 4.
TABLE-US-00005 TABLE 4 Tag tiling variables variable Meaning .beta.
minimum vertex angle K tag diameter M minimum diameter of sensor
field of view on surface S maximum center-to-center tag spacing T
minimum center-to-center tag spacing
8.2 Tag Sensing
8.2.1 Pen Orientation
[0539] To allow a pen-like sensing device to be used as a
comfortable writing instrument, a range of pen orientations must be
supported. Since the pen nib is constrained to be in contact with
the surface, the orientation of the pen can be characterized by the
yaw (z rotation), pitch (x rotation) and roll (y rotation) of the
pen, as illustrated in FIG. 61. While the yaw of the pen must be
unconstrained, it is reasonable to constrain the pitch and roll of
the pen as well as the overall tilt of the pen resulting from the
combination of pitch and roll.
[0540] Yaw is conventionally applied after pitch, such that, for
example, in the case of a pen device it would define a twist about
the physical axis rather than a direction in the surface plane. In
a pen with a marking nib, however, the image sensor is mounted off
the axis of the pen and the pen's image sensing ability (and hence
its yaw sensing ability) is therefore constrained unless the pen is
held almost vertically, as discussed below. Yaw is therefore
applied before pitch, allowing the full yaw range to be specified
by rotating the pen relative to the surface while keeping pitch and
roll constant.
[0541] Pitch and roll are conventionally defined as y and x
rotations, respectively. Here they are defined as x and y
rotations, respectively, because they are defined with respect to
the x-y coordinate system of the surface, where the y axis is the
natural longitudinal axis and the x axis is the natural lateral
axis when viewed by a user. In a right-handed 3D coordinate system,
roll is conventionally defined as positive when anticlockwise,
while pitch and yaw are conventionally defined as positive when
clockwise. Here all rotations are defined as positive when
anticlockwise.
[0542] The pen's overall tilt (.theta.) is related to its pitch
(.phi.) and the roll (.PSI.) in accordance with EQ 11.
[0543] The pen's tilt affects the scale at which surface features
are imaged at different points in the field of view, and therefore
affects the resolution of the image sensor. Since it is impractical
to sense the area directly under the pen nib, the pen's tilt also
affects the distance from the nib to the center of the imaged area.
This distance must be known to allow a precise nib position to be
derived from the position determined from the tag.
8.2.2 Image Sensing
[0544] The field of view can be modeled as a cone defined by a
solid half-angle .alpha. (giving an angular field of view of
2.alpha.), and an apex height of D above the surface when the
optical axis is vertical. Although the image sensor is typically
rectangular, only the largest elliptical subarea of the image
sensor is relevant to guaranteeing that a sufficiently large part
of the surface is imaged, as quantified earlier.
[0545] The intersection of the field of view cone with the surface
defines an elliptical window on the surface. This window is
circular when the optical axis is vertical.
[0546] FIG. 62 illustrates the geometric relationship, for a given
pitch-related tilt .theta. of the pen's optical axis, between the
pen's nib (point A), the pen's optical axis (CE), and the field of
view window (FH). The tilt is defined to be clockwise positive from
the vertical. The equations which follow apply to both positive and
negative tilt.
[0547] When the pen is not tilted, the window diameter (i.e. |BD|)
is given by EQ 12.
[0548] If, when the pen is not tilted, the distance from the nib to
edge of the window (i.e. |AB|) is T, then the distance S from the
nib to the center of the window (i.e. |AC|) is given by EQ 13.
[0549] When the pen is tilted by .theta., the distance from the
viewpoint to the surface along the optical axis is reduced to d
(i.e. |GE|), given by EQ 14.
[0550] The width of the window (i.e. |F|H) is then given by EQ
15.
[0551] D and .alpha. must be chosen so that an adequately large
area is imaged throughout the supported tilt range. The required
minimum diameter m of the area is given by EQ 4, while the width of
the actual imaged area is given by EQ 15. This then gives EQ
16.
[0552] Once D and .alpha. are determined, an image sensor
resolution must be chosen so that the imaged area is adequately
sampled, i.e. that the maximum feature frequency is sampled at its
Nyquist rate or higher.
[0553] When imaged, the scale of the surface decreases with
increasing distance from the viewpoint and with increasing
inclination relative to the viewing ray. Both factors have maximum
effect at point F for positive tilt and point H for negative tilt,
i.e. at the point in the window furthest from the viewpoint. Note
that references to F in the following discussion apply to H when
the tilt is negative.
[0554] The distance of point F from the viewpoint (i.e. |EF|) is
given by EQ 17.
[0555] Scaling due to the inclination of the surface relative to
the viewing ray through F (EF) is given by EQ 18.
[0556] If the surface feature frequency is f, then the angular
surface feature frequency .omega. at F (i.e. with respect to the
field of view) due to both factors is given by EQ 19.
[0557] When there is no object plane tilt (i.e. .theta.=0), this
reduces to EQ 20.
[0558] The image sensor is, by definition, required to image at
least the entire angular field of view. Since the pixel density of
the image sensor is uniform, it must image the entire field of view
at maximum frequency. Given an angular field of view in image space
of 2.alpha.', an image sensor tilt (i.e. image plane tilt) with
respect to the optical axis of .theta.', and a sampling rate of n
(where n.gtoreq.2 according to Nyquist's theorem), the minimum
image sensor resolution q is given by EQ 21 and EQ 22.
[0559] The cos-squared term in the numerator in EQ 22 results from
the same reasoning as the cos-squared term in the denominator in EQ
19.
[0560] When there is no image plane tilt (i.e. .theta.'=0), and the
image space and object space angular fields of view are equal (i.e.
.alpha.'=.alpha.), this reduces to EQ 23 and EQ 24.
[0561] When there is no object plane tilt (i.e. .theta.=0) this
reduces further to EQ 25.
[0562] When the image plane tilt and the object plane tilt are
equal (i.e. .theta.'=0), and the image space and object space
angular fields of view are equal (i.e. .alpha.'=.alpha.), EQ 22
reduces to EQ 26.
[0563] Matching the image plane tilt to the object plane tilt
therefore yields a smaller required image sensor size than when the
image sensor tilt is fixed at zero, and eliminates perspective
distortion from the captured image. Variable image sensor tilt is,
however, a relatively cotly option in practice, and also requires
greater depth of field.
[0564] FIG. 63 illustrates the geometric relationship, for a given
roll-related tilt .theta. of the pen's optical axis, between the
pen's nib (point A), the pen's optical axis (CE), and the field of
view window (FH). The tilt is again defined to be clockwise
positive from the vertical. With the exception of EQ 13, the
preceding equations apply equally to roll-induced tilt. For
roll-induced tilt the distance S from the nib to the center of the
window (i.e. |AC|) is zero rather than as defined by EQ 13.
[0565] For pitch-induced tilt, the magnitude of the tilt range is
maximised by choosing a minimum (negative) tilt and a maximum
(positive) tilt which have the same image sensor requirement.
Since, for pitch-induced tilt, the surface is more distant for
negative tilt than for positive tilt of the same magnitude, the
minimum has a smaller magnitude than the maximum. For roll-induced
tilt they have the same magnitude.
[0566] As described above, the smallest features of the tag 4 are
the structures which encode the data bits, and these have a minimum
diameter of 8 dots. This gives a maximum feature frequency f of
about 7.9 per mm at 1600 dpi.
[0567] As calculated according to EQ 4 above, an equilateral
triangular tiling of 256-dot diameter tags with no overlap between
successive lines of tags requires a minimum field of view window
diameter on the surface of 598 dots, or about 9.5 mm at 1600
dpi.
[0568] Most people hold a pen at about +30.degree. pitch and
0.degree. roll. The inking ball of a ball-point nib loses effective
contact with the surface beyond about +50.degree. pitch (i.e.
40.degree. from the horizontal). A reasonable target pitch range is
therefore -10.degree. to +50.degree., and a reasonable roll range
-30.degree. to +30.degree., bearing in mind greater limitations on
combined pitch and roll as given by EQ 11.
[0569] The highly compact (1.5 mm.sup.2) Matsushita CCD image
sensor (Matsushita Electronic Corporation, and is described in a
paper by Itakura, K T Nobusada, N Okusenya, R Nagayoshi, and M
Ozaki, "A 1 mm 50 k-Pixel IT CCD Image Sensor for Miniature Camera
System", IEEE Transactions on Electronic Devices, Volt 47, number
1, January 2000) is suitable for use in a compact device such as a
pen. It has an available resolution of 215.times.215 pixels.
Assuming equal image and object space angular fields of view, no
image plane tilt, and a nib-to-window distance T of 4 mm,
optimizing the geometry using EQ 16 and EQ 24 to achieve the
desired pitch and roll ranges stated above yields a pitch range of
-16.degree. to)+48.degree. (64.degree. and a roll range of
-28.degree. to)+28.degree. (56.degree. with a viewing distance D of
30 mm and an angular field of view of 18.8.degree.
(.alpha.=9.4.degree.). The available pitch range is actually
-21.degree. to +43.degree., and this is mapped to close to the
desired range by pitching the optical axis at -5.degree. relative
to the physical axis. Note that the tilt range can be expanded
slightly by optimizing a non-zero tilt of the image plane.
[0570] The overall pen tilt is thus confined to an elliptical cone
whose major angle is 64.degree. in the pitch plane and whose minor
angle is 56.degree. in the roll plane.
[0571] The image sensing variables are summarized in Table 5.
TABLE-US-00006 TABLE 5 Image sensing variables Variable meaning
.alpha. Object space field of view half-angle .alpha.' Image space
field of view half-angle .gamma. Pen yaw .theta. Object plane tilt
(i.e. overall pen tilt) .theta.' Image plane tilt .phi. Pen pitch
.psi. Pen roll .omega. Angular frequency in field of view D Normal
viewing distance D Tilted viewing distance F Surface feature
frequency N Sampling rate Q Image sensor resolution S Distance from
nib to center of field of view on surface (when .theta. = 0) T
Distance from nib to edge of field of view on surface (when .theta.
= 0)
8.3 Tag Decoding
8.3.1 Tag Image Processing and Decoding
[0572] Tag image processing is described earlier in Section 1.2.4.
It culminates in knowledge of the 2D perspective transform on the
tag, as well as the decoded tag data.
8.3.2 Inferring the Pen Transform
[0573] Once the 2D perspective transform is obtained which accounts
for the perspective distortion of the tag in the captured image, as
described earlier, the corresponding discrete 3D tag transform with
respect to the pen's optical axis can be inferred, as described
below in Section 8.4.
[0574] Once the discrete 3D tag transform is known, the
corresponding 3D pen transform can be inferred, i.e. the transform
of the pen's physical axis with respect to the surface. The pen's
physical axis is the axis which is embodied in the pen's shape and
which is experienced by the pen's user. It passes through the nib.
The relationship between the physical axis and the optical axis is
illustrated in FIG. 64.
[0575] It is convenient to define three coordinate spaces. In
sensor space the optical axis coincides with the z axis and the
viewpoint is at the origin. In pen space the physical axis
coincides with the z axis and the nib is at the origin. In tag
space the tag 4 lies in the x-y plane with its center at the
origin. The tag transform transforms the tag 4 from tag space to
sensor space.
[0576] Sensor space is illustrated in FIG. 64. The labelling of
points in FIG. 64 is consistent with the labelling in FIG. 62. The
viewpoint is at E, the sensed point is at G, and the nib is at A.
The intersection point G between the optical axis and the surface
is referred to as the sensed point. In contrast with the geometry
illustrated in FIG. 62 where the nib is considered as a point, here
the nib is considered as a small sphere. If the nib is curved, then
the tilt of the physical axis affects the offset between the sensed
point and the contact point between the nib and the surface. The
center point K of the spherical nib, about which the physical axis
pivots, is referred to as the pivot point.
[0577] The nib makes nominal contact with the surface at point A
when the optical axis is vertical. KA is defined to be parallel to
the optical axis. When the pen is tilted, however, contact is at
point L, as shown in FIG. 65. Given the radius R of the nib, the
distance of the pivot point K from the surface, e.g. at A or L, is
always R.
[0578] The discrete tag transform includes the translation of the
tag center from the sensed point, the 3D tag rotation, and the
translation of the sensed point from the viewpoint.
[0579] Given the translation d of the sensed point from the
viewpoint in the discrete tag transform, and according to EQ 14,
the sensed point is given by EQ 27.
[0580] Since the physical axis only differs from the optical axis
by a y translation and x rotation (i.e. pitch), the physical axis
lies in the y-z plane. With reference to FIG. 64, where |AC|=S and
|EC|=D (just as in FIG. 62), it is clear that in sensor space the
position of the pivot point is given by EQ 28.
[0581] The vector from the sensed point to the pivot point is
therefore given by EQ 29.
[0582] The vector from the pivot point to the contact point is by
definition a surface normal of length R. It is constructed by
applying the 3D tag rotation M to a tag space surface normal,
normalizing the result, and scaling by R, as shown in EQ 30 and EQ
31.
[0583] The vector from the sensed point to the contact point is
then obtained in accordance with EQ 32.
[0584] This is transformed into tag space by applying the inverse
of the tag transform 3D rotation, and is then added to the vector
from the tag center to the sensed point, to yield the vector from
the tag center to the contact point in tag space, i.e. on the
surface, in accordance with EQ 33.
[0585] This is finally added to the tag's absolute location, as
implied by its tag ID, to yield the nib's desired absolute location
in the tagged region: see EQ 34.
[0586] The final step is to infer the pen's 3D orientation from the
tag's 3D orientation. The pen's discrete rotations are simply the
inverses of the tag's discrete rotations, with the pen's pitch also
including the effect of the pitch (.phi..sub.sensor) of the optical
axis with respect to the pen's axis, as defined in EQ 35, EQ 36 and
EQ 37.
[0587] 8.4 Inferring the Tag Transform
[0588] The image of the tag 4 captured by the image sensor contains
perspective distortion due to the position and orientation of the
image sensor with respect to the tag. Once the perspective targets
of the tag are found in image space, an eight-degree-of-freedom
perspective transform is inferred based on solving the
well-understood equations relating the four tag-space and
image-space point pairs. The discrete transform steps which give
rise to the image of the tag are concatenated symbolically, and a
set of simultaneous non-linear equations is obtained by equating
corresponding terms in the concatenated transform and the
perspective transform. Solving these equations yields the discrete
transform steps, which include the desired tag offset from the nib,
3D tag rotation, and viewpoint offset from the surface.
8.4.1 Modeling the Tag Transform
[0589] The transform of the tag 4 from tag space to image space can
be modeled as a concatenation of the following transform steps:
[0590] x-y translate (by tag-to-viewpoint offset)
[0591] z rotate (by tag yaw)
[0592] x rotate (by tag pitch)
[0593] y rotate (by tag roll)
[0594] z translate (by tag-to-viewpoint offset)
[0595] perspective project (with specified focal length)
[0596] x-y scale (to viewport size)
[0597] These are concatenated symbolically to produce a single
transform matrix which effects the tag transform. Table 7
summarizes the discrete transform variables used in the following
sections, together with the range of each variable.
TABLE-US-00007 TABLE 7 Discrete transform variables and their
ranges Unit Variable Abbrev. Meaning transform Range .gamma. -- yaw
0 0 .ltoreq. .gamma. .ltoreq. 2.pi. .phi. -- pitch 0 -.pi./2 <
.phi. < .pi./2 .psi. -- roll 0 -.pi./4 < .psi. < .pi./4
t.sub.x A tag-to-viewpoint x 0 -- offset t.sub.y B tag-to-viewpoint
y 0 -- offset cos.gamma. C cosine of yaw 1 -1 .ltoreq. C .ltoreq. 1
sin.gamma. D sine of yaw 0 -1 .ltoreq. D .ltoreq. 1 cos.phi. E
cosine of pitch 1 0 < E .ltoreq. 1 sin.phi. F sine of pitch 0 -1
< F < 1 cos.psi. G cosine of roll 1 0 < G .ltoreq. 1
sin.psi. H sine of roll 0 -1 < H < 1 t.sub.z I
tag-to-viewpoint z -- I < 0 offset 1/.lamda. J inverse focal
length -- J > 0 S -- viewport scale -- S > 0
[0598] Translate in x-y plane by G and G according to EQ 42 (where
A=t.sub.x and B=t.sub.y).
[0599] Rotate about z by .gamma. according to EQ 43 (where
C=cos(.gamma.) and D=sin(.gamma.)), giving EQ 44.
[0600] Rotate about x by .phi. according to EQ 45 (where
E=cos(.phi.) and F=sin(.phi.)), giving EQ 46.
[0601] Rotate about y by .psi. according to EQ 47 (where
G=cos(.psi.) and H=sin(.psi.)), giving EQ 48, where K and L are
defined by EQ 49 and EQ 50.
[0602] Translate in z by t.sub.z according to EQ 51 (where
I=t.sub.z), giving EQ 52.
[0603] Perspective project with focal length X and projection plane
at z=0 according to EQ 53 (where J=1/.lamda.), giving EQ 54.
[0604] Scale to viewport by S according to EQ 55, giving EQ 56.
[0605] Transform a point in the x-y plane (z=0) according to EQ 57,
giving EQ 58.
[0606] Finally, expand K and L, giving EQ 59.
[0607] 8.4.2 2D Perspective Transform
[0608] Given an inferred eight-degree-of-freedom 2D perspective
transform matrix as defined in EQ 60, multiply by an unknown i to
obtain the general nine-degree-of-freedom form of the matrix, as
shown in EQ 61.
[0609] Transform a 2D point according to EQ 62, giving EQ 63.
8.4.3 Inferring the Tag Transform
8.4.3.1 Equating Coefficients
[0610] Equating the coefficients in EQ 59 with the coefficients in
EQ 63 results in EQ 64 to EQ 72, being nine non-linear equations in
11 unknowns.
[0611] These equations are augmented as required by the
trigonometric identity relating the sine and cosine of an angle
(i.e. the sine and cosine of any one of yaw, pitch and roll), as
shown in EQ 73.
[0612] Given the sine and cosine of an angle, the corresponding
angle is obtained using a two-argument arctan as shown in EQ
74.
8.4.3.2 Solving for X-Y Offset
[0613] EQ 66 can be simplified using EQ 64 and EQ 65 to give EQ 75
and then EQ 76.
[0614] EQ 69 can be simplified using EQ 67 and EQ 68 to give EQ 77
and then EQ 78.
[0615] EQ 72 can be simplified using EQ 70 and EQ 71 to give EQ 79
and then EQ 80.
[0616] EQ 76 can be re-written as EQ 81, and EQ 78 can be
re-written as EQ 82.
Equating EQ 81 and EQ 82 and solving for B yields EQ 83 through EQ
85 and finally EQ 86, which defines B. Substituting the value for B
into EQ 82 and simplifying yields EQ 87 through EQ 90 and finally
EQ 91, which defines A.
[0617] This therefore gives the x-y offset of the tag 4 from the
viewpoint, since A=t.sub.x and B=t.sub.y.
8.4.3.3 Solving for Pitch
[0618] From EQ 68, EQ 92 can be obtained.
[0619] From EQ 67, EQ 93 can be obtained.
[0620] From EQ 64, EQ 92 and EQ 93, EQ 94 can be obtained.
[0621] From EQ 65, EQ 92 and EQ 93, EQ 95 can be obtained.
[0622] From EQ 70, EQ 92 and EQ 93, EQ 96 can be obtained.
[0623] From EQ 71, EQ 92 and EQ 93, EQ 97 can be obtained.
[0624] From EQ 94, EQ 98 can be obtained.
[0625] From EQ 95, EQ 99 can be obtained.
[0626] From EQ 96, EQ 100 can be obtained.
[0627] From EQ 97, EQ 101 can be obtained.
[0628] From EQ 98 and EQ 99, EQ 102 and then EQ 103 can be
obtained.
[0629] From EQ 100 and EQ 101, EQ 104 and then EQ 105 can be
obtained.
[0630] From EQ 103 and EQ 105, EQ 106 and then EQ 107 can be
obtained.
[0631] EQ 107 only has a valid basis if G and H are both non-zero.
Since |.psi.|<.pi./2, the cosine (G) of the roll is always
positive and hence non-zero. The sine of the roll (H) is only
non-zero if the roll is non-zero. Specific handling for zero pitch
and roll is described in Section 6.7.3.10.
[0632] This therefore gives the magnitude of the sine of the pitch,
since F=sin(.phi.), and hence the cosine (E) of the pitch by EQ 73,
according to EQ 108.
[0633] Since |.phi.|<.pi./2, the cosine (E) of the pitch is
always positive, so there is no ambiguity when taking the square
root. The sign of the sine (F), however, must be determined by
other means, as described in Section 6.7.3.9.
[0634] Given E and F, the pitch is then obtained, according to EQ
109.
8.4.3.4 Solving for Roll
[0635] From EQ 103, EQ 110 can be obtained.
[0636] From EQ 73, EQ 111 and then EQ 112 can be obtained.
[0637] This therefore gives the magnitude of the sine of the roll,
since H=sin(.psi.), and hence the cosine (G) of the roll by EQ 73,
according to EQ 113.
[0638] Since |.psi.|<.pi./4, the cosine (G) of the roll is
always positive, so there is no ambiguity when taking the square
root. The sign of the sine (H), however, must be determined by
other means, as described in Section 6.7.3.9.
[0639] Given G and H, the roll is then obtained according to EQ
114.
8.4.3.5 Solving for Yaw
[0640] From EQ 73, EQ 92 and EQ 93, EQ 115 and then EQ 116 can be
obtained.
[0641] From EQ 92 and EQ 116, EQ 117 and then EQ 118 can be
obtained.
[0642] From EQ 92 and EQ 116, EQ 119 and then EQ 120 can be
obtained.
[0643] In EQ 116, and hence EQ 118 and EQ 120, the sign of the
square root is determined by the sign of i, which can be determined
from EQ 80, giving EQ 121.
[0644] Since I (t.sub.z) is negative, J (1/.lamda.) is positive,
and IJ<-1 (because |t.sub.z|>.lamda.), then EQ 122 holds.
[0645] Given C and D, the yaw is then obtained according to EQ
123.
8.4.3.6 Solving for Viewport Scale
[0646] The cosine (C) and sine (D) of the yaw are by definition
never simultaneously zero. Since the cosine (E) of the pitch is
never zero, either EQ 67 or EQ 68 can therefore always be used to
determine the viewport scale (S).
[0647] If D is non-zero, then from EQ 67, EQ 124 can be
obtained.
[0648] Otherwise, if C is non-zero, then from EQ 68, EQ 125 can be
obtained.
8.4.3.7 Solving for Focal Length Similarly, since the cosine (G) of
the roll is never zero, either EQ 70 or EQ 71 can be used to
determine the inverse focal length (J), so long as either the pitch
or roll is non-zero. However, the signs of the sines (F and H) of
the pitch and roll may not be known. However, the sign of the
product (FH) of the sines of the pitch and roll is given by EQ 103,
as shown in EQ 126.
[0649] The sign can be assigned arbitrarily to F, since the sign of
J is known a priori. If gi is non-zero, then from EQ 70, EQ 127 can
be obtained.
[0650] If hi is non-zero, then from EQ 71, EQ 128 can be
obtained.
[0651] In practice, the choice between using EQ 127 and EQ 128 is
based on which of gi and hi has the larger magnitude. The inverse
focal length is unknown if gi and hi are both zero, i.e. if the
pitch and roll are both zero.
8.4.3.8 Solving for Z Offset
[0652] Once the inverse focal length (J) is known, the z offset (I)
is obtained from EQ 80, according to EQ 129.
[0653] Again, the z offset (I) is unknown if the inverse focal
length (J) is unknown, i.e. if the pitch and roll are both
zero.
8.4.3.9 Determining Direction of Pitch and Roll
[0654] The sign of the product (FH) of the sines of the pitch and
roll is given by EQ 126. Since -.pi./4<.psi.<.pi./4, a roll
adjustment of +.pi./4 can be introduced to ensure the roll is
always positive, without invalidating any other assumptions. Once
the roll adjustment is introduced, EQ 126 gives the sign of the
sine (F) of the pitch alone.
[0655] The roll adjustment is introduced as follows. The viewport
scale (S), inverse focal length (J), and z offset (I) are all
computed as described. A 3D transform matrix is created from the 2D
perspective transform matrix. The inverses of the viewport scale,
focal length projection and z translation are applied to the 3D
matrix in reverse order. The roll adjustment is then applied by
pre-multiplying the matrix by a .pi./4 y rotation matrix. The roll,
pitch and yaw are computed as described. Since the roll is
positive, the pitch direction is now known. The .pi./4 roll
adjustment is finally subtracted from the roll to give the actual
roll.
[0656] When the roll and pitch are both zero, the focal length and
z offset are both unknown as described above. However, in this case
there is no need to adjust the roll since the pitch and roll are
already known.
8.4.3.10 Handling Zero Pitch and Roll
[0657] When either the pitch or roll is zero, the general solution
based on EQ 107 becomes invalid. The table of FIG. 85 shows the 12
degenerate forms of EQ 64 through EQ 71 which result when the yaw
is variously zero (or .pi.), .pi./2 (or 3.pi./2), and non-zero, and
the pitch and roll are variously zero and non-zero. The table of
FIGS. 86 and 87 sets out the required logic for detecting and
handling cases where the pitch and/or roll are zero, with each case
motivated by zeros appearing in the table of FIG. 85. The cases in
the table of FIG. 85 are labelled with the case numbers from the
table of FIGS. 86 and 87.
CONCLUSION
[0658] The present invention has been described with reference to a
preferred embodiment and number of specific alternative
embodiments. However, it will be appreciated by those skilled in
the relevant fields that a number of other embodiments, differing
from those specifically described, will also fall within the spirit
and scope of the present invention. Accordingly, it will be
understood that the invention is not intended to be limited to the
specific embodiments described in the present specification,
including documents incorporated by cross-reference as appropriate.
The scope of the invention is only limited by the attached
claims.
* * * * *