U.S. patent application number 14/176783 was filed with the patent office on 2014-06-05 for assumed demographics, predicted behavior, and targeted incentives.
This patent application is currently assigned to CATALINA MARKETING CORPORATION. The applicant listed for this patent is Catalina Marketing Corporation. Invention is credited to Ryan CARR, Mark DAVIS, Gary M. KATZ.
Application Number | 20140156401 14/176783 |
Document ID | / |
Family ID | 34434867 |
Filed Date | 2014-06-05 |
United States Patent
Application |
20140156401 |
Kind Code |
A1 |
CARR; Ryan ; et al. |
June 5, 2014 |
ASSUMED DEMOGRAPHICS, PREDICTED BEHAVIOR, AND TARGETED
INCENTIVES
Abstract
A system and method for anticipating consumer behavior and
determining transaction incentives for influencing consumer
behavior comprises a computer system and associated database for
determining cross time correlations between transaction behavior,
for applying the function derived from the correlations to consumer
records to predict future consumer behavior, and for deciding on
transaction incentives to offer the consumers based upon their
predicted behavior.
Inventors: |
CARR; Ryan; (South Elgin,
IL) ; DAVIS; Mark; (Clearwater, FL) ; KATZ;
Gary M.; (Northbrook, IL) |
|
Applicant: |
Name |
City |
State |
Country |
Type |
Catalina Marketing Corporation |
St. Petersburg |
FL |
US |
|
|
Assignee: |
CATALINA MARKETING
CORPORATION
St. Petersburg
FL
|
Family ID: |
34434867 |
Appl. No.: |
14/176783 |
Filed: |
February 10, 2014 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
12870764 |
Aug 27, 2010 |
8650065 |
|
|
14176783 |
|
|
|
|
10559937 |
Dec 8, 2005 |
|
|
|
12870764 |
|
|
|
|
PCT/US04/26736 |
Sep 13, 2004 |
|
|
|
10559937 |
|
|
|
|
60503877 |
Sep 22, 2003 |
|
|
|
Current U.S.
Class: |
705/14.53 |
Current CPC
Class: |
G06Q 30/0255 20130101;
G06Q 30/0235 20130101; G06Q 30/02 20130101; G06Q 30/0204 20130101;
G06Q 30/0202 20130101 |
Class at
Publication: |
705/14.53 |
International
Class: |
G06Q 30/02 20060101
G06Q030/02 |
Claims
1. A computer implemented method of predicting demographic traits
of consumers based on purchase histories and identifying the
predicted demographic traits that are most likely to be accurate,
the method being implemented by a computer system having one or
more physical processors programmed with computer program
instructions that, when executed by the one or more physical
processors, cause the computer system to perform the method, the
method comprising: obtaining, by the computer system, a first set
of consumer records related to a plurality of first consumers,
wherein each of the first set of consumer records includes: a first
set of known demographic traits for a corresponding first consumer,
and first purchase information that indicates one or more first
purchases made by the corresponding first consumer; obtaining, by
the computer system, second purchase information that indicates one
or more second purchases made by a second consumer; comparing, by
the computer system, the first purchase information and the second
purchase information to determine one or more correlations between
the first purchase information and the second purchase information;
determining, by the computer system, a set of assumed demographic
traits of the second consumer based on the comparison and the first
set of known demographic traits, wherein each assumed demographic
trait: (i) predicts a demographic trait of the second consumer, and
(ii) is associated with a probability that the assumed demographic
trait is accurate; selecting, by the computer system, a subset of
the set of assumed demographic traits based on the their respective
probabilities such that at least a first assumed demographic trait
is selected over at least a second assumed demographic trait based
on a first probability that the first assumed demographic trait is
accurate and a second probability that the second assumed
demographic trait is accurate; and identifying, by the computer
system, an incentive to provide to the second consumer based on the
subset.
2. The method of claim 1, the method further comprising: providing
the incentive to the second consumer through at least a first
communication channel from among a plurality of communication
channels.
3. The method of claim 2, the method further comprising:
determining a redemption probability that indicates a likelihood of
a redemption of the incentive by the second consumer if the
incentive is provided through the first communication channel; and
determining whether to provide the incentive through the first
communication channel based on the redemption probability.
4. The method of claim 1, wherein the first assumed demographic
trait relates to a first value for a given type of assumed
demographic trait and the second assumed demographic trait relates
to a second value for the given type of assumed demographic
trait.
5. The method of claim 1, wherein the first assumed demographic
trait relates to a first type of assumed demographic trait and the
second assumed demographic trait relates to a second type of
assumed demographic trait.
6. The method of claim 1, wherein selecting the subset comprises:
filtering the set of assumed demographic traits based on: (i)
respective values of each of the set of assumed demographic traits,
(ii) respective types of each of the set of assumed demographic
traits, and (iii) the respective probabilities that each of the set
of assumed demographic traits is accurate.
7. The method of claim 6, wherein filtering the set of assumed
demographic traits comprises: filtering at least a first type of
assumed demographic trait based on a first probability threshold
and filtering at least a second type of assumed demographic trait
based on a second probability threshold different from the first
probability threshold.
8. The method of claim 1, wherein a given probability that a given
assumed demographic trait is accurate is based on a level of
correlation between the first purchase information and the second
purchase information.
9. The method of claim 1, wherein a given probability that a given
assumed demographic trait is accurate is based on a level of
reliability of correlations between the first purchase information
and the second purchase information in predicting the given assumed
demographic trait.
10. The method of claim 1, wherein selecting the subset comprises:
ranking the set of assumed demographic traits with respect to each
other based on the respective probabilities, such that the first
demographic trait is ranked with respect to the second demographic
trait, wherein the selected subset is based on the ranking.
11. The method of claim 1, wherein the second consumer is
associated with an assumed or known demographic trait that
indicates a consumer location related to the second consumer, the
method further comprising: determining a retail location associated
with a retailer; and correlating the consumer location with the
retail location to determine whether the consumer location is
within a certain region of the retail location, wherein the
incentive is identified based further on the correlation.
12. The method of claim 11, the method further comprising:
determining a retail location associated with a retailer; and
performing demographic analysis based on the retail location and at
least some of the set of assumed demographic traits.
13. The method of claim 12, wherein the second consumer comprises a
plurality of second consumers, and wherein the subset of the set of
assumed demographic traits relates to a respective consumer
location for each of the plurality of second consumers, wherein
performing demographic analysis comprises: associating a set of the
plurality of second consumers that made a purchase at the retailer
with one or more regions within a certain proximity to the retail
location based on the second purchase information and the
respective consumer location for each of the plurality of second
consumers; and determining a number of consumers of the retailer in
the one or more regions based on the association between the set of
the plurality of second consumers and the one or more regions.
14. The method of claim 13, the method further comprising:
determining a total number of consumers that made a purchase at the
retailer based on the second purchase information; and calculating
a fraction of consumers in the one or more regions relative to the
total number of consumers based on the number and the total number;
and determining whether to provide the incentive based on the
fraction.
15. The method of claim 12, wherein the second consumer comprises a
plurality of second consumers, and wherein the subset of the set of
assumed demographic traits relate to a respective consumer location
for each of the plurality of second consumers, wherein performing
demographic analysis comprises: associating a set of the plurality
of second consumers that made a purchase at the retailer with one
or more regions within a certain proximity to the retail location
based on the second purchase information and the respective
consumer location for each of the plurality of second consumers;
and determining a value of purchases that the set of the plurality
of second consumers in the one or more regions made at the retail
location based on the association between the set of the plurality
of second consumers and the one or more regions.
16. The method of claim 15, the method further comprising:
determining a total value of purchases made at the retailer based
on the second purchase information; and calculating a fraction of
the value of purchases made in the one or more regions relative to
the total value based on the value and the total value; and
determining whether to provide the incentive based on the
fraction.
17. A system of predicting demographic traits of consumers based on
purchase histories and identifying the predicted demographic traits
that are most likely to be accurate, the system comprising: a
computer system comprising one or more physical processors
programmed with computer program instructions to: obtain a first
set of consumer records related to a plurality of first consumers,
wherein each of the first set of consumer records includes: a first
set of known demographic traits for a corresponding first consumer,
and first purchase information that indicates one or more first
purchases made by the corresponding first consumer; obtain second
purchase information that indicates one or more second purchases
made by a second consumer; compare the first purchase information
and the second purchase information to determine one or more
correlations between the first purchase information and the second
purchase information; determine a set of assumed demographic traits
of the second consumer based on the comparison and the first set of
known demographic traits, wherein each assumed demographic trait:
(i) predicts a demographic trait of the second consumer, and (ii)
is associated with a probability that the assumed demographic trait
is accurate; select a subset of the set of assumed demographic
traits based on the their respective probabilities such that at
least a first assumed demographic trait is selected over at least a
second assumed demographic trait based on a first probability that
the first assumed demographic trait is accurate and a second
probability that the second assumed demographic trait is accurate;
and identify an incentive to provide to the second consumer based
on the subset.
18. The system of claim 17, wherein the computer system is further
programmed to: provide the incentive to the second consumer through
at least a first communication channel from among a plurality of
communication channels.
19. The system of claim 18, wherein the computer system is further
programmed to: determine a redemption probability that indicates a
likelihood of a redemption of the incentive by the second consumer
if the incentive is provided through the first communication
channel; and determine whether to provide the incentive through the
first communication channel based on the redemption
probability.
20. The system of claim 17, wherein the first assumed demographic
trait relates to a first value for a given type of assumed
demographic trait and the second assumed demographic trait relates
to a second value for the given type of assumed demographic
trait.
21. The system of claim 17, wherein the first assumed demographic
trait relates to a first type of assumed demographic trait and the
second assumed demographic trait relates to a second type of
assumed demographic trait.
22. The system of claim 1, wherein the computer system is further
programmed to: filter the set of assumed demographic traits based
on: (i) respective values of each of the set of assumed demographic
traits, (ii) respective types of each of the set of assumed
demographic traits, and (iii) the respective probabilities that
each of the set of assumed demographic traits is accurate, wherein
the subset is selected based on the filtered set of assumed
demographic traits.
23. The system of claim 22, wherein the computer system is further
programmed to: filter at least a first type of assumed demographic
trait based on a first probability threshold and filtering at least
a second type of assumed demographic trait based on a second
probability threshold different from the first probability
threshold.
24. The system of claim 17, wherein a given probability that a
given assumed demographic trait is accurate is based on a level of
correlation between the first purchase information and the second
purchase information.
25. The system of claim 17, wherein a given probability that a
given assumed demographic trait is accurate is based on a level of
reliability of correlations between the first purchase information
and the second purchase information in predicting the given assumed
demographic trait.
26. The system of claim 17, wherein the computer system is further
programmed to: rank the set of assumed demographic traits with
respect to each other based on the respective probabilities, such
that the first demographic trait is ranked with respect to the
second demographic trait, wherein the selected subset is based on
the rank.
27. The system of claim 17, wherein the second consumer is
associated with an assumed or known demographic trait that
indicates a consumer location related to the second consumer,
wherein the computer system is further programmed to: determine a
retail location associated with a retailer; and correlate the
consumer location with the retail location to determine whether the
consumer location is within a certain region of the retail
location, wherein the incentive is identified based further on the
correlation.
28. The system of claim 27, wherein the computer system is further
programmed to: determine a retail location associated with a
retailer; and perform demographic analysis based on the retail
location and at least some of the set of assumed demographic
traits.
29. The system of claim 28, wherein the second consumer comprises a
plurality of second consumers, and wherein the subset of the set of
assumed demographic traits relates to a respective consumer
location for each of the plurality of second consumers, wherein the
computer system is further programmed to: associate a set of the
plurality of second consumers that made a purchase at the retailer
with one or more regions within a certain proximity to the retail
location based on the second purchase information and the
respective consumer location for each of the plurality of second
consumers; and determine a number of consumers of the retailer in
the one or more regions based on the association between the set of
the plurality of second consumers and the one or more regions.
30. The system of claim 29, wherein the computer system is further
programmed to: determine a total number of consumers that made a
purchase at the retailer based on the second purchase information;
and calculate a fraction of consumers in the one or more regions
relative to the total number of consumers based on the number and
the total number; and determine whether to provide the incentive
based on the fraction.
31. The system of claim 28, wherein the second consumer comprises a
plurality of second consumers, and wherein the subset of the set of
assumed demographic traits relate to a respective consumer location
for each of the plurality of second consumers, wherein the computer
system is further programmed to: associate a set of the plurality
of second consumers that made a purchase at the retailer with one
or more regions within a certain proximity to the retail location
based on the second purchase information and the respective
consumer location for each of the plurality of second consumers;
and determine a value of purchases that the set of the plurality of
second consumers in the one or more regions made at the retail
location based on the association between the set of the plurality
of second consumers and the one or more regions.
32. The system of claim 31, wherein the computer system is further
programmed to: determine a total value of purchases made at the
retailer based on the second purchase information; and calculate a
fraction of the value of purchases made in the one or more regions
relative to the total value based on the value and the total value;
and determine whether to provide the incentive based on the
fraction.
Description
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] This application is a continuation of U.S. patent
application Ser. No. 12/870,764, filed Aug. 27, 2010, which is a
division of U.S. patent application Ser. No. 10/559,937, filed Dec.
8, 2005, which was a National Stage entry of PCT/US04/26736, filed
Sep. 13, 2004, which claims priority to U.S. provisional
application Ser. No. 60/503,877, filed Sep. 22, 2003, and the
entire contents of each of these applications are incorporated
herein by reference.
BACKGROUND OF THE INVENTION
[0002] 1. Field of the Invention
[0003] The invention relates to the field of marketing. More
particularly, the invention relates to data mining and targeted
marketing.
[0004] 2. Discussion of the Background
[0005] Marketing means communicating information regarding at least
one of products and services to consumers.
[0006] Targeted marketing means selectively marketing to a limited
number of consumers, such as an individual, members of a family,
persons with the same residence as one another, or persons having
some other piece of relevant information in common with one
another. U.S. Pat. No. 5,832,457 to O'Brien discloses targeted
marketing.
[0007] Value means a good, a service, or a pecuniary interest
including cash, check, credit, and conditional credit.
[0008] Transaction, in this application, means an exchange
involving at least two parties. A purchase is a transaction.
Receipt of an incentive offer (such as the act of downloading an
incentive offer from a web site), redemption of an incentive offer,
and acceptance of participation in a consumer survey are
transactions.
[0009] Purchase, in this application, means a transaction in which
cash, check, charge, or credit is exchanged for one or more goods
and services.
[0010] Transaction incentive, in this application, means an offer
for a certain value in exchange for entering into a specified
transaction.
[0011] Purchase incentive, in this application, means an incentive
wherein the specified transaction for which the incentive is
offered is a purchase.
[0012] Incentive, in this application, means the certain value
associated with entering into a transaction specified in a
transaction incentive.
[0013] Incentive terms and transaction incentive terms, in this
application, each means the specifications of the transaction into
which someone must enter to be entitled to the incentive associated
with the transaction incentive.
[0014] Predictive modeling is disclosed in U.S. Pat. No. 4,961,089
to Jamzadeh. Predictive modeling in this application means using
data corresponding to prior events to determine a probability of
occurrence of an event during some time period in the future. An
example of a predictive model would be a formula whose inputs are
the number of sun spots recorded over each of the last several
years and whose output was a probability of a number of sunspots in
the next year falling within a specified range.
[0015] A prediction, in this application, means an assumption that
one or more events, such as a transaction or a purchase, either are
probable to occur or are not probable to occur.
[0016] A class, in this application, means a set of things that
share a common attribute. An example of a class is the class of all
things that are goods. Goods are items of merchandise for sale.
Another example of a class is the class of goods that are packaged
goods. Another example of a class is the class of packaged goods
that are canned goods. Another example of a class is the class of
all products that are sold under the same brand name or trademark.
Another example of a class is the class of all products that are
manufactured by the same manufacturer. Another example of a class
is the class of all goods that are diet related. Another example of
a class is the class of all goods that are prescription drugs.
Another example of a class is the class of all products that are
breakfast products. Another example of a class is the class of all
service reservations that are rooming reservations, such as
reservations at hotels and motels. Another example of a class is
the class of all services reservations that are transportation
reservations, including car, bus, train, and airline reservations.
A class may consist of items of a single product, such as items
having the same Universal Product Code ("UPC"). For example, all
product items having the UPC code associated with 12 ounce glass
bottles of Coca Cola define a class.
[0017] A database is a collection of data. Typically a database is
organized in some fashion so that information contained in the data
may be readily accessed. In this application the term database
means data organized in some format in a computer memory that can
be accessed by an associated computer system. Such a concept is
also referred to as a database management system. A database or
database management system includes commercial database products
implemented in a computer system, such as the Microsoft Access and
SQL Server line of products as well as any set of files stored in
computer memory that can be accessed by an associated computer
system.
[0018] Designated, as used in the foregoing example, refers to an
association of the data defining the good or service with data
defining an attribute. All data defining products and services
associated with the same attribute are deemed to be in the same
class defined by that attribute.
[0019] Data defining a good or a service may be stored in a
database. Such data may be designated as a member of a class.
[0020] Transaction data, in this application, means data defining a
transaction. Transaction data includes price of transaction,
description of items purchased, price of items purchased, quantity
of each type of item purchased, date of transaction, time of
transaction, location of transaction, identity of purchaser,
identify of seller, tender type of transaction, etc. Transaction
data does not include description of purchaser beyond identity.
[0021] Non-transaction demographic data, in this application, means
characteristics of people that are not part of transaction data.
For example, non-transaction demographics data includes age
bracket, gender, race, ethnicity, pet ownership. In addition,
non-transaction demographics data includes income range, type of
job or profession, education level, marital status, existence of
children in an age bracket, and hobbies.
[0022] Block data, in this application, means data specifying
averages, medians, or other statistical measures, by detailed
product category and demographic, for a group of people living in a
geographic region of a size on the order of a city block or
subdivision. Typically, the dimension of a geographic region
associated with a block, which is sometimes referred to as a block
region, is between about 100 meters and about 1000 meters.
[0023] Survey demographic data is data provided by people in
response to questions posed to them. Survey demographic data is not
transaction data, as transaction data is defined herein.
[0024] Survey derived demographic data is data measuring a
demographic of a person that is derived from the person's response
to questions posed to the person.
[0025] Demographic data is often survey or survey derived
demographic data, which is data obtained by peoples responses to
questions posed to them. For example, people may be associated with
a demographic named `Active Nutritionalists` in response to either
response to a question whether they are actively practice good
nutritional habits or responses to a series of questions relating
to the type of foods they purchase indicating that they have
healthy diets.
[0026] Mixed derived demographic data, in this application, means
demographic data derived from both non-transaction demographic data
and transaction data. For example, mixed derived demographic data
may be a value of a demographic variable inferred for a customer
based upon the customer identifying `Active Nutritionalist` in a
survey question and regularly purchasing milk. For example, a mixed
derived demographic value for this example could be "mother
practicing good nutritional habits" since regular purchase of milk
may indicate mother of small children demographic.
[0027] "Demographic" herein often is used as a noun, referring to a
value of demographic data.
[0028] In this application, assumed means predicted.
[0029] In this application, dis-associated with means not
associated with.
[0030] In this application, assumed data means a prediction that a
property associated with the value of the data accurately
characterizes a thing, such as a person or people associated with a
CID (consumer identification).
[0031] In this application, assumed demographic data associated
with a person or a CID means a prediction that the corresponding
demographic accurately characterizes the person or people
associated with the CID.
[0032] In this application, assumed non-transaction demographic
associated with a person or a CID means a prediction that the
corresponding non-transaction demographic accurately characterizes
the person or people associated with the CID.
DISCLOSURE OF THE INVENTION
Objects of the Invention
[0033] It is an object of the invention to provide to a consumer
targeted incentives based upon a prediction whether the consumer
will enter into a transaction.
[0034] Generally speaking, it is an object of the invention to
identify transaction patterns of a set of consumers in an earlier
time period that correlate to specified purchases by those
consumers in later time periods.
[0035] More specifically, it is an object of the invention to
identify transaction patterns of a set of consumers in one or more
earlier first time periods that correlate to purchases by those
consumers of one or more products and services within specified
product and service classes in one or more later second time
periods after the one or more earlier first time periods.
[0036] Generally speaking, it is an object of the invention to
identify transaction patterns of consumers in a first time period
that correlate to changes in rate or amount of purchases within a
specific product or service class between the earlier first time
period and the later second time period.
[0037] More specifically, it is an object of the invention to
identify transaction patterns of a set of consumers in one or more
earlier first time periods that correlate to changes in purchases
by those consumers of one or more products and services within
specified product and service classes between the one or more
earlier first time periods and one or more later second time
periods after the one or more earlier first time periods.
[0038] It is a further object of the invention to base a decision
whether to offer a consumer a transaction incentive, such as a
purchase incentive for a specified purchase within a certain
product or service class, upon the existence of the foregoing
transaction patterns in the transaction history records of the
consumer that correlate to that certain product or service
class.
[0039] It is a still further object of this invention to determine
from the foregoing correlations a probability or a prediction that
the specified consumer will purchase an item within a class of
products or services.
[0040] It is still a further object of the invention to use the
foregoing probability or prediction that the specified consumer
will or will not enter into a purchase transaction for an item
within a class of products or services as a factor in deciding what
if any purchase incentives to offer to that specified consumer.
[0041] It is a still further object of this invention to determine
from the foregoing correlations relative likelihoods, for each
member of a set of consumers, that the consumer will purchase an
item within a class of products or services, and to use that
ranking in deciding which consumers to target market.
[0042] It is another object of this invention to perform targeted
marketing by depending offering to a consumer a transaction
incentive upon a probability that offering the incentive will
change the consumer's purchase behavior for purchase of an item
specified in the purchase incentive.
[0043] It is another object of this invention to perform targeted
marketing by depending the value of a purchase incentive, the
preconditions for the consumer to obtain the purchase incentive, or
both, upon a quantitative determined probability or prediction of
the consumer entering into a purchase transaction.
[0044] It is another object of the invention to perform targeted
marketing so as maximize the expected return (either gross profit
or net profit) to the seller or manufacturer of an item of a
product or service based upon the cost of the incentive to the
seller or manufacturer and a predicted probability of changing the
consumer's purchase behavior.
[0045] It is also a general object of the invention to minimize the
cost associated with targeted marketing campaigns by increasing the
redemption rate of incentive offers.
[0046] It is another object of the invention to associate at least
one demographic data with a Consumer Identification ("CID")
associated with a transaction history for a consumer, to thereby
define an assumed non-transaction demographic data for the
consumer.
[0047] It is another object of the invention to assign a
probability that an assumed non-transaction demographic data for a
consumer is correct.
[0048] It is another object of the invention to identify from a
group of assumed non-transaction demographic data for a consumer,
that non-transaction demographic data which has the highest
probability of being correct.
[0049] It is another object of the invention to rank a group of
assumed non-transaction demographic data for a consumer according
to the probability of the assumed non-transaction demographic data
being correct.
[0050] It is another object of the invention to disassociate at
least one non-transaction demographic data with a CID associated
with a transaction history for a consumer, to thereby define a
dis-assumed non-transaction demographic data, that is, a
non-transaction demographic data assumed to not be correct for a
consumer.
[0051] It is another object of the invention to assign a
probability that the dis-assumed non-transaction demographic data
for the consumer is correct, and to rank a set of dis-assumed
non-transaction demographic data for a consumer by their
probability of being correct.
[0052] It is another object of the invention to determine whether
to offer a specified purchase incentive to a consumer based upon at
least in part upon one of assumed non-transaction demographic,
mixed derived demographic, dis-assumed non-transaction demographic,
most probable assumed non-transaction demographic, most probable
dis-assumed non-transaction demographic, probability of assumed
non-transaction demographic, probability of dis-assumed
non-transaction demographic, ranking of assumed non-transaction
demographic, or ranking of dis-assumed non-transaction demographic
associated with that consumer.
[0053] It is another object of the invention to define a purchase
incentive or select from alternative specified terms for purchase
incentives a purchase incentive to offer to a consumer based upon
at least in part upon one of assumed non-transaction demographic,
dis-assumed non-transaction demographic, most probable assumed
non-transaction demographic, most probable dis-assumed
non-transaction demographic, probability of assumed non-transaction
demographic, probability of dis-assumed non-transaction
demographic, ranking of assumed non-transaction demographic, or
ranking of dis-assumed non-transaction demographic associated with
that consumer.
[0054] It is another object of the invention to define a
demographic data assumption function that, when applied to a
transaction history record for a consumer, will assume either that
the demographic data is associated with the consumer or that the
demographic data is not associated with the consumer.
[0055] It is another object of the invention to define a
demographic data probability function that, when applied to a
transaction history record for a consumer, will generate either a
probability that the demographic data is associated with the
consumer or that the demographic data is not associated with
(dis-associated with) the consumer.
[0056] It is another object of the invention to determine from
probability of each one of a set of demographics associated with a
consumer and prior product incentive redemption history for that
consumer which of a set of incentive offers the consumer is most
likely to redeem.
[0057] It is another object of the invention to determine from
probability of each one of a set of assumed non-transaction
demographics associated with a consumer, a set of incentive offers
for purchase of specified products available to be offered to the
consumer, and probability of the consumer redeeming each one of the
set of incentive offers, offering of which incentive is likely to
result in the greatest profit.
[0058] It is another object of the invention to combine assumed
non-transaction demographic data for consumers having transactions
with a specified retail store location with block data for the same
non-transaction demographic data, for at least one block region
geographically close to the retail store location, to estimate the
fraction of people having a specified demographic and living in the
at least one specified block region that shop in the retail
store.
[0059] It is another object of the invention to use the estimate
the fraction of people having a specified demographic and living in
the at least one specified block region that shop in the retail
store location as a basis for deciding whether to target market
consumers associated with that non-transaction demographic that
live in that specified at least one block region.
SUMMARY OF THE INVENTION
[0060] These and other objects of the invention are provided by a
novel computer system programmed or designed to analyze consumer
specific data including historical consumer transaction data to
identify statistical correlations (correlation data) between
transactions in classes in different time periods.
[0061] The same or a different computer system may use the
correlation data in combination with consumer transaction data for
a consumer to rank or estimate probability of that consumer
entering into a transaction in a specified class in a specified
future time period.
[0062] The same or a different computer system may also be
programmed to apply the rankings or probabilities to transaction
incentive criteria to determine whether to offer a transaction
incentive associated with the criteria to a consumer having the
ranking or probability. The computer system may determine the
mechanism for making the offer of a transaction incentive, the
timing of making the offer of a transaction incentive, the value
associated with the offer of a transaction incentive, the class or
classes of transactions into which the consumer must enter in order
to receive the incentive.
[0063] The method of the invention provides predictions of future
transactions for a consumer that are based upon applying a
specified consumer's transaction to one or more mathematical
formula that define a likelihood of a consumer entering into one or
more transactions, and wherein the mathematical formula
incorporates correlation data obtained by statistical analysis of
transaction data for a set of consumers over a time period.
[0064] The correlations resulting from consumer transaction data
may be combined with correlations to consumer purchase preferences
from other sources, such as correlations from demographic data and
customer loyalty quotients. The correlations of a particular
customer to these types of data may be incorporated into a ranking
or probability function in predicting the likelihood of a consumer
entering into a specified transaction in a specified future time
period.
[0065] Some of the types of consumer behavior that may be predicted
are changes in transaction behavior. For example the predictions
may be that certain consumers will be purchasing more or less in a
specified class, switching brands in the same product class,
purchase larger or smaller values or quantities in a particular
class, purchase a particular brand in a class when the consumer had
not recently purchased in that class, or not purchase in a class
the consumer previously purchased in (referred to herein as
lapsing).
[0066] The particular class may be products and services associated
with any one or more designations. Some examples of classes of
goods are fresh meat, canned goods, baking goods, diet foods,
breads, cereals, coffees, dairy products, alcoholic beverages,
beers, wines, and liquors. More examples of classes are a specified
brand of a type of product, such as Kleenex brand tissues, Welch's
brand grape juice. More examples of classes are a brand, per se,
such as Kelloggs, and Green Giant. A class also includes a group of
brands, such as Keebler brands. Examples of classes include
products, such as Green Giant eight ounce canned corn and Aunt
Jemima one pound pancake mix. The particular class may be a group
of products having a feature in common, such as lower fat products,
organic products, or products of one or more brands known to be
relatively low or relatively high priced.
[0067] The consumer transaction data may include data that are
measures of any of the following: purchases of a consumer
(including transaction class data and transaction date data),
incentive redemptions by a consumer, reservations of a consumer,
modality of offering transaction incentives that were redeemed by a
consumer, frequency of transaction, dollar volume of transaction,
item count of purchase, frequency of redemption, value of
redeemable incentive, and relative amount of incentives redeemed in
different classes. Preferably, all transaction data includes a date
or a date range associated with each transaction.
[0068] The non-transactional data may include demographics, such as
information involving family size, number and age of children, age,
and household income bracket. The non-transactional data may also
include personal preferences data provided by the consumer.
[0069] Customer loyalty quotient data is data indicating the
fraction a customer's expenses in classes sold at a specified
retail store or chain of retail stores that the customer actually
purchases at that retail store or in the same chain of retail
stores.
[0070] The method of the invention preferably uses transaction data
from a plurality of distinct classes in determining correlations to
a ranking or a probability of a consumer transaction in a specified
class.
[0071] In certain instances, formula providing predictions of a
consumer entering into a transaction in a specified class do not
depend upon consumer transaction data from that class, but only
from consumer transaction data for transactions from other distinct
classes. This is particularly true when the formula is for a
prediction that a consumer's purchases in a specified class will
lapse or that the consumer will purchase in a class in which the
consumer either has not previously purchased or has not purchased
in a very long period of time, such as 2 months, 4 months, or 6
months.
[0072] The results of the ranking, probabilities, and predictions
may be used as the basis of generating two generically different
types of marketing. One type of marketing provides a transaction
incentive that does not attempt to change the consumer's predicted
transaction behavior. For example, a transaction incentive for
purchasing in a class offered in response to a prediction that the
consumer's purchasing in that class will lapse attempts to change
predicted behavior. A second type of marketing does not provide an
incentive to modify the consumer's predicted transaction behavior.
For example, a transaction incentive for purchasing a brand in a
class when there is a prediction that the consumer will purchase in
that class does not attempt to change predicted behavior. In
addition, a subgeneric type of marketing does not attempt to change
the consumer's predicted purchase behavior as to a genus class but
does provide an incentive for the consumer to change predicted
transaction behavior in a species class of the genus. For example,
a transaction incentive to purchase a second product brand in a
specified class when there is a prediction that the consumer will
purchase a first product brand in the specified class attempt to
change predicted behavior as to the species class but not the
genus.
[0073] Data as to probability of a consumer redeeming a transaction
incentive depending upon the modality of the offer may be used to
determine whether to provide a transaction offer derived by using
the method of the invention via a point of sale (POS) coupon,
email, postal mail, set top box, or personal web site, and data as
to the correlated time period in which the prediction indicates
consumer transaction behavior may be used to time the delivery of
the transaction offer to the consumer.
[0074] In one aspect, the invention provides a system and method
for anticipating consumer behavior and determining transaction
incentives for influencing consumer behavior comprising a computer
system and associated database for determining cross time
correlations between transaction behavior, for applying a function
derived from the correlations to consumer records to predict future
consumer behavior, and for deciding which transaction incentives to
offer the consumers based upon their predicted behavior.
[0075] In another aspect, the invention provides a system and
method of its use comprising a database, a computer system having
read and write access to said database; and wherein the database
stores a plurality of consumer records including a first consumer
record for a first consumer, wherein said first consumer record
stores (1) CID data (consumer identification data) indicating a
first consumer CID for said first consumer; in association with
said first consumer CID, at least (2) transaction data in a first
transaction class field indicating items transacted by said first
consumer in a first transaction class during a first prior time
period and (3) predictive data in a first predictive field
indicating at least one of a ranking, a probability, and a
prediction that said first consumer will transact in a first
correlated class during a correlated time period, and wherein said
correlated time period is subsequent in time to said prior time
period. In additional aspects, said first class and said second
class define the same class; said first class defines a different
class than said second class; said first class is distinct from
said second class; said first class is a genus and said second
class is a species of said genus; said first class defines a
species of a genus and said second class defines said genus; said
database stores, in association with said first consumer CID,
transaction data in a second transaction class field indicating
items transacted by said consumer in a second transaction class
during said first prior time period; said database stores, in
association with said first consumer CID, transaction data in a
third transaction class field indicating items transacted by said
consumer in a third transaction class during said first prior time
period; said database stores, in association with said first
consumer CID, transaction data in a fourth transaction class field
indicating items transacted by said consumer in a fourth
transaction class during said first prior time period; said
database stores, in association with said first consumer CID,
transaction data in a fifth transaction class field indicating
items transacted by said consumer in a fifth transaction class
during said second prior time period; said database stores, in
association with said first consumer CID, transaction data in a
sixth transaction class field indicating items transacted by said
consumer in a sixth transaction class during said second prior time
period; said database stores, in association with said first
consumer CID, transaction data in a seventh transaction class field
indicating items transacted by said consumer in a seventh
transaction class during said second prior time period; said
database stores, in association with said first consumer CID,
transaction data in an eighth transaction class field indicating
items transacted by said consumer in an eighth transaction class
during said second prior time period; the system is programmed to
decide whether to offer a transaction incentive to said first
consumer based upon said predictive data in said first predictive
field for said first consumer; a term of said transaction incentive
is purchase in said first correlated class; a term of said
transaction incentive is purchase of a specified quantity in said
first correlated class; a term of said transaction incentive is
purchase of a specified brand in said first correlated class;
wherein terms of said transaction incentive are purchase in said
first correlated class during said correlated time period; wherein
a term of said transaction incentive is purchase in a class other
than said first correlated class; wherein a term of said
transaction incentive is purchase of a specified brand not in said
first correlated class; wherein terms of said transaction incentive
are purchases in a class other than said first correlated class
during said correlated time period; a terminal for presenting said
transaction incentive to said first consumer; a printer for
printing said terms of transaction incentive; said printer is at a
POS terminal; said printer is at a Kiosk; said printer is at a
consumer's computer; deciding whether to offer a transaction
incentive to said first consumer based upon said predictive data in
said first predictive field for said first consumer, printing said
transaction incentive; and printing in the presence of said
consumer.
[0076] In another aspect, the invention comprises a system and
method for, in a set of customer records containing transaction
data, correlating transactions in a first set of input classes in a
first time period to transactions in a first correlated class in a
second time period, wherein the second time period is subsequent to
said first time period, thereby defining correlation data for said
first set of classes; and deciding whether to issue a transaction
incentive to a customer based at least in part upon transaction
data for said customer in said first set of classes and said
correlation data for said first set of classes. Additional aspects
of the invention comprise correlating transactions in a second set
of classes in said first time period to transactions in a second
correlated class in said second time period, thereby defining
correlation data for said second set of classes; and deciding
whether to issue a transaction incentive to a customer based at
least in part upon (1) transaction data for said customer in said
first set of classes and said correlation data for said first set
of classes and (2) transaction data for said customer in said
second set of classes and said correlation data for said second set
of classes; correlating transactions in a second set of classes in
a second time period to transactions in a second correlated class
in said second time period, thereby defining correlation data for
said second set of classes; and deciding whether to issue a
transaction incentive to a customer based at least in part upon (1)
transaction data for said customer in said first set of classes and
said correlation data for said first set of classes and (2)
transaction data for said customer in said second set of classes
and said correlation data for said second set of classes; a term of
said transaction incentive is purchasing in said first correlated
class; a term of said transaction incentive is purchasing a
specified brand in said second correlated class; said transaction
incentive is a transaction incentive for transacting in that one of
said first correlated class and said second correlated class having
correlated data with higher values; said deciding comprising
determining whether said transaction data for said customer
contains data indicating said customer transacted in at least one
class of said first set having a relatively high correlation values
compared to other classes of said first set; said at least one
class comprises at least two classes; said at least one class
comprises at least three classes; said at least one class comprises
at least four classes; said deciding occurs while said customer is
participating in a transaction at a POS terminal; offering said
transaction incentive to said customer while said customer is
participating in said transaction and at said POS terminal; said
deciding occurs between transactions of said customer; mailing or
e-mailing said transaction incentive to said customer; offering
said transaction incentive to said customer when said customer is
identified at a terminal; said terminal is a POS terminal; said
terminal is a kiosk; said deciding whether to issue a transaction
incentive is based at least in part upon non transaction data for
said consumer; said non transaction data comprises demographic
data; said non transaction data comprises loyalty quotient data;
and said deciding whether to issue a transaction incentive is based
at least in part upon block data. CID block data is statistical
data associated with customers in a local neighborhood region.
[0077] In yet another aspect, the invention provides a system and
method comprising means for, in a set of customer records
containing transaction data, correlating transactions in a first
set of input classes in a first time period to transactions in a
first correlated class in a second time period, wherein the second
time period is subsequent to said first time period, thereby
defining correlation data for said first set of classes; and means
for deciding whether to issue a transaction incentive to a customer
based at least in part upon transaction data for said customer in
said first set of classes and said correlation data for said first
set of classes.
[0078] In yet another aspect, the invention provides a system and
method comprising means for, in a set of customer records
containing transaction data, correlating transactions in a first
set of input classes in a first time period to a change in
transactions in a first correlated class between said first time
period and a second time period, wherein the second time period is
subsequent to said first time period, thereby defining correlation
data for said first set of classes; and deciding whether to issue a
transaction incentive to a customer based at least in part upon
transaction data for said customer in said first set of classes and
said correlation data for said first set of classes. Additional
aspects of the invention include correlating transactions in a
second set of classes in said first time period to a change in
transactions in a second correlated class between said first time
period and said second time period, thereby defining correlation
data for said second set of classes; and deciding whether to issue
a transaction incentive to a customer based at least in part upon
(1) transaction data for said customer in said first set of classes
and said correlation data for said first set of classes and (2)
transaction data for said customer in said second set of classes
and said correlation data for said second set of classes.
[0079] In yet another aspect, the invention provides a system and
method comprising means for (1) anticipating a consumer's behavior
for purchasing a first product based at least in part upon at least
a portion of that part of said consumer's transaction history
including said consumer's transactions for products other than for
purchase of said first product; and (2) basing an incentive
determination upon said anticipating.
[0080] In yet another aspect, the invention provides a system and
method comprising means for (1) anticipating a consumer's behavior
for purchasing a first product based at least in part upon at least
a portion of that part of said consumer's transaction history
including said consumer's transactions for purchases of at least
one product other than said first product; and (2) basing an
incentive determination upon said anticipating. Additional aspects
of the invention comprise that said incentive determination is
based at least in part upon differences between said consumer's
anticipated and prior purchase behavior for said product; said
differences are based at least in part upon variations in frequency
of purchase; said differences are based at least in part upon
variations in quantity of purchase; said differences are based at
least in part upon variations in utilization of prior incentive for
purchase; said differences are based at least in part upon
variations in brand loyalty; said anticipating is based at least in
part upon analysis of data in consumer records; and said
anticipating is based at least in part upon a model of consumer
transaction behavior.
[0081] In yet another aspect, the invention provides a system and
method comprising [0082] receiving transaction data at a POS
terminal in a retailer computer system; transmitting said
transaction data over a communications network to a central
computer system; storing said transaction data in a database to
which said central computer system has read and write access;
[0082] (2) analyzing consumer transaction data stored in said
database; (3) determining transaction incentives to offer to
consumers associated with said transaction data; wherein said
central computer system is programmed to employ predictive modeling
on said transaction data to determine a predictive modeling
function, to determine at least one of rankings, probabilities, and
predictions of future consumer transactions by applying consumer
transaction data to said predictive modeling function, and
determining transaction incentives based upon said at least one of
rankings, probabilities, and predictions. Additional aspects of the
invention comprise determining timing and method of transmission of
said transaction incentives that will most likely result in said
transaction incentives being redeemed; and transmitting said
transaction incentive over said communication network to said
retailer computer system.
[0083] In another aspect, the invention provides a system and
method for associating at least one assumed demographic data with a
Consumer Identification ("CID") associated with a transaction
history for a consumer, comprising: a database; a computer system
having read and write access to said database; and wherein said
database stores a plurality of consumer records including a first
consumer record for a first consumer, wherein said first consumer
record stores: (1) CID data (consumer identification data)
indicating a first consumer CID for said first consumer;
[0084] in association with said first consumer CID, at least the
following: (2) transaction data in a first transaction class field
indicating items transacted by said first consumer in a first
transaction class during a first prior time period; and (3) a first
assumed non-transaction demographic data field for storing assumed
non-transaction demographic data. Additional aspects of this
invention include wherein said computer system comprises means or
code for assuming a value for said first assumed non-transaction
demographic data field of said first consumer record based upon
data in other fields of said first consumer record; wherein said
computer system comprises means or code for deriving a value for a
derived data field of said first consumer record based upon data in
other fields of said first consumer record; wherein said value for
said assumed non-transaction demographic data field is a
probability or an expectation; wherein said value for said derived
data field is a probability or an expectation; means or code for
identifying, from a group of assumed non-transaction demographic
data fields in said first consumer record, that non-transaction
demographic data field which has the highest probability of being
correct; means or code for identifying, from a group including at
least one assumed non-transaction demographic data field and one
mixed derived demographic data field in said first consumer record,
that non-transaction demographic data field which has the highest
probability of being correct for the corresponding consumer; means
or code for ranking a group of assumed non-transaction demographic
data in said first consumer record according to the probability of
the assumed non-transaction demographic data being correct for a
consumer corresponding to said first consumer CID; and means or
code for identifying, from a group including at least one assumed
non-transaction demographic data field and one mixed derived
demographic data field in said first consumer record, that
non-transaction or mixed derived demographic data field which has
the highest probability of being correct for the corresponding
consumer; means or code for determining whether to offer to said
first consumer an incentive based upon data stored in said first
consumer record for at least one of an assumed non-transaction
demographic, a dis-assumed non-transaction demographic, a mixed
derived demographic, a most probable assumed non-transaction
demographic, a most probable dis-assumed non-transaction
demographic, a probability of assumed non-transaction demographic,
a probability of dis-assumed non-transaction demographic, a ranking
of assumed non-transaction demographic, and a ranking of
dis-assumed non-transaction demographic associated with that
consumer.
[0085] In another aspect, the invention provides a system and
method for defining or selecting terms of at least one purchase
incentive offer to associate with a Consumer Identification ("CID")
associated with a transaction history for a consumer, comprising: a
database; a computer system having read and write access to said
database; and wherein said database stores a plurality of consumer
records including a first consumer record for a first consumer,
wherein said first consumer record stores: (1) CID data (consumer
identification data) indicating a first consumer CID for said first
consumer; in association with said first consumer CID, at least the
following: (2) transaction data in a first transaction class field
indicating items transacted by said first consumer in a first
transaction class during a first prior time period; (3) a first
assumed non-transaction demographic data field for storing assumed
non-transaction demographic data; and means or code for defining or
selecting terms of a purchase incentive offer to associate with
said CID wherein said defining or selecting depends at least in
part upon one of an assumed non-transaction demographic,
dis-assumed non-transaction demographic, a most probable assumed
non-transaction demographic, a most probable dis-assumed
non-transaction demographic, a probability of an assumed
non-transaction demographic, a probability of dis-assumed
non-transaction demographic, a ranking of an assumed
non-transaction demographic, and a ranking of dis-assumed
non-transaction demographic associated with that consumer.
Additional aspects of this invention include defining or selecting
depends at least in part upon assumed non-transaction demographic
data.
[0086] In another aspect, the invention provides a system and
method for defining a probability value that a demographic data
value either is associated with a consumer or is not associated
with the consumer, comprising: a database; a computer system having
read and write access to said database; and wherein said database
stores a plurality of consumer records; wherein each one of said
plurality of consumer record stores: (1) CID data (consumer
identification data) indicating a CID, (2) transaction data in a
first transaction class field indicating items transacted during a
first prior time period, and (3) a first actual demographic data
field for storing actual, non-assumed, demographic data for a first
demographic data variable; means or code for determining
correlations between transaction data and data stored in said
actual demographic data field from said plurality of consumer
records and using at least one of said correlations to define a
probability function, wherein said probability function is a
function at least of transaction data, and said probability
function applied to a consumer record generates a probability value
that the consumer associated with said consumer record has a
specified value for said first demographic variable; means or code
for storing said probability value in a data field for said
specified value for said first demographic value and said consumer
record. Additional aspects of this invention include wherein said
means or code for determining correlations uses data from at least
two transaction data fields in said plurality of consumer records;
wherein said means or code for determining correlations uses data
in at least three transaction data fields in said plurality of
consumer records; wherein said probability value is one or zero;
means or code for determining, by applying a function to
demographics associated with said consumer and prior product
incentive redemption history for that consumer, which of a set of
incentive offers said consumer is most likely to redeem; and means
or code for determining, from probability of each one of a set of
assumed non-transaction demographics associated with said consumer,
a set of incentive offers for purchase of specified products
available to be offered to said consumer, and probability of said
consumer redeeming each one of said set of incentive offers, which
incentive offer, if actually offered to said consumer, is likely to
result in the greatest profit.
[0087] In another aspect, the invention provides a system and
method for estimating a fraction of people having a specified
demographic value that both shop in a specified retail store and
live in a specified block region comprising: a transaction data
database; a block data database; a computer system having read and
write access to said transaction data database and said block data
database; and wherein said transaction data database stores a
plurality of consumer records each including at least a CID data
(consumer identification data) indicating a consumer CID and a
first assumed non-transaction demographic data field for storing
assumed non-transaction demographic data for customers of said
retail store; wherein said block data database stores at least one
block data record for a geographic region near the location of the
retail store, each block data record storing, for its corresponding
geographic region, a number of people or consumers residing in that
region, and a number of people or consumers having said specified
demographic value; and means or code for using data stored in said
both said transaction data database and said block data database to
estimate said fraction of people having said specified demographic
value that both shop in said specified retail store and live in
said specified block region. Additional aspects of this invention
include means or code for generating a decision whether to target
market to consumers associated with said specified demographic
value that live in said geographic region; means or code for
determining a number of consumers associated with said specified
demographic value that live in said geographic region; means or
code for determining whether to target market to consumers in said
geographic region based upon either said number of consumers
associated with said specified demographic value that live in said
geographic region or an estimate of a ratio of number of consumers
associated with said specified demographic value that live in said
geographic region to total number of consumers that live in said
geographic region; means or code for determining an estimate of
total value of goods purchased from said retail store by consumers
associated with said specified demographic value that live in said
geographic region; and means or code for determining whether to
target market to consumers residing in said geographic region based
upon an estimate of either said total value of goods purchased from
said retail store by consumers associated with said specified
demographic value that live in said geographic region to total
value of goods purchased in said block region from consumers having
said specified demographic value.
[0088] In another aspect, the invention provides a system and
method for determining whether to target market to consumers
residing in a block regions, comprising: a transaction data
database; a block data database; a computer system having read and
write access to said transaction data database and said block data
database; wherein said transaction data database stores a plurality
of consumer records each including at least a CID data (consumer
identification data) indicating a consumer CID for a customer of
said retail store; wherein said block data database stores at least
one block data record for a geographic region near the location of
the retail store, each block data record storing, for its
corresponding geographic region, a number of people or consumers
residing in that region, a number of people or consumers residing
in said region that have a specified value or range of values for a
specified transaction history variable; means or code for
estimating either a fraction or an absolute number of consumers
residing in said block region having transaction data having either
said specified value or said range of values for said specified
transaction history variable to define an estimate of said fraction
or said absolute number; and means or code for determining whether
to target market to either said block region or to at least one
consumer residing in said block region based upon at least said
estimate of said fraction or said absolute number. Additional
aspects of this invention include means or code for estimating
estimates absolute number of consumers residing in said block
region having transaction data having either said specified value
or said range of values for said specified transaction history
variable; means or code for estimating estimates a fraction of
consumers residing in said block region having transaction data
having either said specified value or said range of values for said
specified transaction history variable; wherein said specified
transaction history variable is a quantity of spending in a prior
time period; wherein said specified transaction history variable is
quantity of purchase of a specified product in a prior time period;
wherein said specified transaction history variable is quantity
purchase in a specified class of products in a prior time period;
wherein said specified transaction history variable is a measure of
redemption of transaction incentives in a prior time period;
wherein said specified transaction history variable is a measure of
redemption of transaction incentive in a specified class of
transaction incentives in a prior time period; wherein said means
or code for determining whether to target market to either said
block region or to at least one customer residing in said block
region also depends upon at least demographic data variable;
wherein said means or code for determining wether to target market
to either said block region or to at least one customer residing in
said block region also depends upon at least one derived data
variable; and wherein said means or code for determining wether to
target market to either said block region or to at least one
customer residing in said block region also depends upon at least
one mixed data variable.
BRIEF DESCRIPTION OF THE DRAWINGS
[0089] The invention is better illustrated in connection with the
following figures.
[0090] FIG. 1 is a schematic of a computer network system;
[0091] FIG. 2 is high level flow chart;
[0092] FIG. 3A is an intermediate level flow chart of step 210 in
FIG. 2;
[0093] FIG. 3B is an intermediate level flow chart of step 210 in
FIG. 2;
[0094] FIG. 3C is an intermediate level flow chart of step 210 in
FIG. 2;
[0095] FIG. 4 is an intermediate level flow chart of step 220 in
FIG. 2;
[0096] FIG. 5 is another intermediate level flow chart of step 220
in FIG. 2;
[0097] FIG. 6 is a graph relating to probability outputs of the
model;
[0098] FIG. 7 is a table illustrating a consumer record;
[0099] FIG. 8 is a table illustrating a consumer record with
purchases accumulated by month;
[0100] FIG. 9 is a table illustrating cells containing fractions
representing the average for that cell in a set of records;
[0101] FIG. 10 is a table illustrating a cells containing fractions
representing the averages for that cell in a set of records wherein
all records have a certain value in the cell in row 5/00 and column
P12; and
[0102] FIG. 11 is a table illustrating the difference between
tables 9 and 10, which represents correlation data of each cell to
the value of the cell in row 5/00 and column P12.
DETAILED DESCRIPTION OF THE EMBODIMENTS
[0103] FIG. 1 shows a computer network system including network 10
(preferably the Internet), analysis computer system 20 including
server system 20a, database 30, manufacturer computer system 40,
retailer computer system 50, consumer computer 60, databases 70,
80, 90, point of sale (POS) terminal 100, and kiosk 110. The lines
connecting elements in FIG. 1 indicate a means for data
transmission including wire and wireless transmission hardware,
data format, and transmission protocols. Each computer system
includes at least one digital computer including associated central
processing unit, memory, input, and output devices.
[0104] Data stored preferably in database 30, but which may be
stored in any one of databases 30, 70, and 80 includes at least one
of purchase transactions data, redemption transactions data,
reservations transactions data, demographics data, and loyalty
quotient data in association with a consumer identification.
[0105] Preferably, the transactions data for each transaction is
stored in association with a date of the corresponding transaction.
Redemptions transaction data may be stored with more than one date:
the date of the transaction and the date or dates of the
reservation. Redemption transactions data may be stored in
association with more than one date including the date of the
redemption transaction and the data upon which the incentive
transaction resulting in the redemption was offered. Demographics
data and loyalty quotient data may also be stored in association
with at least one date, such as the date upon which the
demographics data was received or the date on which the loyalty
quotient was calculated. Demographics data and loyalty quotient
data may also be stored in association with another date or date
range representing the date range over with data resulting in the
demographic data or loyalty quotient data was received.
[0106] All data received in step 20 relating to a particular
consumer identification (CID) is stored in the database in
association with that CID. The database may be formatted as flat
files or as one or more relational database files and may include
tables, forms, queries, relations, reports, modules, and other
objects useful in database management and programming. For
convenience, the data analysis will be described below in the
context of consumer records in which each record includes a field
for CID (or CID in combination with store ID) and a large number of
associated fields. However, it will be apparent to those skilled in
database programming that the relations between the data may be
otherwise than as specifically described, for example, by having
data stored in third normal form.
[0107] The exemplary data format comprises a table with transaction
records in which each record includes fields for CID, store ID,
dates of purchase, identification of products, services, and
reservations purchased in association with the date of purchase,
identification of transaction incentives offered including terms of
the offering in association with the date of offering and dates of
redemption of each transaction incentive. Each record may also
include time of day of transaction, day of week of transaction, day
of month of transaction, form of payment (such as credit card,
cash, check). Each transaction record may include a currency amount
of each product item, transaction total currency amount, currency
amount of redemptions, incentive amount of offered transaction
incentives, incentive amount associated with product item, frequent
shopper ID, amount of purchase less discounts and incentives,
manufacturer identifications associated with products for which
each incentive was offered or redeemed. Each transaction record may
use a universal product code (UPC) as the identifier of a product,
service, reservation.
[0108] Preferably, there is one record per customer either per
transaction, per transaction date, or per date range.
[0109] The database may also store in association with a CID
demographic data including age, gender, income, household income,
location of residence, location of work, postal code, family size,
number of children, marital status, gender, type of commuting
vehicle, length of commute, type of job.
[0110] The database may also store in association with a CID block
data identification. The database may also store in association
with the CID or the block data identification block data, which is
data indicating statistical averages for transactions and other
data for consumers that live in a specified block group.
[0111] The database may also store, in association with a CID,
fields derived from data, such as fields derived from demographic
data for the customer, block data for the block to which the
customer belongs, and loyalty quotient data for the customer's
loyalty to a store, brand, or any class of transactions. These
derived fields may contain data representing likelihood of a
consumer purchasing in a specified class, at a specified store, or
in a store of a specified retail chain.
[0112] Derived fields also include fields for frequency of
specified classes of transactions, including purchases, redemption,
and reservations, such as for specified products, over a specified
time period.
[0113] Preferably, analysis system 20 is either hard wired or
programmed to perform the algorithms disclosed herein for
predictive modeling, and the applications of models to rank,
determine probabilities, or make predictions. However, retailer
computer system 50 or manufacturer computer system 40 may be
programmed to perform those functions.
[0114] FIG. 2 shows step 200 of storing data, step 210 of
predictive modeling, step 220 of determining incentive data and
step 230 of offering incentives.
[0115] In step 200 a database, such as database 30, receives and
stores consumer related data. Alternatively, any one of databases
70 and 80 may receive and stores consumer related data for
performing the following steps.
[0116] In step 210, system 20 runs software or implements in
hardware a form of predictive modeling to analyze a set of data for
multiple CID records in database 30. The predictive modeling
identifies correlations between the existence of transactions in
one or more classes of transactions in one or more time period and
the existence of transactions in one or more transaction classes
(hereinafter the correlated class) in at least one subsequent time
period (hereinafter the correlated time period). The predictive
modeling may also include other correlations, such as correlations
based upon a consumer's demographic, block data, and loyalty
quotients.
[0117] Generally, speaking, the predictive modeling applies the
statistical correlations to any consumer's transaction data to
predict what transaction behavior that consumer is likely to enter
into in the correlated time period. The model may generate a
probability, a prediction, and it may rank the relative likelihoods
of each member of a set of consumers entering into a class of
transactions.
[0118] For example, in step 210, system 20 may determine the
probability that a CID (that is, people or households associated
with the CID) that had no purchases of coffee associated with it
will purchase coffee in the next month. In step 210, system 20 may
rank CIDs not previously associated with purchase of coffee
according to their probability of purchasing coffee in the next
month. As another example, system 20 may rank households that had
previously purchased coffee by their probability of not purchasing
coffee in the next month (lapsing). As another example, in step
210, system 20 may determine the probability of a CID not
purchasing at a retail store in the next month.
[0119] In step 220, system 20 determines incentive data for CIDs,
in which the determination is based at least in part upon the
rankings generated in step 210.
[0120] In step 230, system 20 takes action that will ultimately
provide the transaction incentive to the consumer. This includes
saving the data in a file for displaying or printing in human
readable form, transmitting the data to a POS computer system, an
Internet web site, an email address associated with the CID, or a
postal mailing facility. Data defining a transaction incentive for
a consumer may be provided to the consumer via email, at Kiosk 110
upon the consumer being identified by the Kiosk, at a POS terminal
when the consumer's CID is associated with a transaction at that
terminal, or postal mailed to the consumer's postal address
associated with the CID.
[0121] FIG. 3A shows steps relating to step 210 that system 20 may
implement in determining a probability of a particular consumer
changing a particular class of transactions behavior in a specified
class. The specified class and the correlated time period are
represented by a specified data field.
[0122] In step 310a, system 20 receives input filter criteria.
Input filter criteria are criteria that are values for or depend
upon statistical correlations of input fields for data associated
with consumers to the data for the specified data field. The input
fields are fields that store data for transactions occurring in one
or more time periods prior to the correlated time period.
[0123] Examples of filter criteria depend upon the specific
predictive model. However, an example of filter criteria is
criteria specifying the 1, 3, 5, or 10 fields most highly
correlated to the specified data field. Another example of filter
criteria is all fields having a value for correlation to the
specified data field of greater than one of 0.6, 0.7, 0.8, or 0.9.
Another example of filter criteria is all fields having a
correlation to the specified data field of greater than 0.6 and
which are fields associated with transactions from a different
class than the specified class.
[0124] In step 320a, system 20 applies the input filter criteria to
a set of records in order to determine which fields meet the
criteria. The data fields that meet the criteria are the data
fields used in subsequent steps involving the transaction data.
System 20 applies the input filter function using the input filter
criteria to a set of data records from database 30 to define a
subset of data fields in the records. Preferably, the set of
records is large enough so that the results of the predictive
modeling are statistically significant. This means that at least
about 50 records, preferably at least 100 records, and typically
500 to 20,000 records are included in each predictive modeling.
[0125] The input correlation filter criteria preferably includes at
least one cross-correlation value between a field other than the
specified data field and the specified data field. A
cross-correlation means a value for correlation between the
specified data field and other data fields in the CID record, for a
set of CID records. Number of cross-correlations means a number,
examples being 2, 4, 10, 20, 50, and 100 of cross-correlations
defined by different sets of two data fields, one of which is
preferably the specified data field. The input filter function
ranks the cross-correlations to the specified data field of other
data fields, and limits the input fields for step 330a to those
data fields most highly statistically correlated to the specified
data field.
[0126] The input filter function is applied only because current
digital computer systems are in practice not currently powerful
enough to implement the remaining steps of FIG. 2, given a very
large number of input fields, such as the several thousand product
specific transaction classes defined by individual UPC codes. The
filtering steps are therefore optional and may not be practiced
when sufficiently powerful digital computers become available.
Moreover, instead of automated filtering, a user may select which
fields to use for correlation in the following step.
[0127] The specified data field is preferably a data field whose
data indicates quantity of transactions in the specified class in
the correlated time period. However, the specified data field may
also be a field whose data indicates a ratio of quantity of
transactions in the specified class in the later time period
(correlated time period) to quantity of transactions in the
specified class in one of the earlier time periods. Quantity in
this sense may be measured by number of items, volume of items,
cost paid for items, or any other means to measure quantity.
[0128] In step 330a, system 20 applies a predictive modeling
algorithm to the input data fields defined in the filtering step.
The predictive modeling algorithm is designed to result in a
function that can be applied to a consumer record to generate a
value. The value obtained by applying the predictive modeling
algorithm to a record for a consumer is a rank, probability, or
prediction that the actions of the consumer in the correlated time
period will result in data in the specified field. For example, the
predictive model algorithm may assume a predetermined functional
form, then determine cross correlations of the input data fields to
the correlated data field, and then set coefficients in the
predetermined functional form to certain values based upon the
values of the correlations that were obtained between the input
data fields and the specified data field.
[0129] The covariance, C, of two variables, A and B, is defined as
the expected value of their product E{AB} minus the product of
their expected values E{A}E{B}. C=E{AB}-E{A}E{B}. The correlation
coefficient r of the variables A and B is by definition:
r=C/sigma(A)sigma(B). Two variables A and B are uncorrelated if
their covariance is zero. The correlation coefficient or the
covariance may be used as the measure of the correlation between
the target product or class and the input data fields.
[0130] The predictive model function is a function of the values of
a consumer's transactions in specified classes corresponding to the
input data fields. Those classes are the classes which were the
inputs to the predictive modeling used to derive the correlations
used in defining the predictive model function for the specified
class.
[0131] Preferably, the predictive model algorithm's predetermined
functional form also includes as input variables certain
demographic variables, loyalty quotient variables, and block data
variables. In this alternative, the resulting predictive model
function also depends upon the data values in a consumer record for
the corresponding certain demographic fields, loyalty quotient
fields, and block data fields. In this alternative, the value of
the predictive model function reflects the likelihood of the
consumer performing the specified transaction in the correlated
time period, including the impact on that likelihood due to
demographic data, loyalty quotient date, and block data in the
consumer's record.
[0132] In step 340a, system 20 applies the predictive model
function defined in step 330a to a set of consumer data records.
The predictive model function generates data stored in association
with each CID. The generated data is preferably a ranking
probability value, or prediction, Ps (predicting data), for the
consumer transaction in class S for that CID in the correlated time
period. The predicting data may be stored to indicate probability
or prediction of purchase in the specified claims by the consumer
having the specified CID in the correlated future time period. The
correlated time period is typically the next week, next month, next
six weeks, or next three month period compared to the most recent
time period associated with data in the input data fields. However,
the correlated time period may be a discontinuous time period from
the time period representing the transaction dates for data in the
database, such the following week, month, or quarter. The
predicting data may store data indicating a probability of
reduction or increase in quantity of transaction in the specified
class in the correlated time period, such as the next week, month,
or quarter.
[0133] Typically, one set of data records is used in steps 310a to
330a in order to define the predictive model function. Then the
predictive model function is applied to a different set of records
in step 340a to generate rankings, probabilities, and predictions
for a correlated time period corresponding to a future time period
for use in making decisions regarding transaction incentives to
offer to the corresponding consumers.
[0134] Alternatively, the predictive model function may also be
applied to the different set of consumer records in step 340a to
generate rankings, probabilities, and predictions for a correlated
time period, but wherein the correlated time period is for a
historical time period in which there is actual transaction data in
the set of records. In this alternative, the rankings,
probabilities, or predictions for the correlated time period may be
compared to the actual transaction data for the correlated time
period to determine how closely the predictions match actual data.
This comparison may be used to generate a predictive model function
validation value for the specific predictive model function.
[0135] In step 350a, system 20 may automatically compare the
predictive model function validation value to a predetermined value
(such as the results of prior predictive model function validation
values for models predicting the same specified data field) to
determine whether to return to step 310a. If the system 30 returns
to step 310a, it automatically obtains a different set of input
filter criteria based upon predetermined programmed parameters and
then repeats steps 320a to 350a. Alternatively, the decisional step
350a may be user controlled, in which case system 20 waits for user
instructions once completing step 340a.
[0136] When decisional step 350a does not loop back to step 310a,
processing proceeds to step 220 of FIG. 2. Typically, processing
will only proceed to 220 when the correlated time period is a
future time period.
[0137] Steps 340a and 350a enable feedback on the effect of the
form or the coefficients of the predictive model function. Thus,
system 20 may store in code alternative functional forms which in
addition to filter criteria may change between iterations of cycle
310a, 320a, 330a, 340a, and 350a. System 20 may store a plurality
of validation values in association with their predictive model
functions for the specified data field, and may store validations
values in association with their predictive model functions for a
plurality of specified data fields. Preferably, prior to proceeding
to step 220, system 20 has determined the predictive model function
providing the largest validation value for each specified data
field and calculated and stored the corresponding ranking,
probability or prediction data for each specified data field.
[0138] Preferably, a predictive model is generated using a
relatively small subset of all CID records stored in a database,
for computational efficiency. However, the predictive model
function or functions derived from the algorithm of FIG. 3a may
subsequently be applied to any subset or all CID records stored in
database 30. Preferably, the predictive values, Ps, are determined
for a large number of CID records in database 30 and of course each
Ps is stored in a field associated with the corresponding CID.
[0139] FIGS. 3B and 3C also show slightly different algorithms
corresponding to step 210.
[0140] In step 310b, system 20 (or a user) selects a target product
or product class. The specified product or class is one for which a
prediction will be generated as to the probability a consumer will
change purchase behavior. For example, the target could be a
product class, such as breakfast foods; a product type, such as
cereal; a product brand, such as Kellogg's cereal; a specific
product, such as Kellogg's Cons Flakes; or a specific product size,
such as a 16 ounce box of Kellogg's Corn Flakes.
[0141] As in FIG. 3A, the transactions data could be any measure of
the consumer's purchase behavior, such as frequency of purchase,
dollar value of purchase, item count of purchase, frequency of
redemption of incentives, value of redeemable incentive, relative
amount of incentives redeemed in different classes, price
elasticity, or a measure of responsiveness to advertisements or
marketing such as measure of redemption of coupons, use of
vouchers, or request for rebates.
[0142] In step 320b, system 20 determines whether a correlation
exists between purchase data for a target product in a correlated
later time period and the other data in fields in a set of consumer
transaction records in database 30 corresponding to transactions in
an earlier time period.
[0143] System 20 may determine a correlation coefficient for a
variable, A, that is a measure of behavior in a later time period
for a specified or target product with purchase in an earlier time
period of a non-specified product (or other recorded transaction
activity).
[0144] System 20 may determine correlations of A with a set of
variables Bi for i=1 to n where each Bi is a measure of the number
of a non-target product items or volume of purchase of a non-target
product item in the earlier time period.
[0145] In step 330b, system 20 selects a subset of variables, B,
providing relatively high correlations with the targeted product
based on the results of step 320b. For example, assume system 20 is
predicting which households will be purchasing cereal during the
following month. System 20 identified a correlation between the
target product of cereal and five variables: B1, B2, B3, B4, and
B5, which are measures of monthly purchases of more than three
gallons of milk, monthly purchases of at least three loaves of
bread, the presence of two or more children under the age of 15 in
the household, a monthly grocery bill exceeding 350 dollars, and
the absence of purchase of Kellogg's Pop Tarts during a one-month
period, respectively. In step 330b, system 20 selects this subset
of 5 variables to include in a predictive model. In step 330b,
system 20 defines a predictive model based upon the number of
variables selected and the correlations of each one of those
variables with the target product. For example, the predictive
model may set the probability Ps that a consumer will purchase the
target product in a future time period to a normalized set of
values for the sum a1*B1+a2*B2+a3*B3+a4*B4+a5*B5 in which the
weighting coefficients a1 to a5 in the linear combination are
proportional to the corresponding correlation coefficients between
the B data fields and the data for the specified product in the
correlated time period. Given this predictive model function, for
example, if the target is the class of cereal, system 20 may
predict a high probability of reduced purchase of cereal, or a low
probability of purchase of cereal, for a CID record which indicates
purchases of more than three gallons of milk in the last month,
purchases of more than three loaves of bread in the last month,
existence of three children in the household, household grocery
expenditures of at least $350 in the last month, and information
that the household did not purchase Kellogg's Pop Tarts during the
one-month period. Depending upon the correlations and the resulting
probability function, the high probability of reduced purchase of
cereal, or a low probability of purchase of cereal could both be
inverted.
[0146] In step 340b, system 20 applies the probability function to
the subset of variables included in step 330b in a probability
function to obtain a value that will indicate either the
probability that a consumer will purchase the target product in a
specified future time period or the probability that the consumer
will change purchase behavior for the target product.
[0147] In step 310c, the target product or purchasing behavior that
the model is to predict is entered into system 20. That is, either
a data field, set of data fields, or function of one or a set of
data fields in CID records is specified to system 20.
[0148] In step 320c, system 20 examines a certain set of CID
records in database 30.
[0149] In step 330c, system 20 checks each record in the set to
ascertain which consumer records show the presence of the predicted
behavior. If the predicted behavior is present in the customer
record, system 20 performs step 340c. If the consumer record does
not meet the predicted behavior, system 20 performs step 360c.
[0150] In step 340c, if the consumer record meets the predicted
behavior, system 20 adds the consumer record to a target matching
database table.
[0151] In step 350c, system 20 checks to see if there are
additional consumer records to access. If there are additional
consumer records to access, system 20 performs step 320c. If there
are no more consumer records to access, system 20 performs step
365c.
[0152] In step 360c, the consumer record does not meet the
predicted behavior and system 20 adds the consumer record to a
non-target matching table in database 30.
[0153] In step 365c, system 20 determines correlations between the
targeted purchasing behavior and the non-targeted data fields. The
correlations are discernable by comparing the mean value for each
non-targeted product in the target matching database to the mean
value for that non-targeted product in the non-target matching
database. A significant difference in the mean values indicates
that a correlation exists between non-targeted data field and the
targeted purchasing behavior. The larger the difference in mean
values, the larger the correlation. System 20 can define a
predictive model as discussed for FIGS. 3A and 3B based upon the
values of the correlations between the non-targeted products and
targeted purchasing behavior.
[0154] In step 370c, system 20 reads a customer record from
database 30 and applies the predictive model to determine
probability of the targeted purchasing behavior occurring.
[0155] In one embodiment, the generation and storing of ranking,
probability, or prediction criteria is bypassed. In this
embodiment, incentive decisions are made by determining whether
certain data exists in the customer's transaction record, wherein
the certain data are in fields found to be highly correlated to the
specified class or specified transaction data. In this embodiment,
a decision whether to offer a transaction incentive for a
transaction in the correlated time period in the specified class
depends upon the existence of the predetermined pattern of
transactions found to be correlated to the specified data field.
This pattern matching based decisional process has the advantage of
avoiding the complexity of storing intermediate data, namely the
ranking, probability, or prediction data for each customer.
[0156] FIG. 4 shows steps relating to one example of step 220,
determining transaction incentives from predictions obtained from a
predictive model function. In step 410, system 20 determines
whether the probability that a consumer (i.e., a consumer or
household associated with a CID) who has purchased W will cease
purchasing W (indicated by the subscript "-W") and decides based
upon that probability whether to offer an incentive to that
consumer for purchase of W. As shown, if the probability of the
consumer changing his or her purchase behavior to stop purchasing W
is greater than or equal to 0.8, system 20 decides to implement
step 420 and provide an incentive on purchase of W to the consumer.
The value of the incentive may depend upon the value of the
probability. If the probability is less than 0.8, system 20
implements step 430 and decides not to offer the consumer an
incentive for purchase of W.
[0157] FIG. 5 also shows steps relating to step 220. In step 510,
system 20 determines the probability that a consumer a consumer or
household associated with a CID) who has purchased W will cease
purchasing W, and decides based upon that probability whether to
offer an incentive to that consumer for purchase on a product
correlated to W. As shown, if the probability of the consumer
changing his or her purchase behavior to stop purchasing W is
greater than or equal to 0.8, system 20 decides to implement step
520 and provide an incentive for purchase on a product correlated
to W to the consumer. The value of the incentive may depend upon
the value of the probability. If the probability is less than 0.8,
system 20 implements step 530 and decides not to offer the consumer
an incentive for the purchase of W.
[0158] FIG. 6 shows CID records ordered by probability, from
highest to lowest probability extending along the X axis. FIG. 6
shows the percent of CIDs (from 0%, 10%, 50%, 100%) on the X axis
having the probability shown on the Y axis. Here, probability means
the probability for a CID having data in the specified data
field.
[0159] In optional step 350a, filter criteria are changed, and the
process of steps 310a-340a are repeated based upon the new
criteria. The change may be programmatically implemented based upon
programmed criteria or manually implemented by commands entered by
a user via an input/output device.
[0160] FIG. 6 shows a ranking of consumers by probability of
consumers changing from previously purchasing product W to not
purchasing product W (indicated by "-W"). As shown, 10% of CIDs
have a probability of greater than 0.7. These consumers may be
ideal candidates for an incentive. The consumers whose CIDs are
represented on the X axis by the 50% to 100% range are shown having
probabilities of changing purchase behavior on W of less than about
0.4. This means that one half of all consumers having CID records
in the system are determined to have less than about 0.4
probability of ceasing their purchases of W.
[0161] FIGS. 4-6 relate to predicting the probability of a consumer
changing behavior from purchasing a product in one time period to
not purchasing that product in a subsequent time period. In those
cases, one marketing possibility is to target market consumers
having a high such probability. Alternatively, the system and
method disclosed herein may be used to determine consumers that
have not previously purchased W who will start purchasing W.
Decisions on incentives to those consumers for W and for products
correlated to purchase of W may be made depending upon this
probability.
[0162] System 20 could decide whether to issue an incentive offer
based solely on a score generated by the model for a target
product. The system could also decide to issue different incentive
offers based on score. Examples of different incentive offers
include free target products; discounts on the target product,
offers on a different brand in the same class as the target
product, offers on a product complimentary to the target product;
and discounts on purchases of multiple packages the target product.
Complementary products are those products often used on conjunction
with one another, such as milk and cereal, or cheese and crackers,
or reservations for flights and reservations for hotels.
[0163] In addition, an incentive program could be part of a
marketing campaign or as a method of generating further information
on the purchasing habits of consumers to help improve accuracy of
system 20's predictive modeling.
[0164] In addition to recognizing the presence or absence of a
product, system 20 may be programmed to account for the consumer's
changes in purchase level for specific products. For example, a
consumer purchased eight gallons of milk in February/March but only
four gallons in April/May. System 20 could use milk purchase data
for both prior time periods to account for tendencies that are
common to the population as a whole. For example, people may not
drink as much milk when it is hot outside. System 20 could use milk
purchase data to account for tendencies that are specific to the
consumer. For example, system 20 may determine from records based
on at least one of the consumer's airline tickets, hotel records,
and grocery purchase records from a location distant from the
customer's normal place of purchase that the consumer had been out
of town for a certain time period.
[0165] Detailed examples of algorithms for performing the invention
follow in order to explain aspects of the invention and identify
types of transaction incentives applicable to the results of the
predictive modeling.
[0166] First, obtain a consumer transaction database containing
transaction data for a first set of consumers. This transaction
database may also include demographic data. In this example, the
database contains transactions for classes P1, . . . P12. P1, . . .
P12 may each represent the products having the same Universal
Product Code (UPC), or any class of products sharing a common
attribute as previously discussed.
[0167] Second, cumulate the transactions records for each consumer
by time period, such as by month. That is, identify the number of
item or services purchased in association with a consumer
identification within each one of classes P1, . . . P12, in each
month. Tables 1 and 2 illustrate this step.
[0168] In FIG. 7, table 1 shows a customer's transactions in
association with dates of transaction for transaction classes P1 to
P12. Each row in table 1 represents a transaction and specifies the
date of the transaction and the quantity of items transacted in
each class.
[0169] In FIG. 8, table 2 shows the cumulation of quantity of items
transacted in classes P1 to P12 by time period. More specifically,
table 2 shows the cumulation of transaction data for customer 1 in
each one of months of January through May of the year 2000 (1/00 to
5/00). Alternatively, place a 1 in the cells in table 2 for any
month and Pi in which the consumer has purchased at least one
item.
[0170] Second, for each class Pi and each one of the time periods,
calculate the fraction of the consumer records of the first set of
consumers having a non-zero value.
[0171] In FIG. 9, table 3 graphically illustrates the result of the
second step.
[0172] Third, identify a first subset of cumulated transaction
history records for consumers of the first set that purchased at
least 1 item in class P12 in time period 5/00. That is, filter from
the larger set of records a sub set in which the fraction for P12
is 1. That is, include in the first subset only customer
transaction records that have a non-zero value for transactions of
class P12 in time period 5/00.
[0173] Fourth, repeat the second step on the first subset. That is,
for each class Pi and each one of the time periods in the first
subset, calculate the fraction of consumer records having a
non-zero value.
[0174] In FIG. 10, table 4 graphically illustrates the result of
the fourth step.
[0175] Fifth, subtract the values in cells in Table 3 from the
values in the corresponding cell in Table 4.
[0176] In FIG. 11, table 5 illustrates the result of the fifth
step. Table 5 shows values in each cell that are representative of
correlations to the existence of transactions in P12 in time period
5/00. A positive value in table 5 indicates purchase in the
corresponding class correlates to purchase of P12 in time 5/00,
with the magnitude of the value indicating the degree of
correlation. A negative value in Table 5 indicates failure to
purchase in the corresponding class predicts purchase of P12 in
time 5/00, with magnitude of the value indicating the degree of
correlation.
[0177] Alternatively, instead of correlating to a fraction
representing the existence or lack of existence of a transaction in
each cell in table 5, steps 1 to 4 could have been modified so that
table 5 correlated to the average number of items purchased. That
is, tables 3 and 4 could have been determined by calculating the
average number of items transacted instead of the fraction of the
consumer records having non-zero values. In that case, the cells in
table 5 would contain numbers indicating an average difference in
number of items purchased for those in the first subset compared to
the average number of items purchased for those in the first set.
Use of data in this alternative would require subtracting the
average values determined for the first set of consumers from the
actual transaction data for a specific consumer, and then
determining whether the result of that subtraction correlated to
the alternative table 5 data. This additional step is not necessary
when using the correlation data shown in table 5, as explained
below.
[0178] The correlation data shown in table 5 may be used in many
ways to estimate the likelihood or relative likelihood of a
consumer transacting in class P12 in the future. Preferably, the
correlation data is used by operating with it on an individual
consumer transaction record to determine a likelihood of that
consumer subsequently purchasing in P12 in the correlated time
period, or a relative likelihood relative to other consumers, of
that consumer subsequently purchasing P12 in the correlated time
period.
[0179] A consumer having non-zero transactions in a class and a
time period that is correlated to a future transaction in class P12
is a predictor that the consumer will subsequently conduct a
transaction in P12. That prediction may be used as one factor in
deciding whether and when to provide to that particular consumer a
transaction incentive offer to purchase in class P12 in the
correlated time period. A consumer having no transactions in any
class and a time period that is correlated to a future transaction
in class P12 is a predictor that the consumer will not subsequently
conduct a transaction in P12 in the correlated time period. That
prediction may be one factor used in deciding whether and when to
offer that particular consumer an incentive to purchase in class
P12. Moreover, correlations or lack thereof may be used to
determine the value of any incentive transaction to offer to the
consumer.
[0180] The identification of correlations and their magnitude
indicated by non-zero values in table 5 may be used to combine the
significance of some or all of the correlations. One method
involves determine the difference in the fraction of consumers that
purchase in two or more of the correlated classes identified in
table 5 and subsequently purchase in class P12 in time period 5/00
(the correlated time period) to the fraction of consumers that
purchase those two correlated products without regard to purchase
of P12 in time period 5/00. Similarly, fractional difference
correlations may easily be calculated for any combination of cells
of table 5 showing a correlation. More than one such correlation
value for two or more of the cells shown in table 5 may be
calculated. The resulting correlations may be used individually or
combined in a ranking function as discussed in the next
paragraph.
[0181] Alternatively, we can derive a ranking function by defining
a multi variable correlation formula applicable to the data for a
consumer's transactions that cumulates the significance of each of
the single class correlations shown in table 5. Applying the
derived multi variable correlation formula to a second set of
consumer transaction records results in values that rank each
consumer record by the likelihood of the corresponding consumer
purchasing in P12 in the correlated time period. An expression for
an exemplary ranking function follows. Define a ranking function
R(P12, n, Sk)=R, which defines a rank R for consumer record Sk's
likelihood of transacting in class P12 in time period n. We define
R(P12, n, Sk)=S(NCij if Cij is positive -Cij if Cij is negative),
where Nij represents the number of items transacted in class i in
time period j, S represents summation over values of i and j, Cij
are the correlation, values derived as indicated for table 5, and j
runs from 1 to 12, or 1 to 11. In the example represented by table
5, non-zero values of Cij only exist for certain i values wherein
j=n-1 (time period 4/00) or j=n-2 (time period 3/00). However, we
recognize that correlations between more than three time periods,
and non-consecutive time periods may exist, and the general form
for R(P12, n, Sk) accounts for these possibilities.
[0182] Any of the multi variable correlation functions applied to a
set of consumer transaction records will result in a distribution
of values for consumer records Sk. The records of some consumers
will have relatively high R values and the records of other
consumers will have relatively low R values. Relatively high values
indicate a relative likelihood that the corresponding consumers
will purchase in category P12 in the predicted time period. The
system of the invention may use both relatively high values and
anticipated time period of purchase and relatively low values as
inputs in determining whether to make available to a consumer an
incentive offer and when to make the incentive offer available.
[0183] Statistical analysis, heuristic, or ad hoc rules may be
applied to the R function to map R values to probabilities in the
range 0 to 1. The simplest method of mapping the R values for a set
of consumers to a probability range is to normalize the R values by
dividing all of the R values for a set of consumers by the largest
R value in the set. The results of the ranking or probability of
purchase of P12 in time period 5/00, the correlated time period,
may be used in decisions regarding offering of incentives in a
variety of ways.
[0184] In the case of a relatively high likelihood of a consumer
purchasing in class P12 in a correlated time period, one of the
manufacturers competing for sales in class P12 may decide to offer
the consumer a discount on their brand product in class P12, and at
a time, such as a day a few days, or a week prior to the
anticipated purchase time period, or during the anticipated
purchase time period. If the consumer's prior purchase data shows
that the consumer preferentially purchases that manufacturer's
products in that class P12, the manufacturer may decide to offer no
incentive or to offer only a low value incentive. If the consumer's
prior purchase data shows that the consumer preferentially
purchases a different manufacturer's product in class P12, the
manufacturer may decide to offer a large value incentive in an
attempt to induce the consumer to try its product. If the
consumer's prior purchase data shows that the consumer has
purchased from a variety of different manufacturer's product in
class P12, the manufacturer may decide to offer an intermediate
value purchase incentive in an attempt to induce the consumer to
try its product. The determinations based upon consumer's
preference to a different manufacturer in class P12 is an example
of a class and subclass determination. The class is P12. The
subclass is the brand of the manufacturer of products in P12 to
which the customer's prior sales are associated.
[0185] One benefit of this invention is that it enables
manufacturers and retailers to make determinations regarding
incentive determinations based upon a ranking or likelihood of a
consumer purchase in a class and additional data regarding the
consumer's preferences or purchase history in that class or a
subclass of that class. Thus, the invention provides for making
decisions regarding purchase incentive offers based upon a
customer's prior purchase history in (1) a class and (2) a subclass
of that class and correlations to either the class or the subclass
indicating likelihood of the consumer purchasing in either the
class or the subclass in a correlation time period.
[0186] In the case of a relatively low likelihood of a consumer
purchasing in class P12 in a correlated time period, a manufacturer
may decide either to offer the consumer a relatively high value
purchase incentive for purchasing in P12 or to forego offering the
consumer a purchase incentive in P12.
[0187] The foregoing references to manufacturers in this example
apply equally to retailers. Especially when those retailers (1)
desire to increase sales by offering purchase incentives on
products a consumer is unlikely to otherwise purchase, (2)
introduce their own brands in their stores (house label brands) in
competition with the other brands they sell in their stores, and
(3) desire to motivate consumers to perform subsequent shopping
transactions in stores of the same retailer rather than in stores
of different retailer, such as a competing retailer.
[0188] One advantage of the invention is that it enables a
determination of a correlation time period when a consumer is
unlikely to purchase in a specified class, even when the consumer's
purchase history shows transactions in that class in the past. This
enables manufacturers and retailers to avoid the expense of
generating and transmitting to consumers incentive offers that have
little likelihood of being used.
[0189] The ranking and probability determinations identified above
for P12 can be repeated for each one of classes P1 to P11.
Manufacturers and retailers may make decisions regarding incentive
offers for a consumer based upon all of that information. For
example, a manufacturer may decide to offer an incentive to a
consumer only for the one class in which it is most likely that the
consumer will purchase in the class, and will likely purchase from
another manufacturer. Alternatively, a manufacturer may decide to
offer a specified number of transaction incentives to each
consumer, and provide to each consumer incentives in those classes
in which it is most likely that the consumer will purchase in the
correlated time period, and depend the value of the incentive to
each consumer upon the consumer's likelihood of purchasing from the
manufacturer as determined by the consumer's prior purchase
history. Alternatively, the manufacturer may decide to provide
incentives only to consumers that previously have consistently
purchased from that manufacturer in a class but appear unlikely to
purchase in that class in the correlated time period.
[0190] Demographic data provides additional information which can
be correlated to consumer's purchases in specified classes. For
example, consumer demographic data may show that consumers having
at least one child under the age of 12 in their household are 70
percent likely to purchase a dairy product in each shopping
transaction at a supermarket, whereas consumers without at least
one child in their household are only 25 percent likely to purchase
at least one diary product in each shopping transaction at a
supermarket. This time independent correlation data may be combined
into the foregoing ranking functions to account for this known
demographic effect upon a consumers anticipated transactions in the
correlated time period. For example, the R function defined above
may be modified by adding a fraction corresponding to the
demographic based statistical likelihood of a purchase in the
target class. In that alternative, the function R(P12, n,
Sk)=S(NCij if Cij is positive -Cij if Cij is negative) could be
modified by adding a term Dk where D represents the correlation
value (0.70 for a consumer who has a child under 12 and 0.25 for a
consumer who does not have a child under age 12) and k represents
the specific consumer. Similarly, any other data providing a
statistical correlation to certain consumer transactions may be
included in a ranking or probability function indicating likelihood
or relative likelihood of a specific consumer transacting in a
class in a time correlated period. For example, a retailer loyalty
quotient, Q, which is an estimate of the fraction of a consumer's
total grocery shopping dollars spent at a certain retailer, may be
included in a ranking function. For example the value of Qk for
consumer k can be added to the definition of the ranking function
such that R(P12, n, Sk)=S(NCij if Cij is positive -Cij if Cij is
negative)+Dk+Qk.
[0191] Additional time independent consumer specific data which may
be included in the ranking of consumers includes a measure of the
consumer's likelihood to redeem transaction incentives that have
been provided to them in various ways, such as via direct mail,
email, at the point of sale, at kiosks not at the point of sale
both in a retail store and out of the retail store, and transaction
incentives over the Internet at web sites. The time independent
data relating to the consumer's likelihood of redeeming based upon
the modality of transmission of the transaction incentive to the
consumer may be used in determining which modality to use in
providing the transaction incentive to the consumer.
[0192] An example of a predictive model is Ps(x1, x2, x3) where Ps
is a statistical probability of purchase of canned peas in a later
time period, x1, x2, and x3 are measures of purchase of milk,
cereal, and paper goods in a prior time period. For example, the
functional form of Ps may be Ps=f1(x1)+f2(x2)+f3(x3) where 11 is a
function ranging from 0 to 1/2, and f2 and f3 are both functions
ranging from 0 to 1/4. For example of the functional form of f1, f1
may be zero if x1 indicates no purchase of milk in the prior time
period and ranging up to 1/2 with increasing quantity of milk
purchased in the prior time period up to 2 gallons of milk. For
example of the functional form of f2, f2 may be 1/4 if x2 is zero,
and x2 may decrease with increasing x2 until x2 reflects purchase
of at least 5 dollars of cereal in the prior time period. For
example of the functional form of f3, f3 may be zero when the value
of x indicates no purchase of paper goods in the prior time period,
and f3 may range up to 1/4 as the value of x3 may increase
polynomially to indicate purchase of $20.00 in paper goods.
[0193] Another example of a predictive model is Ps(x1, x2, x3) in
which Ps is the probability of a change in purchase behavior of the
specified product in the earlier and later time periods. For
example, Ps may be the probability that quantity of the specified
product will decrease, remain constant, or increase. Ps may or may
not be a function of a measure of purchase of the specified
quantity in the prior time period. As a specific example, Ps may be
the probability of the quantity of purchase of canned peas in the
second time period being no more than 50 percent the quantity of
canned peas purchased in the first time period. Again, one possible
predictive model is Ps=f1(x1)+f2(x2)+f3(x3). Obviously, the number
of variables, x1, x2, and x3 may vary. Preferably, the model
includes at least two variables, more preferably three or more
variables. The number of variables, x, in a predictive model of the
invention may range from 2 to over 100, but preferably, in view of
current digital processing limitations, is in the range of 3 to 15,
more preferably 5 to 10. The functional form of Ps, the predictive
model, may also vary greatly. However, the functional form of Ps
should reflect the underlying correlation of the variables x1, x2,
etc with the change in purchase quantity in the earlier and later
time periods of the specified product. The functional form of Ps
may be predetermined by the user or automatically determined by
computer code generating the predictive models applicable to each
specified product.
[0194] This invention will allow a company with a database
containing detailed shopping records for numerous consumers to help
a cold cereal manufacturer protect its sales by providing an
incentive to a consumer who has purchased a specific product of
that manufacturer's cereal in the recent past but might stop
purchasing the cereal product (hereinafter in this example referred
to as "cereal") in the future. In order to accomplish this goal,
all of the consumers' purchasing records are read into a computer
analysis and modeling program which uses one or a combination of
regression analysis, neural network analysis, and decision tree
analysis to identify characteristics in the data records that
correlate with the subsequent purchase and non-purchase of cereal
and to formulate a predictive model based upon the identified
correlations. In formulating a prediction of a specific consumer's
purchases, the modeling program relies on the predictive model
generated by an analysis of statistical population of data records
(CID records) applied to actual data in that specific consumer's
purchasing records. The purpose of the modeling program is to
examine leading indicators and determine which consumers are going
to change their spending habits and in what way. For example, there
may be 30 or 40 products that are highly correlated to the future
purchase or non-purchase of cereal. Some of the products could be
complimentary products, such as milk, bananas, or sugar. Other
products might be competing products, such as oatmeal, instant
breakfast drinks, muffins, or a competitors brand of cereal. Still
other products might be considered which would appear to have no a
prior or readily discernable rational reason for correlation to the
sale of cold cereal, such as Pez candy or nail polish. The prior
purchase of cereal, the target class, may or may not be one of the
variables that comprises the model.
[0195] In addition to products, modeling may use a wide variety of
demographic data to develop correlations based on indication of a
change in lifestyle. Some demographic reasons a consumer would
alter the consumer's consumption of cereal include: an only-child
graduating high school and leaving the household (decreased demand
compared to the same household) or the presence of a large number
of children under the age of 15 in a household (increased demand
for cereal compared to the average household). Note that the
preceding two examples that the model can accommodate are
comparisons between the household at two different times (the child
going away to college) and a comparison between the household and
other households generally (the household with a large number of
children under the age of 15). Lifestyle factors may also be
included in the analysis, such as a job change necessitating a
longer commute which did not allow the consumer time to sit down
and eat breakfast, the consumer may have started a fad diet that
consisted only of meat (eliminating cereal from the foods consumers
would eat), or the consumer's refrigerator may have broken and the
consumer was unable to keep cold milk in the home. Other factors
that may indicate that the consumer might stop purchasing cereal
are related to the consumer's shopping habits. If the consumer
constantly changed the type of cereal the consumer purchased, it
might indicate the consumer did not find a type of cereal to the
consumer's liking and would make the consumer more likely to stop
purchasing cereal than a consumer that habitually bought a certain
brand and product. Such a non-loyal consumer is a prime target for
an incentive for cereal. A decline in the consumer's frequency of
purchasing cereal or a decrease in the size of the cereal box
purchased, or an increase in the price of cereal might forestall a
future decline in the consumer's cereal purchases.
[0196] In addition to identifying products which conflate to the
non-purchase of cereal, system 20's analysis formulates the
relative weighting of each variable and the relevant time period or
time periods that provide improved relatively larger correlations.
System 20 may be programmed to evaluate purchases from two or more
prior time periods and from time periods of different durations in
generating a predictive model. Alternatively, system 20 may be
limited to a program based on only two time periods of equal
duration. For example, system 20 might predictively model consumer
behavior from each one of a subsequent 6 months based upon consumer
transaction information for the month of September.
[0197] As stated above, all of the variables can be derived by the
computer program based on a combination of regression analysis,
neural network analysis, and decision tree analysis. However, if
the user chooses, any variable may be established by the user.
[0198] In a preferred embodiment, after system 20 defined the
predictive model function, it applies the predictive model function
to a CID record for each consumer in a set of consumers resulting
in a score for each CID record for the targeted behavior. The score
is preferably then compared to a value to determine if the consumer
associated with the CID will receive an incentive. For example, the
model for the cereal incentive program introduced above might
contain the following product variables: the purchase of milk,
napkins, limes, 35 millimeter film, bananas, frozen pizza, oatmeal,
sugar, coffee, tissues, and canned corn; as well as the following
demographic data: household income and number of children under the
age of 16. Each variable is evaluated to a number. The values of
the numbers may be discrete (for example, a "one" for the purchase
of the product during the applicable period and a "zero" for the
absence of a purchase of the product during the relevant time) or
continuous. These values may be summed to produce a score for each
CID. The score need not be an estimate of probability, per se. The
model could equate a percentile to a score (for example, a score of
seven indicates that seventy percent of the consumers will purchase
cereal during the relevant period and a score of eight might
indicate that eighty-five percent of the consumers will purchase
cereal during the relevant period). This would enhance decision
making regarding which consumers to provide an incentive to, for
which product, and how enticing the incentive will be. The focus of
the scoring could also be to indicate that the consumer was about
to stop purchasing cereal, that the consumer was about to purchase
less cereal, the consumer was about to purchase a different brand
of cereal, and that the consumer was going to purchase a hot cereal
instead of a cold cereal.
[0199] Alternative embodiments of the invention relating to
non-transaction assumed demographic data are described below.
[0200] Data stored preferably in database 30, but which may be
stored in any one of databases 30, 70, and 80 includes may include
fields for actual demographics data, assumed demographics data, or
both types of data. For example, that data fields may include one
or more fields for household income level. For example a first
field may store a first value which indicates annual household or
consumer income under $50,000 per year or a second value which
indicates annual household income over $50,000 per year. That first
field may be for non-assumed demographic data, such as data
provided by the consumer. In addition, in the same database, there
may be a second field for assumed income level wherein the same
first and second values indicate the same assumed income levels as
in the non-assumed income level field. The demographic data may
stored in various forms, such as one field for each demographic
value or range, and there may be multiple demographic data values
or ranges per demographic variable. For example, there may be one
field storing age of consumer wherein the numeric value in that
field represents the consumer's age in years. Alternatively, there
may be one field for range of each age in years (0-9, 10-19, 20-29,
30-59, 60-100 for example) wherein each such field stores a boolean
value, and only one field stores a boolean positive value
indicating the age of the consumer. Where, in the case of assumed
demographic data, age is represented by a statistical probability
of the consumer being have a specified age, one field may store a
value associated with a set of statistical probabilities for one or
more ranges or values of the demographic variable. Alternatively,
the database may contain a plurality of fields each of which stores
the statistical probability of the consumer having a value of the
demographic variable in a particular range or of a particular
value. An example of the last alternative would be a series of
fields each of which stores a value from zero to one representing
the probability that the consumer is of a specified age range, such
one field indicating age between 18 and 19 years, a next field
indicating age between 19 and 20 years, a next field indicating age
between 29 and 21 years, and so on. In this last alternative,
preferably, the sum of the values stored in all of the age fields
essentially equals 1.0. However, the sum of the values stored in
all of the age fields may essentially equal a number other than
one, so long as the same normalization is applied to all records
for all consumers to which this record is to be related by
subsequent processing.
[0201] Moreover, the definition of fields in the database may not
distinguish between actual non-transaction demographic data and
assumed non-transaction demographic data such that the data in the
non transaction demographic fields associated with any CID is not
indicated, for example in field definitions, to be actual or
assumed data.
[0202] The database may also store in association with a CID
assumed block data identification. Assumed block data
identification is identification of a bock (sometimes referred to
as a block group) with a CID assumed based upon correlation of
assumed data in the customers' record matching data for the block
group in a geographic region relatively close to the location of
the retail store. Relatively close to the retail store means close
enough to the retail store's location that it is likely that some
substantial fraction of the people living in the block group will
shop in the retail store. In addition, the database may not
distinguish between assumed block data identification and actual
block data identification associated with any CID.
[0203] The database may also store, in association with a CID,
fields derived from non-transaction assumed demographic data
including block data for the block to which the customer belongs,
loyalty quotient data for the customer's loyalty to a store, brand,
or any class of transactions. These assumed derived fields may
contain data representing likelihood of a consumer purchasing in a
specified class, at a specified store, or in a store of a specified
retail chain.
[0204] The correlations resulting from consumer transaction data
may be combined with correlations to consumer purchase preferences
from other sources, such as correlations from actual demographic
data, assumed non-transaction demographic data, and customer
loyalty quotients. The correlations of a particular customer to
these types of data may be incorporated into a ranking or
probability function in predicting the likelihood of a consumer
entering into a specified transaction in a specified future time
period.
[0205] Preferably, analysis system 20 is either hard wired or
programmed to perform the algorithms disclosed herein for
predictive modeling, and the applications of models to rank assumed
non-transaction demographic data, determine probabilities of
assumed non-transaction demographic data, or make predictions based
at least part upon the assumed data. However, retailer computer
system 50 or manufacturer computer system 40 may be programmed to
perform those functions.
[0206] In step 210, system 20 runs software or implements in
hardware a form of predictive modeling to analyze a set of data for
multiple CID records in database 30. The predictive modeling
identifies correlations between the existence of transactions in
one or more classes of transactions in one or more time periods and
known demographic data. The predictive modeling may also include
other correlations, such as correlations based upon a consumer's,
block data and loyalty quotients to the known demographic data. The
predictive modeling of course does that on a record by record
basis, wherein each record is associated with one CID. The
correlations identified above are then used to analyze CID data
records wherein the value of the relevant demographic variable is
unknown to determine either an assumed value for that variable or
probabilities or expectations that the value of the demographic
variable has certain values. That is, one or more correlations are
used to define a function in the manner already described to
generate a probability, prediction, or ranking relative to other
records for a demographic variable to assume for a CID.
[0207] The assumed non-transaction demographic data may be used in
a variety of ways.
[0208] Demographic data, pre se, has a substantial value.
Accordingly, a set of CID records including at least one
demographic field storing data based upon assumed non-transactional
demographic data may be sold for value.
[0209] The assumed demographic data, if in the form of
probabilities of values of demographic values, may be filtered to
identify, for each CID, the demographic variable having the highest
value. For example, a CID's record having a highest probability
value of 0.1 for any age group and a highest value of 0.7 for
income greater than $50,000.00 may be filtered to define a set of
records associated with the particular demographic of income range
and the value of that variable being greater than $50,000.00.
Similar sets of records may be associated with each demographic
variable so that each set of records for each demographic variable
contains only those records from the original group of records in
which the value of that particular demographic variable was the
highest of any of the demographic probability values. Each such set
of records associated with a particular demographic may be
associated with a transaction incentive designed to target the
demographic group associated with that set. For example, an
incentive targeted to be effective for household incomes over
$50,000.00, such as a discount on a luxury item, since an income
over $50,000.00 is currently well above the average household
income, may be associated with the CIDs associated demographic of
household income greater than $50,000.00. The transaction incentive
may then be offered to the consumer or consumers associated with
the CID in any manner.
[0210] Given a set of CID records containing at least one such
field, that data may be sorted by a region field, such as town,
state, or postal code, and a subset of those records having only
records for one or a few towns, states, or postal cods, may be
offered for sale to retail stores having locations correlated with
the towns, states, or postal codes of that subset of records. This
is particularly useful because CID locations can be correlated with
retail store geographic locations so that a retail store can use
data for consumers in the geographic region around the locations of
the retail store, since that is the region from which the store is
likely to draw purchasers. For example, the CID records may be
sorted to identify all records associated with zip code 22202, and
then sold to retail stores located in or adjacent to that zip
code.
[0211] Furthermore, the computer system may then use the assumed
non-transaction demographic data, by comparison to demographics
associated with the block regions in the local region of a
particular retail store, to infer a specific consumer block region
for each consumer. This may be done by correlating the values in
the CIDs record, of variables including the assumed non-transaction
demographic data, to demographic data of the block data for those
blocks in the local geographic region surrounding the retail store
from which the retail store draws customers. Then, the system
assumes that the customer resides in the block region to which the
CID's record is most highly correlated and stores that data in the
record associated with the CID. Of course, if the consumer's actual
residence location is stored in the CID's record, no such
processing is needed for that consumer. The computer analysis
system may determine the number of consumers assumed to be in each
block region. This data may be used to identify relative numbers of
a retails store's customers in each block region local to the
retail store. The computer analysis system may also identify the
fraction of the number of the store's customers assumed to be in
each block region to the total number of consumers in that block
region. The computer analysis system may also identify the fraction
of the number of the store's customers assumed to be in each block
region to the total dollar value spent in each block region. The
computer system or the retails store's computer system may then
preferentially target market based at least in part upon blocks
regions where the identified fraction is relatively low, i.e.,
where the retail store has below average market share. For example,
if the consumer is assumed to have demographic variables of age
under 30, income between $20,000.00 and $30,000.00, and gender
male, and correlation that age, income and gender for the 20
closest block groups surrounding the location of the retail store
identifies a block group 13 as the most correlated, the computer
system may assume that the consumer resides in block group 13.
Alternatively, the computer system may define probability or
expectation of the consumer residing in each one of the foregoing
block groups, and then cumulate the probabilities for records for
all consumers to assume the number of its customers in each block
group region.
[0212] One way to obtain mixed derived demographic data is for the
computer system to identify a set of records for consumers that do
actually purchase a specified product or class of products or
services, and then solicit survey information from consumers
corresponding to that set of records. One way to solicit response
to such a survey is to offer consumers corresponding to that set an
incentive for participating in the survey. The incentive can be
targeted in one of two ways. First, the incentive offered may
depend upon the common product or service from which the class was
specified in such a manner that the incentive is likely to be
useful to purchasers of the common product or service. The computer
system would in this methodology associated an incentive relating
to the specified product or class of products or services used to
define the set of consumers. Second, once the set of consumer's
records was defined, the computer system could determine based upon
criteria applied to record data, what incentive to offer each
consumer whose record was contained in the set of records. Notably,
the record data includes transaction data and product purchase
data, and it may also include actual, assumed non-transaction,
derived, or mixed derived demographic data.
[0213] The algorithmic portions of the invention are preferably
implemented in software, which is also referred to as code.
However, they may be implemented in hardware, for example, by
burning an EPROM type of semiconductor chip or an ASICS type of
semiconductor chip, and including the chip in a digital computer
system.
[0214] Incentive offers may be generated via email, postal mail,
target market advertising, POS terminal printing, and the like.
[0215] Our invention is not limited to the specific example above.
It is more properly defined by the scope of protection we claim in
the following claims.
* * * * *