Measuring large‐scale market responses and forecasting aggregated sales: Regression for sparse high‐dimensional data

Published date01 August 2019
Date01 August 2019
AuthorNobuhiko Terui,Yinxing Li
DOIhttp://doi.org/10.1002/for.2574
RESEARCH ARTICLE
Measuring largescale market responses and forecasting
aggregated sales: Regression for sparse highdimensional data
Nobuhiko Terui | Yinxing Li
Graduate School of Economics and
Management, Tohoku University, Sendai,
Japan
Correspondence
Nobuhiko Terui, Graduate School of
Economics and Management, Tohoku
University, Kawauchi Aobaku, Sendai
9808756, Japan.
Email: terui@tohoku.ac.jp
Abstract
In this article, we propose a regression model for sparse highdimensional data
from aggregated storelevel sales data. The modeling procedure includes two
submodels of topic model and hierarchical factor regressions. These are
applied in sequence to accommodate high dimensionality and sparseness and
facilitate managerial interpretation.
First, the topic model is applied to aggregated data to decompose the daily
aggregated sales volume of a product into subsales for several topics by alloca-
ting each unit sale (wordin text analysis) in a day (document) into a topic
based on jointpurchase information. This stage reduces the dimensionality of
data inside topics because the topic distribution is nonuniform and product
sales are mostly allocated into smaller numbers of topics. Next, the market
response regression model for the topic is estimated from information about
items in the same topic. The hierarchical factor regression model we introduce,
based on canonical correlation analysis for original highdimensional sample
spaces, further reduces the dimensionality within topics. Feature selection is
then performed on the basis of the credible interval of the parameters' posterior
density.
Empirical results show that (i) our model allows managerial implications from
topicwise market responses according to the particular context, and (ii) it per-
forms better than do conventional category regressions in both insample and
outofsample forecasts.
KEYWORDS
dimension reduction, feature selection, hierarchical factor regression, highdimensional sparse data,
topic model
1|INTRODUCTION
Disaggregated scanner panel records from stores have
been analyzed for various purposes using a variety of
models. To model consumer heterogeneity as predicted
by microeconomic theory, heterogeneous choice models
with hierarchical Bayes modeling were proposed by
Rossi, McCulloch, and Allenby (1996). These have been
widely and successfully applied to understand individual
customers and explore targeted or onetoone marketing
strategies, as is discussed in Rossi, Allenby, and
McCulloch (2005) and associated references. Terui, Ban,
and Allenby (2011) discussed the effectiveness of TV
advertising in relation to the formation of consideration
sets using the single source of disaggregated purchases
with individual TV exposure records. Hasegawa, Terui,
Received: 20 November 2017 Revised: 14 September 2018 Accepted: 3 January 2019
DOI: 10.1002/for.2574
440 © 2019 John Wiley & Sons, Ltd. Journal of Forecasting. 2019;38:440458.wileyonlinelibrary.com/journal/for
and Allenby (2012) explored the mechanisms of con-
sumer satiation with products by modeling product attri-
butes using dynamic heterogeneouschoice models.
On the other hand, sales data are automatically accu-
mulated at customer checkout points by the pointofsale
(POS) terminals in most retail locations. These data are
extremely important for developing promotional pro-
grams, even if the store does not use a customer loyalty
program. Most traditional methods of analyzing the
aggregated store data specify the range of products by cat-
egorythat is, category regression. Category regression
as discussed by Hanssen, Parsons, and Shultz (2001)
assumes that the number of products is smaller than the
number of days for which sales data are available. This
approach is useful when applied to products from well
recognized categories with high frequency of sales. How-
ever, the approach cannot be applied to all products in a
store, particularly to products that are purchased infre-
quently over the period of observation.
By contrast, recent advances in network technology
have made clear that scanning entire databases can
uncover unexpected hidden patterns of joint purchases.
This bigdata approach could offer insights for marketing
managers that help them to understand the shopping
contexts of their customers. The aggregated scanner data
from POS terminals contain records of a huge number of
sales, prices, and promotions, as the store in the case
study below has about 8,000 products. These variables
cannot be used directly as covariates because the size of
the covariate matrix in the market response function is
intractably large. Even when a model can be estimated,
overfitting occurs because of the socalled N<Pproblem,
where Nand Pare the numbers of samples and covari-
ates. Thus we need to generate smaller datasets by
decomposing the larger dataset or reducing the dimen-
sion of the data.
These highdimensional datasets contain many zeros,
so they are sparse, as is the case with aggregated scanner
data in our study, because the daily number of product
items purchased is considerably smaller than that of
items displayed in a store, and the sales data for many
items are therefore zero, so that information about their
price and promotional variables is not recorded, produc-
ing yet another data entry with zero value.
In this study, we accommodate entire products into
our analysis without assuming categories. The proposed
model is composed of two submodels. The first model
reduces dimension of the original data space by
decomposing it into several subdatasets. Hidden struc-
ture within the aggregated scanner data is uncovered by
applying a topic model (Blei, Ng, & Jordan, 2003) to
aggregated data. The second model solves the N<P
problem by reducing the dimensionality of the covariate
space. A hierarchical factor regression model is proposed
for this purpose, to estimate market structure in the lower
dimensional space between dependent variable and
covariates. The market response functions defined by
regression are estimated regularly since in the reduced
dimensional space it does not include many zeros.
Finally, the market response in terms of regression coeffi-
cients in the highdimensional original data space is
recovered by converting the estimated structure in the
reduceddimensional space into the original space. An
overview of the proposed model is shown in Figure 1.
Our paper contributes to the theory of marketing
modeling and forecasting by the Big Data approach. More
specifically, we apply a topic model to aggregated sales
data in a new manner to reduce dimensionality and
uncover shopping contexts. This approach is next
combined with a regression model for sparse high
dimensional data by proposing hierarchical factor regres-
sion to find effective covariates and measure their market
responses. These models are verified with an empirical
study that unveils actionable insights for retail managers.
We derive unusual findings from these response func-
tions by scanning the whole database, which contributes
to managerial practice usually based on categoryspecific
analysis.
In the next section, we apply the topic model to aggre-
gated sales data and generate subdatasets for topics that
can be interpreted as shopping contexts. These sub
datasets already have a lower dimension than those of
the original dataset, since product sales are not likely to
be allocated to the topics evenly. In Section 3, according
to the subdatasets for the respective topics, we apply a
second model to further reduce the data dimensionality
to estimate topicwise market response regressions. In
Section 4, we report an empirical study of aggregated
scanner data. The concluding remarks are given in
Section 5.
2|DIMENSION REDUCTION
USING TOPIC MODEL
2.1 |Decomposing aggregated sales into
subsales by shopping context
Consumer motivations for making purchases of products
are hidden in aggregated sales data. For example, out of
50 sales of a chocolate, 15 could be purchased for con-
sumption, 25 to be given away, which are purchased
jointly with a card, and 10 could be purchased for
cooking, which is indicated by purchasing flour simulta-
neously. The decomposition of total sales into several
topics allows a better understanding of the market and
TERUI AND LI 441

To continue reading

Request your trial

VLEX uses login cookies to provide you with a better browsing experience. If you click on 'Accept' or continue browsing this site we consider that you accept our cookie policy. ACCEPT