Design of a shopbot and recommender system for ... - Semantic Scholar

16 downloads 176534 Views 2MB Size Report
Jul 3, 2006 - The increasing proliferation of online shopping and purchasing has naturally led to a growth in the ..... Adobe® Photoshop® Elements 2.0.
Decision Support Systems 42 (2006) 1974 – 1986 www.elsevier.com/locate/dss

Design of a shopbot and recommender system for bundle purchases Robert Garfinkel a,1 , Ram Gopal a,2 , Arvind Tripathi b,3 , Fang Yin a,⁎ a

School of Business, University of Connecticut, Storrs, CT 066269-1041, USA b University of Washington Business School, Seattle, WA 98195-3200, USA

Received 29 June 2005; received in revised form 10 May 2006; accepted 14 May 2006 Available online 3 July 2006

Abstract The increasing proliferation of online shopping and purchasing has naturally led to a growth in the popularity of comparisonshopping search engines, popularly known as “shopbots”. We extend the one-product-at-a-time search approach used in current shopbot implementations to consider purchasing plans for a bundle of items. Our approach leverages bundle-based pricing and promotional deals frequently offered by online merchants to extract substantial savings. Interestingly, our approach can also identify “freebies” that consumers can obtain at no extra cost. We also develop a model to extend the capability of the current recommendation algorithms that are mainly based on collaborative filtering and item-to-item similarity techniques, to incorporate product price and savings as an additional important factor in making recommendations to shoppers. We develop a practical algorithm that can be employed when the number of items is large or when the real-time nature of shopbot applications dictates quick response rates to consumer queries. A detailed experimental analysis with real-world data from major retailers suggests that the proposed models can provide significant savings for bundle purchasing consumers, and frequently identify freebies for consumers. Together the results underscore the potential benefits that can accrue by incorporating our models into current shopbot systems. © 2006 Elsevier B.V. All rights reserved. Keywords: Shopbot; Bundle pricing; Recommender system; Integer programming

1. Introduction The Internet has dramatically reduced buyer search costs by providing easy information retrieval [13]. On the other hand researchers have found significant price ⁎ Corresponding author. Tel.: +1 860 486 6182; fax: +1 860 486 4839. E-mail addresses: [email protected] (R. Garfinkel), [email protected] (R. Gopal), [email protected] (A. Tripathi), [email protected] (F. Yin). 1 Tel.: +1 860 486 1289; fax: +1 860 486 4839. 2 Tel.: +1 860 486 2408; fax: +1 860 486 4839. 3 Tel.: +1 206 221 5369; fax: +1 206 543 3968. 0167-9236/$ - see front matter © 2006 Elsevier B.V. All rights reserved. doi:10.1016/j.dss.2006.05.005

variation on the Internet even for identical commodities such as books and CDs, to name a few [3,4]. This variation and the large number of vendors have made it difficult for a user to find the best price for an item or items. In response, a number of comparison-shopping search engines, widely known as “shopbots”, have become popular (e.g., mySimon.com, Froogle.com, and PriceGrabber.com). At these websites a shopper can enter the product name and specification, and the shopbot will search a large number of vendors and return the prices offered by retailers, as well as other information such as shipping cost and availability. Since the first shopbot, BargainFinder, launched in 1995, a large number of shopbots have emerged and are

R. Garfinkel et al. / Decision Support Systems 42 (2006) 1974–1986

increasingly being used by online shoppers. Shopbots are claimed to be able to collect, process, and present product related information at little cost, and therefore greatly affect the efficiency and behavior of markets [16]. Most current shopbots are geared towards oneproduct-at-a-time search. We refer to a “product” as something that can be purchased separately from any other item (see Section 2.1). Thus, using these shopbots, a shopper who wants to find the best price for a group (bundle) of products would have to initiate a search for each individual item and then combine the

1975

results on her own. A few shopbots do allow a shopper to compare shopping for multiple items as a whole by displaying the total purchasing price of these items from a single vendor and/or from multiple vendors (e.g., PriceGrabber.com's “Shopping List” feature, BooksPrice.com's multiple book price comparison). However, none of these shopbots can incorporate the variety of bundling and pricing alternatives that are frequently offered by online retailers [19]. To illustrate the disadvantage of using one-productat-a-time shopbots to purchase a bundle, consider Fig.

Fig. 1. Comparison between shopbot results and bundle promotion.

1976

R. Garfinkel et al. / Decision Support Systems 42 (2006) 1974–1986

1. The lowest individual prices obtained from the shopbot MySimon.com for Apple iPod ($244), and the software packages McAfee VirusScan 8.0 ($32), Roxio Easy CD and DVD Creator ($43.44), and Quicken 2004 Basic ($26.31) are listed. Best Buy, on the other hand, offers a bundle promotion where a consumer who purchases Apple iPod for $249.99 would then qualify to purchase any three software packages for a total of $49.98. While the offer by Best Buy is not the cheapest for any individual product, it is a cheaper option for a consumer interested in purchasing Apple iPod, along with two or more of the listed software packages. In this situation, the shopbot fails to provide the best possible purchase plan for the consumers. The use of bundle pricing and promotions has been a common marketing practice for a long time. Recently, They have been used extensively and more frequently in online retailing to boost sales since the menu cost of a digital storefront is much lower than that of its bricks-and-mortar counterpart. Another reason for increased use of bundle pricing and promotions could be that retailers are trying to avoid direct price competition pressure caused by shopbots. Further, online shoppers are increasingly purchasing multiple items in a single order because of factors such as the convenience of online shopping, site search, non-linear shipping costs, and recommendations of related products from various recommendation systems [5]. Therefore, it would be desirable to design a shopbot that is able to take advantage of promotions based on bundle pricing and promotions. Such a bundle shopbot would benefit consumers who desire to purchase multiple items by providing optimal purchase plans. It would also benefit merchants who offer bundle pricing and promotions to increase their market exposure. The focus of our work is on the development of models for shopbots that leverage bundle pricing and promotional deals offered by online merchants to extract price savings. It is intended to operate in the presence of the demand for multiple items by a user. Recent academic work has addressed issues surrounding the impact of shopbots on consumer behavior and retailer pricing strategies [8,16], and operational design improvements to enhance a consumer's overall utility with the shopbots [11]. To the best of our knowledge, this is the first research incorporating bundle pricing and promotion into shopbot design. The main contribution of this research is to extend the capability of the current shopbots to take advantage of the plethora of bundle

pricing and promotions so that shopbots can provide shoppers more effective and value-added services. This research also extends the capability of the current recommendation algorithms that are mainly based on collaborative filtering and item-to-item similarity techniques, to incorporate product price and savings as an additional important factor in making recommendations to shoppers. We also provide an empirical demonstration of the significant savings that could result from our extension to the current shopbots. The rest of the paper is organized as follows. Section 2 presents a general classification of bundles, as well as a model for purchasing the desired products at minimum cost in the presence of bundle pricing and promotions. An efficient algorithm for solution of that model is also given. A recommender system based on the model of Section 2 is the subject of Section 3. Computational results using real data are given in Section 4 and indicate that substantial savings can result from use of the models and algorithms of Sections 2 and 3. Conclusions and future research directions are highlighted in Section 5. 2. Bundle purchasing models In this section, we first discuss various types of bundle pricing and promotion practices that are commonly used by the retail industry. We then formally develop a general bundle purchasing model that leverages the bundle pricing and promotions to deliver optimal purchasing plans for the customers. Since a real shopbot needs to be able to respond to requests of a large number of shoppers in a timely way, we present a practical algorithm to solve the general bundle purchasing problem quickly and with excellent performance. 2.1. General classifications of bundles There is considerable literature on the benefits of bundling to the sellers. Beginning with the work of Stigler [18] on how bundling can increase seller profits, this stream of research has dealt with the seller's bundling decision problem. Work in this stream has focused on construction and pricing of bundles to maximize profit (see [19] for a comprehensive review). Adam and Yellen [1] classified three modes of bundling structures, namely pure bundling, mixed bundling, and component selling (pure unbundling). In pure bundling, individual items are not offered. Mixed

R. Garfinkel et al. / Decision Support Systems 42 (2006) 1974–1986

1977

Fig. 2. Bundle example — tie-in.

bundling is a combination of pure bundling and component selling. On the other hand pure bundling is not a concept of interest from the buyer's point of view

since one can simply consider “components” to be minimal sets of goods that can be purchased individually.

Fig. 3. Bundle example — add-on.

1978

R. Garfinkel et al. / Decision Support Systems 42 (2006) 1974–1986

The various general bundling strategies that have been implemented by retailers [15] are listed below: Deterministic bundling: Exactly one set of predetermined items is included in the bundle. Non deterministic bundling: This includes, for instance, the following: Tie-in bundling: The buyer is required to buy one major product (e.g., digital camera or mp3 player) to qualify for discounted prices on other products (e.g., three software products for $48). Usually there are comprehensive lists of both the major and tied-in products (see Fig. 2 for an illustration). Add-on bundling: The buyer is required to buy one product (e.g., wireless router) to get a free product (e.g., wireless card). This is illustrated in Fig. 3. Cross promotion: The buyer is required to purchase one product to qualify for a discount on another product. However, the buyer has the option of not purchasing the additional product. Total value discount: If the total amount of an order is above a certain threshold, the order gets an extra discount (e.g., 10% or $15 off any order above $100) (see Fig. 4). We incorporate all the above varieties of bundle pricing and promotion practices in the following model.

2.2. The general model Consider a buyer with demand for one of each of a set of items S0. A set of bundles B ≔ {B1, B2,…} is offered by the retailers, where each bundle in B contains at least one element of S0. Included in B are the |S0| degenerate bundles {i} for all i ∈ S0. Each item i has an original (unbundled) P cost given by ci > 0. The unbundled cost cðT Þw iaT ci of a set T of items represents the benchmark for price savings. Also let Sj be the set of items in Bj, and the set of all items in B is denoted by Sw [jaB Sj . The cost of Bj is fj > 0, so that fj = ci if Sj = {i}. Let B(i) be the set of bundles that contain item i. Then the problem (CB) of choosing the cheapest set of bundles that satisfies the demand for S0, can be modeled as the following set covering problem: X min f j xj ð1Þ jaB

s.t. X

xj z1; iaS0

ð2Þ

jaBðiÞ

xj binary; jaB

ð3Þ

Note that constraint (2) allows the buyer to receive additional goods beyond the desired set S0. If B⁎ is an

Fig. 4. Bundle example — value discount.

R. Garfinkel et al. / Decision Support Systems 42 (2006) 1974–1986

optimal set of bundles from CB, denote by S ⁎ w [jaB⁎ Sj the corresponding optimal set of goods. Then, if B⁎ is purchased, the buyer may receive the set F⁎ of additional goods (“freebies”) beyond single units of the elements of S0. That is, F⁎ contains at least one unit of every element of S⁎\S0, as well as possible extra units of elements of S0. In Section 4 we show that our shopbot design frequently identifies freebies for shoppers, which results in further savings. This feature is not available in any current shopbots and can provide additional value and a richer shopping experience for shoppers. Given the bundle set B, along with the vector of bundle prices, the problem of choosing an optimal subset of B is set covering, which is shown to be NPHard (e.g. [7]). There are also other features of the current problem that make it computationally difficult. Interested readers can refer to Appendix A for a detailed discussion.

1979

have not been chosen at iteration t. The GRAB algorithm is formally presented below. Algorithm GRAB Initialization Step: Set M1 = S0, N1 = B, and t = 1. Bundle Addition Step: Select a Bundle ( ) P t t j aN to min fj = ci . i2 Sj \ M t

Iteration Step: Set Nt+1 = Nt \{jt}, Mt+1 = Mt\{Sjt}, and t = t + 1. If Mt ≠ ϕ go to Bundle Addition Step. Bundle Generation Step: The solution is xj = 1 for j ∉ Nt+1 and xj = 0 otherwise.

2.3. A GRAB algorithm For consumer purchases that involve only a few items, an optimal solution of CB can easily be found by generic integer programming or specialized set covering algorithms. However, if we want to use the same model to help organizations decide on an optimal purchase plan for a large number of items, even specialized algorithms might take a long time to reach optimality. On the other hand, even if the number of items is small as in the case for individual consumers, the huge number of simultaneous requests that could be received by the shopbots can still pose a challenge. For example, pricegrabber.com, a popular shopbot, claims to provide price comparison services to 18 million active shoppers. Even if only a small portion of these shoppers send in bundle search requests at any point of time, it would take at least quite a few seconds to solve for optimality for all the requests. Usability research shows that delay of more than 10 s results in loss of user attention [12]. So waiting time even in seconds could lead to user attrition. Therefore, a practical algorithm that can solve the problem quickly will be useful if the number of items to be purchased is large and/or there are potential benefits from shaving off valuable seconds in responding to a large number of shopper requests. Therefore, we present the algorithm, which we name “Greedy Addition of Bundles” (GRAB). Let t index the iterations; Mt be the items in S0 not yet purchased at iteration t; Nt be the bundles of B that

Note that at each step the algorithm chooses a bundle that includes at least one desired item not purchased at previous steps. The chosen bundle has the lowest ratio of cost to the individual costs of these desired items in the bundle. This algorithm is a variation of that of Fisher and Wolsey [6]. GRAB takes advantage of the fact that there are prices for each of the individual items, a condition that is not true in general set covering problems. Our computational experiments with real-world data suggest that the timesavings of GRAB are on the order of 90% compared to generic integer programming packages such as CPLEX. In terms of objective function value, the performance of GRAB is consistent and is approximately 1% greater than that of an optimal algorithm. 3. Recommender system The general bundle purchasing model described in the previous section can also be utilized to extend the capabilities of current recommendation systems. Recommendation has become an important tool for target marketing in the online world. Given the huge number of items available and vast amount of information that can be easily collected from online shoppers, almost every major online retailer employs recommendation systems to help shoppers identify interesting items, thereby boosting sales and enhancing the shopping experience. There are numerous recommendation algorithms that have been adopted, including traditional collaborative filtering, cluster models, search-based methods, and item-to-item collaborative filtering,

1980

R. Garfinkel et al. / Decision Support Systems 42 (2006) 1974–1986

among others [9]. These algorithms commonly use the shopper's purchasing activity data as well as browsing history, rating data, and demographic data to infer the shopper's preferences and give recommendations based on the similarity and/or relatedness amongst shoppers and/or items. However, none of the current recommendation algorithms consider possible savings that can be obtained from bundle pricing and promotions in making recommendations to consumers [2]. Consider a situation where a shopper wants to buy a digital camera. Current recommendation systems may recommend one or more photo printers to accompany the camera, based on the similarity among shoppers and/or items. However, if there is a bundle promotion involving this camera and a certain model of printer, current recommendation algorithms are not capable of identifying this specific printer as the top pick, which could then generate the highest utility option for the shopper. Here we use the general model CB of the previous section to develop a model for recommending products that the consumer can obtain at discounts if combined with the original bundle of items. Note that this is in addition to any freebies identified when solving the original problem CB. Recall from Section 2.2 that S ⁎ w [jaB⁎ Sj . Then, one-at-a-time, for all i′ ∈ S\S⁎ solve the following set covering problem, which we denote by RS: X min f j xj ð4Þ jaB

s.t. X

xj z1; iafi Vg [ S ⁎

ð5Þ

jaBðiÞ

xj binary; jaB

ð6Þ

with optimal objective value z(i′). Then product recommendation can be displayed in non increasing order of extra savings ci′ − z(i′) or alternatively of ci zði ÞV V ci V . The same GRAB algorithm can be used to solve RS. Note that our model assumes a given set of items S\S⁎ to consider for recommendation. This set of items could come from the current recommendation algorithms. Then, based on the possible savings from bundle pricing and promotions, these items are sorted in order of non-decreasing amount of savings and presented in that order. We believe the incorporation of bundle pricing and promotion information into recommendation algorithms will greatly enhance the

possibility that shopper purchase the recommended items and benefit both shoppers and merchants. 4. Experimental verification To ascertain the potential practical benefits from deploying our models, we conducted an experimental verification using real-world data on computer-related products. Recent reports indicate that this product category, which includes software, wireless gadgets, printers, memories, and digital cameras is one of the most commonly purchased categories online [14]. When selecting retailers from whom to collect data, we tried to avoid the brand effect that might complicate our experimental results. The literature has shown that the reputation of a seller, especially in the online world, does have a significant impact on the price of a product and the behavior of shoppers [17]. Therefore, to limit any confounding retailer brand effects that may influence a shopper, we carefully selected the set of retailers for consideration as follows. We initially started with Best Buy, and then identified its main competitors in the category of computer products. This information was derived from Hoovers (www.hoovers.com), which provides in-depth information on 40,000 of the world's top business entities as well as comprehensive industry and market information. This process led to the selection of the following 14 retailers: Best Buy; Circuit City, Buy.com, CDW.com, Staples, Kmart, Wal-Mart, OfficeMax, Office Depot, Sears, Gateway, Radio Shack, CompUSA, and Amazon.com. We only included these 14 well-established national retail chains or online retailers so that the reputation of sellers is identical across different bundle solutions. This way we can show clearly that the savings are really from purchasing in bundles instead of possibly purchasing from a cheap, but perhaps non brand-name seller. If we were to include more sellers, it will surely not reduce the total savings, but these savings could be compromised by the lack of reputation of certain sellers. We then monitored these retailers for a period of 10 days in early 2004, and captured their bundle-based pricing and promotional offers for computer-related products. During these 10 days, we identified various formats of bundle offers including: special discounts/ rebates on bundled software; free memory cards with purchase of digital cameras; and any three software products for a fixed price with the purchase of a given set of products. From these we generated a total of 194 bundles involving 36 products. These products naturally fell into seven groups and are listed in Table 1, along with the lowest price for each product among these 14

R. Garfinkel et al. / Decision Support Systems 42 (2006) 1974–1986

1981

Table 1 List of products Group

Product

Lowest price

Camera

Photosmart 635 2.1 megapixel digital still camera Canon powershot A80 digital still camera (4.0 megapixel) Olympus 5.0-Megapixel 3× Optical/4× Digital Zoom Digital Camera Brand/Model: OLM C5000ZOOM Sony Cyber-shot® 5.0-Megapixel 4× Optical/4× Digital Zoom Digital Camera Brand/Model: SON DSCV1 Kodak 5.0-Megapixel 3× Optical/3.3× Digital Zoom Digital Camera Brand/Model: EKC DX4530 Lexar 128 MB compact flash card SanDisk 128MB Secure Digital™ Memory Card Brand/Model: SDK SDSDB128781 PhotoSmart 7350 Color photo printer (inkjet printer) Black inkjet cartridge (Mfr: Epson) T007201 Color inkjet cartridge (Mfr: Epson) T009201 Adobe® Photoshop® Elements 2.0 Adobe Photoshop Album 2.0 DVD_PictureShow_2/Photo_Explorer_8_-_Windows_ PhotoImpact_8_-_Windows_ VideoStudio_7_-_Windows_ Pinnacle_Studio_Version_8_-_Digital_Imaging_-_Windows_ Flip_Album_Suite_BE4.2_-_Windows_ Pinnacle_Studio_Version_7_-_Windows_ Norton Antivirus 2004 Professional edition Norton Antispam 2004 McAfee Internet Security Suite 6.0 Tax cut deluxe 2003 McAfee SpamKiller V5.0 McAfee Personal Firewall plus 5.0 Norton Systemworks 2004 Norton Personal Firewall 2004 McAfee_VirusScan_2004_Ver._8.0_-_Windows_ Netgear WG511 802.11g 54 Mbps Wireless PC Card D-Link AirPlus Xtreme G DWL G650 Belkin 802.11b Wireless Notebook Adapter Brand/Model: BLK F5D6020 Netgear WGR614 v2 802.11g Wireless 4-port Router D-Link AirPlus Xtreme G DI-624 Belkin 802.11b Wireless Broadband Router with 4-Port Switch Brand/Model: BLK F5D62314 Belkin 802.11g Wireless Broadband Router with 4-Port Switch Brand/Model: BLK F5D72304

$149.99 $349.94 $329.94

Memory Printer

Software — picture-editing

Software — utility

WirelessCard

WirelessRouter

retailers in the monitoring period. Table 2 lists summary statistics on the individual items and the bundles. We evaluated the bundle shopping models by varying the size of the bundle from 2 to 20. For each bundle size, we randomly generated the products in the bundle by Table 2 Summary of the sample Items Mean Max Min S.D. n

Bundles $84.64 $459.94 $5.86 $105.05 36

Mean Max Min S.D. n

$376.80 $549.97 $0.01 $142.19 194

$459.94 $269.99 $44.74 $54.99 $89.98 $18.93 $18.80 $58.62 $39.22 $49.99 $55.00 $86.99 $57.95 $49.95 $49.99 $57.94 $24.12 $39.99 $24.95 $24.96 $15.86 $59.72 $44.82 $30.00 $38.20 $43.99 $29.99 $67.98 $55.18 $34.99 $49.99

randomly drawing from the seven product groups without replacement. The reason for drawing by group is that, realistically, a shopper is less likely to buy products in the same category, for example, two digital cameras, in one order than to buy products from different categories, for example, a digital camera and a photo printer. One thousand bundles for each bundle size were generated, resulting in a total of 19,000 simulation runs. For each bundle we solved the problem using both CPLEX and our algorithm and compared the resulting savings using the sum of the individually-cheapest prices of items in the bundle as a benchmark. Fig. 5 highlights the cumulative percentage savings over the

1982

R. Garfinkel et al. / Decision Support Systems 42 (2006) 1974–1986

Fig. 5. Distribution of savings.

benchmark using the GRAB algorithm. Positive savings were observed in over 85% of the cases. The median savings rate was 10%, and in nearly 10% of the cases the percentage savings were over 20%. Table 3 lists the performance of CPLEX and the GRAB algorithm for various bundle sizes. As expected, the savings increase as the number of items in the bundle increases. The GRAB algorithm performs extremely well. Both the optimal and GRAB algorithms were able to frequently identify freebies, with more freebies identified for larger bundles. Utilizing the lowest individual price of the

freebies as the baseline, the net savings with the freebies included are 4% to 10% higher than without the freebies. Interestingly, the GRAB algorithm, in a number of instances, was able to identify higher-valued freebies than the optimal algorithm. Results from the evaluation of the recommender system are shown in Table 4. These results were derived from the optimal algorithm and reflect the product recommendation that results in the largest savings. In general, the buyer need only spend less than 5% of the original total cost to purchase the recommended item.

Table 3 Performance analysis results Number Avg Optimal of items benchmark Avg cost Avg S.D. Avg # Avg value of needed cost of saving (%) of extra extra items needed (%) items items 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

$166.47 $257.41 $323.21 $414.57 $497.73 $588.50 $666.88 $757.90 $844.94 $912.59 $1000.71 $1087.77 $1168.34 $1248.80 $1351.52 $1441.32 $1535.85 $1613.24 $1711.97

$158.51 $246.04 $306.18 $390.26 $466.50 $545.42 $611.69 $695.09 $772.20 $826.50 $900.85 $974.91 $1042.02 $1104.26 $1192.22 $1262.71 $1341.38 $1402.93 $1478.32

6.8 5.9 6.3 6.5 6.7 7.7 8.8 8.8 9 9.9 10.3 10.7 11.1 11.9 12.1 12.7 12.9 13.3 13.8

16.5 11.8 9.6 8.3 7.1 7 7.1 6.8 6.1 5.9 5.5 5.2 4.9 4.9 4.8 4.7 4.4 4.3 4.1

0.3 0.4 0.5 0.6 0.7 0.8 1.0 1.1 1.1 1.3 1.3 1.4 1.5 1.5 1.5 1.6 1.6 1.7 1.7

$13.38 $17.26 $24.73 $31.35 $32.03 $37.52 $42.05 $47.00 $46.30 $51.82 $54.82 $58.10 $58.58 $58.76 $59.90 $63.62 $63.85 $68.56 $69.20

GRAB algorithm % of Avg cost Avg S.D. Avg # Avg value % of benchmark saving (%) of extra of extra benchmark cost (%) items items cost 10.9 9.5 10 9.3 7.7 7.3 7 6.8 5.9 6 5.7 5.5 5.1 4.8 4.5 4.5 4.2 4.3 4.1

$158.51 $246.04 $306.23 $390.41 $466.68 $545.75 $612.28 $695.70 $773.05 $827.44 $902.26 $976.53 $1043.92 $1106.82 $1195.06 $1266.77 $1345.72 $1409.23 $1486.10

6.8 5.9 6.2 6.5 6.6 7.6 8.7 8.7 8.9 9.8 10.2 10.5 10.9 11.7 11.9 12.4 12.6 12.8 13.4

16.5 11.8 9.6 8.3 7.1 7 7.1 6.8 6.1 5.9 5.6 5.2 4.9 4.8 4.7 4.6 4.2 4.1 3.8

0.3 0.4 0.5 0.7 0.7 0.8 1.0 1.1 1.1 1.3 1.4 1.5 1.6 1.6 1.6 1.7 1.8 1.9 1.9

$13.38 $17.26 $24.85 $31.58 $32.34 $38.00 $42.63 $47.87 $47.24 $53.19 $56.71 $60.17 $61.38 $62.39 $63.43 $67.01 $67.74 $72.97 $73.52

10.9 9.5 10.1 9.4 7.8 7.4 7.1 6.9 6 6.2 5.9 5.7 5.3 5.1 4.8 4.7 4.5 4.6 4.3

R. Garfinkel et al. / Decision Support Systems 42 (2006) 1974–1986

1983

Table 4 Recommendation analysis results No. of items needed

Benchmark cost of needed items a

Extra cost to buy recommended item a

Extra cost % of original cost a

Benchmark value of recommended item a

% saving of benchmark value a

2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

$164.41 $255.94 $331.80 $421.17 $505.77 $583.44 $664.67 $750.58 $840.99 $929.82 $1005.48 $1085.41 $1180.28 $1260.15 $1350.48 $1441.52 $1527.57 $1614.40 $1706.26

$6.05 $7.71 $10.15 $12.86 $14.99 $15.77 $20.15 $19.62 $22.49 $28.52 $28.90 $27.81 $34.94 $34.44 $34.69 $35.21 $36.63 $38.52 $40.24

4.2 3.6 3.6 3.5 3.0 2.5 3.0 2.7 2.9 3.4 3.1 2.8 3.2 3.1 3.0 2.8 2.8 2.7 2.7

$68.86 $71.34 $75.65 $80.57 $83.94 $85.44 $92.47 $91.58 $95.82 $104.34 $104.74 $102.85 $111.28 $109.83 $107.57 $106.55 $106.06 $105.22 $105.68

95.9 94.7 93.1 91.3 89.9 89.3 86.5 86.8 85.3 82.1 81.2 82.2 78.8 79.4 78.8 79.2 78.0 78.1 77.5

a

All are means over sample of 1000.

The potential savings from purchasing the recommended product are extremely large (in excess of 75%). These savings are the highest for smaller sized bundles, which underscores the viability of the proposed recommender system for consumer-oriented shopbots where a consumer's original shopping list may be comprised of only one or a few items. 5. Concluding remarks We have provided a model for computing an optimal purchase plan to procure a bundle of items from competing sellers who offer bundle pricing and promotions. We have also developed a recommender system that suggests additions to the original bundle. Detailed analysis of the models with real-world data on computer-related products from major retailers reveals significant savings for bundle purchases, frequent opportunities to obtain freebies, and additional substantial savings that can result from the recommender system. These findings point to the viability of the proposed models for implementation consideration in current shopbot systems. Since we are not trying to do an extensive empirical test of the effect of bundle purchasing on the saving for consumers, we only collected a small number of bundle promotions during a short period of 10 days for this paper. Savings resulting from our small experimentation should not be used as a general indicator of the true savings that could be realized by consumers in

the real shopbot environment. In addition, before the current bundle searching algorithm can be implemented in the real world it must be further refined. It will require not only the refinements of the algorithm to generate the bundle set B given all formats of bundle promotions, but also the joint efforts between shopbots and retailers with respect to how bundle promotion information should be presented, updated, and made available for shopbots. In general a shopbot needs a large amount of information on various types of bundle promotions in order to be able to make recommendations to shoppers. Currently this information is not available in a way that enables automatic retrieval through current web-crawling technology, nor is it provided by any retailers through direct data feed. In our experimental analysis, the data was collected through a manual process of visiting and extensively browsing retailer websites daily. However, there are secondary sources through which the bundle promotion information could be collected, integrated, and presented in a manner facilitating automatic retrieval. One example of such a secondary source is online deal forums such as www.fatwallet.com and www.dealcoupon.com. These forums possess a large amount of information on retailer promotions and pricing posted by their members. Some forums process the raw input from their members and present it in organized and searchable formats. Bundle purchase shopbots could potentially procure relevant data from such forums as the basis for optimization and recommendation. In

1984

R. Garfinkel et al. / Decision Support Systems 42 (2006) 1974–1986

addition, if an appropriate mechanism can be developed to provide incentives for retailers to directly feed the deal information into the shopbots, as some retailers are currently doing with traditional shopbots, the issue of data availability can be satisfactorily resolved. Our work can be extended in a number of ways. Our finding that the savings increase with the number of items in the bundle suggests potential benefits from aggregating various consumer purchase plans. Consideration of retailer reputation and substitutable products is also worthy of further study. Finally, while mechanisms to automate retrieval of individual item prices from a retailer exist and are in use by current shopbots, no such mechanisms exist to automate the procurement of bundle prices and promotions. While the wide variety of bundling and bundle promotion strategies used by online retailers pose significant challenges, this can contribute immensely to the popularity and usage of shopbots. Appendix A. Complexity analysis Given the bundle set B, along with the vector of bundle prices, the problem of choosing an optimal subset of B is set covering, which is shown (e.g., [7]) to be NP-Hard. Of course the complexity of CB is a function of the size of the problem in terms of the number of constraints and variables. The former is determined by the cardinality of S0, i.e. the number of items specified by the user. The latter can be bounded by 2|S0 | − 1, since if two bundles contain identical elements of S0, the shopbot will simply choose the cheaper of the two. A.1. Complexity from value discounts Consider a seller who offers a total value discount of α if the buyer spends more than a floor g. That is, the price of a set T of items is 

cðT Þ; if cðT ÞVg ð1  aÞcðT Þ; otherwise

As before let S denote the set of all items offered by the seller and let T0 ≔ S ∩ S0 be those items in the bundle desired by the buyer. Any items purchased in S\T0 have no intrinsic value except for the possible discount. If c (T0) ≥ g the buyer will purchase only the items in T0, so assume that c(T0) < g. Then let X ⊆ S\T0 be the additional items purchased. To get the discount it must be true that

c(X) ≥ g − c(T0). It is also easy to see that X must satisfy αc(T0) ≥ (1 − α)c(X) or else the discount does not outweigh the additional cost of X. Then the resulting problem is min

X

ð7Þ

ci yi

s.t. X

ci yi zg  cðT0 Þ

ð8Þ

iaSqT0

X iaSqT0

ci yi V

a c ð T0 Þ 1a

yi z0 and integer; iaSqT0

ð9Þ ð10Þ

But even as α → 1, so that the constraint (9) becomes inoperative, the remaining problem is a minimization variation of the standard Integer Knapsack Problem, with the special property that the cost coefficients and the constraint coefficients are the same. That special case is known to be NP-Hard (see e.g. [10]). In practice, the hard problem of finding the optimal price for a certain bundle under value discount can be solved satisfactorily by using a “filler” good to make up the gap between the current total price c(T)and the floor g. A “filler” good is an item that is priced at a sufficiently low amount, such as a book mark at 25 cents, and can be used to increase the total price to barely exceed the floor g. There are numerous online bargain forums where information of such filler goods is posted for most major online merchants. These filler goods can also be programmatically retrieved by constructing customized query for the individual merchant. A.2. Complexity from generating the bundle set Determination of the bundle set B to be used in solving CB, and also for the recommender system, is clearly an important issue. In general terms it follows that as the set B is expanded the solutions to CB will improve. Yet, there are two downsides to making |B| large. In general CB becomes more difficult to solve as |B| increases and, in addition, there is the cost of searching the Web to find additional promotions. Finally, the size and structure of B will also be determined by the particular bundle promotions as indicated in Section 2.1. For non-deterministic bundling

R. Garfinkel et al. / Decision Support Systems 42 (2006) 1974–1986

there may be multiple elements of B for a given promotion and set S0. The shopbot may simply choose to add only one element to B in these circumstances but, as shown above, choosing the cheapest such element, for at least one type of promotion, is itself an NP-Hard problem. Thus it may be useful to develop heuristic rules for the generation of B. References [1] W.J. Adams, J.L. Yellen, Commodity bundling and the burden of monopoly, Quarterly Journal of Economics 90 (3) (1976) 475–498. [2] G. Adomavicius, A. Tuzhilin, Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions, IEEE Transactions on Knowledge and Data Engineering 17 (6) (2005) 734–749. [3] J. Bailey, Electronic Commerce: Prices and Consumer Issues for Three Products: Books, Compact Discs, and Software. 1998, Organisation for Economic Co-Operation and Development. [4] E. Brynjolfsson, M.D. Smith, Frictionless commerce? A comparison of internet and conventional retailers, Management Science 46 (4) (2000) 563–585. [5] DoubleClick, E-Commerce Site Trend Report Q3 2004. 2004. [6] M.L. Fisher, L.A. Wolsey, On the greedy heuristic for covering and packing problems, SIAM Journal on Algebraic and Discrete Methods (1982) 584–591. [7] M.R. Garey, D.S. Johnson, Computers and Interactability, Freeman, 1979. [8] J.O. Kephart, A.R. Greenwald, Shopbot economics, Autonomous Agents and Multi-Agent Systems 5 (2002) 255–287. [9] G. Linden, B. Smith, J. York, Amazon.com recommendations: item-to-item collaborative filtering, IEEE Internet Computing 7 (1) (2003) 76–80. [10] S. Martello, P. Toth, Knapsack Problems: Algorithms and Computer Implementations, John Wiley & Sons, New York, 1990. [11] A.L. Montgomery, et al., Designing a better shopbot, Management Science 50 (2) (2004) 189–206. [12] J. Nielsen, Designing Web Usability, New Riders Publishing, Indianapolis, IN, 2000. [13] P. Pereira, Do lower search costs reduce prices and price dispersion? Information Economics and Policy 17 (1) (2005) 61–72. [14] Shop.org, Online Sales Skyrocket as Profitability Jumps, According to Shop.org/Forrester Research Study. 2004, Shop. org. [15] H. Simon, G. Wuebker, Bundling—a powerful method to better exploit profit potential, in: G. Wuebker (Ed.), Optimal Bundling: Marketing Strategies for Improving Economic Performance, Springer-Verlag, Berlin, 1999, pp. 7–28. [16] M.D. Smith, The impact of shopbots on electronic markets, Journal of the Academy of Marketing Science 30 (4) (2002) 442–450. [17] M.D. Smith, E. Brynjolfsson, Consumer decision making at an internet shopbot: brand still matters, The Journal of Industrial Economics 49 (4) (2001) 541–558. [18] G.J. Stigler, United States vs. Loew's Inc.: a note on block booking, Supreme Court Review (1963) 152–157.

1985

[19] S. Stremersch, G.J. Tellis, Strategic bundling of products and prices: a new synthesis for marketing, Journal of Marketing 66 (1) (2002) 55–72. Dr. Robert Garfinkel is a professor in the Operations and Information Management Department of the School of Business at the University of Connecticut. His work on a variety of problems in operations research, mainly involving combinatorial optimization, has appeared in such journals as: Operations Research; Management Science; Informs Journal on Computing; Decision Support Systems, and Mathematical Programming. His current research has focused heavily on the problem of optimally balancing valid security concerns against the desire to provide users of a database with valuable information. Other ongoing research streams include: improving efficiency in hospital settings; design of markets for grid computing; construction of recommender systems for shopbots; optimization problems in microfluidic systems; and optimization in vehicle routing. He is also coauthor of the book Integer Programming with George Nemhauser.

Dr. Ram D. Gopal is currently the GE Endowed Professor of Business and an Ackerman Research Scholar. His current research interests are in the areas of data security, privacy and valuation, database management, intellectual property rights and economics of software and music piracy, online market design and performance evaluation, economics of online advertising, technology integration, and business impacts of technology. His research has appeared in Management Science, Operations Research, INFORMS Journal on Computing, Information Systems Research, Journal of Business, Journal of Law and Economics, Communications of the ACM, IEEE Transactions on Knowledge and Data Engineering, Journal of Management Information Systems, Decision Support Systems, and other journals and conference proceedings. He currently serves as the Ph.D. director for the department and is on the editorial board of Information Systems Research, Journal of Database Management, Information Systems Frontiers, and Journal of Management Sciences.

Arvind K Tripathi is an Assistant Professor at the Management Science Department, University of Washington Business School, Seattle. He holds a Ph.D. from the University of Connecticut. His research interests are in online advertising/ marketing, open source software development, and electronic markets. His research has been published or forthcoming in marketing and information systems journals such as Journal of Retailing, Decision Support Systems, Communications of the ACM and IEEE Transactions on Knowledge and Data Engineering. He serves on the editorial board of Journal of Database Management.

1986

R. Garfinkel et al. / Decision Support Systems 42 (2006) 1974–1986 Dr. Fang Yin is an Assistant Professor in the Operations and Information Management Department of the School of Business at the University of Connecticut. His research interests are in business value of IT investment, online sales promotion, shopbots design, and online recommender systems. He has published in MIS Quarterly, Journal of Retailing, Sloan Management Review, and several other journals. He holds a Ph.D. from the University of Texas at Austin.