Store layout optimization using indoor positioning system

5 downloads 0 Views 2MB Size Report
Indoor positioning system, store layout optimization, fingerprinting, wireless localization, positioning .... robot-based or tracking-assistive wireless localization.
Research Article

Store layout optimization using indoor positioning system

International Journal of Distributed Sensor Networks 2017, Vol. 13(2) Ó The Author(s) 2017 DOI: 10.1177/1550147717692585 journals.sagepub.com/home/ijdsn

Hyunwoo Hwangbo1, Jonghyuk Kim1, Zoonky Lee1 and Soyean Kim2

Abstract Indoor positioning systems have attracted considerable attention from practitioners and firms seeking to optimize the consumer shopping experience with the goal of attaining increased revenue and profitability. Acknowledging the importance of indoor positioning systems in store layout optimization, we conducted a field experiment for 11 months in order to develop algorithms for connecting indoor positioning data with customer transaction data. Using fingerprinting as a primary data collection technique, we compared positioning and transaction data before and after critical store layout optimization decisions in order to identify which customer movement patterns generated the highest sales. In contrast to previous works on indoor positioning systems, which focused solely on developing algorithms or techniques to increase accuracy rates, our algorithms in principle integrate computing and marketing perspectives. Our findings can be applied to store layout optimization and personalized marketing. Keywords Indoor positioning system, store layout optimization, fingerprinting, wireless localization, positioning algorithm

Date received: 7 October 2016; accepted: 12 January 2017 Academic Editor: Seong-eun Yoo

Introduction Store layout optimization is intended to optimize consumers’ shopping experiences with the goal of increasing revenue and profitability. By acquiring detailed knowledge in order to develop an understanding of customer movement patterns inside a store, firms can design a store layout that increases shopper traffic, average transaction values, and profitability. Adoption of various Internet of Things (IoT) technologies in brick-and-mortar stores has prompted such changes in the retail industry. In particular, indoor positioning systems (IPSs) enable real-time data collection and customer behavior analysis and allow practitioners to create customer-centric layouts that enhance merchandise and space performance. In this study, we focus on incorporating real-time customer movement pattern and transaction data. Specifically, we identify algorithms that can be used to transform positioning and movement pattern data from an IPS into processed data containing customer insights.

An IPS uses radio waves, magnetic fields, acoustic signals, or other sensory signals relayed through mobile devices in order to assess the locations of objects and people within buildings.1 Existing location-based services, which are based on the satellite-based global positioning system (GPS) or cellular triangulation technology, face limitations in roofed venues due to microwave signal attenuation and scattering by construction materials used for roofs and walls.2,3 However, the locations of users in indoor environments can now be tracked through the increased use of more accurate WiFi, Bluetooth, radio-frequency identification (RFID), 1

Graduate School of Information, Yonsei University, Seoul, Republic of Korea 2 Department of International Studies, College of International Studies, Kyung Hee University, Yongin-si, Republic of Korea Corresponding author: Soyean Kim, Department of International Studies, College of International Studies, Kyung Hee University, Yongin-si 17104, Republic of Korea. Email: [email protected]

Creative Commons CC-BY-NC-ND: This article is distributed under the terms of the Creative Commons Attribution 3.0 License (http://www.creativecommons.org/licenses/by/3.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (http://www.uk.sagepub.com/aboutus/ openaccess.htm).

2 near-field communication, and other close-range wireless technologies. Smartphone’s sensors such as proximity sensors, accelerometers, ambient light sensors, moisture sensors, gyroscopes, and compasses can play a significant role in providing detailed data on the actions of users. Finally, the development of microelectromechanical systems (MEMS) has marked the opening of an Internet of Things era in which sensors are minimized and all objects are placed on the network. The indoor location-based service market is expected to grow from US$935 million in 2014 to US$4424 million by 2019, at an estimated compound annual growth rate of 36.5% over this period.4 The number of active beacons in the United States is expected to increase to 4.5 million by the end of 2018, up from approximately 30,000 in 2015. This anticipated growth is based on the effect of promotions involving beacon technology that are expected to engender a 20-fold increase in customer purchase intent. Based on studies such as a survey on the degree of technology adoption in retailers in which it was determined that 71% of retailers were planning to count people and 68% were considering Wi-Fi in their stores user analysis based on indoor positioning is expected to become more widespread5 and the minimization of trade-offs between localization accuracy and infrastructure costs will become more important.6 In this study, we conducted a field experiment and identified algorithms that both researchers and practitioners can employ in connecting indoor positioning data with customer transaction data to optimize store layouts. Methodologically, we collected positioning and movement pattern data through a wireless sensor network both inside and outside of a store and utilized these data in order to optimize the store’s layout to maximize sales. We used fingerprinting as our main data collection technique because it does not require line-of-sight measurement of access points (APs) and enables highly accurate calculations in complex indoor environments. The main contributions of this research are summarized as follows. (1) While previous works on the IPS focus solely on developing algorithms or techniques to increase the accuracy rate, we examine the processes in which customer positioning can be linked with transaction data. In other words, we examine the algorithms after positioning data collection in order to enable the use of combined data sets for store layout optimization. Although such algorithms can benefit business practitioners, previous work on this topic is surprisingly limited. (2) Furthermore, our research is particularly meaningful because we conducted a longitudinal study based on a field experiment in a real-world store setting. (3) In the field experiment, which was conducted over 11 months, we compared positioning and transaction data before and after a critical store layout optimization decision in order to obtain customer insights.

International Journal of Distributed Sensor Networks The remainder of this article is organized as follows. In section ‘‘Background knowledge and related works,’’ we review related work on the IPS and customer behavior analysis in order to gain insights for application to store layout optimization. In section ‘‘Design details,’’ we describe our research design, which includes experimental settings, a data collection process, and a methodology. Section ‘‘Experimental evaluation’’ details our data analysis process, store layout optimization decision, and experimental results both before and after the optimization decision. In section ‘‘Conclusion,’’ we discuss theoretical and practical implications of our findings as well as future research plans.

Background knowledge and related works In this section, we review related works on the IPS from two different perspectives: radio fingerprinting as a positioning algorithm and wireless local area networks (WLANs) as a radio-frequency (RF) technique. In addition, we examine previous literature that connects insights from the IPS with customer behavior analysis.

Radio fingerprinting as positioning algorithms RF-based localization algorithms are used for the implementation of the IPSs. Such algorithms can be categorized as triangulation, proximity, radio fingerprinting, or dead-reckoning techniques.7,8 Here, we explain radio fingerprinting, which is used in section ‘‘Design details.’’ Radio fingerprinting, a signal distribution technique, involves site surveying to construct fingerprint maps under the assumption that each location has a specific signal characteristic. Various algorithms are then used to map the signals received from the monitoring device onto a fingerprint map.9 Surveys are conducted from an offline training phase and queries are sent from an online position determination phase to carry out signal collection and indoor location association.10 Site surveys from the offline phase are conducted to collect the Wi-Fi signal vectors of the received signal strength indicators (RSSIs), which are collected from multiple APs located at reference points (RPs) with known locations.11 In the online position determination phase, data related to target objects are measured and compared against pre-measured data collected during the offline phase. Fingerprinting uses deterministic methods and statistical methods to identify targets at the closest fingerprint location in non-line-of-sight conditions. Deterministic methods include Euclidean distance, cosine similarity, Tanimoto similarity, k-nearest neighbor, support vector machine, linear discriminant analysis, and so on, while the statistical methods include

Hwangbo et al. Bayesian networks, expectation–maximization, Gaussian processes, smallest M-vertex polygons, and so on.11,12 Fingerprinting is advantageous because it does not require line-of-sight measurement of APs and provides highly accurate calculations in complex indoor environments.13 Temporal and spatial signal patterns, collaborative localization, and motion-assisted localization can be used to correct location errors that result from signal fluctuations in RFs. Examples of temporal patterns include peak-based Wi-Fi fingerprinting and Walkie-Markie, while examples of spatial pattern include UnLoc, HALLWAY, and Wi-Fi Signal Coverage Intersection and Division. Motion-assisted localization techniques that use inertial sensors in various devices include Zee, XINS, Graph-Fusion, HMM Fusion, Moloc, and MapCraft.11

WLAN as RF-based IPSs In object detection, a customer’s position and movements are tracked to obtain indoor location analytics. This involves two techniques: RF positioning and video analysis. Video analysis is implemented through highly specialized cameras that monitor movement. In this research, RF positioning was implemented using mobile apps or Wi-Fi-enabled devices that interact with sensors.14 RF-based indoor localization devices can use GPS, WLAN, ultra-wide band (UWB), RFID, or Bluetooth. We explain WLAN used in this study in section ‘‘Design details.’’ WLAN based on the IEEE 802.11 network operates in 2.4-GHz industrial, scientific, and medical bands and is among the most commonly used IPS techniques. Also called Wi-Fi, WLAN uses gross bit rates of 11, 54, or 108 Mbps and has a 50- to 100-m measurement range. Because it uses APs that are already installed in many buildings to enable Internet services and there is consequently no need to install extra infrastructures such as sensors for identifying locations, Wi-Fi is costeffective and as such is most commonly used in indoor localization systems.15 Wi-Fi-based positioning systems use the media access control (MAC) addresses of wireless service set identifiers and wireless APs for indoor positioning. Localization using WLAN is often based on received signal strength (RSS) and radio fingerprinting techniques. Because the accuracy of WLAN positioning system using RSS is approximately 3–30 m and updating occurs in cycles of several seconds, they have the disadvantages of low accuracy against dynamic variations in indoor environments and time delays caused by the time consumed in providing positioning data.16 To resolve this issue, many researchers have studied robot-based or tracking-assistive wireless localization based on the IEEE 802.11 infrastructures.17 Microsoft RADAR,18 Horus,19 DIT,20 MultiLoc,21 TIX,22

3 OntoNav,23 NAVIO,24 Ekahasu, and COMPASS9 are examples of IPS applications using WLAN. Khaoampai et al.15 and Marques et al.25 used a selflearning algorithm to create Wi-Fi fingerprints on separate floors as part of a floor localization system that measures positions within an indoor environment with at least two floors (applications of their method include FloorLoc-SL).

Customer behavior analysis and store layout optimization Understanding and leveraging the various touchpoints of a ‘‘moment of truth’’ during a customer’s shopping experience is an important task in the retail industry. Previous studies on this subject in various industries have emphasized the importance of indoor location analysis.2,4,8,26–29 Indoor location analysis involves analyzing a customer’s behavior in brick-and-mortar stores with the goal of enhancing proximity marketing. Similar to the manner in which companies analyze online web-browsing on e-commerce sites, indoor location analysis involves the tracking and analysis of in-store movements in brick-and-mortar stores with the goal of enabling customer behavior analysis. In this process, businesses conduct path analyses that can be applied to store layout design. Using the above processes, retail stores can view layout designs in which the company’s most strategic products are directly displayed in customers’ ranges of interest. Store managers can also determine the times and locations with the highest customer traffic to enable efficient store staff allocation. Furthermore, companies can increase sales rates through duration time analyses. The trend in which a positive correlation exists between visit durations and actual purchasing30 can be even more apparent for particular store zones and products within those zones. Based on this knowledge, store managers can identify methods to increase the sales rates by responding to customers after acquiring information on customer interest based on visit duration instead of directly engaging to the customer at the start of a visit. This study acknowledges the importance of indoor positioning analysis in store layout optimization and thereby connects customer positioning data with transaction data in order to identify customer movement patterns that generate the highest sales. Here, we have strived to integrate the computing perspective with a marketing perspective because we believe that the use of indoor location analysis in conjunction with sales analysis can effectively optimize store layout and personalized marketing.31 Prior research on store layout optimization using the IPS is surprisingly limited. The extant literature focuses

4

International Journal of Distributed Sensor Networks

Table 1. Studies on indoor positioning using floor plan and customer behavior analysis. Authors

IPS

Localization technique

Wireless technology

Floor plan

Customer behavior analysis

Store layout optimization

Wu et al.32 Sen et al.33 Ledlie et al.34 Zhang et al.35 De Marco et al.36 Kholod et al.37 Takai and Yada30 Pierdicca et al.38 Xiaoling et al.39 Popa et al.40 Our study

LiFS Zee Mole´ GROPING – – – Estimote – – –

Fingerprinting Fingerprinting Fingerprinting Fingerprinting Proximity Proximity Proximity Proximity Computer vision Computer vision Fingerprinting

WLAN RSS WLAN RSS WLAN RSS WLAN RSS RFID RFID RFID Bluetooth beacon CCTV CCTV WLAN RSS

Yes Yes No No No No No No No No Yes

No No No No Yes Yes Yes Yes Yes Yes Yes

No No No No No No No No No No Yes

IPS: indoor positioning system; WLAN: wireless local area network; RSS: received signal strength; RFID: radio-frequency identification; CCTV: closed-circuit television.

on increasing the accuracy rate of positioning by developing new algorithms or tools. However, the IPS has been significantly improved through the use of sensors (e.g. LiFS, Zee, Mole´, GROPING) in smartphones, and it is difficult to find research that covers both location positioning techniques (e.g. Wi-Fi fingerprinting) and store layout optimization. Several researchers have attempted to compare different positioning techniques and applied their findings to floor planning; however, to date, no one has examined the use of the IPS for store layout optimization. Table 1 presents the previous studies that implemented floor plans or performed customer behavior analysis using the localization technique. Specifically, a number of studies combined Wi-Fi RSS fingerprinting and floor plans established through site surveys to build IPSs. Recent studies such as LiFS,32 Zee,33 Mole´,34 and GROPING35 introduced IPSs that overcome the shortcomings of time-consuming and labor-intensive site surveys through crowdsourcing using sensors embedded in smartphones such as accelerometers, gyroscopes, and compasses. In addition, some studies have analyzed customer movement paths or performed customer behavior analysis based on collected sensor data. De Marco et al.36 analyzed the store operations records of Italian apparel retailers using RFID technology, and Kholod et al.37 examined the effects of shopping path length in grocery stores on purchase behavior using RFID data. Takai and Yada30 used RFID data in a Japanese supermarket to derive the relationship between stay-time and purchase probability. Pierdicca et al.38 suggested a persuasive retail environment by installing Bluetooth beacons in showrooms of Italian fashion retailers and using collected data for customer behavior analysis. Meanwhile, Xiaoling et al.39 examined video tracking data using path analysis and a hierarchical Bayesian model and studied the effects of

interactive social influence (salesperson contact, whining-group discussion) and shopper density (crowding) on purchase. Popa et al.40 collected data through closed-circuit television systems in shopping malls using computer vision technology applications such as motion detection, background modeling, object detection, face detection, and object tracking, and applied the results to shopping behavior analysis.

Design details Experimental settings and data collection Our field experiment tracked movements of customers who made direct purchases. In the field experiment, the store was divided into five sections (right-enter (RE), right-inner (RI), left-enter (LE), left-inner (LI), and counter-AP). We assigned the MAC addresses of each customer to one of these five sections and examined instore movements for 3 months starting from August 2015 (Round 1), and for 9 months starting from November 2015 (Round 2). The right side of the store included women’s classics, casual clothing, bags, and accessories, while the left side primarily contained men’s clothing. If the Wi-Fi of a visiting customer’s mobile device was activated, the MAC address of the mobile device was detected. We could track movements of 3647 visitors, which was about half the number of visitors estimated to have visited the store during the given period. The total number of visitors was estimated based on the historical average of daily visitors (approximately 100 visitors/day). Under the assumption that most visitors had a mobile device when they entered the store, we found that about 50% of them activated Wi-Fi. This finding aligns with the Wi-Fi activation index ratio in a recent study related to mobile devices.

Hwangbo et al.

5 For the analysis, however, we used data from only 3151 customers by excluding customers who left the store after visiting only one section. As shown in Figure 1, customers passed the RE section of the store and stopped at the final counter-AP to make purchases. They then exited through the RE section, where the door was located. Therefore, we could connect the movement pattern data with sales data for each customer because we matched the transaction time of the sales data with the time data of the counter-AP positioning data. However, we could not assume that the customer would always be detected at the RE section because they could pass the entrance and RE section too quickly not allowing the sensor to detect them.

Figure 1. Store layout with five different APs.

Methodology Among the 3647 visitors, we excluded customers who left the store after visiting only one section, and we only tracked customers who exhibited movement patterns between at least two sections, as our objective was to identify and compare movement patterns. Thus, we conducted our final analysis with the remaining 1050 visitors (Round 1). Based on our analysis on positioning and transaction data collected for the initial 3 months (August to October 2015), we rearranged the store layout for 9 months starting from 18 November 2015 (Round 2) using the same method. During Round 2, we collected data from 11,455 visitors whose Wi-Fi was activated.

One of the two main objectives of this research was to develop algorithms that convert frequency sensor data from in-store fingerprint measurements into customer movement pattern data (Algorithm 1). The goal was to identify the algorithms that connect customer movement pattern data with transaction data (Algorithms 2 and 3). We performed this step to derive the amount of sales for each movement pattern. The second objective was to identify algorithms often used in marketing to convert transaction data into binary notations in order to effectively reduce the number of association rules for measuring support and confidence values to facilitate

Algorithm 1. Transformation of sensing raw data to processed data. add the TIME_GROUP column to the sensing raw data table, IFC_RAW. create the IFC_GROUP table with grouping by the TIME_GROUP and the DEVICE_ID from IFC_RAW table add the strongest RSSI, SQUARE_MAC_ID columns to each record. sort by DEVICE_ID, TIME_GROUP from IFC_GROUP table create PROCESSED_DATA table from IFC_GROUP table by for each DEVICE_ID TIME_SQUENCE = 1 for each TIME_GROUP if PREVIOUS_SQUARE_MAC_ID is null then DWEL_TIME = 15 else if PREVIOUS_SQUARE_MAC_ID = CURRENT_SQUARE_MAC_ID DWELL_TIME + = 15 else insert DEVICE_ID, PREVIOUS_SQUARE_MAC_ID, DWELL_TIME, TIME_SQUENCE TIME_SQUENCE + = 1 DWELL_TIME = 15 end if PREVIOUS_SQUARE_MAC_ID = CURRENT_SQUARE_MAC_ID next TIME_GROUP insert DEVICE_ID, PREVIOUS_SQUARE_MAC_ID, DWELL_TIME, TIME_SQUENCE next DEVICE_ID

6

International Journal of Distributed Sensor Networks

Algorithm 2. Add sales data on processed data. create SALES_MOVEMENT table with inner join SALES_DT table and MOVEMENT_DT table on SALES_DATETIME column between MOVEMENT_DATETIME and MOVEMENT_DATETIME + TIME_INTERVAL(180 s) remove the records except only the maximum SALES_DATETIME records with grouping by MOVEMENT_DT from SALES_MOVEMENT table remove the records except only the maximum MOVEMENT_DATETIME records with grouping by SALES_DATETIME from SALES_MOVEMENT table

Algorithm 3. Create a paired movement pattern data set. sort by DEVICE_ID, SQUARE_MAC_ID of PROCESS_DATA table create PAIRED_TRAFFIC_LINE_DATA table with inner join PROCESSED_DATA table as P1 and PROCESSED_DATA table as P2 on P1.DEVICE_ID = P2.DEVICE_ID and P1.TIME_SQUENCE + 1 = P2.TIME_SEQUENCE add column P1.SQUARE_MAC_ID as CURRENT_MAC_ID add column P2.SQUARE_MAC_ID as REAR_MAC_ID

Algorithm 4. Measuring value of support and confidence. confidence(X ! Y) = count (CURRENT_MAC_ID = X and REAR_MAC_ID = Y)/count (CURRENT_MAC_ID = X) directional support (X ! Y) = count (CURRENT_MAC_ID = X and REAR_MAC_ID = Y)/count (all records) directional support (Y ! X) = count (CURRENT_MAC_ID = Y and REAR_MAC_ID = X)/count (all records)

the comparison of sales across customer movement patterns (Algorithm 4). To achieve the above objectives, we installed sensors that can detect mobile device IDs within each section of a particular clothing section in a major department store with high customer traffic. After the customer movement pattern was assessed and connected to sales data and the relationship between the movement pattern and sales was analyzed, the main circulation of the store was rearranged alongside a redisposition of the clothing display. In this research, we hypothesized that doing this would ultimately produce a critical improvement in sales. Although the actual data processing was implemented using an SAS algorithm, we use pseudo-code below to describe the process and to ensure code readability. Pseudo-code is an informal high-level description of the operating principle of a computer program or other algorithm. It uses the structural conventions of a normal programming language, but is designed for human rather than machine reading. Specific steps of our process are described in the following. Algorithm 1. First, we collect data through in-store signal sensor devices under the assumption that the sensor

devices in the store detected the activated mobile signals of customers. We then use the fingerprinting technique to create signal detection term units according to RSSI values repeatedly detected in each section. According to this technique, the value with the highest strength in each term is expressed in terms of distance from the device (Table 3). Following the equation below, we can infer that a minimum value indicates the strongest signal in the time zone and therefore serves as evidence that the customer is located in the given section at the given time. In the equation, j represents the signal strength of each section and i represents each time period (signal detection term unit). The distance from the section device to the customer position is derived from the gap between each section’s average signal strength and the signal strength of a specific time zone. In particular, the RSSI value represents the degree of separation from the centric device signal. Smaller negative and absolute values of the RSSI mean that the centric signal is close, and that the strength is high. The localization algorithm below shows that each value is multiplied and then squarerooted to offset any negative effect. The section with the minimum value is considered to be the location of the customer.

Hwangbo et al.

7

Table 2. Variable description.

Sensor data

Processed data

Variable name

Description

TS SQUARE_MAC_ID DEVICE_ID RSSI DEVICE_ID SQUARE_MAC_ID DWELL_TIME TIME SEQUENCE

Time stamp Sensing device location Mobile identification number Received signal strength indication Mobile identification number Sensing device location Dwell time Sequence

Table 3. Example of sensing raw data. TS

SQUARE_MAC_ID

DEVICE_ID

RSSI

2015-08-26 23:02:06.772 2015-08-26 23:02:09.247 2015-08-26 23:02:09.635 2015-08-26 23:02:10.099 2015-08-26 23:02:10.967

f4fd2b1052ce f4fd2b1052ce f4fd2b1052aa f4fd2b1052c6 f4fd2b1052c2

ffac2219d7289486a20db4cc6c8b2e31 ffac2219d7289486a20db4cc6c8b2e31 8bdc459bfb09125231f9e073caf233e1 8bdc459bfb09125231f9e073caf233e1 d3c1f691cb9239b03a6512383da6c8da

275 272 263 259 270

Table 4. Example of processed data. DEVICE_ID

SQUARE_MAC_ID

DWELL_TIME

TIME_SEQUENCE

ffac2219d7289486a20db4cc6c8b2e31 ffac2219d7289486a20db4cc6c8b2e31 8bdc459bfb09125231f9e073caf233e1 8bdc459bfb09125231f9e073caf233e1 8bdc459bfb09125231f9e073caf233e1 d3c1f691cb9239b03a6512383da6c8da

f4fd2b1052ce f4fd2b1052aa f4fd2b1052ce f4fd2b1052c6 f4fd2b1052c2 f4fd2b1052aa

00:30 00.45 01:15 01:30 01:45 00:30

1 2 1 2 3 1

2 Min4Di =

n  X

RSSIj  RSSIij

2

!1=2 3 5 (i = 1, 2, 3, . . . , m)

j=1

Next, we transform the collected sensor data into movement pattern data. The raw data in Table 3 show the RSSI values corresponding to a customer’s mobile signal (DEVICE_ID) during a specific time (TS) in a specific section (SQUARE_MAC_ID). The sensor data set is translated into the set in Table 4 using the query shown in Algorithm 1. The TIME_SEQUENCE is formed using the path through a specific section for each DEVICE_ID. To ensure that the customer remains in one section for a sufficient amount of time, the appropriate dwell time is set to 15 s.

Algorithm 2. Next, we connect the customer movement pattern for five sections of the processed data set with the store’s sales point of sale (POS) data. To connect these two data sets, the sales POS data occurring within

180 s of a customer’s dwell time in the counter-AP section are considered to be a customer’s sales record based on the assumption that the customers enter through RE, make purchases at counter-AP, and then exit through RE. Specifically, customer behavior is assumed to be consistent with a customer remaining for at least the minimum term at the counter-AP section followed by generation of sales POS data within 180 s thereafter (a key assumption of this research). Algorithm 2 therefore generates a new data set. By connecting the sales data produced in this manner, we found that during the first 3 months of our assessment (Round 1), only 265 out of 1050 customers with at least two pairs of movement patterns had a sales record. In other words, the remaining 785 customers could be assumed to have left the store after window shopping. This ratio greatly exceeds the customer sales ratio based on the store’s statistical floating population. However, the sales records of Wi-Fi deactivated customers were not available for this research (on average, about 40% of the population always activate their

8

International Journal of Distributed Sensor Networks

Table 5. Sales by movement: before the rearrangement event. Movement

Total sales ($)

Total sales ratio (%)

Unit sales ($)

Unit sales ratio (%)

Support (%)

Confidence (%)

LE ! LI LE ! RE LE ! RI LI ! LE LI ! RE LI ! RI RE ! LE RE ! LI RE ! RI RI ! LE RI ! LI RI ! RE Total

2,001 1,575 816 3,243 1,955 1,732 1,892 2,632 4,579 1,535 2,475 1,279 25,714

7.8 6.1 3.2 12.6 7.6 6.7 7.4 10.2 17.8 6.0 9.6 5.0 100.0

20 11 23 23 9 19 15 10 16 36 25 8 215

9.3 5.0 10.5 10.8 4.1 8.7 6.9 4.8 7.6 16.6 11.7 3.9 100.0

3.8 4.7 2.2 6.4 7.3 7.1 4.6 8.8 16.3 4.2 8.3 4.9

22.9 28.2 13.0 24.3 27.7 26.7 15.3 29.2 53.8 15.9 31.4 18.4

Wi-Fi), and the customers who travel less than two sections were excluded for research purposes. Thus, we can extrapolate our raw purchasing customer ratio to about 5%–7% of the overall traffic, which appears to be very realistic. Indeed, according to data collected in September and October 2015, 7% of all customers were shown to have made purchases. Algorithm 3. To convert the data sets formed based on per-person basis (DEVICE_ID basis) into data sets based on each customer’s movement pattern, we organize the data by grouping the two sections into one ordered pair. Because we have to include customers who have not gone through sales to make a comparison, we group the remaining four sections except counter-AP. Based on this, 12 movement patterns were generated (Table 5). Probabilistically, six combinations combine two out of four points is six; however, in this case RE ! LE and LE ! RE are different ordered pairs, that is, different combinations because they have directionality. For the 3 months before the store layout rearrangement (Round 1), the total number of paired movement patterns was 1753, and the number of movement patterns that resulted in sales (for 265 people) was 715. There were 1038 paired movement patterns for the remaining 785. For the 9 months after the store layout rearrangement (Round 2), there were 5256 total movement patterns, with 2146 movement patterns for 792 people who made purchases, and 3110 movement patterns that did not result in sales for the remaining 2359 people. Algorithm 4. We apply the sequential association rule, which is a technique often used in marketing, in order to compare sales and movement sequences for each movement pattern in a more scientific way. The association rule measures support and confidence according

to rules of association for each product purchased by a customer. It is ultimately used as a marketing technique to decide on a cross-selling or positioning policy. Sequential association is a data mining technique that explores hidden patterns in data, and it can be considered to be a method that mines relationships between events that occur alongside a phenomenon or determines how a prior event suggests another event. In finding an X ! Y type rule between items X and Y, it shows a correlation in which there is a high chance that Y will appear when item X with a previous attribute appears. Support and confidence are commonly used as the representative index that quantifies the degree of this association rule, and each index is defined as follows Support =

#T ½X \ Y = Pr(X \ Y) N

Confidence =

#T ½X [ Y = Pr(YjX) #T ½X

Directional support =

#T ½X ! Y or #T ½Y ! X N

where N is the number of total transactions, X is the number of transactions of section X, and Y is the number of transactions of section Y. Support represents the probability that items X and Y will simultaneously appear, as shown in the above formula. Support is mutually symmetrical, with support for Rule X ! Y and support for Rule Y ! X having the same value. Confidence is the conditional probability that Y will occur when X. 0  S  1 and 0  C  1 can be established, and the association rule X ! Y shows the probability that Y is established when condition X is established. This research expands on the practice of simply examining the correlation between items in a transaction. We have redefined the above association rule to

Hwangbo et al. analyze the spending pattern of a customer according to the passing of time and how this change affects sales. In other words, we did not merely use data composed of transactions, items, and sections; rather, we analyzed the purchasing behavior through a customer’s sequential path based on information regarding time sequence. We then rearranged the movement patterns of high sales routes for each path in the store for marketing purposes of creating a plan to increase sales. Another essential part of this research was analysis of spending behaviors exhibited by customers with the same movement pattern but with different directions. Thus, we redefined existing non-directional support using directional support. Algorithm 4 is a query for inferring directional support and confidence before and after the point of rearrangement. Its key point is that the target is the SECTION variable, and that a sequential path is deduced by forming a new TIME_SEQUENCE variable, which is the sequence of this variable. Our approach is of interest as only a few prior studies have connected sales data with movement pattern data deduced through a path algorithm. By applying the sequential association rule, we found that spending patterns change according to changes in customer movement sequences (details in section ‘‘Experimental evaluation’’). This finding is meaningful in marketing as well as in technical field studies involving the use of mobile signals rather than the common practice of evaluating simple traffic associations through volume.

Experimental evaluation Results before store layout arrangement (Round 1) Using the movement pattern and transaction data collected for 3 months starting in August 2015 (Round 1), we compared the unit sales of each movement sequence and calculated the total sales of each sequence. In addition, we estimated support and confidence values for each sequence. The results acquired through the methodology mentioned above are shown in Table 5. Prior to the store layout rearrangement, the RE to RI movement pattern, which had extremely high support and confidence values, had a unit sales ratio of barely 7.6%. On the other hand, routes that included the men’s casual or men’s accessories section, such as LE to LI, LE to RI, or RE to LE, did not have very high traffic, but had considerably high unit sales ratios. A process miner was used to track the reverse trajectory of a movement sequence that resulted in the most sales in order to determine the tendencies produced by movement patterns. To achieve this goal, we assumed a sequence for each section in which all customers entered the store through the RE section before the rearrangement. The sequence started from RE and flowed in

9

Figure 2. Process miner map: before the rearrangement event (Round 1).

Figure 3. Highest sales generating pattern: before the rearrangement event.

stages. Before the rearrangement, there were up to 11 stages in the sequence, with the fourth stage having the most sales. As shown in Figures 2 and 3, each traffic sequence ended when the customer left the store after a sale (CA) or when they left the store without a sale (OT). If not, the sequence would continue onto a lower stage. All customers entered the store through RE. The most common sequence, with the highest number of people (485), was to move to LI; the red path in the process miner map shows this route. In the sequence describing the next lower stage, only one person made a purchase and then immediately exited, while 126 people exited the store without showing any purchase behavior. The remaining customers navigated toward the third stage again, and the majority (43 people) navigated to RI. It is evident that before the rearrangement, most sales occurred during the fourth stage, and we therefore analyzed the stream of the reversed route. Figure 3 shows the sequence with the highest sales to traffic ratio before the rearrangement, which was RE to RI, that is, the women’s section.

Results after store layout arrangement (Round 2) Based on the results we obtained from the analysis in Round 1, we rearranged the store layout by opening an LE path, that is, to the men’s casual or men’s accessories section that had a high sales ratio despite having a lower traffic index of support and confidence. Furthermore, we moved the mannequins and the

10

International Journal of Distributed Sensor Networks

Table 6. Sales breakdowns by route after the rearrangement event. Movement

Total sales ($)

Total sales ratio (%)

Unit sales ($)

Unit sales ratio (%)

Support (%)

Confidence (%)

LE ! LI LE ! RE LE ! RI LI ! LE LI ! RE LI ! RI RE ! LE RE ! LI RE ! RI RI ! LE RI ! LI RI ! RE Total

8,107 4,525 9,613 15,928 6,878 8,445 4,982 8,622 11,740 7,580 11,545 3,232 101,197

8.0 4.5 9.5 15.7 6.8 8.3 4.9 8.5 11.6 7.5 11.4 3.2 100.0

24 14 33 27 10 22 17 12 16 34 28 9 247

9.5 5.6 13.5 11.1 4.1 9.1 6.9 4.9 6.7 13.9 11.3 3.5 100.0

8.3 3.9 6.2 8.1 6.3 7.5 4.0 8.5 12.8 6.0 9.3 4.1

34.2 20.0 32.0 29.8 23.3 27.8 16.0 33.8 80.9 21.2 33.0 14.7

Changes in customer movement patterns (Round 1 vs Round 2)

Figure 4. Highest sales generating movement pattern: after the rearrangement event.

displays in the LE path for path rearrangement and the store underwent construction for 3 days starting on 18 November 2015 to achieve the objective of increasing store sales. We then observed changes in customer movement patterns for 9 months (Round 2). We examined sales by movement sequences after rearrangement and calculated support and confidence for each sequence using the same method as in Round 1. The results are shown in Table 6, from which we observe an apparent increase in the overall sales ratio of the LE path. The Process Miner was used to track the reverse trajectory of the sequence with the highest sales in order to identify meaningful tendencies in the movement sequence. In this case, there were up to 12 stages, with the fifth stage having the most sales; this was one stage lower than prior to rearrangement. As shown in Figure 4, each movement sequence ended when the customer exited the store after a purchase (CA) or when the customer exited the store after only window shopping (OT). In other cases, the sequence continued to a lower stage. The analysis proves that after rearrangement, the movement sequence of customers who entered through LE was significantly higher.

Figure 5 shows the map created based on support and confidence to represent the customer movement frequency in order to show the difference between customer movement patterns before and after rearrangement. In Round 1, the RE section was the only entrance, which resulted in high traffic in the women’s apparel section. While women’s classics at RI (Women 1) and women’s casual at RE (Women 2) had the highest traffic, there was almost no traffic toward men’s clothing, especially men’s casual at LE (Men 2). Based on this, in Round 2, we decided to open the LE section as an additional entrance to the store. We then observed changes in customer movement patterns after the rearrangement event for approximately 9 months (Round 2) using the same method. It is evident that a new pattern developed around LE, as shown in Figure 5, in which the support and confidence, corresponding to how frequently customers moved to a given section, are represented by the thickness of each route. Changes in traffic share are also evident, as shown in Figure 6. In conclusion, we rearranged the store layout by identifying a movement pattern with a high unit sales ratio despite having lower support and confidence values. In other words, prior to store layout rearrangement, movements between men’s apparel sections were limited and were distributed unevenly. Thus, an optimization plan was devised to allow a balanced distribution of customer movements between sections. Unlike RE and RI, which had extremely high support and confidence values before the rearrangement event while having lower unit sales, LE showed low support and confidence values before store layout rearrangement. However, the opening of LE as an additional entrance resulted in a remarkable increase in sales for the men’s section, as shown in Figure 7.

Hwangbo et al.

11

Figure 5. Changes in customer movement patterns (Round 1 vs Round 2).

Figure 6. Changes in customer traffic composition (Round 1 vs Round 2).

Conclusion IPSs have attracted considerable attention from both researchers and business practitioners as a marketing technique that can be used to design store layouts that increase shopper traffic, average transaction values, and profitability. Acknowledging the importance of IPSs in store layout optimization, we conducted a field experiment for 11 months and then identified algorithms that connected indoor positioning data with customer transaction data. We also compared positioning and transaction data before and after a critical store layout optimization decision in order to identify customer movement patterns that generated the highest sales.

Our findings are meaningful from a theoretical perspective in that they integrate computing and marketing perspectives, which enhances previous work on IPSs that focused solely on developing algorithms or techniques to increase the accuracy rate. Unlike the existing research, this study examined the processes in which customer positioning can be linked with transaction data, thereby analyzing the relationship between customer movement patterns and sales. From a practical perspective, our findings can be used for store layout optimization and personalized marketing. Increased interaction with customers and analytical insights into their behavior patterns, such as movement patterns, will help retailers optimize their

12

International Journal of Distributed Sensor Networks

Figure 7. Sales breakdowns.

strategies and increase returns on investment. Furthermore, using an IPS can help retailers in developing real-time personalized offers and allocating in-store staff members more efficiently. From a consumer’s perspective, an IPS can inform them about the shortest routes needed to fulfill their shopping lists and can help them discover products in addition to shops and facilities in a mall. This work can therefore serve as a guideline for the development of a diversity of practical applications of IPSs in the future. Declaration of conflicting interests The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding The author(s) received no financial support for the research, authorship, and/or publication of this article.

References 1. Curran K, Furey E, Lunney T, et al. An evaluation of indoor location determination technologies. J Locat Based Serv 2011; 5: 61–78. 2. Yaeli A, Bak P, Feigenblat G, et al. Understanding customer behavior using indoor location analysis and visualization. IBM J Res Dev 2014; 58(3): 3.1–3.12. 3. Ilarri S, Hermoso R, Trillo-Lado R, et al. A review of the role of sensors in mobile context-aware recommendation systems. Int J Distrib Sens Netw 2015; 11: 1–30.

4. MarketsandMarkets. Indoor location market by solution (tag-based, RF-based, sensor-based), by application (indoor maps & navigation, indoor location-based analytics, tracking & tracing, monitoring & emergency management), by service, by vertical, & by region–global forecast upto 2019, Dallas, TX, 2014, http://www.market research.com/product/sample-8517826.pdf 5. Harkins H. Real analytics: whats’ in-store? Brickstream global retailer survey. Metro Manila, Philippines: Integrated Computer Systems Inc., 2014. 6. Higuchi T, Fujii S, Yamaguchi H, et al. Mobile node localization focusing on stop-and-go behavior of indoor pedestrians. IEEE T Mob Comput 2014; 13: 1564–1578. 7. Bejuri WM, Wan Y, Mohamad M, et al. Ubiquitous positioning: a taxonomy for location determination on mobile navigation system (arXiv preprint arXiv: 11035035), 2011, https://arxiv.org/abs/ 1103.5035 8. Fallah N, Apostolopoulos I, Bekris K, et al. Indoor human navigation systems: a survey. Interact Comput 2013; 25: 21–33. 9. King T, Kopf S, Haenselmann T, et al. COMPASS: a probabilistic indoor positioning system based on 802.11 and digital compasses. In: Proceedings of the 1st international workshop on wireless network testbeds, experimental evaluation & characterization, Los Angeles, CA, 29 September 2006. pp.34–40. New York: ACM. 10. Gu Y, Lo A and Niemegeers I. A survey of indoor positioning systems for wireless personal networks. IEEE Commun Surv Tutor 2009; 11: 13–32. 11. He S and Chan SHG. Wi-Fi fingerprint-based indoor positioning: recent advances and comparisons. IEEE Commun Surv Tutor 2016; 18: 466–490.

Hwangbo et al. 12. Xiao Z, Wen H, Markham A, et al. Non-line-of-sight identification and mitigation using received signal strength. IEEE T Wirel Commun 2015; 14: 1689–1702. 13. Li Q, Li W, Sun W, et al. Fingerprint and assistant nodes based Wi-Fi localization in complex indoor environment. IEEE Access 2016; 4: 2993–3004. 14. Contributor SB. The future of retail: 4 trends bringing the virtual experience into reality [Infographic]. Forbes, 2015, http://www.forbes.com/sites/samsungbusiness/2015/01/13/ the-future-of-retail-4-trends-bringing-the-virtual-experienceinto-reality-infographic/#16998161292e 15. Khaoampai K, Na Nakorn K and Rojviboonchai K. FloorLoc-SL: floor localization system with fingerprint self-learning mechanism. Int J Distrib Sens Netw 2015; 11: 1–20. 16. Li N, Chen J, Yuan Y, et al. A Wi-Fi indoor localization strategy using particle swarm optimization based artificial neural networks. Int J Distrib Sens Netw 2016; 12: 1–9. 17. Ladd AM, Bekris KE, Rudys AP, et al. On the feasibility of using wireless ethernet for indoor localization. IEEE T Robot Autom 2004; 20: 555–559. 18. Bahl P and Padmanabhan VN. RADAR: an in-building RF-based user location and tracking system. In: Proceedings of the INFOCOM 2000 nineteenth annual joint conference of the IEEE Computer and Communications Societies, Tel Aviv, Israel, 26–30 March 2000, pp.775– 784. New York: IEEE. 19. Youssef M and Agrawala A. Handling samples correlation in the Horus system. In: Proceedings of the INFOCOM 2004 twenty-third annual joint conference of the IEEE Computer and Communications Societies (vol. 2), Hong Kong, China, 7–11 March 2004, pp.1023–1031. New York: IEEE. 20. Brunato M and Battiti R. Statistical learning theory for location fingerprinting in wireless LANs. Comput Netw 2005; 47: 825–845. 21. Prasithsangaree P, Krishnamurthy P and Chrysanthis P. On indoor position location with wireless LANs. In: Proceedings of the 13th IEEE international symposium on personal, indoor and mobile radio communications (vol. 2), Lisboa, 18 September 2002, pp.720–724. New York: IEEE. 22. Gwon Y and Jain R. Error characteristics and calibration-free techniques for wireless LAN-based location estimation. In: Proceedings of the second international workshop on mobility management & wireless access protocols, Philadelphia, PA, 1 October 2004, pp.2–9. New York: ACM. 23. Tsetsos V, Anagnostopoulos C, Kikiras P, et al. Semantically enriched navigation for indoor environments. Int J Web Grid Serv 2006; 2: 453–478. 24. Retscher G and Thienelt M. NAVIO: a navigation and guidance service for pedestrians. J Glob Position Syst 2004; 3: 208–217. 25. Marques N, Meneses F and Moreira A. Combining similarity functions and majority rules for multi-building, multi-floor, WiFi positioning. In: Proceedings of the international conference on indoor positioning and indoor

13

26.

27. 28. 29.

30.

31.

32.

33.

34.

35.

36.

37.

38.

39.

40.

navigation (IPIN), Sydney, NSW, Australia, 13–15 November 2012, pp.1–9. New York: IEEE. Zheng Y, Zimu Z and Yunhao LIU. From RSSI to CSI: indoor localization via channel response. ACM Comput Surv 2013; 46: 25–32. Li X and Alsindi N. Recent advances in indoor geolocation techniques. Int J Wirel Inf Netw 2013; 20: 243–245. Yada K. String analysis technique for shopping path in a supermarket. J Intell Inf Syst 2011; 36: 385–402. Liu H, Darabi H, Banerjee P, et al. Survey of wireless indoor positioning techniques and systems. IEEE T Syst Man Cyb C: Appl Rev 2007; 37: 1067–1080. Takai K and Yada K. Relation between stay-time and purchase probability based on RFID data in a Japanese supermarket. In: Setchi R, Jordanov I and Robert J (eds) Knowledge-based and intelligent information and engineering systems. Berlin: Springer, 2010, pp.254–263. Ogino A, Kobayashi T, Iida Y, et al. Smart store understanding consumer’s preference through behavior logs. In: Aykin N (ed.) Internationalization, design and global development. Berlin: Springer, 2011, pp.385–392. Wu C, Yang Z and Liu Y. Smartphones based crowdsourcing for indoor localization. IEEE T Mob Comput 2015; 14: 444–457. Sen S, Radunovic B, Choudhury RR, et al. You are facing the Mona Lisa: spot localization using PHY layer information. In: Proceedings of the 10th international conference on mobile systems, applications, and services, Low Wood Bay, 25–29 June 2012, pp.183–196. New York: ACM. Ledlie J, Park J-g, Curtis D, et al. Mole´: a scalable, usergenerated WiFi positioning engine. J Locat Based Serv 2012; 6: 55–80. Zhang C, Subbu KP, Luo J, et al. GROPING: geomagnetism and cROwdsensing powered indoor naviGation. IEEE T Mob Comput 2015; 14: 387–400. De Marco A, Cagliano AC, Nervo ML, et al. Using system dynamics to assess the impact of RFID technology on retail operations. Int J Prod Econ 2012; 135: 333–344. Kholod M, Nakahara T, Azuma H, et al. The influence of shopping path length on purchase behavior in grocery store. In: Setchi R, Jordanov I and Robert J (eds) Knowledge-based and intelligent information and engineering systems. Berlin: Springer, 2010, pp.273–280. Pierdicca R, Liciotti D, Contigiani M, et al. Low cost embedded system for increasing retail environment intelligence. In: Proceedings of the IEEE international conference on multimedia & expo workshops (ICMEW), Torino, 29 June–3 July 2015, pp.1–6. New York: IEEE. Xiaoling Z, Shibo L, Burke RR, et al. An examination of social influence on shopper behavior using video tracking data. J Market 2014; 78: 24–41. Popa M, Rothkrantz L, Zhenke Y, et al. Analysis of shopping behavior based on surveillance system. In: Proceedings of the IEEE international conference on systems, man, and cybernetics (SMC), Istanbul, Turkey, 10–13 October 2010, pp.2512–2519. New York: IEEE.