2016 IEEE International Conference on Big Data (Big Data)
Leveraging Social Big Data for Performance Evaluation of E-Commerce Websites Eyad Makki, Lin-Ching Chang Department of Electrical Engineering and Computer Science The Catholic University of America, Washington, DC, 20064, USA
[email protected],
[email protected] Abstract— E-commerce plays a key role in business success nowadays. Therefore, the performance of E-commerce websites is critical. E-commerce websites generate a large amount of data that is often used for performance evaluation. Many website evaluation methods have been proposed, but the social media factor is usually not taken into consideration. In this paper, Twitter data is utilized for big data analytics and several Twitter performance indexes are proposed to assist the website performance evaluation. The result is compared with the performance evaluation result using several commercial tools for 13 selected E-commerce websites in Saudi Arabia. Keywords- E-Commerce, Social Media, Big Data Analysis, Website Evaluation, Saudi Arabia
I.
INTRODUCTION
E-commerce websites collect a huge amount of data from various sources such as system, network, customers, and social media. Big Data Analysis (BDA) is adopted by Ecommerce businesses handling both structured data, like demographic data, and unstructured data, like traffic, in order to be ahead of their competitors. One BDA application is the performance improvement for E-commerce business. For example, it was used to generate personalized recommendations, incentivize purchases with special offers, and streamline the checkout process for a faster, better, and more intimate shopping experience [1]. Social data has also been utilized in E-commerce businesses. Social media plays a critical role in E-commerce development; it increases the ability to reach a larger group of customers online. There are many benefits to be gained from social media supporting E-commerce. Social media helps in promoting brand awareness, generates more business and richer user experiences that was confirmed by many studies [2]–[6]. It also provides the ability to rapidly spread ads, and broaden targeting customers by the followers. It is an effective way to communicate with customers and receive their feedback. In fact, social media helps to drive recommendation that leads the hesitancy customers to purchase, thanks to the increased degree of confidence in the product or service, as well as the reliance on the trusted sources with respect of word of mouth. Feedbacks of customers and rating of products can be easily obtained through their comments, likes and reviews, which facilitate interactive ways and relationships development. From technical perspective, social media supports Search Engine Optimization due to the positive relationship between higher search rankings and wider markets and customers. In practical E-commerce, social media provides marketing insights reflected by the increased customer loyalty through building trust with products or services and mainly with the company brand itself [7].
978-1-4673-9005-7/16/$31.00 ©2016 IEEE
2525
There are many traditional, simple approaches used for E-commerce websites evaluation. These approaches are based on: checklist methods, knowledge experience, regression model, and analog approach. They can solve problems based on a set of systematic steps ignoring the associations between global decision making process factors. Therefore, some significant factors influencing the outcomes are the ability and the experience of the analyst [8]. The effectiveness of E-commerce websites is crucially important as it directly impacts the compatibility, usability, revenue and growth of the business. Qualitative approach using principals or criteria were often used to evaluate the overall quality of an E-commerce site. However, the qualitative evaluation performed by a domain expert can be subjective and time-consuming [9]. Moreover, the Ecommerce environment is not static and data is collected all the times, so an automatic quantitative approach to continuously evaluate the performance of a site would be highly desired. A number of websites evaluation methods using quantitative approach have been proposed; each method was designed with one or a few particular evaluation purposes. For examples, some methods are designed for general websites and some are customized for specific domains [10], while others are used for redesign decisions or for strategic decisions related to business or marketing [9], [10]. Researchers or field engineers usually extended an existing foundation model like WebQual [11] or ServQual [12] by adding new criteria and dimensions to meet their own evaluation purposes [13]. In addition, these tools usually use a set of pre-defined factors without providing a way to take other important factors into account. These existing tools often report only a set of parameters such as the speed, accessibility, link analysis and page ranking that may not be the most effective indictors for the growth of the business. Moreover, these existing approaches fail to adequately incorporate social media factor. The use of social media plays an important role in online purchase decision. For example, Makki and Chang reported that social media affects the purchase decision for female Saudis [14]. The use of social media helps in building a stronger customer relationship and in increasing the product awareness. The number of social media accounts and their massive usage in Saudi Arabia also indicate consumers’ readiness to employ social media on E-commerce. Twitter is one of the largest social media platforms in the world and is the most popular one in the Arabic world. Twitter Search API provides a way to evaluate E-commerce sites through measuring social engagement, translated in the post and reply rates; user interaction reflected in retweet rates, favorite rate, and direct messages; and response rate or
speed. At the time of writing, Twitter has 320 million users, which makes it the sixth largest social networking site in the world. As Jansen and Zhang pointed out [15], no other social media platform allows a direct and instant communication between consumers and businesses, and free APIs to streaming data. The importance of Twitter also emanates from the fact that tweets are not only displayed on a user’s profile page, but they can be delivered directly to followers via instant messaging, Short Message Service, Really Simple Syndication, email, or other social networking platforms. One discrepancy among countries regarding Twitter usage is clear if we calculate the Twitter penetration for each country. The penetration is defined as the number of monthly active tweeting users relative to the total amount of internet users in that country. Statistical reports and information graphics about social media usage in Saudi Arabia puts it in top of other countries in the region and the world with over 32% active twitter users [16]. Saudi Arabia’s Twitter growth rate is of 3,000% from 2011 to 2012, and is still growing. This growing rate is 10 times more compared to the average global rate. Statistics also showed that there was an average of 50 million tweets per month in 2012 and 150 million tweets in 2013 in the country [5]. The afore mentioned numbers made Saudi Arabia ranked number one in the world in Twitter growth and activity, which motivate us to use the social media data from Twitter to assist the evaluation of Ecommerce websites in Saudi Arabia. Many recent phenomena and new factors affecting Ecommerce performance are globally recognized which include but are not limited to social media, mobile usage, payment method, and government regulation. It is also a known fact that the factors influencing E-commerce adoption and implementation are highly related to the country or the region and its culture under investigation [17]–[20]. These new factors, region and culture differences needed to be considered in a modern E-commerce websites evaluation model [21]. Thus, a new tool based on such model that better tackles fore mentioned problems is indeed needed. Therefore, our ultimate goal is to develop an Online Presence Evaluation Model (OLPEM) for performance evaluation of E-commerce sites. In this paper, however, we do not intend to create such new tool but focus on social media and region factors and leverage them for performance evaluation. The region under study is Saudi Arabia and the social media channel is Twitter. From a previously collected data of over 160 E-commerce websites in Saudi Arabia [5], we chose the top 13 Twitter followers accounts that have web traffic data. Tweets data in a 28-day period was collected to analyze the performance of E-commerce sites through factors like popularity, engagement (response) and reach. II.
LITERATURE REVIEW
A. Evaluation of E-commerce Sites Evaluation of the performance of websites and assessment of the impact is important to the successfulness of any E-commerce business. It is also a way to define their strengths and weaknesses in order to enhance the former and address the latter. Website evaluation allows business owners
2526
and investors to measure the site’s success and deficiency. Consumers will also benefit from the evaluation results as it will provide them with the information they need regarding E-commerce services and product quality as well as the business’ overall reputation. In fact, consumers as well as Ecommerce owners have been using third party comparison websites to have a neutral, reliable evaluation of business websites. Comparison sites are considered as one of the most reliable methods for E-commerce evaluation today as they provide an impartial assessment based on a set of predefined criteria. For instance, BizRate.com, one of the very first comparison websites, established 10 criteria to evaluate Ecommerce websites [9]. Comparison sites bridge the gap between E-commerce websites and their consumers. So, customers have a way to navigate the growing landscape of online retailers to assess the quality of different online stores. Other methods that have been used to evaluate Ecommerce websites are online surveys, expert evaluation, or the combination of both. For instance, Forrester Research is an independent technology and market research company which combines online surveys and experts analysis to come up with a comprehensive evaluation. Gomez Advisors Company is famous for what its experts call “scorecard,” which encompasses criteria comprising 150 to 250 factors to measure the quality of E-commerce websites. Shihab et al [22] have taken safety as a new factor and created a new formula to measure safety when shopping online. They call it the “Trust Presence Evaluation” tool which would include a larger number of parameters that they obtained from other 10 similar studies. They reported that lack of trust in Ecommerce transactions may potentially hinder further growth of E-commerce. Their study was the first one that focused on the level of trust displayed by sellers online. They formulated a trust presence evaluation tool that includes 9 categories, 29 parameters, and 93 indicators to be used to assess the trust presence in E-commerce websites. Creating and maintaining a website is one of the most important branding tools to improve the brand’s visibility. Social media has a growing and an undeniable impact on any businesses nowadays. According to Weatman [23], building an online presence in social media is vital for a brand to thrive and to be acknowledged by consumers. She stressed on the fact that with their omnipresence, social media represents a huge opportunity for all kinds of E-commerce to get the publicity it needs and to reach hundreds of millions of customers. Weatman provided advice to business owners on how to build the online presence through various social media, including Facebook, Twitter, LinkedIn, Pinterest, and YouTube. Her paper suggested that businesses need to grow a positive online presence and ensure the development and promotion of a polished, well-represented brand image. To evaluate an E-commerce online presence, it is important, as the businesses’ fan-base grows, to use analytics and other tools to help understand how effective an online presence is, and how to make it even more effective, such as knowing where or what specific social media a business needs to focus its efforts on and what they might need to change. Talpau et al [24] have enumerated the benefits of Ecommerce online presence, namely; overcoming
geographical boundaries, increasing speed of communications, 24/7 access to information, low cost promotion, the possibility of establishing inter-personal relationships, the ability to segment the market very well, the ease in obtaining feedback, and high exposure. Their study, as they qualify it, is descriptive and exploratory. The data is collected from public sources and then analyzed to identify Romanian top brands behaviors in an online environment. Exploratory studies are used to understand a problem or a situation. Descriptive studies help building behavioral profiles of people, events, and situations and in this case the brands. The authors explained online presence in terms of the brand strength and weakness; they proved that the presence of brands in the social media and brand strength are interdependent. Social media helps brands to interact with customers, using humanizing strategies of the brand which can lead to the customers’ loyalty. Clarifying the difference between website and social media presence, the authors noted that the brand's experience in the online environment is the site and the contact with the brand through social media, while in the office environment, the brand is represented more by products attributes. More than with the website, social media has the potential to create relationships with customers, interact with them and retain them, which is why brands must be active in the social media in order to establish contact with the target audience. Companies must focus on trust and the first step is to create brand loyalty [24]. B. E-Commerce in Saudi Arabia AlGhamdi et al [17] presented a new conceptual framework aiming to promote trust in E-commerce environment in Saudi Arabia. The new model is a result of mixed methods conducted in four stages involved in the study to determine the reasons of the slow growth of Ecommerce in Saudi Arabia. The framework is a five-part model aiming to promote trust in E-commerce in Saudi Arabia. These five parts are: the provision of secure and trustworthy online payment alternatives, enforcing consumer protection, strengthening delivery systems, clarifying market restrictions, and establishing certification authority. The results of the study confirmed that the disbelief in Ecommerce environment in Saudi Arabia is the main reason of the slow growth. Alotaibi and Bach introduced an exploration of the major challenges to establish efficient facilities faced by the Ecommerce industry in Saudi Arabia [25]. These challenges of the successfulness of E-commerce implementation in Saudi Arabia are: the absence of government involvement, insecure online payment infrastructure, the lack of E-commerce law, and weakness of the postal delivery system. Therefore, some proposed solutions were presented to address challenges facing the process of E-commerce development facilitation in Saudi Arabia. Their solutions require the support of government to establish permanent home addresses and to provide secure online payment, as well as to establish regulations and policies. Makki and Chang [26] agreed on the challenges and solutions but reported several new factors. Other web evaluation studies [27] have reported that a few
2527
factors such as heuristic and usability are no longer affecting the performance of websites, which explains why these factors will be excluded in this research. C. E-Commerce Website and Social Media Evaluation in Saudi Arabia Alotaibi proposed an evaluation instrument aiming to evaluate the quality of E-commerce websites in Saudi Arabia and then highlight the potential improvement opportunities [10]. Consequently, six E-commerce websites have been chosen for analysis and evaluation, equally categorized as follows: regional, international, and domestic. These websites are: Logta, E-Mall, Souq, Amazon, E-Market, and Alibaba. Sixty participants have evaluated those six websites, following a number of evaluation instrument guidelines with good web design experience. Evaluation factors described by the evaluation instrument are: content, interaction, assurance, appearance, organization, and customer focus. The outcomes of the instrument have confirmed that the regional and domestic websites were of lower quality than international websites considering all web evaluation aspects. Therefore, the quality of domestic websites is less than the quality of regional websites in terms of customers’ focus, appearance, organization, assurance and content. Several hypothesis were formed by Guzzo et al [28] regarding how the consumer’s purchase decision is affected by other members of social network with the same social attraction and interests. Hence, a new model called Ecommerce Consumer Acceptance (ECA) was developed taking into account the additional components that affect consumer acceptance with respect to online social networks. Their new model can be considered as an extension to the Technology Acceptance Model formalized from the social network point of view to determine the influence of social media on consumer’s acceptance of E-commerce. The study added a better understanding of the online social effects on the E-commerce acceptance. Therefore, their model extends other variables associated with purchase decision affected by a set of characteristics of social networks. Alshehri and Meziane put their attention to the influence of social media as a third party on E-commerce development especially in building trust measures [3]. They described that close social relationships, mainly family members and friends, has stronger impact of recommendations on the purchase decision in Saudi Arabia. They also attempted to explore whether Saudi Arabia’s case is similar to the findings in other studies. More specifically, their study evaluated the effects of social endorsements on E-commerce transactions to determine the role of social impact in the increase of online shopping in the country. They performed a survey and collected data from 606 participants in Saudi Arabia. Based on their survey’s results, they reported four main factors associated with the social characteristics and relationships affecting online shopping. In details, a strong effective factor was a member of a famous group of companies that improves the trust in online shopping with 83.3%, followed by 81.5% of the participants trust in their family members. In addition, the percentage of the
participants that trust their friends’ recommendations was 73.8%. In general, 57.1% of the respondents are fine with supporting factors of social impact as a third party that enhances online shopping. The findings of the study have confirmed that the similarity between previous studies conducted in other countries and Saudi Arabia. Moreover, they reported that the social influence is a motivating factor, which supports and strengthens E-commerce development in Saudi Arabia. Alafeef identified the influence of social media on Ecommerce sales based on a sample selected E-stores from Saudi Arabia [7]. He reported a positive relationship between the use of social media and E-commerce sales and promotions in the conducted companies. Thus, he suggested the use of social media as a changing tool to the development of E-commerce in Saudi Arabia. He stated that E-commerce is one of the current beneficial methods to enhance business operations in terms of increasing purchase and marketing efficiency in Saudi Arabia. Makki and Chang also conducted a similar study related to E-Commerce and social media in Saudi Arabia [14]. D. Twitter for Analytic Purposes Twitter analytics has been extensively used among researches seeking to collect huge amount of data and find useful information on different subject fields. Jamison-Powel et al, used twitter data to conduct a research on insomnia [29]; others analyzed twitter data for marketing [30] or athletics purposes [31], just to name a few. Other studies using Twitter data for analytics cover a wide range of topics including but not limited to information propagation and diffusion [32], popularity prediction [33], event detection [34], and tweet classification [35]. Most of these studies focused on qualitative content analysis, linguistic analysis or network analysis; some studies focused on quantitative analysis by employing statistical method and algorithm development like ours. III.
METHODOLOGY
In this section, we describe the data used in this paper, how and where the data was collected, and what factors were extracted from Twitter data. We propose several Twitter performance indexes and explain how we use these indexes for E-commerce site performance evaluation. We then compare the twitter performance indexes with the performance evaluation result using several commercial tools for 13 selected E-commerce websites in Saudi Arabia. A. Data Collection Methods In this study, we used a quantitative approach to gather an in-depth understanding of Twitter usage from Ecommerce websites in Saudi Arabia. The top 13 Twitter followers accounts were selected from our previous study containing data of more than 160 Saudi E-commerce sites [5]. As pointed out [36], social media analytics has played an important role in E-commerce. In this study, we aim to leverage the Twitter data to assist the performance assessment for E-commerce systems. To accomplish our goal, we collected data from online Twitter analysis
2528
websites, TweetChup.com and Twitonomy.com. In addition to Twitter data, we also collected website traffic, ranking, regional related and services data for each selected website. The dynamic website created in our previous study (ecgulf.org) containing data of E-stores, like category, location, website link, social media accounts links, delivery methods, accepted payment methods, languages of website, currencies accepted, mobile compatibility etc. [5]. More than 160 major online stores in Saudi Arabia were included in ecgulf.org. Information of each E-store was manually added first by visiting its corresponding website and filling record form in our site. E-store owners can also visit the site and fill the form to be included in our database. The actual number of E-stores in our site can vary as we regularly add new E-stores or remove the out-of-business E-stores. At the time of this paper is written, the number of E-stores in our website is 164. We also collected the overall web traffic data the of 13 selected E-stores from SimilarWeb.com. In summary, Twitter data, website traffic and regional related data were collected to formulate their influences on performance, and to evaluate how they affect the overall E-store performance as part of our development of OLPEM. Twitter Data Collection: Tweets of each account from June 1st, 2016 to June 28th, 2016 are gathered using Tweetchup.com and our Twitter Search API web program. The total number of tweets collected during this 28-day period is 667,391. This Twitter dataset is analyzed using an in-house web-based program to extract useful information. The following information is extracted for each account: account creation date, number of followers, times the account is added to a list by followers and total lifetime tweets. Table 1 shows the selected accounts and summary of their data. Table 1: Twitter Accounts Data Summary Twitter Account
Created
Followers
Listed
Total Tweets
alhabib_shop
11/18/2012
76,480
135
7,918
alwaneshop
10/02/2012
60,548
110
41,871
axiom_ksa
07/05/2012
195,992
274
51,745
Carrefoursaudi
05/21/2011
295,958
349
3,701
dokkanafkar
08/08/2012
44,134
87
8,503
EMall_KSA
06/23/2010
86,664
261
28,105
eXtraStores
09/08/2009
616,234
778
63,296
jarirbookstore
04/13/2011
1,336,817
1,679
252,408
MarkaVIP
09/07/2010
54,895
135
19,786
matjarhk
07/15/2012
63,594
97
4,280
NamshiDotCom 11/17/2011
313,948
327
111,027
SouqKSA
01/18/2010
284,253
603
53,988
vanillaeshop
09/13/2011
77,099
229
20,763
ar-sa.namshi.com, 4: carrefourksa.com, 5: dokkanafkar.com, 6: e-mall.com.sa, 7: extrastores.com, 8: jarir.com, 9: ksa.axiomtelecom.com, 10: markavip.com, 11: matjarhk.com, 12: saudi.souq.com, 13: vanillaeshop.com.
We also extracted selected time period data: number of period tweets, average daily tweets, number of mentions, number of tweets containing links, number of replies, number of retweets, number of times tweets are favorited (liked) and number of hashtags. Table 2 shows retrieved data counts for each Twitter account during the 28-day period.
Region Related and Services Data Collection: We have collected regional related and services data that include payment methods, delivery methods, aftersales services, regional coverage, website languages and accepted currencies. For each factor, we applied a pre-defined weighting scheme based on our previous studies [14], [37]. For examples, credit card payment method is desired and weights more than other payment methods. The payment score would be ranged from 1 to 10 with the following weights: Credit Card=4, PayPal=3, Bank Transfer=1.5, Cash On Delivery=1 and CashU=0.5). CashU is a payment gateway service for the Arab world. For the delivery methods, the use of more reliable local and commercial services would be preferred, so the weights is 8 for Aramex, FedEx/SMSA and Saudi Postal, and 2 if own delivery systems were used. For regional coverage, if an E-store covers Saudi Arabia, an initial score of 7 is given, and the score will be reduced by 1 if it covers limited areas in the country. If the E-store also covers the neighboring Gulf countries, Arab countries or international countries, the score will be increased by 1 for each. The main website language is Arabic which is scored of 5, and any additional languages (either one or more) will make it 10. Similarly, the main accepted currency is Saudi Arabian Riyals (SAR) which is scored 9, and any additional currencies would be scored as 10. Finally, aftersales services are counted by 2 for each service (exchange, return, maintenance, guarantee and technical support).
Website Traffic and Ranks Data Collection: We also collected website traffic data of each E-store from SimilarWeb.com for the same period of time. The traffic data collected includes: total number of visits, number of average daily visits, average visitor duration, number of pages per visit, and bounce rate. SimilarWeb.com also provides other data like traffic source, display advertising, content and audience interests. However, such information is not always available for all selected websites. Therefore, we decided not to consider it until it is available in the future. Additional data extracted is Google Page Rank, Alexa Rank, Moz Rank and Moz Domain Authorization. These ranks provide a general view on the web pages in either search engines based on a set of criteria or in their own directory using their own algorithms for a certain purpose. For example, Page Rank works by counting the number and quality of links to a page according to keywords to determine a rough estimate of how important the website is. In addition, we also checked the following information for each site including Native Mobile App availability, Google Mobile Friendly (GMF) status, and Google Safe Browsing (GSB). Table 3 shows the traffic data of the 13 E-commerce websites retrieved from SimilarWeb.com. For space-saving purpose, we will use the E-store numbers instead of the full address of the E-commerce sites in tables. The mapping is 1: alhabibshop.com, 2: alwaneshop.com, 3:
Table 2: Twitter Accounts Tweets Counts from 1, June 2016 to 28, June 2016 Average Daily Mentions Tweets 2.8 41
Twitter Account
Tweets
alhabib_shop
76
alwaneshop
226
8.4
axiom_ksa
2,751
Carrefoursaudi dokkanafkar
Links
Replies
Retweets
Favorited
Hashtags
4
37
10,229
1,086
11
80
41
32
16,961
1,513
111
101.9
2,943
627
2,657
247
430
99
140
5.2
117
31
97
626
1,797
76
69
2.6
63
12
56
17
78
0
EMall_KSA
410
15.2
215
243
186
1,229
965
964
eXtraStores
2,853
105.7
2,684
1,061
2,696
1,777
2,259
124
jarirbookstore
4,935
176.3
2,980
1,072
4,708
19,481
8,892
290
MarkaVIP
422
15.6
413
62
381
661
1,159
95
matjarhk
350
13
223
215
158
40,826
2,397
337
NamshiDotCom
1,271
47.1
1,222
71
1,238
77
227
105
SouqKSA
2,082
77
1,807
401
1,805
506
1,406
215
vanillaeshop
144
5.3
98
44
94
559
153
34
2529
Table 3: E-commerce Websites Traffic Data from SimilarWeb.com from 1, June 2016 to 28, June 2016
2,444
Average Daily Visits 87
Average Visitor Duration 00:04:50
2
4,772
170
3
756,829
4
E-store No
Total Visits
1
Pages Bounce per Visit Rate 7.62
16.57%
00:06:03
3.58
29.91%
27,030
00:07:12
10.08
41.91%
240,623
8,594
00:03:34
5.77
34.15%
5
243,715
8,704
00:07:16
8.37
33.32%
6
107,681
3,846
00:04:43
6.24
28.84%
7
2,659,492
94,982
00:05:35
5.48
31.10%
8
2,517,003
89,893
00:05:58
5.16
29.97%
9
99,373
3,549
00:03:39
4.04
35.66%
10
651,633
23,273
00:06:33
8.73
40.97%
11
9,710
347
00:04:09
4.38
28.27%
00:04:30
4.30
51.90%
00:09:02
4.41
29.01%
12 13
11,977,050 427,752 10,746
384
, where Y is the age of the account in years, T is the total number of tweets, F is the number of followers, and L is the number of listed. Tweets Activity (TA): The TA score depends on the average daily tweets reflecting how often the account tweets. The TA score can be calculated according to the following formula also using the PRE function: 𝑇𝑇
𝑇𝑇𝑇𝑇 = 𝑃𝑃𝑃𝑃𝑃𝑃 � � × 100 𝑃𝑃
, where T is the number of tweets within a period and P is the number of days in the same period. Account Reach (AR): The AR score depends on the total number of tweets and the number of followers that reflects how many tweets have been sent to how many followers. We recognize that the number of retweets may contribute to the score of this index, but due to the difficulty to track the followers of each follower who retweeted, we have decided not to include it here but use it in our next index. The AR score can be calculated using the following formula using the PRE function: 𝐴𝐴𝐴𝐴 = 𝑃𝑃𝑃𝑃𝑃𝑃(𝑇𝑇 × 𝐹𝐹) × 100
The numbers of total lifetime tweets by each account illustrates that the level of tweets is not proportional to the number of followers, see Table 1. For example, Carrefour Saudi is ranked 4th in number of followers with nearly 300 thousand while it is ranked last (13th) with only 3,701 lifetime tweets. On the other hand, alwaneshop is ranked 11th in number of followers, over 60 thousand, but is ranked 6th with over 41 thousand tweets. More details on the results will be presented and discussed in the next section.
, where T is the number of tweets within a period and F is the number of followers.
B. Twitter Performance Indexes To measure the performance of Twitter accounts, as a part of social media factor in the OLPEM, the following indexes were proposed and computed using the collected data. The score of each index presented here is ranged from 0 to 100. In general, a higher score indicates a better performance.
4
Account Age (AA): The AA score depends on the account creation date, when the account was added to a user list (listed) per follower and the lifetime tweets. The score of this index reflects how long the account being opened, how many times the account being listed, and how many tweets the account have been posted during the account’s lifetime. A Microsoft’s Excel function PERCENTRANK.EXC (PRE), which returns the rank of a value in a data set as a percentage, is used to calculate the percentile of one account compared to the average of all accounts. The AA score was calculated according to the following formula: 𝐴𝐴𝐴𝐴 = �
𝑌𝑌
𝐹𝐹
𝑃𝑃𝑃𝑃𝑃𝑃�𝑇𝑇�+𝑃𝑃𝑃𝑃𝑃𝑃� 𝐿𝐿 � 2
� × 100
Account Popularity (AP): The AP score depends on the number of tweets, the retweets percentages, the favorited percentages and the number of followers. The AP score reflects how popular is the account, and can be calculated by the following formula: 𝐴𝐴𝐴𝐴 = �
𝑇𝑇 𝑅𝑅
𝑇𝑇 𝑉𝑉
𝑃𝑃𝑃𝑃𝑃𝑃(𝑇𝑇×𝐹𝐹)+ 𝑃𝑃𝑃𝑃𝑃𝑃� �+𝑃𝑃𝑃𝑃𝑃𝑃� �+𝑃𝑃𝑃𝑃𝑃𝑃(𝐹𝐹)
, where T is the period tweets, F is the number of followers, R is the retweet percentage and V is the favorited percentage. Account Engagement (AE): The AE score depends on the percentage of replies and mentions that reflects how many times out of tweets an account replied or mentioned a follower during the observed period. The AE score can be calculated according to the following formula: 𝐴𝐴𝐴𝐴 = �
𝑇𝑇
𝑇𝑇
𝑃𝑃𝑃𝑃𝑃𝑃�𝑌𝑌�+𝑃𝑃𝑃𝑃𝑃𝑃�𝑀𝑀�+𝑃𝑃𝑃𝑃𝑃𝑃(𝐹𝐹) 3
� × 100
, where T is the period tweets, Y is the number of replies, M is the number of mentions and F is the number of followers. Account Website Support (WS): The WS score depends on the percentage of tweets containing hyperlinks within a period. The WS score can be calculated according to the following formula: 𝑇𝑇
WS= 𝑃𝑃𝑃𝑃𝑃𝑃 � � × 100 𝑈𝑈
2530
� × 100
, where T is the period tweets and U is the number of tweets containing hyperlinks. C. Website Performance Indexes Three different website performance indexes will be used to represent the performance evaluation of Ecommerce website and compare against the Twitter performance indexes. As mentioned in section III.A, the data used here was collected from SimilarWeb.com (see Table 3). The score of these three indexes are also ranged from 0 to 100. Website Traffic (WT): The WT score depends on the total number of visits (V), the average daily visits (D), the average visitor duration in seconds (R), the number of pages per visit (P) and the bounce rate (B). Intuitively, these factors are not equally important. Based on our previous studies and our best knowledge, we have defined a relative weighting scheme for these factors. The WT score can then be calculated according to the following formula with the weights pre-defined: 𝑊𝑊𝑊𝑊 = �
(𝑃𝑃𝑃𝑃𝑃𝑃(𝑉𝑉)×6)+(𝑃𝑃𝑃𝑃𝑃𝑃(𝐷𝐷)×2)+(𝑃𝑃𝑃𝑃𝑃𝑃(𝑅𝑅)×0.75)+ 𝑃𝑃𝑃𝑃𝑃𝑃(𝑃𝑃)+(𝑃𝑃𝑃𝑃𝑃𝑃(𝐵𝐵)×0.25) � 10
× 100
Website Rankings and Mobile Friendly (RM): The RM score depends on the Google Page Rank (G), the Alexa rank (A), the Moz rank (M), the Moz domain authority rank (D), the native mobile app (P) and GMF (F). If there is a native mobile app, P equals to 1, otherwise 0. If the website passed the GMF test, F equals to 1, otherwise 0. The RM score can be computed as the average value of those 6 parameters according to the following formula: 𝑅𝑅𝑅𝑅 = �
𝑃𝑃𝑃𝑃𝑃𝑃(𝐺𝐺) + 𝑃𝑃𝑃𝑃𝑃𝑃(𝐴𝐴) + 𝑃𝑃𝑃𝑃𝑃𝑃(𝑀𝑀) + 𝑃𝑃𝑃𝑃𝑃𝑃(𝐷𝐷) + 𝑃𝑃 + 𝐹𝐹 � × 100 6
Regional and Services (RS): The RS score depends on the payment methods (P), the delivery methods (D), the
regional coverage (R), the aftersales services (S), the website languages (L), and the accepted currencies (C). Similarly, these factors won’t be equally important, and relative weight should be applied to each factor. Therefore, we also defined a weighting scheme for these factors. The RS index can then be computed using the following formula:
IV.
𝑅𝑅𝑅𝑅 = (𝑃𝑃 × 3.5) + (𝐷𝐷 × 2) + (𝑅𝑅 × 1) + (𝑆𝑆 × 2.5) + (𝐿𝐿 × 0.5) + (𝐶𝐶 × 0.5)
RESULTS AND DISCUSSION Table 4 lists the scores of computed performance indexes from the collected Twitter data. More investigation is needed to determine which index is more effective for the performance evaluation of E-commerce sites. We believe a linear combination of these indexes would be an optimal choice. For the purpose of simplicity, we compute an average Twitter performance score that includes all proposed indexes, i.e., a linear combination of these indexes with equal weights. In Table 5, we show the values of computed rank using various commercial evaluation tools. We manually checked each E-commerce site to determine the existence of a native mobile application. Google Page Rank indicates how a website is performing in their search engine using Google’s own algorithm. This rank refers to the order of the website showing in searching results and the value is out of 10. Moz Rank and Moz Domain Authority have values out of 10 and 100 respectively. Unlike Google Page Rank targeting keywords, Moz Rank targets link popularity score based on search queries. This also explains the difference in range between them and will lead us to normalize the values when combining results. Table 6 listed the information needed to compute the RS index, which is a score related to regional and services. Each factor was computed for each E-commerce site based on the data we have obtained from our dynamic webpage (ecgulf.org), and applied the predefined weighting schemes we have described in the method section.
Table 4: Scores of Twitter Accounts Performance Indexes Twitter Account
AA Score
TA Score
AR Score
AP Score
AE Score
WS Score
Average Score
alhabib_shop alwaneshop axiom_ksa Carrefoursaudi dokkanafkar EMall_KSA eXtraStores jarirbookstore MarkaVIP matjarhk NamshiDotCom SouqKSA
32.10 49.95 57.10 57.10 21.40 46.40 82.10 74.95 28.55 67.80 67.80 46.35
14.20 35.70 85.70 21.40 4.10 50.00 92.80 64.20 57.10 42.80 71.40 78.50
14.20 28.50 71.40 57.10 7.10 50.00 92.80 85.70 42.80 35.70 64.20 78.50
17.80 26.75 74.95 42.80 39.25 48.20 76.75 58.90 37.45 24.95 80.30 73.15
52.33 71.37 38.03 45.20 35.67 76.13 49.97 35.67 33.30 71.40 30.90 54.73
7.10 28.50 78.50 21.40 14.20 64.20 85.70 92.80 42.80 57.10 50.00 71.40
22.96 40.13 67.61 40.83 20.29 55.82 80.02 68.70 40.33 49.96 60.77 67.11
vanillaeshop
17.80
28.50
21.40
48.15
54.70
35.70
34.38
2531
Table 5: E-commerce Websites Ranks and Mobile FriendlyTest E- Google store Page No Rank
Alexa Rank
Moz Rank
Moz Native GMF Domain Mobile Pass Auth. App
1
0
1,407,359
0.00
7.43
No
TRUE
2
1
220,757
4.29
13.23
No
TRUE
3
3
12,428
4.90
46.39
Yes
TRUE
4
0
101,521
4.79
33.06
Yes
FALSE
5
3
59,225
5.05
25.11
No
TRUE
6
7
176,734
5.38
36.91
No
FALSE
7
4
6,994
5.02
37.02
Yes
TRUE
8
0
7,310
6.04
40.90
Yes
TRUE
9
0
68,488
5.07
37.66
No
TRUE
10
3
27,424
5.98
39.17
Yes
TRUE
11
0
548,359
5.10
21.04
Yes
TRUE
12
3
520
5.52
59.00
Yes
TRUE
13
3
5,873,703
4.92
25.83
No
TRUE
Table 6: Websites Regional and Services Estore No
Payment
Delivery
Reginal Coverage
Aftersales Service
Website Language
1
2.5
8
7
4
5
9
2
6.5
8
10
10
10
9
3
8
8
7
4
10
9
4
1
2
7
4
10
9
5
4.5
8
7
4
10
10
6
4
8
7
4
10
9
7
5
8
7
10
10
9
8
6.5
8
7
10
10
9
9
5
10
7
10
10
9
10
8
8
9
2
10
10
11
9.5
8
7
6
10
10
12
10
8
7
4
10
9
13
8.5
8
8
4
5
9
Accepted Currency
performance indexes. When comparing the performance indexes for each E-commerce site, however, we find website performance is not always coincident with Twitter performance index (i.e., E-stores dokkanafkar.com and markavip.com). This implies the social big data can play a significant role affecting the performance assessment. The overall activity of Twitter accounts showed in Table 2 indicates that accounts activity can vary regardless of how many followers it has or how old the account is. The number of retweets actually reflects the account activity. In Table 1, the relatively new accounts (i.e., alhabib_shop, alwaneshop and matjarhk) that have a lower number of followers and tweets but were ranked as the top 3 for the retweet with significantly high percentages, see Table 8. The account alhabib_shop has a very small number of followers and tweets performed the best in retweet activity. It has only 76 tweets in the 28-day period but 10,229 retweets. Despite the low number of followers compared to top retweeting accounts and the very low tweeting rates, a huge amount of people retweeted their tweets. This indicates that those accounts’ followers are more active and are more loyal followers who often retweet (share) their tweets with their family, friends and followers. On the other hand, some accounts have a lower retweet rates like axiom_ksa and NamshiDotCom with 247 (nearly 9%) retweets from 2,751 tweets and 506 (about 24%) retweets from 2,082 tweets respectively. Even though some E-stores Twitter accounts have significantly high retweets, they may not perform well in other factors. Table 7: Website Performance Factosr Scores
In Table 7, we summarize the scores of three website performance indexes. These three indexes covered most of the factors that were previously identified by many studies and can influence the performance of an E-commerce site. Those commercial website evaluation tools, however, do not take social media into consideration. When comparing the result in Tables 4 and 7 (see the last column), we can compute the correlation coefficient 0.7765 between the averaged Twitter performance index and the averaged web
2532
E-store No.
Traffic Scores
Ranking and Mobile Friendly Scores
Regional and Services Scores
Average Website Performance
1
18.35
22.58
48.75
29.89
2
18.49
33.28
83.25
45.01
3
72.64
72.60
70.50
71.91
4
47.14
35.68
34.00
38.94
5
61.03
47.60
58.75
55.79
6
45.30
41.65
56.50
47.82
7
78.56
78.55
75.00
77.37
8
72.79
76.15
80.25
76.40
9
31.76
46.40
79.00
52.39
10
65.82
78.55
68.00
70.79
11
23.72
52.35
81.25
52.44
12
78.70
85.68
77.50
80.63
13
35.12
38.08
70.75
47.98
V.
Table 8: Number and Percentage of Retweets and Replies E-store No. 1
Retweets (%)
Replies (%)
10,229 (13,459.21%)
37 (48.68%)
2
16,961 (7,504.87%)
32 (14.16%)
3
77 (6.06%)
1,238 (97.40%)
4
626 (447.14%)
97 (69.29%)
5
17 (24.64%)
56 (81.16%)
6
1,229 (299.76%)
186 (45.37%)
7
1,777 (62.29%)
2,696 (94.50%)
8
19,481 (394.75%)
4,708 (95.40%)
9
247 (8.98%)
2,657 (96.58%)
10
661 (156.64%)
381 (90.28%)
11
40,826 (11,664.57%)
158 (45.14%)
12
506 (24.30%)
1,805 (86.70%)
13
559 (388.19%)
94 (65.28%)
Replying to, or mentioning, the followers is essential in the E-commerce environment to engage with the followers and answer their questions. This factor is also used to calculate the AE index. Table 8 shows the number and percentage of account’s replies. It indicates that the top period tweets E-Stores are also top E-stores in replies except for SouqKSA. Most of their tweets are replies in an average percentage of 95% meaning they are actively engaging with their followers. These E-stores also tend to score higher in website performance factor. However, SouqKSA outperformed them in website factor, which may explain why SouqKSA does not focus on replying to followers. Based on the results of our preliminary evaluation model, extrastores.com is the best performing E-commerce website by a score of (77.37+80.02)/2=78.7. According to Darasha’s online article, Extra invested in their E-commerce website, which started on 2011 (claimed to be the first in the country) and is growing by 300% annually [38]. The next best performing E-stores are also well-known E-stores in the region. Most likely, the E-stores with high Twitter performance scores have similar high website performance, which indicates that they work to improve their E-commerce efforts to remain in competition. The rest of medium or low performance score E-stores need to work on enhancing their efforts, especially in regional and services factors as they are performing poorly even with good scores on other factors. In addition, they need to find solutions to increase traffic to their websites as it plays a key role in the overall performance of E-commerce. Perhaps a good way is by increasing the Twitter WS factor that focuses on utilizing Twitter to bring visitors to the E-store’s website via links in the tweets. WS factor for some E-stores are extremely low and need to be improved. Therefore, its weight might be recalculated. Some E-stores have significantly high retweets, but they are not enough to perform well if not accompanied with high scores in other factors.
2533
CONCLUSIONS
In this paper, we have shown that social big data can be utilized for performance evaluation of E-commerce websites. Twitter data was collected for social data analytic, and several Twitter performance indexes were proposed. We also evaluated the E-commerce website performance using several commercial evaluation tools. Based on our study, the correlation coefficient is 0.7765 between the averaged Twitter performance index and the web performance index. In the future, we plan to include more E-commerce sites, various periods, more factors like link validation and page speed, and add other social media channels such as Facebook for our social big data analysis. Feature selection techniques will be used to determine which Twitter performance index is more effective or feature extraction techniques like principle component analysis to create new performance indexes. The ultimate goal would be to design and develop a new tool that leverages the social media data to dynamically evaluate the performance of E-commerce sites. During our study, we found that the website traffic information is not always accessible to public. Other useful information such as abandoned shopping cart and successful sales are not available to us either. A marketing strategy would be needed to convince the website owners allowing us to obtain those currently unreachable data. Finally, although our study was conducted using the Ecommerce sites in Saudi Arabia, the proposed framework can be adopted in other countries and regions with a different set of regional related factors or even different domains other than E-commerce. REFERENCES [1]
S. Akter and S. F. Wamba, “Big data analytics in Ecommerce: a systematic review and agenda for future research,” Electron. Mark., vol. 26, no. 2, pp. 173–194, 2016.
[2]
A. Al Matar and Q. Gao, “Social Media’s Impact on E Commerce : How to Adopt Social Media for E-Commerce Marketing,” 2011, pp. 1–6.
[3]
H. Alshehri and F. Meziane, “The Impact of Social Influence and Third Party Endorsement on Online Shopping In Saudi Arabia,” J. Internet E-bus. Stud., vol. 2015, pp. 1–9, 2015. M. N. Hajli, “A study of the impact of social media on consumers,” Int. J. Mark. Res., vol. 56, no. 3, pp. 387–404, 2013.
[4] [5]
[6]
[7] [8]
E. Makki and L.-C. Chang, “The Impact of Mobile Usage and Social Media on E-Commerce Acceptance and Implementation in Saudi Arabia.,” in The 2015 International Conference on e-Learning, e-Business, Enterprise Information Systems, and e-Government (EEE’15), 2015, pp. 25–30. R. J. Assaf, N. A. Noormohamed, and M. A. Saouli, “Rethinking Marketing Communication – Using Social Media to Attract College Consumers in the Middle East,” Compet. Forum, vol. 10, no. 2, pp. 112–117, 2012. M. A. M. I. Alafeef, “The Impact of Social Media on Sales Promotion in E-commerce Companies ( Case Study – KSAAl Baha City ),” vol. 6, no. 16, pp. 197–205, 2014. H. Liu, V. V. Krasnoproshin, and S. Zhang, “Combined Method for E-Commerce Website Evaluation Based on Fuzzy
Neural Network,” Appl. Mech. Mater., vol. 380–384, pp. 2135–2138, 2013. [9] [10]
[11] [12] [13] [14]
[15] [16]
[17]
[18] [19]
[20]
[24]
Y. Wang, H. Ding, and G. G. Xu, “E-Commerce Site Evaluation Based on Neural Networks,” Appl. Mech. Mater., vol. 263–266, pp. 2678–2681, 2012. M. B. Alotaibi, “E-Commerce Adoption in Saudi Arabia: an Assessment of International, Regional and Domestic Web Presence,” Int. J. Inf. Technol. Comput. Sci., vol. 5, no. 2, pp. 42–56, Jan. 2013. J. A. Rodríguez-Pineda, “WebQual: A Web Site Quality Instrument,” University of Georgia, 2000. A. Parasuraman, V. A. Zeithaml, and L. L. Berry, “Servqual: A Multiple-Item Scale For Measuring Consumer Perc,” J. Retail., vol. 64, no. 1, p. 12, 1988. H. S. Al-Khalifa, “A framework for evaluating university mobile websites,” Online Inf. Rev., vol. 38, no. 2, pp. 166– 185, 2014. E. Makki and L.-C. Chang, “Understanding the Effects of Social Media and Mobile Usage on E-Commerce: An Exploratory Study in Saudi Arabia,” Int. Manag. Rev., vol. 11, no. 2, pp. 98–109, 2015. B. J. Jansen and M. Zhang, “Twitter Power:Tweets as ElectronicWord of Mouth,” J. Am. Soc. Inf. Sci. Technol., vol. 60, no. 11, pp. 2169–2188, 2009.
[26]
[27] [28] [29]
N. Schoonderwoerd, “4 Ways How Twitter Can Keep Growing - PeerReach Blog,” 2013. [Online]. Available: http://blog.peerreach.com/2013/11/4-ways-how-twitter-cankeep-growing/. [Accessed: 20-Jan-2015]. R. AlGhamdi, S. Drew, and T. Alhussain, “A Conceptual Framework for the Promotion of Trusted Online Retailing Environment in Saudi Arabia,” Int. J. Bus. Manag., vol. 7, no. 5, pp. 140–150, Feb. 2012. S. a Al-Hudhaif and A. Alkubeyyer, “E-Commerce Adoption Factors in Saudi Arabia,” Int. J. Bus. Manag., vol. 6, no. 9, pp. 122–133, Sep. 2011. R. Alghamdi, A. Nguyen, J. Nguyen, S. Drew, and R. Al Ghamdi, “Factors Influencing Saudi Customers’ Decisions to Purchase from Online Retailers in Saudi Arabia: A Quantitative Analysis,” in IADIS International Conference eCommerce, 2011, no. Citc 2007, pp. 153–161. R. Alghamdi, A. Nguyen, and V. Jones, “A Study of Influential Factors in the Adoption and Diffusion of B2C ECommerce,” Int. J. Adv. Comput. Sci. Appl., vol. 4, no. 1, pp. 89–94, 2013.
[21]
S. S. Abed, Y. K. Dwivedi, and M. D. Williams, “SMEs’ Adoption of E-commerce Using Social Media in a Saudi Arabian Context : a Systematic Literature Review,” Int. J. Bus. Inf. Syst., vol. 19, no. 2, pp. 159–179, 2015.
[22]
M. R. Shihab, S. Wahyuni, and a. N. Hidayanto, “Analysis of trust presence within e-commerce websites: A study of Indonesian e-commerce websites,” in Proceedings - ICACSIS 2014: 2014 International Conference on Advanced Computer Science and Information Systems, 2014, no. October, pp. 133–138. D. Weatman, “Use Social Media to Manage Your Online Reputation and Support Your Brand,” Career Plan. Adult Dev. J., vol. Summer, pp. 107–114, 2016.
[23]
[25]
2534
A. Talpäu, G. Brätucu, and T. Viera, “Top Romanian Brands Online Presence - behavior and success strategies,” Rom. J. Mark., vol. apr-iun201, no. 2, pp. 13–23, 2013. A. O. Alotaibi and C. Bach, “Consumer Awareness and Potential Market for e-Commerce in Saudi Arabia: Challenges and Solutions,” in ASEE 2014 Zone I Conference, 2014, pp. 1–5. E. Makki and L.-C. Chang, “E-Commerce in Saudi Arabia: Acceptance and Implementation Difficulties,” in The 2014 International Conference on e-Learning, e-Business, Enterprise Information Systems, and e-Government (EEE’14), 2014, pp. 114–120. D. I. Zahran, H. A. Al-nuaim, M. J. Rutter, and D. Benyon, “a Comparative Approach To Web Evaluation and Website Evaluation,” Int. J. Public Inf. Syst., vol. 1, pp. 21–39, 2014. T. Guzzo, F. Ferri, and P. Grifoni, “ECA: An E-commerce Consumer Acceptance Model,” Int. Bus. Res., vol. 8, no. 1, pp. 145–156, 2014. S. Jamison-Powell, C. Linehan, L. Daley, A. Garbett, and S. Lawson, “‘I can’t get no sleep’: discussing #insomnia on Twitter,” in Conference on Human Factors in Computing Systems - Proceedings, 2012, pp. 1501–1510.
[30]
M. J. Culnan, P. J. McHugh, and J. I. Zubillaga, “How Large U.S. Companies Can Use Twitter and Other Social Media To Gain Business Value.,” MIS Q. Exec., vol. 9, no. 4, pp. 243– 259, 2010.
[31]
M. E. Hambrick, J. M. Simmons, G. P. Greenhalgh, and T. C. Greenwell, “Understanding Professional Athletes’ Use of Twitter: A Content Analysis of Athlete Tweets.,” Int. J. Sport Commun., vol. 3, no. 4, pp. 454–471, 2010.
[32]
D. M. Romero, B. Meeder, and J. Kleinberg, “Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter,” in Proceedings of The 20th International Conference on World Wide Web, 2011, pp. 695–704.
[33]
J. Yang and S. Counts, “Predicting the Speed , Scale , and Range of Information Diffusion in Twitter,” in Fourth International AAAI Conference on Weblogs and Social Media, 2010, pp. 355–358. T. Sakaki, M. Okazaki, and Y. Matsuo, “Earthquake shakes Twitter users: real-time event detection by social sensors,” in Proceedings of The 19th International Conference on World Wide Web, 2010, pp. 851–860. D. Davidov, O. Tsur, and A. Rappoport, “Enhanced Sentiment Learning Using Twitter Hashtags and Smileys,” in Proceedings of the 23rd International Conference on Computational Linguistics: Posters, 2010, no. August, pp. 241–249.
[34]
[35]
[36] [37]
[38]
A. Bruns and S. Stieglitz, “Towards more systematic Twitter analysis : metrics for tweeting activities.,” Int. J. Soc. Res. Methodol., vol. 16, no. 2, pp. 91–108, 2013. E. Makki and L.-C. Chang, “E-Commerce Acceptance And Implementation In Saudi Arabia: Previous, Current And Future Factors,” Int. J. Manag. Res. Bus. Strateg., vol. 4, no. 3, pp. 29–44, 2015. Brinda Darasha, “Saudi retailers could miss e-commerce bandwagon,” 2016. [Online]. Available: http://www.argaam.com/en/article/articledetail/id/431607. [Accessed: 25-Aug-2016].