INTRODUCTION. The story of the Web began with an essay written by Vannevar Bush, «As We May Think» published in July, 1945 in the Atlantic Monthly (Bush ...
// International Journal of Web Portals // Volume 4, Issue 1, 2012, Pages 33-41
Web personalization based on fuzzy aggregation and recognition of user activity Alexander Alfimtsev Bauman Moscow State Technical University, Russia Sergey Sakulin Bauman Moscow State Technical University, Russia Vladimir Devyatkov Bauman Moscow State Technical University, Russia
ABSTRACT This article addresses Web personalization based on the analysis of individual user activity. However, human behavior is characterized by uncertainties that should be considered in the personalization algorithms. Fuzzy logic allows taking into account different types of uncertainty. Therefore, the paper presents a method for Web personalization based on fuzzy aggregation and recognition of user activity. The advantages of this approach are 1) the ability to use two types of fuzzy integrals without using the necessary expert set of fuzzy measures and 2) covering all stages of the personalization from aggregation of a single user's query parameters to aggregation of an individual user's profiles in a single parameter of group of users. Keywords: Web personalization, fuzzy aggregation, Choquet fuzzy integral, Sugeno fuzzy integral, fuzzy measure.
INTRODUCTION The story of the Web began with an essay written by Vannevar Bush, «As We May Think» published in July, 1945 in the Atlantic Monthly (Bush, 1945). In this essay, Bush described the theory of Memex a proto-hypertext system that was later developed into the World Wide Web. Today the Web is increasing exponentially and now this phenomenon is more than just a hypertext. The Web began to acquire elements of intelligence. One such element is the ability to adapt to a particular user. For example, the Google search engine allows one to personalize the search based on user search history (Sullivan, 2009). Online marketing and advertising are also becoming more individualized, aimed separately at each user (consumer) (Chiu, 2001;
Markellou, Rigou, & Sirmakessis, 2009). Flexible security personalization approaches will allow an Internet or Web service provider to negotiate with its clientele to an agreed-upon personalized security policy (Yee & Korba, 2008). Interface designers suggest using the personality concept, which helps build a personal user's model based on the interests of individuals (Casas, et al., 2008). (Verma, Verma & Bhatia, 2011) provides an overview of research articles in the field of Web personalization. According to (Verma, Verma & Bhatia, 2011), Web Personalization is defined as the process of customizing the content and structure of a web site to the specific and individual needs of each user taking advantage of the user’s navigation habits. Today, the main methods for Web personalization are data mining techniques which use clustering algorithms. Clustering is widely used in fields such as machine learning, information retrieval, etc. According to (Marmanis & Babenko, 2009) the problem of identifying groups of users naturally lends itself to the use of clustering methods. Data clustering can also be used to generate user profiles based on information about the activity of each user and then for the formation of groups of users based on their profiles. But current clustering methods do not take into account all the factors for successful application in many real tasks. Instead, extensive experimentation with the pattern recognition methods is combined with intensive automatic machine learning which yields both approach and knowledge for resolving a lot of sophisticated problems. Adapting to a particular user is a difficult task because it is necessary to take into account inherent human uncertainty since man is unstable and spontaneous by nature. Many uncertainties of different kinds are present in the global network, for example, Web pages appear and disappear, the context of search keywords is constantly changing. Therefore, a search on the network is indistinct by its nature. In order to work with this uncertainty using computers it should be formalized in any way. Fuzzy logic allows us to take into account different kinds of uncertainties. Fuzzy clustering of the users' profiles can be used to construct fuzzy rules and conclusions in order to modify queries. The results can be used for knowledge extraction from users’ profiles for marketing purposes (Loia et al., 2010). However, the simple application of fuzzy logic to personalize the web may not lead to desired results, because it simplifies the actual picture of the user's preference. One of these simplifications is the usage of the weighted arithmetic mean operator for the aggregation of several criteria when each criterion is assigned a weight yet interaction between the criteria is not taken into account (Marichal, 2000b). The use of the Sugeno and Choquet fuzzy integrals with respect to fuzzy measure allows us to consider such interactions within the model (Grabisch, 1996). But in this case there is a problem of identification of fuzzy measures. This problem is caused not only by exponentially increasing complexity of fuzzy measures based on the criteria number, but by lack of clear understanding of the essence of fuzzy measure concept as well (Grabisch, 1996). Usually this problem is solved by means of significant simplification of the model and applying various support tools aimed for the identification of fuzzy measures (Takahagi, 2000; Wu & Zhang, 2010) or by identifying fuzzy measures without expert's presence. For example, the Sugeno fuzzy integral is used to aggregate the search results received from the various search engines on the Web (Cui & Feng, 2005). In this paper, an approach to personalization in the network using a recognition method based on aggregation of data sources involving the Sugeno and the Choquet fuzzy integrals is suggested. The difference between this approach and existing ones lies in the ability to use the Choquet integral without the necessary expert set of fuzzy measures (which is almost impossible in real
REFERENCES Bush, V. (1945). As We May Think. The Atlantic Monthly, 1, 101-108. Retrieved November 13, 2011, from http://www.ps.uni-saarland.de/~duchier/pub/vbush/vbush-all.shtml. Casas, R., Blasco Mar In, R., Robinet, A., Delgado, A., Yarza, A., Mcginn, J., Picking, R., et al. (2008). User Modelling in Ambient Intelligence for Elderly and Disabled People. Intelligence, 5105(Computers Helping People with Special Needs), 114-122. Springer. Retrieved from http://www.springerlink.com/index/5085668312150127.pdf Chiu, W. (2001). Web site personalization. Retrieved November 13, 2011 from http://www.ibm.com/developerworks/websphere/library/techarticles/hipods/personalize.html Cui, S. & Feng, B. (2005). A Fuzzy Integral Method to Merge Search Engine Results on Web. Lecture Notes in Computer Science. Computational Intelligence and Security. Springer Berlin / Heidelberg, pp. 731-736. Retrieved November 13, 2011, from http://dx.doi.org/10.1007/11596981_107. Devyatkov, V., Alfimtsev, A. (2008). Optimal Fuzzy Aggregation of Secondary Attributes in Recognition Problems. Proceedings of 16-th Int. Conf. in Central Europe on Comp. Graphics, Visual. and Computer Vision, Plzen, pp. 78–85. Grabisch, M., Roubens, M. (2000). Application of the Choquet Integral in Multicriteria Decision Making. In Fuzzy Measures and Integrals, Physica Verlag, pp. 415-434. Grabisch, M. (1996). The application of fuzzy integrals in multicriteria decision making. European Journal of Operational Research, 89(3), 445-456. Retrieved November 13, 2011, from http://www.sciencedirect.com/science/article/pii/037722179500176X Larbani, M., Huang, C.-Y., Tzeng, G.-h. (2011). A Novel Method for Fuzzy Measure Identification. International Journal of Fuzzy Systems. 13 (1), pp. 24-34. Loia, V., Nikravesh, M., & Zadeh, L. A. (2010). Fuzzy logic and the Internet. (p. 337). Berlin, Springer-Verlag. Germany. Marichal J.-L. (2000a). On Choquet and Sugeno Integrals as Aggregation Functions. Fuzzy Measures and Integrals, 40, pp. 247-272. Marichal, J.-L. (2000b). An axiomatic approach of the discrete Choquet integral as a tool to aggregate interacting criteria. IEEE Transactions on Fuzzy Systems, 8, 800-807. Retrieved November 13, 2011, from http://ikojadin.perso.univ-pau.fr/kappalab/pub/MarITFS2000.pdf Markellou, P., Rigou, M., & Sirmakessis, S. (2009). Web Personalization for E-Marketing Intelligence. In C. Ang, & P. Zaphiris (Eds.), Human Computer Interaction: Concepts, Methodologies, Tools, and Applications (pp. 2164-2180).
Marmanis, H., & Babenko, D. (2009). Algorithms of the Intelligent Web (p. 368). Greenwich, Manning Publications Co. USA. Sugeno, M. (1977). Fuzzy measures and fuzzy integrals: A survey. Fuzzy Automata und Decision Processes. North Holland, Amsterdam, pp. 89-102. Sullivan, D (2009). Google Now Personalizes Everyone’s Search Results. Retrieved November 13, 2011, from http://searchengineland.com/google-now-personalizes-everyones-search-results Ramasso, E., & Jullien, S. (2011). Parameter identification in Choquet Integral by the KullbackLeibler divergence on continuous densities with application to classification fusion. European Society for Fuzzy Logic and Technology. 1(1), pp. 132 – 139. Takahagi, E. (2000). On Identification methods of λ-fuzzy measures using weights and λ. Japanese Journal of Fuzzy Sets and Systems, 12(5), 665-676. Retrieved November 13, 2011, from http://www.isc.senshu-u.ac.jp/~thc0456/Efuzzyweb/mant2/fz_sou.pdf Verma, V., Verma, A. K., Bhatia, S. S. (2011) Comprehensive Survey of Framework for Web Personalization using Web Mining. International Journal of Computer Applications, 35(3), 2328. Retrieved May 9, 2012, from http://research.ijcaonline.org/volume35/number3/pxc3976066.pdf Wu, J. & Zhang, Q. (2010). 2-order additive fuzzy measure identification method based on diamond pairwise comparison and maximum entropy principle. Fuzzy Optimization and Decision Making, 9(4) , 435-453. Retrieved November 13, 2011, from http://dx.doi.org/10.1007/s10700-010-9086-x Yee, G. O., & Korba, L. (2008). Security Personalization for Internet and Web Services. International Journal of Web Services Research (IJWSR), 5(1), 1-23. doi:10.4018/jwsr.2008010101