On-the-fly Collaboration in Distributed Systems through. Service Semantic .... services provided on the network (broker); (ii) it supplies a service, publishing it on ...
On-the-fly Collaboration in Distributed Systems through Service Semantic Overlay∗ Devis Bianchini,Valeria De Antonellis, Michele Melchiori,Denise Salvi Università di Brescia Dip. Elettronica per l’Automazione Via Branze, 38 25123 Brescia - Italy
{bianchin|deantone|melchior|salvi}@ing.unibs.it ABSTRACT In the recent years distributed architectures and P2P technology have been adopted to better support effective collaboration among networked organizations. According to the P2P paradigm, many autonomous peers need to cooperate under highly dynamic conditions by sharing their resources (such as data and services), without sharing a common semantic model or ontology and without having a-priori knowledge about each other. In such a scenario, for effective collaboration, new advanced tools are required to support semantic representation and discovery of distributed resources such as data and services. In this paper we focus on semantic collaboration and service discovery in P2P systems. We propose the construction of a service semantic overlay over the P2P network, that is, a dynamic conceptual map across peers that provide similar services and constitute synergic service centers.
Categories and Subject Descriptors H.3.3 [Information Storage and Retrieval]: Information Search and Retrieval
Keywords Service Semantic Discovery, P2P, Ontology-based matchmaking
1. INTRODUCTION In the recent years, service-oriented paradigm evolved towards P2P technologies to better support organization collaboration in dynamic and open environments. Many au∗This work has been partially supported by the ESTEEM PRIN Project (http://www.dis.uniroma1.it/∼esteem/index.html) and TEKNE FIRB Project (http://www.tekne-project.it/), funded by the Italian Ministry of Education, University and Research.
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. iiWAS2008, November 24-26, Linz, Austria Copyright 2008 ACM 978-1-60558-349-5/08/0011 ...$5.00.
tonomous peers need to cooperate by sharing their resources (such as data and services), without sharing a common semantic model or ontology and without having a-priori knowledge about each other. Moreover, peers collaborate under highly dynamic conditions, joining and leaving the network at any moment and the computation to search for services on the network should be distributed to different peers to ensure scalability. According to this view, in peerbased collaborative systems, effective communication and exchange of data and services under highly dynamic conditions are challenging activities. From the service perspective, availability of semantic representation is a key issue to support effective discovery and integration/composition of services. Ontologies provide the benefits of formal specifications and inference capabilities apt to improve semantic matching. For service analysis and automated matchmaking, we have proposed a methodology and the Semantic Driven Service Discovery (SDSD) approach characterized by a novel hybrid matchmaking and discovery environment [4]. However, in a P2P scenario, new advanced tools are required to support semantic representation and discovery of distributed resources. In literature, P2P semantic-driven resource discovery has attracted much attention from Web services and Semantic Web area and relies on several efforts in related research fields, such as data integration and emergent semantics in P2P environments [3] to go beyond limitations of centralized architectures. Recent works involved structured [6] and unstructured P2P architectures [1]. The absence of any global ontology to overcome semantic heterogeneity has been faced recurring to centralized data structures to classify peer registries [7], to mediation-based architectures, where mediator super-peers store available services and have their own reference ontologies, to the adoption of semantic communities, intended as sets of peers that share similar interests and reach a semantic agreement on shared resources [2]. Our approach focuses on unstructured architectures without constraining to global ontologies/registries or mediationbased architectures. In particular, we propose the construction of a service semantic overlay over the logical network, that is a conceptual dynamic model of peers that provide similar services and constitute synergic service centres. In this paper we emphasize the role of the peer knowledge model in service discovery. The paper is organized as follows: Section 2 describes the service semantic overlay and how to apply semantic matchmaking techniques to establish it; Section 3 illustrates how to maintain the service semantic
overlay in a dynamic P2P-based environment; the service discovery process based on the semantic overlay is sketched in Section 4 with the help of some examples extracted from the application scenario, while Section 5 presents concluding remarks and future work.
1.1 Application Scenario Let consider an application scenario, where healthcare organizations (laboratories, drug-stores, research centres, hospitals) and doctors collaborate to provide useful services (i.e., drug ordering, diagnosis services, on-line consultation) in emergency situations, in which patient care services must be quickly found (for example, for anaphylactic shock treatment), but peers keep joining and leaving the network (for example, a doctor connects and disconnects from the network with his/her laptop). Organizations and doctors constitute the nodes of a P2P network, where each peer can play two roles: (i) it makes available discovery functionalities for services provided on the network (broker); (ii) it supplies a service, publishing it on a broker (provider). Brokers and providers can also act as requesters, searching for services on the network. Brokers are in charge of establishing and maintaining semantic links towards other brokers and performing service search.
2. PEER KNOWLEDGE AND THE SERVICE SEMANTIC OVERLAY In the P2P-SDSD framework independent peers cooperate in a common domain by sharing services without any a-priori reciprocal knowledge. In such a scenario, no centralized authorities are defined to provide a comprehensive view of the shared services in the collaborative network. Each peer has local knowledge about the services it handles and local P2P knowledge about peers in the network with similar services.
2.1 Peer knowledge conceptual model Each collaborative peer in the network has a local knowledge infrastructure constituted by: (i) UDDI Registry, where services are published together with their WSDL document and service providers register themselves; (ii) Peer Ontological Knowledge, that provides a conceptualization of abstract service operations and I/O parameters through a given domain ontology; a conceptualization of service categories through a Service Category Taxonomy (SCT), extracted from available standard taxonomies (e.g., UNSPSC, NAiCS) and a conceptualization of peer neighborness through semantic overlay. In the UDDI Registry, a Provider peer is registered as businessEntity, that provides businessServices, that are associated to corresponding WSDL document through the tModel mechanism. WSDL standard distinguishes among the abstract service interface (operations and I/O parameters) and concrete bindings. From the abstract service interface a semantic description (Semantic service interface class) is generated, where the operations and I/O parameters are concepts of the domain ontology. Mappings between the operations and I/O parameters in the WSDL interface and the corresponding concepts in the semantic service interface are maintained. Finally, each semantic service interface is classified into Service Categories defined in the SCT. Semantic service interfaces are related to each other according to semantic links between similar services. In particular, an Intra-peer semantic link relates a pair of similar
services published on the same peer, while an Inter-peer semantic link relates a locally published service with a similar service published on a Semantic neighbor of the peer. In both cases, semantic links are established by applying semantic matchmaking techniques [4] and are labeled with matching information, as explained in the next sections. The peer ontological knowledge is represented using OWL-DL language. The conceptual model of the peer knowledge infrastructure is shown in Figure 1.
Figure 1: Peer knowledge infrastructure conceptual model.
2.2
Matching techniques for semantic link definition
Each broker is endowed with an ontology-based matchmaker used to support service discovery and semantic link definition. In [4] we defined a hybrid matchmaking strategy. In the hybrid model, a deductive matchmaking model is combined with a similarity-based model to compare services on the basis of their semantic-enriched functional interface. The SCT and the domain ontology are exploited by the matchmaker to identify matching services. Moreover, the domain ontology is augmented by a thesaurus containing terms that are related by terminological relationships (as synonymy or hypernymy) to the names of concepts and categories. By means of the thesaurus, matching capabilities based on the domain ontology and categories are extended. The deductive matchmaking is used to qualify the kind of match MatchType(S1 ,S2 ) between two semantic service interfaces S1 and S2 . According to this matchmaking model, it is possible to state if S1 and S2 provide the same functionalities (S1 Exact S2 ), if S1 provides additional functionalities with respect to S2 (S1 Extends S2 ) or viceversa, if there is a non empty intersection between functionalities provided by S1 and S2 (S1 Intersects S2 ) or if S1 and S2 have nothing in common (S1 Mismatch S2 ). In case of partial overlapping among service functionalities (Extends or Intersects) the similarity-based matchmaking model is used to quantify service similarity Sim(S1 , S2 ) ∈ [0, 1] through coefficients properly defined to compare service interfaces. Otherwise, if S1 Exact S2 or S1 Mismatch S2 , Sim(S1 , S2 ) = 1.0 or Sim(S1 , S2 ) = 0.0, respectively. Two semantic service interfaces are similar (denoted with S1 ≈S2 ) when MatchType(S1 ,S2 ) is not Mismatch and Sim(S1 , S2 )≥δ, where δ is a similarity threshold experimen-
tally set. More details about the hybrid matchmaking model and the combined use of ontologies and thesaurus with experimental validation can be found in [4].
2.3 On-the-fly semantic collaboration and service semantic overlay definition The semantic service interfaces and the set of semantic links between them constitute the service semantic overlay built on the logical network overlay, that is, the P2P network. Each broker has its own local view (the links to semantic neighbors) on the semantic overlay. The service semantic overlay can be seen as a global collaboration view that dynamically emerges from local on-the-fly interactions between peers. In fact, we define a probe collaboration model in which a probe service query is submitted by a peer with the intention to find peers with semantically related services. Example 2.1. Figure 2 shows an example of service semantic overlay as viewed by a peer PX . Peer PX recognizes PY as its semantic neighbor through a sequence of local interactions between PX and PY . In fact, to find semantic neighbors, the peer PX sends a probe service request for each service SiX published on it to other peers connected at the logical network overlay (in the example, PY and PZ ). The probe service request contains the interface of SiX and the IP address of PX . The peer PY and PZ match the probe service request against their own semantic service interfaces by applying the matching techniques and obtain for each comparison the MatchType and the similarity value. If the MatchType is not Mismatch and the similarity value exceeds the threshold δ, an inter-peer semantic link can be established. In this example, only PY finds a semantic service interface matching the request (in this example, S2Y ) and replies to PX sending only S2Y together with the MatchType and the similarity value. PX establishes an inter-peer semantic link, denoted with islPX →PY (S3X , S2Y ), and PY is recognized as a semantic neighbor of PX .
Figure 2: A portion of the service semantic overlay.
overlay. Logical network overlay evolution. The logical network overlay collects all the peer participating to the collaborative network: each node represents a peer and each link is a logical connection between two peers. In order to guarantee connection between peers an Overlay Management Protocol (OMP) is used, which defines specific procedures to join, leave and modify the logical network overlay. In our approach a shuffling-based OMP is chosen in order to allow more effective information diffusion among peers [8]. As shown in the evolution statechart in Figure 3, when peer PX joins the collaborative network at the logical level for the first time (State 1-2), it receives a list of logical neighbors. Through the probe mechanism, PX sends a probe service request to each logical neighbor PY . If PX does not receive any reply, no inter-peer semantic links are established and PX remains connected only at the logical overlay (State 2). Otherwise, PX establishes inter-peer semantic links with its semantic neighbors, to which it is connected at the service semantic overlay (state 3). Probe service requests are sent according to a Time-To-Live (TTL) mechanism with a low TTL value to avoid network overload.
Figure 3: UML statechart describing the logical and semantic overlay evolution. Service semantic overlay evolution. The service semantic overlay has to be managed with respect to events that occur to the collaborative peers: acquisition of a new peer in the logical network overlay, disconnection of a semantic neighbor from the network, publication/cancellation of a service in a registry. The shuffling mechanism acts at the logical network overlay, but its effects are propagated at the service semantic overlay. If a peer PX , connected at the service semantic overlay (state 3), acquires new logical neighbors that are not yet among its semantic neighbors, it activates the probe mechanism to check if the new logical neighbors are also semantic neighbors. Depending on the replies to probe requests, new inter-peer semantic links are established for alignment. A peer PX recognizes that a semantic neighbor PY becomes unavailable if a given number of messages sent to PY are not answered. In this case the inter-peer semantic links toward semantic service interfaces published on PY are removed from PX . Publication/cancellation of a service are managed in a similarly way.
3. NETWORK EVOLUTION AND SERVICE SEMANTIC OVERLAY ALIGNMENT
4.
SERVICE DISCOVERY AND SERVICE SEMANTIC OVERLAY EXPLOITATION
The P2P network evolves through time with peers joining or leaving. Evolution occurs both at logical and semantic
A service request SR is formulated by specifying one or more service categories, the desired outputs and operations
and the inputs that the requester is able to provide for service execution. It is sent to a peer PX in the collaborative network and it is matched against semantic service interfaces published on PX (local search). After performing local search, PX starts the distributed search forwarding the request SR to its semantic neighbors with respect to S3X and S1X , that is, semantic service interfaces that match locally. If no matching service has been found locally, PX selects randomly a set of peers connected at the logical network overlay and forwards SR to them. Distributed search can be performed by applying different forwarding policies based on inter-peer semantic links; the request is propagated following inter-peer semantic links until the search stops according to a Time To Live mechanism. In Figure 4 a minimal policy is depicted. In this case, a request SR sent to peer PX partially matches S3X . Peer PX forwards SR to semantic neighbors (PY and PZ ) with respect to S3X . If no semantic neighbors exist for any matching service SiX on peer PX , the request is forwarded to one of the peers that are connected to PX in the logical network overlay (randomly chosen). In this example, SR is sent to PY and PZ . On peer PZ the service S1Z completely satisfies the request (by applying the matching model) and there is no need to further propagate SR . On peer PY the request is only partially satisfied by S2Y , but semantic neighbors of PY (in this case, PH ) do not add further capabilities with respect to S2Y and also in this case the request is not further propagated.
R
e
S
q
u
e
s
t
R
Y B
o
p
:
g
e
t
D
r
u
g
D
o
s
a
g
e
r
2
k
e
r
P
Y
n
:
A
n
a
p
h
y
l
a
c
t
i
c
S
h
o
c
k
;
D
r
u
g
;
o
o
o
;
S
i
u
t
:
D
o
s
a
g
e
;
I
n
t
e
r
a
c
t
i
o
n
;
S
i
d
e
E
f
f
e
c
t
s
p
:
g
e
t
P
h
a
r
m
a
c
e
u
t
i
c
a
l
P
r
e
p
a
r
a
t
i
o
n
;
Y S
P
o
s
o
l
o
g
i
a
s
1
;
Y S
i
3
n
:
o X S
u
D
t
i
:
s
e
D
a
o
s
s
e
a
;
g
P
e
h
;
a
S
r
i
m
d
a
e
E
c
f
f
e
e
u
c
t
i
t
c
s
a
l
P
r
e
p
a
r
a
t
i
o
n
;
;
R
3
S
S I
N
T
E
R
S
E
C
T
2
Y
E
X
T
E
N
D
S
S
3
X
S
I
S
(
i
m
=
0
.
9
1
n
t
e
r
a
c
t
i
o
n
;
)
X B
r
S
o
3
k
e
r
P
X
S
5
E
X
A
C
T
S
2
Y
H o
p
:
g
e
t
D
r
u
g
D
o
s
e
; X S
1
S
( i
i
m
=
1
.
0
)
n X :
o
D
u
t
i
:
D
s
e
o
a
s
s
a
e
g
;
e
D
;
r
I
u
n
t
g
;
e
r
S
a
c
t
i
o
n
2
;
H B
S
1
Z
N
I
T
E
R
S
E
C
T
S
S
3
r
i
m
=
0
.
7
4
o
k
e
r
P
X
S
S
(
5
H
)
S
o
p
:
g
e
t
D
r
u
g
D
o
s
a
g
e
1
H
;
S
i
2
H
n
:
D
i
s
e
a
s
e
;
D
r
u
g
;
S
3
H
Z B
r
o
k
e
r
P
S o
S
1
u
t
:
D
o
I
o
i
p
:
a
g
e
;
S
i
d
e
E
f
f
e
c
t
s
4
H
;
g
e
t
D
r
u
g
Q
u
a
n
t
ti
y
n
t
e
r
a
c
t
i
o
n
;
;
n
:
o
s
Z
u
D
t
i
:
s
e
D
a
r
s
u
e
g
;
;
D
o
s
a
g
e
;
I
n
t
e
r
a
c
t
i
o
n
;
T B
S
4
N
I
T
E
R
S
E
C
T
S
T
S
1
r
o
k
e
r
P
Z S
4
T
S S
(
i
m
=
0
.
7
8
1
T
)
o
p
:
g
e
t
D
r
u
g
Q
u
a
n
t
ti
y
;
S
i
2
T
n
:
D
i
s
e
a
s
e
;
D
r
u
g
;
e
r
S
o
i
n
t
e
r
D
p
e
e
r
s
e
m
a
n
t
i
c
l
i
n
u
t
:
D
o
s
a
g
e
;
I
n
t
a
c
t
i
o
n
3
T
;
k
Figure 4: Distributed search. According to the minimal policy, search over the semantic overlay stops when matching services which fully satisfy the request have been found. Exhaustive policies can be applied following the same rules, but the search does not stop when matching services that fully satisfy the request are found: the request SR is forwarded to semantic neighbors to find other services that could present, for example, better non functional features. In [5] a detailed presentation of different forwarding rules based on inter-peer semantic links is provided. Note that without the semantic overlay, the discovery process would rely on conventional P2P infrastructures and associated routing protocols for query propagation in the network (e.g., flooding). Exploiting the semantic overlay, it is possible to enforce query forwarding according to content similarities rather than to the mere network topology.
5.
CONCLUSION
In this paper, we proposed the P2P-based Semantic Driven Service Discovery (P2P-SDSD) framework to enable cooperation and communication based on a semantic overlay that organizes semantically the P2P-integrated knowledge space and emerges from local interactions between peers. The semantic overlay can be seen as a continuously evolving conceptual map across collaborative peers that provide similar services and constitute synergic service centres in a given domain. The semantic overlay enables effective similaritybased service search and optimization strategies are defined for request propagation over the P2P network keeping low the generated network overload. For a preliminary evaluation of our approach, experiments are being performed [5] to demonstrate the better recall results of our distributed search with respect to other approaches, both using floodingbased strategies and applying a service semantic overlay, and to confirm that the use of the P2P-SDSD request forwarding policy results in an improved scalability.
6.
REFERENCES
[1] F. Banaei-Kashani, C. Chen, and C. Shahabi. WSPDS: Web Services Peer-to-Peer Discovery Service. In Proc. of the 5th Int. Conference on Internet Computing (IC’04), pages 733–743, Las Vegas, Nevada, USA, 2004. [2] B. Benatallah, M.S. Hacid, H.Y. Paik, C. Rey, and F. Toumani. Towards semantic-driven, flexible and scalable framework for peering and querying e-catalog communities. Information Systems, 31(4-5):266–294, 2008. [3] P. Bernstein, F. Giunchiglia, A. Kementsietsidis, J. Mylopoulos, L. Serafini, and I. Zaihrayeu. Data Management for Peer-to-Peer Computing: A Vision. In Proc. of the 5th International Workshop on the Web and Databases (WebDB’02), pages 177–186, Madison, Wisconsin, 2002. [4] D. Bianchini, V. De Antonellis, and M. Melchiori. Flexible Semantic-based Service Matchmaking and Discovery. World Wide Web Journal (http://www.springerlink.com/content/ 4514473142836174/), 2008. [5] D. Bianchini, V. De Antonellis, M. Melchiori, and Denise Salvi. A Semantic Overlay for Service Discovery Across Web Information Systems. In Proc. of 9th Int. Conference on Web Information Systems Engineering WISE2008, Auckland, New Zealand, September 2008. [6] C. Schmidt and M. Parashar. A Peer-to-Peer Approach to Web Service Discovery. World Wide Web Journal, 7(2):211–229, 2004. [7] K. Verma, K. Sivashanmugam, A. Sheth, A. Patil, S. Oundhakar, and J. Miller. METEOR-S WSDI: A Scalable Infrastructure of Registries for Semantic Publication and Discovery of Web Services. Journal of Information Technology and Management, Special Issue on Universal Global Integration, 6(1):17–39, 2005. [8] S. Voulgaris, D. Gavidia, and M. van Steen. CYCLON: Inexpensive Membership Management for Unstructured P2P Overlays. Journal of Network and Systems Management, 13(2):197–217, 2005.