May 11, 2016 - Indexing. ⢠WoTSF: a Framework for searching in the WoT (High level indices). Crawling ..... Search Engine Optimization (SEO) - Starter Guide.
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
WoTSF: A Framework for Searching in the Web of Things M. Younan, S. Khattab, and R. Bahgat
May 11th 2016
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
Agenda 1
• Introduction
2
• Related Work
3 • The Architecture of the WoTSF
4 • The Implementation of the WoTSF
5 • Experimental Evaluation
6 • Conclusion and Future Work
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
INTRODUCTION
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
Introduction (WSN -> WoT) • Wireless Sensor Network (WSN) • Daily life integration of embedded devices – Convert things to Smart Things (SThs) – No.of devices reaches order of billions in 2020. – No.of users < no.of devices.
• The Internet of Things (IoT). – Huge Sensory Data (data stream).
• The Web of Things (WoT). – Current web tools and services.
• Contribution Idea: – Searching in the WoT increases its popularity
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
Introduction • Features of the WoT (Searching) – – – –
Different formats. Non-standardized naming Huge and dynamic Sensory Data. Daily life integration (normal user)
Crawling
• Simple query language • Real time queries • Interested in high level knowledge (summary)
Indexing and Searching
• Main Points: – Crawling • LWoTSEs’ Data –> following features of the WoT dataset presented in [1][2].
– Indexing • WoTSF: a Framework for searching in the WoT (High level indices)
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
RELATED WORK
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
Related Work (Selection)
Traditional Search engines o Google Optimizations [3]
o AJAX Crawler [4]
DiscoWoT (Mayer and Dominque. 2011)[5].
Shodan SE (www. Shodanqh.com,2015)[6] .
Dyser SE (Ostermaier et al.- 2010)[7].
Zhang et al. (2015) present a framework for a distributed range-query search[8]
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
Dyser • Dyser uses periodic patterns for each sensor (L,O,W,P), where L: period length O: offset
W: sensor output P: probability value
E.g., (7-Days, 2, empty, 0.5)
• Expansion of such periodic pattern is (K*L +O) For Example: If crawling process starts in 20-4, then Index record: (Occ., ID_01, 25-4-2016, empty, 0.5)
Su
Mo
Tu
We
Th
Fr
Sa
17
18
19
20
21
22
23
24
25
26
27
28
29
30
# recorded patterns in Dyser index will be >= # SThs
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
Dyser Index-> WoTSF Index 1
2
3
4 5
6 7
S01: (7-Days, 1, empty, 0.8) S02: (7-Days, 1, empty, 0.8) S03: (7-Days, 2, empty, 0.5) S04: (7-Days, 2, empty, 0.9) S05: (7-Days, 2, empty, 0.6) S06: (7-Days, 3, empty, 0.7) S07: (7-Days, 3, empty, 0.2) S08: (7-Days, 3, empty, 0.4) S09: (7-Days, 3, empty, 0.9) S10: (7-Days, 4, empty, 0.3) S11: (7-Days, 5, empty, 0.8) S12: (7-Days, 6, empty, 0.6) S13: (7-Days, 7, empty, 0.5) S14: (7-Days, 7, empty, 0.7)
WoTSF: High Level Index
WoT01: (7-Days, 1, empty, 0.8, S01) WoT01: (7-Days, 1, empty, 0.8, S02) WoT01: (7-Days, 2, empty, 0.9, S04) WoT01: (7-Days, 3, empty, 0.9, S09) WoT01: (7-Days, 4, empty, 0.3, S10) WoT01: (7-Days, 5, empty, 0.8, S11) WoT01: (7-Days, 6, empty, 0.6, S12) WoT01: (7-Days, 7, empty, 0.7, S14) Using aggregation function: ‘Max.’
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
THE PROPOSED WOTSF ARCHITECTURE
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
The WoTSF Architecture Crawling (WoT)
Google Opt. (server-root file)
Save time:
crawl less no.of pages
parse less no. of pages (Union format)
(High Level)
Query life cycle of the WoTSF
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
The WoTSF Architecture (Query Processing) Query
• Generate sub-queries
q2
• Select WoTSEs APIs
Master Indices
a
q2 WoT_1
• Send sub-queries
b
WoT_1 M.Ds
c
Ind.
Ind.
Ind.
d
DS_1
DS_2
DS_3
…
WoT_2 M.Ds
…
WoT_3 M.Ds
Key: sensor type
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
The WoTSF Architecture (Index Structure)
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
The WoTSF Architecture (Index Structure) WoT-Level (WoTSF)
SThs-Level (individual WoTSEs)
mi mi MI
SI
D D
A
R
Master Indices
SI SI
D D
WWW WWW WWW
S
S A
Secondary Indices
D
Dynamic Pages R
Storage
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
The WoTSF Implementation (Ranking) • Sensor State probability 𝑆𝑆 𝑖 = 𝐾 ∗ 𝐶𝑆 𝑖 + 1 − 𝐾 𝑃𝑖 K-> constant (0:1), CS-> current state, P->Probability
• Entity State probability 𝐸𝑆 𝑖 =
1 𝑛
𝑛 𝑗=1 𝑆𝑆
𝑗
N-> number of sensors
𝐸𝑆(𝑖) =
𝑛 𝑗=1 𝐷(𝑗)
∗ 𝑆𝑆(𝑗)
D(j) -> impact factor of sensor j on ES(i)
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
THE WOTSF IMPLEMENTATION
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
The WoTSF Implementation (Crawling) Crawling Levels: – WoTSE – WoTSF
Start Listen to Sensors Receive new_value
Start
Wait Period (T)
SrVal_Stability (Sr,new_value) = (cur_rec_time - last_rec_time(Sr))/interval Calculate Entity State
Read Cur_Value
newChange (EoI, State)
Compare Cur_value with previous value (range)
Change
Yes
Send Cur_Value
(a) Sensor
Add new_value
no.of records >N
no.of. consecutive changes > M
Update (EoI, State)
Replace (oldest_value, new_value
Update prediction model (Sr)
(b) Gateway
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
The WoTSF Implementation (Indexing) • The WoTSF is evaluated using Real dataset: We use Kth percentile (historical data) -> single record per time unit. e.g. Suppose, a temperature sensor reads a set of values {5, 20, 21, 20, 20, 21, 23, 20, 21, 23} then, average=19.4 50th = 20
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
The WoTSF Implementation (Indexing) • The WoTSF is evaluated using random values: – prediction model (‘7-Days’) – SThs of type ‘occupancy’ in 10 WoT networks, -> 10,000 SThs – aggregation function ‘Max’ (empty room). – expanding the quadruple predictions
The prediction model type -> periodical crawling processes.
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
EXPERIMENTAL EVALUATION
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
The WoTSF Implementation (Indexing)
(a)
• WoTSF index (b)
– Count -> Max reading (WoT level) (b)
In case of distinct SThs values • Dyser Index (a)
– Count -> Max reading (STh level)
Local search engine WoTSF
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
The WoTSF Implementation (Searching)
(a)
(b)
(c)
• The WoTSF prototype: – implements autosuggestions like dyser. – filters buildings (static part) -> WoTSEs’ APIs – Evaluation is done on the dynamic part of the query (WoTSEs) using: • Building - Level (Master Indices) • SThs – Level (Secondary Indices)
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
The WoTSF Evaluation (Index size)
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
Experimental Evaluation (WoTSF & Dyser)
• Index Size
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
Experimental Evaluation (WoTSF & Dyser)
• Processing Time - WoTSF indices (High Level) - Dyser Indices
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
Experimental Evaluation (WoTSF & Dyser) • Result Accuracy Accuracy = Consistency =
𝐶𝑜𝑟𝑟𝑒𝑐𝑡 𝑉𝑎𝑙𝑢𝑒𝑠 (𝑡𝑟𝑢𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠) 𝑇𝑜𝑡𝑎𝑙 𝑅𝑒𝑡𝑟𝑖𝑒𝑣𝑒𝑑 (𝑓𝑎𝑙𝑠𝑒 𝑎𝑛𝑑 𝑡𝑟𝑢𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠)
𝐼𝑛𝑡𝑒𝑛𝑑𝑒𝑑 𝑅𝑒𝑡𝑟𝑖𝑒𝑣𝑒𝑑 (𝑡𝑟𝑢𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠) 𝑇𝑜𝑡𝑎𝑙 𝐼𝑛𝑡𝑒𝑛𝑑𝑒𝑑 (𝑡𝑟𝑢𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠+𝑓𝑎𝑙𝑠𝑒 𝑛𝑒𝑔𝑎𝑡𝑖𝑣𝑒𝑠)
Each building represents a single WoT network and hosts an occupancy sensor in each room Building ID
100
200
Room ID
101 102 103 201 202 203
State
Prediction Value
Empty
0.8 0.4 0.7 0.3 0.5 0.4
Search Engine
Index Size
WoTSF
2
Dyser
6
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
The WoTSF Evaluation (Index size) WoTSF and DSE search results according to different query types. Search Engine
Number of Results First K-Result = 4 Probability >= 50% 2 2 4 3
all 2 6
WoTSF DSE
WoTSF and DSE searching results (list of rooms) according to different query types. Search Engine WoTSF
DSE
Number of Results Results Room 101 Room 202 Room 101 Room 103 Room 202 Room 102 Room 203 Room 201
all
0.8 0.5 0.8 0.7 0.5 0.4 0.4 0.3
First K-Result = 4
Probability >= 50%
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
The WoTSF Evaluation (Summary) Criteria
DSE
Granularity of Device (STh) level master index
WoTSF
Network (e.g., building) level
Indices and accuracy: based on prediction models. Less network overheads. More consistent results
All available results Case: Speed Search More time for ranking. Case: Accurate Results Pros
Cons
Larger indices. Harder to keep indices up-to date. More time for crawling, parsing, and indexing
top values per WoT network. Faster search Indices more up-to-date Consumes more time Indices: up-to-date. Accuracy: high Small and semi-dynamic indices. Less time for (crawling, parsing, indexing).
Tradeoff between search speed and result accuracy
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
CONCLUSION AND FUTURE WORK
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
Conclusion • The WoTSF:
works on the top of Dyser supports simple query language saves time consumed for crawling, parsing and indexing builds High level indices Supports two types of search: • Search in high level indices (speed search). • Search in secondary indices using LWoTSEs’ APIs (Accurate search)
increasing accuracy indices are up-to-date keeping individual LWoTSEs handle their network.
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
Future Work • WoTSF has limitations such as: – It consumes more time in accurate search type. • Network overhead • Merging and Ranking results.
– Scheduling the crawling processes • to balance network overhead and result accuracy
– Using semantic technology: will be helpful for interoperability. – Considering other aggregation functions
• LWoTSE has limitations such as: – Extracting prediction models from SThs historical data – Dynamic discovery. – Solving problem of using multiple formats partially (crawling) but not by meaning -> (ontologies)
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
References (Selected) [1] M. Younan, S. Khattab, and R. Bahgat, "An Integrated Testbed Environment for the Web of Things," in ICNS 2015 : The Eleventh International Conference on Networking and Services, ISBN: 978-1-61208-404-6, Rome, Italy, May, 2015, pp. 69-78. [2] M. Younan, S. Khattab, and R. Bahgat, "Evaluation of An Integrated Testbed Environment for the Web of Things ", in IntSys15v8n34 : International Journal On Advances in Intelligent Systems, v 8 n 3&4, December, 2015. [3] Google. (2010, Jan.) Search Engine Optimization (SEO) - Starter Guide. [4] P. Suganthan G C, "AJAX Crawler," in Data Science & Engineering (ICDSE), International Conference on. IEEE, Cochin, Kerala, July 2012, pp. 27-30. [5] S. Mayer, D. Guinard, "An Extensible Discovery Service for Smart Things," in in Proceedings of the 2nd International Workshop on the Web of Things (WoT 2011), ACM, San Francisco, CA, USA, June, 2011, pp. 7-12. [6] (2015, Jan.) shodan search engine. [Online]. www.shodanhq.com [7] B. Ostermaier, K. Romery, F. Mattern, M. Fahrmairz, and W. Kellererz, "A Real-Time Search Engine for the Web of Things," in The 2nd IEEE International Conference on the Internet of Things (IoT), Tokyo,Japan, November. 2010, pp. 1-8. [8] C. Zhang, T. Zhang, and M.Wang, "A Distributed Range Query Framework for the Internet of Things," in 18th International Conference on Intelligence in Next Generation Networks (ICIN 2015), IEEE 2015, ISBN 978-1-4799-1866-9, Paris, France, February, 2015, pp. 83-88.
M. Younan, S. Khattab, and R. Bahgat, "WoTSF: A Framework for Searching in the Web of Things," in INFOS 2016 : The 10th International Conference on Informatics and Systems, ACM, Cairo , Egypt, May, 2016.
THANKS QUESTIONS ?