Uncovering Performance Differences Among Backbone ISPs with Netdiff

12 downloads 0 Views 939KB Size Report
6092 (0.49). 229 (38.1) n/a. Global Crossing. 60. 143 (9.0). 19082 (0.64). 689 (19.4) n/a. Level 3. 62. 249 (12.7). 70907 (2.29). 1513 (40.0). 30 (0.8). NTT/Verio.
Uncovering Performance Differences Among Backbone ISPs with Netdiff

Ming Zhang (Microsoft Research) Joint work with Ratul Mahajan (Microsoft Research) Lindsey Poole, Vivek Pai (Princeton) 1

Benchmarking computer systems „ Benchmarking is common today for

databases, web servers, compilers, file systems, …. „ Systematic quantification of system performance „ Enables comparison across competing vendors Helps consumers make informed decisions „ Helps vendors make relevant optimizations „ The resulting transparency and competition helps the industry at large „

Today benchmarks do not exist for Internet Service Providers (ISPs) „ ISPs’ customers do not know which ISP is

best for their traffic „

Even though it has a big impact on application performance

„ ISPs may have little incentive to improve end-

to-end customer performance

Ming Zhang

Existing methods to compare ISPs „ Service-level agreements (SLAs)

Typically within an ISP’s network „ Not directly related to application performance „ Hard to compare across ISPs „

„ Keynote systems

1,200 nodes inside ISP PoPs „ Measure individual path „

„

Insufficient for systematic comparison

4

First step towards systematic ISP benchmarking „ Introduce the Netdiff system

An easily deployable system that does not require ISP cooperation „ Provides detailed differences between backbone ISPs „ Performance measure: path latency „

Ming Zhang

Requirements of ISP comparison „ Relevant to customers

Measure end-to-end path „ Target destinations of interest „ Compare based on workload „

„ Helpful to ISPs

Account for size and geographic presence „ Internal vs. external „ Bad Point-of-Presence (PoP) or destination „

6

Ming Zhang

Ideal architecture

7

Ming Zhang

Netdiff architecture Prober Prober

Prober

Prober Prober

Prober 8

Ming Zhang

Measurement process Prober Prober

Prober

Controller Prober

Prober 9

Ming Zhang

Mapping IP to PoP/location „ Router IPs „ DNS name contains PoP info „

„

sl-gw2-sea-4-0-0.sprintlink.net -> Sprint in Seattle

Parse names with undns

„ Destination IPs „ MaxMind geo-location database „ „

Based on user input Accuracy: 99% country-level, 80% city-level in US

10

Ming Zhang

Deal with scalability „ Map ISP topology at large time scale „ Probe to all prefixes „ Track topological changes „ Measure ISP performance at small time scale „ Probe a subset of prefixes „ „ „

„

Each PoP to destination network Each internal path between two PoPs Probing load below threshold

Formulate as a set covering/pack problem „

Solve using greedy algorithm 11

Ming Zhang

Overhead & coverage ISP

# cities

# probes (K) (Reduction factor)

# dst. paths (% of total)

# int. paths (% of total)

# int. paths seen by Keynote (% of total)

AOL Transit

27

5 (5.5)

658 (0.05)

230 (32.8)

n/a

AT&T

113

86 (13.5)

13364 (0.24)

838 (6.6)

72 (0.6)

AboveNet

20

40 (7.6)

17258 (1.73)

277 (72.8)

n/a

British Telecom

32

11 (13.7)

4898 (0.31)

440 (44.3)

2 (0.2)

Broadwing

23

28 (9.5)

7655 (0.67)

149 (29.3)

n/a

Cogent

72

161 (10.2)

42620 (1.18)

1799 (35.2)

12 (0.2)

Deutsche Telekom

67

29 (7.7)

2266 (0.07)

129 (2.9)

0 (0.0)

France Telecom

25

31 (12.1)

6092 (0.49)

229 (38.1)

n/a

Global Crossing

60

Savvis

41

56 (8.2)

5K to 23689K(19.4) 70907 (2.29) 1513 (40.0) probes per 28943 (1.26) 535 (25.8) ISP, 400x 20270 (0.78) 696 (26.2) reduction 10012 (0.49) 511 (31.1)

Sprint

55

136 (13.1)

36366 (1.32)

1208 (40.7)

20 (0.7)

Tiscali

36

30 (8.8)

5483 (0.30)

325 (25.7)

0 (0.0)

VSNL(Teleglobe)

43

30 (14.4)

5500 (0.26)

524 (30.0)

6 (0.3)

Verizon

161

120 (11.5)

20507 (0.26)

2098 (8.1)

306 (1.2)

XO

47

7 (23.5)

1415 (0.06)

522 (24.1)

2 (0.1)

Level 3 NTT/Verio Qwest

143 (9.0) 18 backbone 62 249 (12.7) ISPs,9820 46 (8.4) min 52 112 (10.9) per ISP

19082 (0.64)

n/a 30 (0.8)

10x more internal paths

6 (0.3) 30 (1.1) 20 (1.2)

12

Ming Zhang

Dealing with noise and errors „ Sanity checks to guard against possible

errors Inferred ISP topology „ IP-to-Geo mapping „ Path asymmetry „ Overloaded prober or access link „

13

Noise due to ICMP? „ UDP on both directions vs. UDP on forward &

ICMP on reverse 100

% of paths

80 60 40 20 0

0.0

0.5 1.0 1.5 2.0 Netdiff / UDP latency

Ming Zhang

Validation with Keynote „ Keynote double-ended vs. Netdiff single-

ended

15

Ming Zhang

Noise due to path asymmetry/load?

16

Relevance to application 300

HTTP difference (ms)

HTTP latency (ms)

1500

1000

500

0

0

100 200 300 400 500 Netdiff latency (ms)

200

100

0

0

20 40 60 80 100 Netdiff difference (ms)

Ming Zhang

Current status „ PlanetLab: 300+ sites worldwide „ 18 backbone ISPs: AT&T, Sprint, BT… „ Since Feb 2007: 20 minutes per ISP

18

Ming Zhang

Comparison methodology „ Path of interest „ Internal: PoP PoP „ Destination: PoP destination networks „ Path stretch „ Measured latency minus optimal latency „ Account for difference in path length „ Group paths based on length „ Short: < 20 ms „ Medium: 20 - 50 ms „ Long: > 50 ms 19

Ming Zhang

Overall comparison Destination paths

Internal paths

20

Ming Zhang

Detailed comparison „ Which ISP is best depends on workload

Distance „ Geographical region „ Direction „ Destination „… „

„ Data available at http://netdiff.org „

Customize your own comparison

21

Ming Zhang

Dependency on distance Medium paths

Long paths

22

Dependency on geographical region Medium paths to/from U.S.

Ming Zhang

Medium paths

23

Ming Zhang

Dependency on destination Medium paths to popular websites

Medium paths

24

Future directions „ Extend Netdiff to other measures

Loss „ Stability „ Reachability „ Network neutrality „

„ Explain differences in ISP networks

Ming Zhang

Conclusions „ Netdiff is the first system that provides

detailed comparison among backbone ISPs Easy to deploy, requires no ISP cooperation „ Currently measures 18 backbone ISPs „

„ ISP ranking depends on workloads „

Netdiff’s detailed comparison is essential

„ ISP benchmarking will help customers and

improve overall network infrastructure

26

Dependency on direction Medium paths from LA

Tiscali Above Deutsche

Medium paths to LA

Savvis Qwest British NTT

Cogent Qwest Sprint

Above GlobalX France

Level3 Verizon NTT Bwing VSNL ATT

Deutsche Level3 Cogent

GlobalX France Savvis

Verizon Sprint Tiscali 60

70

80

90

100 110

dst. path stretch (ms)

30

40

50

60

70

dst. path stretch (ms)

80

Suggest Documents