Internet2 NetFlow: Weekly Reports: Week of 20080609

  1. Introduction
  2. Bulk TCP
  3. Full Data Set

Introduction

You are looking at the weekly Abilene network usage report for the week of 20080609 produced from NetFlow records. The view of the whole network as a single traffic-relaying unit is presented. More formally, data from all interior circuits (those connecting two Abilene routers) were discarded while all the rest of the data were merged to create this view.

During this week, data for the following day(s) were missing: Monday. We multiplied all nominal quantities by 7/6 to estimate the amounts of various types of traffic. Percentages and distributions were not modified.

The data are split into two sections: bulk TCP data and the full data set. A "bulk TCP" flow is defined as a TCP flow that transferred more than 10MB of data. The first section only concerns these data. The second section studies the overall traffic composition.

All the numbers in this report are hyperlinked to plots that show their history (e.g., clicking on the percentage of octets of NNTP traffic will bring up a time-series plot that shows the history of this parameter).

Bulk TCP

During this week, bulk TCP traffic comprised 43.63% of octets and 23.59% of packets of the full data set traffic.

The distribution of bulk TCP throughputs is the most important piece of data in this report. Cumulative distribution function plots (1-CDF vs. throughput in bits/second) in semi-log and log-log scales are as follows:
[Bulk TCP throughputs (semi-log scale).] [Bulk TCP throughputs (log-log scale).]

Distribution of the amount of data transferred (in semi-log and log-log scale, 1-CDF vs. total trasfer size in octets) is presented below. It should be recognized that NetFlow collection mechanism is always configured so that flows (in the accounting sense) cannot last longer than a certain period of time. Therefore, the distribution of transfer sizes is to a certain extent skewed in the upper part.
[Bulk TCP transfer sizes (semi-log scale)] [Bulk TCP transfer sizes (log-log scale).]

The distribution of durations of bulk TCP flows (in seconds) is as follows (you may notice the cut-off phenomenon mentioned above):

[Bulk TCP durations distribution.]

The following table shows actual values from the above distribution plots that correspond to characteristic values (such as median, 90%, max, etc.).

Table 1. Selected Points from Distribution Graphs (Bulk TCPs)

Percentile Throughput (b/s) Durations (s) Size (octets)
1 1.391M 3 10.05M
5 1.493M 9 10.50M
10 1.620M 17 11.10M
50 3.585M 58 19.20M
90 13.60M 59 56.10M
95 20.72M 59 79.22M
99 52.20M 59 181.3M
99.9 127.8M 59 412.8M
99.99 898.3M 60 1.227G
99.999 1.016G 119 4.495G
100 4.447G 122 6.716G

We compute average packet size of each flow by dividing the number of octets in a flow by the number of packets. Distribution of average sizes of packets belonging to bulk TCP flows is as follows:

Table 2. Packet Sizes (Bulk TCP)

Packet Size Packets
Small (<100B)0.54% 1.137G
Medium (100-1400B)6.68% 13.96G
Large (1401-1500B)92.70% 193.8G
Jumbo (>1500B)0.08% 177.3M
Total100.00% 209.1G

We show what applications transfer large amounts of data in the following table. Note that this is bulk TCP traffic only; full data set usage is presented in the next section.

Table 3. Aggregated Application Types (Bulk TCP)

Traffic Type OctetsPacketsFlows
Data Transfers25.80% 78.97T 26.13% 54.65G 30.47% 3.106M
Encrypted Traffic8.00% 24.48T 8.36% 17.47G 6.05% 616.3k
Advanced Apps5.98% 18.29T 6.10% 12.75G 7.93% 808.0k
File Sharing2.63% 8.062T 2.63% 5.492G 2.00% 203.4k
Measurement1.12% 3.433T 1.18% 2.461G 0.19% 19.01k
Misc0.69% 2.105T 0.78% 1.622G 0.79% 80.60k
Games0.16% 480.9G 0.16% 337.8M 0.21% 21.36k
Audio/Video0.14% 432.3G 0.15% 316.0M 0.30% 30.75k
Unidentified55.48% 169.8T 54.52% 114.0G 52.07% 5.307M
Total100.00% 306.0T 100.00% 209.1G 100.00% 10.19M

The following are the fastest 10 measurement flows with unique source and destination AS numbers (i.e., for any given pair of source and destination AS numbers, no more than one fastest flow is shown).

Table 4. Fastest Bulk TCP Measurement Flows with Unique AS Source and Destination

Throughput (b/s)Packet size (bytes)Duration (s)Src ASDest ASApplication type
995.5M150019Brookhaven National Lab [43]Abilene [11537]Iperf
363.9M150036Unknown [0]Abilene [11537]Iperf
363.3M150021Merit [237]Abilene [11537]Iperf
221.0M150011NASA-ESDIS-NET [22767]Israeli Academic and Research Network [378]Iperf
183.8M150019Unknown [32361]SWITCH [559]Iperf
178.6M139829NASA-HPCC-ESS [7847]APAN-JP [7660]Iperf
164.1M150030NASA GSFC [1701]Unknown [25689]Iperf
163.1M150018NASA Internet [297]Unknown [25689]Iperf
158.9M150060IIJNET [2501]FR [2200]Iperf
155.4M150029NASA GSFC [1701]UT-Austin [18]Iperf

The following are the fastest 10 non-measurement flows with unique source and destination AS numbers (i.e., for any given pair of source and destination AS numbers, no more than one fastest flow is shown). When unable to determine the application type, we give the source and destination port numbers.

Table 5. Fastest Bulk TCP Non-measurement Flows with Unique AS Source and Destination

Throughput (b/s)Packet size (bytes)Duration (s)Src ASDest ASApplication type
1.049G900010NASA-HPCC-ESS [7847]Abilene [11537]54601 -> 5101
1.023G900010High Performance Computing Modernization Program [668]Abilene [11537]40831 -> 5101
522.4M900010Abilene [11537]High Performance Computing Modernization Program [668]54911 -> 5101
355.4M136811SWITCH [559]PSC [1207]Audiogalaxy
302.5M149513Network for Education and Research in Oregon [3701]Unknown [0]HTTP
292.3M900028NCSA [1224]TACCNET [32093]53552 -> 50001
291.9M150012UNIVHAWAII [6360]DFN-IP service G-WiN [680]Rsync
261.4M149827Virginia Tech [1312]U Kansas [2496]HTTP
259.8M150015Unknown [0]Unknown [36375]HTTP
249.7M142013Nat Lib Med [70]Unknown [40127]50289 -> 39816

We also compute the average concurrency of bulk TCP flows for the week (by adding durations of all captured flows and dividing the result by the by the duration of the week). This week's average number of concurrent bulk TCP flows: 820.0.

Full Data Set

In addition to bulk TCP flows data, we provide statistics that characterize the overall composition of the complete data set (everything that transited the Abilene network this week).

The following table describes what kinds of traffic went through the network (multiple applications are aggregated into classes):

Table 6. Aggregated Application Types (Full Data Set)

Type OctetsPackets
Data Transfers35.03% 245.6T 35.53% 315.0G
Encrypted Traffic6.12% 42.94T 6.65% 58.95G
Advanced Apps3.64% 25.53T 3.35% 29.72G
File Sharing3.00% 21.01T 3.53% 31.27G
Misc2.37% 16.63T 5.40% 47.84G
Audio/Video1.43% 10.02T 1.34% 11.90G
Measurement0.77% 5.391T 0.85% 7.561G
Games0.28% 1.932T 0.39% 3.474G
Unidentified47.37% 332.2T 42.96% 380.9G
Total100.00% 701.4T 100.00% 886.7G

This table is available additionally in the following more verbose version (no applications are aggregated into classes, but class composition is shown):

Table 7. Detailed Application Types (Full Data Set)

Traffic type OctetsPackets
Data Transfers
HTTP
Rsync
NNTP
FTP
---
30.26%
2.00%
1.63%
1.15%
---
212.2T
14.00T
11.40T
8.051T
---
31.58%
1.55%
1.38%
1.02%
---
280.0G
13.70G
12.27G
9.022G
Encrypted Traffic
SSH
HTTPS
IPsec ESP
IPsec AH
IPsec IKE
---
3.86%
1.95%
0.30%
0.01%
0.00%
---
27.04T
13.69T
2.137T
66.50G
5.492G
---
3.58%
2.61%
0.44%
0.02%
0.00%
---
31.78G
23.13G
3.866G
135.8M
29.04M
Advanced Apps
UNIDATA LDM
McIDAS
IBP
BBCP
GsiFTP
BBFTP
---
2.72%
0.53%
0.30%
0.09%
0.01%
0.00%
---
19.05T
3.712T
2.088T
622.3G
39.77G
18.59G
---
2.47%
0.44%
0.35%
0.07%
0.01%
0.01%
---
21.87G
3.917G
3.090G
632.9M
95.35M
116.2M
File Sharing
Shoutcast
Audiogalaxy
Hotline
BitTorrent
eDonkey2000
Gnutella
FastTrack
WinMX
Carracho
Blubster
Freenet
Neo-Modus
Direct Connect++
---
1.28%
0.73%
0.59%
0.26%
0.10%
0.02%
0.01%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
---
8.951T
5.147T
4.143T
1.809T
731.0G
141.4G
52.34G
23.08G
6.801G
4.255G
1.942G
446.2M
3.396M
---
2.06%
0.63%
0.43%
0.26%
0.09%
0.02%
0.01%
0.00%
0.00%
0.01%
0.00%
0.00%
0.00%
---
18.29G
5.611G
3.850G
2.311G
832.0M
207.2M
77.72M
28.13M
11.77M
49.13M
1.934M
1.163M
10.03k
Misc
Mail
DNS
Port 0
Squid
NFS
AFS
X11
Telnet
IRC
RTIP
NTP
MS Windows
IDENT
SOCKS
SNMP
AOL AIM
RPC Portmapper
---
1.35%
0.26%
0.24%
0.20%
0.13%
0.07%
0.06%
0.02%
0.01%
0.01%
0.01%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
---
9.483T
1.809T
1.682T
1.428T
931.4G
483.1G
413.7G
150.9G
66.49G
55.79G
47.92G
31.96G
14.02G
13.97G
11.78G
11.07G
184.3M
---
2.43%
1.88%
0.20%
0.33%
0.09%
0.10%
0.10%
0.05%
0.04%
0.06%
0.07%
0.03%
0.01%
0.00%
0.01%
0.00%
0.00%
---
21.54G
16.67G
1.734G
2.936G
794.7M
916.0M
852.4M
455.8M
376.3M
511.8M
626.7M
227.0M
57.60M
27.55M
91.88M
15.58M
1.722M
Audio/Video
Any-Source Multicast
Real Player
Windows Media
H.323 Signaling
Backbone Radio
StreamWorks
Camarades webcams
Subset of VoIP
Single-Source Multicast
---
0.95%
0.43%
0.03%
0.01%
0.01%
0.00%
0.00%
0.00%
0.00%
---
6.683T
3.034T
184.5G
55.60G
50.54G
13.28G
3.602G
2.310G
0.000
---
0.68%
0.61%
0.03%
0.01%
0.01%
0.00%
0.00%
0.00%
0.00%
---
6.057G
5.424G
261.0M
68.14M
63.84M
16.59M
6.223M
4.744M
0.000
Measurement
Iperf
ICMP
IPMP
---
0.60%
0.20%
0.00%
---
4.214T
1.373T
0.000
---
0.46%
0.46%
0.00%
---
4.100G
4.038G
0.000
Games
DirectX
Battlenet
Spy Arcade
Quake
Half-Life
Starsiege Tribes
Asheron
---
0.21%
0.02%
0.02%
0.01%
0.01%
0.00%
0.00%
---
1.439T
169.9G
125.0G
92.94G
90.88G
8.750G
5.350G
---
0.23%
0.05%
0.01%
0.02%
0.08%
0.00%
0.00%
---
2.057G
427.4M
127.4M
163.1M
671.7M
15.75M
11.33M
Unidentified
Unidentified
---
47.37%
---
332.2T
---
42.96%
---
380.9G
Total
Total
---
100.00%
---
701.4T
---
100.00%
---
886.7G

The following table summarizes use of most popular IPv4 protocols:

Table 8. IP Protocols Distribution (Full Data set)

Protocols OctetsPackets
ICMP[1]0.20% 1.373T 0.46% 4.038G
IGMP[2]0.00% 44.20M 0.00% 1.222M
IP-ENCAP[4]0.02% 137.5G 0.02% 186.0M
TCP[6]86.86% 609.2T 83.17% 737.5G
UDP[17]6.93% 48.59T 11.46% 101.6G
IPv6[41]0.00% 5.272G 0.00% 22.61M
GRE[47]5.71% 40.03T 4.45% 39.45G
ESP[50]0.30% 2.137T 0.44% 3.866G
AX.25[93]0.00% 30.80k 0.00% 466.0
PIM[103]0.00% 3.562G 0.00% 37.07M
IPMP[169]0.00% 0.000 0.00% 0.000
Other0.01% 103.6G 0.07% 600.5M
Total100.00% 701.4T 100.00% 886.7G

We compute average packet size of each flow by dividing the number of octets in a flow by the number of packets. Distribution of (average) packet sizes is as follows:

Table 9. Packet Sizes (Full Data Set)

Packet Size Packets
Small (<100B)38.43% 340.8G
Medium (100-1400B)19.71% 174.8G
Large (1401-1500B)39.64% 351.5G
Jumbo (>1500B)2.21% 19.63G
Total100.00% 886.7G

We only track DSCP values for which special treatment was defined by Internet2 QoS working group (and the default of DSCP=0):

Table 10. Important DSCP Values (Full Data Set)

Type OctetsPackets
Best effort [DSCP=0]96.82% 679.1T 97.41% 863.7G
Scavenger [DSCP=8]0.05% 317.7G 0.05% 482.4M
EF [DSCP=46]0.00% 28.40G 0.02% 150.4M
Other3.13% 21.94T 2.52% 22.36G
Total100.00% 701.4T 100.00% 886.7G

We collect statistics about ECN-capable traffic:

Table 11. ECN-Capable Traffic

Type OctetsPackets
ECN-Capable0.42% 2.950T 0.25% 2.216G

To facilitate detection of emerging applications, we present statistics about frequently encountered unidentified port numbers (no distinction is made in this table between TCP and UDP):

Table 12. Frequent Unidentified Ports

Port OctetsPackets
543211.19% 8.332T 0.63% 5.562G
19350.77% 5.395T 0.94% 8.363G
330010.65% 4.538T 0.35% 3.073G
200000.33% 2.334T 0.38% 3.389G
45000.29% 2.058T 0.29% 2.589G