WL#10703: scale to 50k connections

Affects: Server-8.0   —   Status: Complete

Motivation

Routers network core relies on a

1:1 thread-connection

design to forward connections. The design limits the number of connections the router can handle. OSes apply limits on:

  • number of threads per process
  • max memory used per process
  • number of open file-descriptors per process

which set the boundaries such a design can handle.

In WL#???? the limit of concurrent connections was raised from 500 to ~5000 connections by using poll() instead of select(), but the underlying design of 1-thread==1-connection was kept intact.

To go beyond 5k concurrent connections, the 1:1 design of the routing core needs to be replaced.

System Limits

MacOS X

number of threads per process is limited by the OS to based on installed memory:

RAM max_threads max_taskthreads
16G 20480 4096
32G 40960 8192

The values can be retrieved via sysctl:

sysctl hw.memsize
sysctl kern.num_taskthreads
sysctl kern.num_thread

Goal

Refactor the routing plugin into a event-driven + IO-threadpool design which:

  • uses non-blocking IO

    • no thread-stack issues
    • low memory usage
    • no thundering herd
  • uses a low number of threads (~num of cores)

    • no more max-threads-per-process limits