Depending on the configuration the forwarding and load balancing toward analyzing CPU cores or probes may be segmenting in various ways, e.g., the Control plane packets may be duplicated and isolated for some subscriber segments’ traffic, while both User place and Control Plane packets can be provided for other segments, thus allowing in depth analysis of segments of subscribers while a shallower approach may be employed for other segments. This can allow monitoring solutions to focus on the subscribers most important for the operator’s revenue, while keeping the analysis resources to the level needed for doing a fully covering, but targeted analysis.
Not all Control plane traffic of a subscriber needs to be available to perform the session distribution. The correlation can rely on either GTP-C or PFCP, or both, which is then correlated to the subscribers’ GTP-U traffic. This flexibility is important to meet the operator deployment scenarios where topology and colocation of nodes is not a given. In many cases the opposite is the case. Monitoring around the User plane nodes is essential for the correlation and for efficient data movement.
Distribution can go into regular distribution channels or into VIP channels. These channels are mapped to either RAM buffers, Tx Interfaces or VLANs on Tx interfaces for logical segmentation on the same physical distribution interface. VLAN channels can be used to send coherent traffic to external devices and these devices will be able to load balance based on the initial correlation done, simply by load balancing based on the channelized VLAN IDs.
This distribution can be exclusively into RAM buffers on the solutions deployment host, or via Tx network links to secondary devices or a combination, which ensures efficient utilization of the deployment HW by letting the correlating host also handling analysis workloads.
The single host’s resources can be optimally utilized by using the Session Based GTP Distribution to load balance the coherent data to multiple RAM buffers. This allows using multiple CPU cores with multiple instances of the analyzing process, which ensure optimal utilization of the available CPU resources.
Key features
- Independent of network infrastructure
- Supports 5G/4G CUPS/4G/3G/2G
- Coherent subscriber session load balancing
- Distribution of PFCP, GTP-C and GTP-U based on correlation with IMSI or generic ID
- Subscriber or session unique key
- Distribution channel derived from IMSI, for persistence of distribution between sessions
- Subscriber (IMSI) filtering for interception
- Subscriber segmentation and bulk IMSI filtering supporting 10 mil+ IMSIs
- Flexible and weighted distribution
- Standalone solution for network and monitoring system independence
- System integrated distribution solution for optimal flexibility and resource utilization
- RAM buffer load balancing, Direct distribution, Daisy chain distribution with fractional load balancing
- HW traffic filters features, with BCD style syntax
- True IP fragment handling in distribution
Scaling beyond the power of a single host
Performing distribution to RAM in a single Host has some limitations in the bandwidth across PCI, with about 110 Gbps per card and the individual server’s own ability to scale processing power with CPU and available RAM speeds and indeed also for handling analysis results.
To be truly scalable the distribution via Tx can be employed to bring more compute power to the task of analyzing the data.
The Silicom solution can utilize 2 concurrent 2x100GE for highest bandwidth performance. But many different line speeds and port counts can be utilized. For passing the load balancing on to secondary processing nodes, the Tx ports of the initial correlation FPGA PCIe card are connected to these secondary nodes.
The load balancing can be configured as either a daisy chain or as a direct distribution. With the daisy chain all correlation traffic is channeled via VLAN and sent via one or more Tx links to the next processing node. This node filters in a number of VLANs matching the load per channel and processing capacity.
The secondary node’s FPGA PCI card will duplicate and regenerate the received signal and transmit it out of its own Tx ports. Thus, passing on the entire channeled traffic to the next node in the daisy chain, which in turn filters in the next subset of VLAN channels it will process, and so on. Like this any number of nodes can be employed in the processing.
In the direct distribution each processing node is directly attached to the correlating and distributing node. This avoids the use of VLAN channels, but it can still be beneficial to use them, as it allows the processing node to load balancing across its CPUs simply by using VLAN IDs.
This allows for optimal utilization of the hardware platforms employed. Through dual level GTP session distribution, an unprecedented effectiveness and scalability can be achieved with new or existing analysis engines in multiple nodes, while preserving the session coherency for analyzing data for vertical and horizontal metrics.
Key benefits
- Fully offload session correlation tasks and eliminate typical performance bottlenecks
- Coherent user sessions
- Load balance up to 400 Gbps of User and control plane traffic
- Enables processing of large number of sessions
- Frees CPU cycles for analysis tasks
- Subscriber targeted monitoring, supporting 10’s of millions IMSIs
- More than 500M tunnel endpoints supported
- Easy integration into host system
- Scale performance of existing solution’s SW/probe architecture
- Zero packet loss
By utilizing the power and flexibility of FPGAs and the fbCAPTURE framework, Silicom Denmark’s Session Based GTP Distribution, the capacity of analysis systems can be raised to a new level using existing implementations and including even legacy equipment as well as cutting edge.
The Silicom Denmark GTP solution can ensure that monitoring systems can keep up with the growth in traffic volumes while providing the same full and rich analysis as before. This can be realized on standard commodity hardware for analysis rather than the ever more expensive hardware needed if trying to simply scale via pure processing power.
At the core of Silicom Denmark’s Session Based GTP Distribution are hardware integrated network traffic decoders and a GTP/PFCP tracking engine. All based on high performance hardware, harnessing the power and flexibility of FPGAs. The solution consists of one or more FPGA PCIe line cards for 1 – 100Gbps and port count form 2 – 16 ports per card, and the programming of this to ensure efficient traffic handling and correlation.
Additionally, configuration tools and control plane tracking SW to run on Host system is provided, requiring just a single CPU core. An API for low level control and alternate correlation mechanisms can be provided.