Next: Temporal-locality-aware poll Up: A Scalable Locality-Aware Event Previous: A Scalable Locality-Aware Event

Introduction

Scalability of network servers is important today. As the Internet continues to grow in popularity and size, network servers without good scalability will result in performance drop or live-lock[2]. One approach is to serve all connections in a single process, called single process event-driven (SPED). Server cannot block itself waiting for any read or write on a file descriptor. Nonblocking I/O is needed. However, calling read() or write() excessively on all file descriptors to see if they are ready for read or write is inefficient. We must use system call like poll() to inquire kernel which file descriptor is ready and only read or write on ready one. Although single process event driven (SPED) architecture is much more efficient than multiple process (MP) architecture, previous studies[1,3] have shown that poll() is not scalable; more than 30%of CPU time is spent on such a system call on a normal squid proxy server. poll() performs the amount of work in proportion to the number of file descriptors in event array rather than constant factor. The overhead of handling event detection for all connections severely limits scalability. Most solutions proposed for the scalability are based on efforts in the kernel mode[4] or even a new operating system architecture (such as Novell's Internet Caching System-ICS). The event dispatching mechanisms in Linux can be summarized in Table 1. In this paper, we focus on improving the performance of web service applications in the user mode. We compare different event dispatching mechanisms in Linux, and give a summary of them. Based on the observation that most web connections are idle, we present a temporal-locality aware library for event dispatching in the user mode. A scalable event dispatching mechanism may improve the performance because of less context switching between user mode and kernel mode, but also encourage better code portability for various web applications. We conducted the performance evaluation of the proposed library on a memory-based event-threading server. Our studies show that performance of the server can be significantly improved by 30% or more. Programming interface and implementation details of web server and library are discussed. Performance analysis is given based on two metric: event dispatching overhead and dispatching throughput.

Table 1: Comparison of various event dispatching mechanisms

features	scalable	event	dequeue	event	when	return	multiple
	to	col-	multiple	queue	queue	initial	interest
	large	lapsing	event	over-	overflow,	state	sets
	set of		per	flow	kernel	when	in
	file		call		fallback to	declare	kernel
	desc.				traditional	interest	per
methods					poll()	fd	process
select()	No	NA	Yes	NA	NA	NA	NA
poll()	No	NA	Yes	NA	NA	NA	NA
/dev/poll	Yes	NA	Yes	NA	NA	NA	Yes
RT signals	Yes	No	No	Yes	No	No	Yes
RT sig-per-fd	Yes	Yes	No	No	NA	No	Yes
declare_interest	Yes	Yes	Yes	Yes	Yes	Yes	No

Next: Temporal-locality-aware poll Up: A Scalable Locality-Aware Event Previous: A Scalable Locality-Aware Event

2003-03-23