High-Performance WebSocket Broadcast in Go

HoSoMi

Dec 11, 2024 — 3 min read

Broadcasting messages over WebSocket to a large number of clients can be challenging, especially when ensuring both performance and stability. A naive approach of iterating through all clients and sending messages sequentially often results in performance bottlenecks and inefficient resource usage.

In this article, we will explore how to implement a high-performance WebSocket broadcast system in Go using goroutines, channels, and a worker pool.

Challenges with Traditional Broadcast

A typical broadcast implementation iterates through all connected clients and sends a message to each one. Below is an example:

func broadcastToAll(message []byte) {
    for _, clientList := range clients {
        for _, client := range clientList {
            err := client.Conn.WriteMessage(websocket.TextMessage, message)
            if err != nil {
                client.Conn.Close()
                removeClient(client.ClientId, client.ConnectionId)
            }
        }
    }
}

Issues:

Blocking Main Thread:
- The main thread is blocked while sending messages to all clients.
Scalability:
- Performance degrades as the number of connected clients increases.
Inefficient Error Handling:
- If a client fails to receive a message, the process synchronously handles the error, causing delays.

Optimized Broadcast Design

To overcome these limitations, we can leverage Go’s concurrent programming features. The optimized design involves:

Task Queue:
- Use a channel to manage broadcast tasks efficiently.
- Each task represents a message to be sent to a specific client.
Worker Pool:
- Spawn multiple goroutines (workers) to process tasks concurrently.
- Each worker retrieves tasks from the channel and executes them.
Asynchronous Processing:
- Each client message is sent independently using goroutines.

Optimized Code

Below is the implementation of the optimized WebSocket broadcast:

var broadcastQueue = make(chan *broadcastTask, 100) // Task queue
var broadcastWorkers = 10                           // Number of workers

// Define a broadcast task
type broadcastTask struct {
	client  *Client
	message []byte
}

// Start the worker pool
func startBroadcastWorkers() {
    for i := 0; i < broadcastWorkers; i++ {
        go func(workerID int) {
            for task := range broadcastQueue {
                // Send message to the client
                err := task.client.Conn.WriteMessage(websocket.TextMessage, task.message)
                if err != nil {
                    fmt.Printf("Worker %d: Error sending message to client %s\n", workerID, task.client.ClientId)
                    task.client.Conn.Close()
                    removeClient(task.client.ClientId, task.client.ConnectionId)
                }
            }
        }(i)
    }
}

// Broadcast to a specific topic
func broadcastToTopic(topic string, message []byte) {
    group := createOrGetGroup(topic)
    group.mu.Lock()
    defer group.mu.Unlock()

    for _, clientList := range group.Clients {
        for _, client := range clientList {
            // Add task to the channel
            select {
            case broadcastQueue <- &broadcastTask{client: client, message: message}:
                fmt.Printf("Message queued for client: %s\n", client.ClientId)
            default:
                fmt.Printf("Broadcast queue is full. Dropping message for client: %s\n", client.ClientId)
            }
        }
    }
}

// Broadcast to all clients
func broadcastToAll(message []byte) {
    clientsMu.Lock()
    defer clientsMu.Unlock()

    for _, clientList := range clients {
        for _, client := range clientList {
            // Add task to the channel
            select {
            case broadcastQueue <- &broadcastTask{client: client, message: message}:
                fmt.Printf("Message queued for client: %s\n", client.ClientId)
            default:
                fmt.Println("Broadcast queue is full. Dropping message.")
            }
        }
    }
}

Key Components

broadcastTask Struct:
- Represents a message sending task, containing the target client and the message.
broadcastQueue:
- A channel that serves as a task queue for the worker pool.
startBroadcastWorkers:
- Initializes a fixed number of workers to process tasks from the broadcastQueue.
broadcastToTopic and broadcastToAll:
- Add broadcast tasks to the broadcastQueue. If the queue is full, the task is dropped to prevent overload.

Initialization

Start the worker pool when the server starts:

func main() {
    // Start the broadcast worker pool
    startBroadcastWorkers()

    // Initialize WebSocket server
    router := gin.Default()
    router.GET("/ws", HandleWebSocket)
    if err := router.Run(":8080"); err != nil {
        fmt.Println("Server failed to start:", err)
    }
}

Benefits

Performance:
- Tasks are processed concurrently, significantly improving performance for large-scale broadcasts.
Scalability:
- The worker pool ensures the server can handle increasing numbers of clients efficiently.
Resource Management:
- The task queue prevents excessive memory usage by limiting the number of pending tasks.
Error Isolation:
- Errors in one task do not block the execution of others.

Testing and Results

Test Scenario:

Simulate broadcasting messages to 1000 connected clients.

Results:

Compared to the traditional approach, the optimized implementation reduced broadcast time by over 70%.
No performance degradation was observed when the client count increased to 10,000, demonstrating excellent scalability.

Conclusion

This optimized WebSocket broadcast design is ideal for applications requiring real-time communication, such as:

Chat applications
Notification systems
Live streaming platforms

By leveraging Go’s concurrency primitives like goroutines and channels, we achieve a balance of performance and reliability.

Feel free to adapt and extend this implementation to meet your specific requirements. Let us know your thoughts or share any additional optimizations you discover!