Simulating a Cluster in Go

This content originally appeared on Level Up Coding - Medium and was authored by Mukkund Sunjii

Building a worker pool from scratch to execute map-reduce jobs

Implementing a multi-node cluster in Golang.

Over the course of practicing with Golang, I’d gotten interested in learning more about go-routines and concurrent programming in general. I figured that the best way to get some reps in is to build systems that we take for grant from the ground up.

In this article, I will take you through the process of building a web-server which accepts submission of jobs which are then processed over several workers in the system using the map-reduce paradigm.

Objective

The objective of the application is not to focus on the details of the map-reduce implementation but on how a pub/sub system works in conjunction with the client the workers and the cluster as a whole.

In the process of building the application, some important concepts are revealed:

Queueing submitted jobs
Monitoring the queue
Asynchronous processing
Reduction of results received from different nodes

System Design

System design of the application

To quickly go through the system architecture, it is divided into 4 parts.

Web-Client

A web application that exposes a method to submit jobs.
Displays real time status of the workers.
Displays the job queue.
Displays the results of the jobs.

Web-server

REST-API server that has endpoints to submit jobs and fetch information about the current state of the cluster. Based on the type of the job, an appropriate message is published to the message queue to be later processed by a worker.

Message queue

As the client submits different types of jobs, they are then pushed (published) to a queue. The webserver also monitors the length and the state of the queue.

Workers/nodes

Workers here are simply implemented as separate go-routines that subscribe to jobs/messages. Initially 4 of the “workers” are initialized and they start listening to jobs.

Map-Reduce Job

To not get too wordy, a map-reduce is a way of processing large amounts of data by distributing the workload among multiple workers. For more information, please use this reference.

Map-reduce workflow

As seen in the image, the map reduce generally consists of the following steps:

Dividing or partitioning the data into different chunks, usually after sorting.
Processing these chunks by applying some logic to them (Mapping).
Combining the results back using a predefined logic (Reducing).

In this context, in order to focus on our objective, the map reduce looks something like…

Map-reduce job in our context

In the application, there are 2 types of jobs (or mapping functions):

Short jobs: A job with a map function that takes a couple of seconds to complete.
Long jobs: A job with a map function that takes a longer amount of time to complete.

For each job, the user can specify the number of partition by which the “data” is divided. Thereafter, based on the type of the job, the mapper function is run on every “partition” by the available workers.

Each “job” is represented by the `JobSpec` struct:

type JobSpec struct {
 Id        string
 Operation func() int
}

Essentially a job could be an object with a UUID and a message that is associated with a routine. In this case, the job spec directly contains the function to be executed. In real production systems, the job could also contain accompanying data or other spec that is needed to perform the map step of the process.

Mapper function

As an example, a short job mapper function would look something like this:

func JobShort() int {
 time.Sleep(time.Duration(rand.Intn(5)) * time.Second)
 return 1
}

Keep in mind, the worker doesn’t really use a “data” partition. Rather, it simulates the processing by setting a timeout for the respective thread (or worker).

Reducer Function

Once the mapper function is finished, a separate routine monitors for completed jobs. The jobs are then collected and consolidated based on the job ID.

func CollectResults(receiver <-chan models.ResultSpec, results map[string][]int) {
 for r := range receiver {
  result := results[r.Id]
  result = append(result, r.Result)
  results[r.Id] = result
 }
}

The reduce step is the same for both types of the jobs which just appends the results received from the workers whose length will equal the number of partitions specified in the job.

Worker

As mentioned previously the worker is implemented as a go-routine that listens to incoming jobs. Put in Golang terms, a worker can simply be defined by the following struct:

type Worker struct {
 Id    int
 Busy  *bool
 JobId *string
 r     <-chan JobSpec
 s     chan<- ResultSpec
}

Each worker is initialized with a receiver and sender channel. Through the receiver, the worker receives a `JobSpec`. On the flip side, each worker also has `sender` channel which is used to send the calculated results of the map step.

The workers are created and initialized when the webserver is started with the following methods:

// Initializing the workers
for i := range workers {
  workers[i] = models.CreateWorker(i, sender, receiver)
 }

// Starting the workers
for i := range workers {
    go workers[i].StartListening()
}

Monitoring the queue

A job queue implementation is not explicitly required as the act of sending or receiving messages in a channel is a blocking command in Golang. However, in order to visualize the queueing of the jobs, a separate queue or array of job IDs is maintained and exposed through an endpoint.

func RunShortJobs(s chan models.JobSpec, jobs *[]string) func(c *gin.Context) {
 return func(c *gin.Context) {
  numWorkers, err := strconv.Atoi(c.Param("numPartitions"))
  if err != nil {
   return
  }
  reqId := utils.GenerateId("short")
  c.IndentedJSON(200, gin.H{
   "status": "Submitted",
   "job":    reqId,
  })
  for i := 0; i < numWorkers; i++ {
   *jobs = append(*jobs, reqId)
  }
  for i := 0; i < numWorkers; i++ {
   jobSpec := models.JobSpec{Id: reqId, Operation: utils.JobShort}
   s <- jobSpec
   *jobs = (*jobs)[:len(*jobs)-1]
  }
 }
}

In the above code snippet, the function is the controller of an endpoint exposed by the webserver. Here, the variable `jobs` contains the jobs that are yet to be picked up.

The moment a user submits another short job with a specific n number of partitions, then n short map jobs are added to the queue. As the jobs are consumed and completed by the workers, it gets popped from the queue.

Web client

The web client is build using Vue.js. It is a simple interface that lets users accomplish the following activities:

Submit jobs
Visualize worker status
Visualize queues
Keep track of the results of the submitted jobs.

Job Submissions

The clients uses HTTP endpoints exposed by the webserver to submit jobs.

For example:

### Check status of workers
GET http://host/status

### Submit a job with 1 parition
POST http://host/short/1

### Submit a long job with 3 paritions
POST http://host/long/3

Visualization

The web client uses the following websockets to show real-time status of the map-reduce cluster:

- `ws://host/queue` — sends the current jobs in the queue

- `ws://host/status` — sends current status of the workers

- `ws://host/results` — sends the updated list of results

Demo

Lets see how it all works when put together:

Demo of the application.

So what is happening here?

The user submits two jobs where the partition parameter is set to 15.
Both of the jobs take place asynchronously.
The workers then pick up the published map jobs.
The results are collected and the progress of the job can be seen in the results section.

Want to Connect?

Thank you for reading my article. You can also find me on LinkedIn and my work on GitHub.

Simulating a Cluster in Go was originally published in Level Up Coding on Medium, where people are continuing the conversation by highlighting and responding to this story.

This content originally appeared on Level Up Coding - Medium and was authored by Mukkund Sunjii

Print Share Comment Cite Upload Translate Updates

APA

Mukkund Sunjii | Sciencx (2024-07-16T16:32:07+00:00) Simulating a Cluster in Go. Retrieved from https://www.scien.cx/2024/07/16/simulating-a-cluster-in-go/

MLA

" » Simulating a Cluster in Go." Mukkund Sunjii | Sciencx - Tuesday July 16, 2024, https://www.scien.cx/2024/07/16/simulating-a-cluster-in-go/

HARVARD

Mukkund Sunjii | Sciencx Tuesday July 16, 2024 » Simulating a Cluster in Go., viewed ,<https://www.scien.cx/2024/07/16/simulating-a-cluster-in-go/>

VANCOUVER

Mukkund Sunjii | Sciencx - » Simulating a Cluster in Go. [Internet]. [Accessed ]. Available from: https://www.scien.cx/2024/07/16/simulating-a-cluster-in-go/

CHICAGO

" » Simulating a Cluster in Go." Mukkund Sunjii | Sciencx - Accessed . https://www.scien.cx/2024/07/16/simulating-a-cluster-in-go/

IEEE

" » Simulating a Cluster in Go." Mukkund Sunjii | Sciencx [Online]. Available: https://www.scien.cx/2024/07/16/simulating-a-cluster-in-go/. [Accessed: ]

rf:citation

» Simulating a Cluster in Go | Mukkund Sunjii | Sciencx | https://www.scien.cx/2024/07/16/simulating-a-cluster-in-go/ |

Please log in to upload a file.

There are no updates yet.
Click the Upload button above to add an update.

You must be logged in to translate posts. Please log in or register.