jsreed5.org /misc/robs-filespooler-guide.gmi

Rob's Filespooler Guide

---

Filespooler is a Unix utility for managing queues in a decentralized and strictly ordered way. Filespooler takes commands from a source and writes them to jobs, which can be piped into a stream or written to a file system. A destination, not necessarily on the same machine, then processes the jobs and executes the commands in the exact order they were created, regardless of the order in which the jobs were sent or received.

Many tools already implement FIFO (first in, first out) queues, but Filespooler works differently from most them.

Filespooler does not use a client-server model. The same application both creates and processes jobs. Any node can act as a source or a destination.
Filespooler can process queues asynchronously by storing the job queue as files. No active connection between the source and destination is needed. In fact, the queue can be processed even if the source and destination never communicate with each other directly.
By default, Filespooler processes jobs strictly in the order the jobs are created, not when they're transmitted or received. If the source creates jobs 1 and 2 and sends job 2 to the destination before sending job 1, the destination will still process job 1 first. Filespooler can also order jobs by creation date.

Filespooler has a variety of uses, from applying patches to incremental backups to simply ensuring that a series of programs are run in a particular order. Filespooler pairs well with NNCP, wrapping remote execution tasks into a series of files and using a single call to 'nncp-file' to run them.

Like with NNCP, I've written a series of notes for myself on how Filespooler works and how to run it. This is mostly a mirror of those notes.

Concepts

Filespooler works by managing two sequence files: one at the source, which controls the sequence number to be assigned to the next job, and one at the destination, when controls which job number to process next.

To use a queue, Filespooler makes a queue directory. It contains the following:

jobs/

nextseq

nextseq.lock

'jobs/' contains job files that are to be processed by the destination.
'nextseq' contains the job sequence number that Filespooler with start with on the next processing run. This is actually just a plain text file. It starts at 1 by default, but it can be set to any value manually.
'nextseq.lock' is a lock file for the queue sequence number.

If 'jobs/' is synchronized across different locations, i.e. by using a symbolic link or copying with rsync, then 'nextseq' and 'nextseq.lock' only need to be present at the destination. This is actually the recommended configuration because it will prevent the source from processing the queue by accident.

The source also needs some supplementary files to write to the queue.

A sequence file contains the job sequence number that Filespooler will assign to the next job it creates. This is also a plain text file, and by default it starts at 1. The file can be named anything you like--I usually use 'seq' to keep it simple.
The sequence number uses a lock file, just like the queue. If the sequence file is named 'seq', the lock file will be named 'seq.lock'.

Workflow

At its core, Filespooler operates in a very straightforward way.

First, we prepare a job packet at the source. Filespooler does this in four steps:

Data is read from a file or from STDIN. This is the job payload
The payload is packaged into a job packet and is assigned a job sequence number by the source sequence file. The job packet also includes other information, such as the creation time of the job.
The number in the source sequence file increments by 1.
The job packet data is sent to STDOUT.

If we want to process jobs using a stream or a pipe instead of a file system, we can go directly to processing the job at the destination. This has some caveats that the official documentation covers thoroughly.

Otherwise, we send the job packet to the job queue. Two events happen here:

The job packet is written to a file with a specific name pattern: 'fspl-[string].fspl'. The string in the middle of the name can be anything you want; by default it is a randomly-generated UUID.
The job packet is stored in the 'jobs/' subdirectory of the queue.

Finally, we process queued jobs at destination. We do this in three steps:

We start with the job whose job sequence number matches the current queue sequence number. Earlier jobs are ignored. If no matching job is found, we exit immediately.
The job payload is processed by some command and the result written to STDOUT.
The queue sequence number increments by 1.

The above process continues until one of three stop conditions is met.

All the jobs in the queue have been processed.
We specified the '--maxjobs' parameter and Filespooler has processed the specified number of jobs.
Filespooler encounters an error, such as a missing job.

Installation

Depending on your platform, Filespooler might be available as a package. On Ubuntu, Filespooler can be installed from the Universe repository:

> sudo apt install filespooler

My main distro is Fedora, however, and there I needed to build Filespooler from source.

> sudo dnf install cargo
> cargo install filespooler

If you build Filespooler from source, be sure to add '~/.cargo/bin' to your $PATH.

Example Queue Process

This section walks through a complete example of setting up a queue, adding a job to it, and processing the job.

First, create sample source and destination directories.

> mkdir /tmp/source /tmp/destination

We then need to initialize the job sequence in the source. This can be done by hand, but it's more convenient to use Filespooler to do it.

> fspl prepare-get-next --seqfile /tmp/source/seq
1

Listing the source directory contents shows us the sequence file and lock:

> ls /tmp/source
seq
seq.lock

Now we need to create a queue directory.

> fspl queue-init --queuedir /tmp/queue

When we look inside the queue directory, we see queue sequence files and a 'jobs/' directory, but there are no jobs yet.

> ls -R /tmp/queue
/tmp/queue:
jobs
nextseq
nextseq.lock

/tmp/queue/jobs:

The queue sequence is initialized at 1. We can verify it:

> fspl queue-get-next --queuedir /tmp/queue
1

With our source sequence and queue directory initialized, we're ready to create our fist job. As an example, we'll simply write a job containing some text to the queue. This can be done in several ways, but I will outline two here.

The first method is by writing everything to intermediate files. Start by storing some text in a file.

> echo "test" > /tmp/source/file.txt

Next, create a job packet file out of the source file.

> fspl prepare --seqfile /tmp/source/seq --input /tmp/source/file.txt > /tmp/source/job1.fspl

Finally, move the job file to the queue and rename it to Filespooler's internal name format.

> mv /tmp/source/job1.fspl /tmp/queue/jobs/$(fspl gen-filename)

The second method is to pipe all the data from the moment it's created to the moment it's written to the queue. This is often how Filespooler is invoked, since it's much faster and more compact. Our example can be written as a one-liner like so:

> echo "test" | fspl prepare --seqfile /tmp/source/seq --input - | fspl queue-write --queuedir /tmp/queue

After creating the job file, the source sequence number has incremented. It actually increments when we invoke 'fspl prepare' and before we run 'fspl queue-write'.

> fspl prepare-get-next --seqfile /tmp/source/seq
2

The job is also visible in the 'jobs/' directory.

> ls -R /tmp/queue
/tmp/queue:
jobs
nextseq
nextseq.lock
	
/tmp/queue/jobs:
fspl-a7dd5186-c6a5-4b33-9546-90fb9b4c5c38.fspl

However, the job has not been processed yet, so the queue sequence number remains the same.

> fspl queue-get-next --queuedir /tmp/queue
1

In case we want more information about the jobs in the queue than a simple 'ls' can tell us, Filespooler has its own mechanism to list jobs.

> fspl queue-ls --queuedir /tmp/queue
ID                   creation timestamp          filename
1                    2025-01-31T16:07:20-06:00   fspl-a7dd5186-c6a5-4b33-9546-90fb9b4c5c38.fspl

We can also drill down into a specific job and print specific information.

> fspl queue-info --queuedir /tmp/queue --job 1
FSPL_SEQ=1
FSPL_CTIME_SECS=1738361240
FSPL_CTIME_NANOS=528569291
FSPL_CTIME_RFC3339_UTC=2025-01-31T22:07:20Z
FSPL_CTIME_RFC3339_LOCAL=2025-01-31T16:07:20-06:00
FSPL_JOB_FILENAME=fspl-a7dd5186-c6a5-4b33-9546-90fb9b4c5c38.fspl
FSPL_JOB_QUEUEDIR=/tmp/queue
FSPL_JOB_FULLPATH=/tmp/queue/jobs/fspl-a7dd5186-c6a5-4b33-9546-90fb9b4c5c38.fspl

At last, it's time to process the jobs in the queue. In this example, we'll use the default behaviors: process the entire queue, and process it in order of sequence number (as opposed to processing by creation date).

> fspl queue-process --queuedir /tmp/queue tee -- /tmp/destination/file.txt
test1

The example command we run on the data is 'tee', which outputs the job payload to STDOUT and also writes it to a file.

The act of processing the job increments our queue sequence number.

> fspl queue-get-next --queuedir /tmp/queue
2

The queue is now empty.

> fspl queue-ls --queuedir /tmp/queue
ID                   creation timestamp          filename

We can confirm this by checking the 'jobs/' directory in the queue. The job file is gone.

> ls -R /tmp/queue
/tmp/queue:
jobs
nextseq
nextseq.lock
	
/tmp/queue/jobs:

Finally, we see the file has reached its destination:

> ls /tmp/destination
file.txt

Congratulations, you've used your first Filespooler queue!

More Information

Filespooler has extensive documentation on each of its commands. You can find it online at salsa.debian.org.

(HTTPS) fspl.1.md

[This post was originally written on 2025-06-25.]

---

Up One Level

Home

[Last updated: 2026-01-28]