Scoped threads in Rust, and why its async counterpart would be unsound
Disclamer: This post contains a lot of oversimplification.
TL; DR: Scoped threads wait for threads to finish in a loop. Async version can't wait until tasks are finished because futures can be ignored/forgotten.
Fearless concurrency with scoped threads
No standard library API demonstrate Rust's Fearless Concurrency motto better than std::thread::scope
.
Take a look at the example code from the documentation.
let mut a = vec![1, 2, 3];
let mut x = 0;
thread::scope(|s| {
s.spawn(|| {
println!("hello from the first scoped thread");
// We can borrow `a` here.
dbg!(&a);
});
s.spawn(|| {
println!("hello from the second scoped thread");
// We can even mutably borrow `x` here,
// because no other threads are using it.
x += a[0] + a[2];
});
println!("hello from the main thread");
});
// After the scope, we can modify and access our variables again:
a.push(4);
We're spawning two scoped threads that borrow and modify the main thread's data.
In other languages, borrowing and mutating between threads requires great care, and produces memory races if done even subtly wrong. You don't have to worry about such issues in Rust; the Rust compiler prevents all memory race for you.
Scoped threads was introduced in 2015. Unfortunately, that first version was unsound and had to be removed before Rust reached 1.0. Recently, the API was redesigned and had since been added to the standard library of Rust 1.63 (released August 2022) onward.
Let's examine how it works...
Controlling threads' lifetimes
Borrows in Rust have lifetimes. thread::scope
allows your threads to borrow data from the main thread for a lifetime of 'life
.
In the example code shown earlier, 'life
would be a segment in the code. Like this
let mut a = vec![1, 2, 3];
let mut x = 0;
// ------------------+
// |
// this is 'life |
// |
// ------------------+
a.push(4);
To use scoped threads, you first create closures with lifetime not smaller than 'life
. Rust's usual lifetime rules apply here: within 'life
, your closures can't borrow things that someone else is mutating or mutate things that someone else is borrowing.
The job of thread::scope
is to spawn your closures into threads. The closures are only guaranteed to be valid within 'life
, so these threads must start and end within 'life
.
let mut a = vec![1, 2, 3];
let mut x = 0;
let my_closure_1 = || { /* ... */ };
let my_closure_2 = || { /* ... */ };
// start of 'life
// -------------------------------------
// -- spawn the closures here! --
//
// ...
//
// -- make sure they end by here --
// -------------------------------------
// end of 'life
a.push(4);
The start part is easy enough: just spawn the closures! The end part is harder: thread::scope
must ensure that the we don't progress beyond 'life
before all the threads have finished.
To accomplish this, thread::scope
gives you a Scope object called s. To spawn each closure, you do s.spawn(closure)
. This spawns the closure into a thread, and records the existence of that thread within s.
Once all the threads have been spawned, thread::scope
will go into an infinite loop. It will only break from this loop once all the threads spawned by s have completed.
At this point we know that all the spawned closures have been dropped. The work of thread::scope
is done. It returns. We exit the lifetime 'life
.
Overall, expanding thread::scope
would give something like this
let mut a = vec![1, 2, 3];
let mut x = 0;
let my_closure_1 = || { /* ... */ };
let my_closure_2 = || { /* ... */ };
// start of 'life
// --------------------------------------
let s = /* ... */;
s.spawn(my_closure_1);
s.spawn(my_closure_2);
while s.num_remaining_threads() > 0 {}
// --------------------------------------
// end of 'life
a.push(4);
Async version of scoped threads
What's a future?
A future is like a pausable closure. It gets polled, driving it to run some code. It then pauses to wait for something to happen. When that thing happens, the future gets polled again and runs more code...
What's a task?
Tasks are the async equivalent of threads. They are futures that have been placed under control of the executor. The executor will schedule and execute tasks, similar to how the OS schedule and execute threads.
Envisioning scoped tasks
An async version of thread::scope
would probably look something like this
let mut a = vec![1, 2, 3];
let mut x = 0;
scoped_tasks(|s| {
// spawn a task for the executor to run
s.spawn(async { // <-- a future instead of a closure
println!("hello from the first scoped task");
dbg!(&a);
});
s.spawn(async {
println!("hello from the second scoped task");
x += a[0] + a[2];
});
println!("hello from the main future");
}).await; // <-- note the await
a.push(4);
It spawns futures as tasks instead of closures as threads. And note that we need to await it at the end.
Those familiar with async Rust would notice that this looks very similar to joining futures. Internally, though, it is different. By spawning the futures to the executor, we can get asymptotically better performance compared to joining.
Unsoundness of scoped tasks
Unfortunately, scoped_tasks
as envisioned provides an unsound API. How so? Let's demonstrate by abusing the function step-by-step.
First, we create a future fut_1
that we want to spawn. Let's assume the lifetime of fut_1
ends at the same point as 'life
(i.e. fut_1
must be dropped before 'life
ends).
let fut_1 = async { /* ... */ };
Then, let the lifetime 'life
start...
We then call scoped_tasks
. This is an async function, so it returns a future. Let's call this future x.
let x = scoped_tasks(|s| {
s.spawn(fut_1);
});
Usually, we would now do x.await
to let the future be polled to completion automatically.
Instead, to demonstrate unsoundness, we will poll x manually.
// x.await; <-- what we usually do
x.poll(...);
This will cause the future x to do work. In this case, it spawns fut_1
.
The future x is now waiting for the spawned task of fut_1
to complete. In the sync version, we would loop until the threads complete.
In async Rust, running an idle infinite loop for a long time is a bad idea; it "blocks the executor". Instead of looping, our future x must pause and wait to be polled again once the spawned tasks have completed.
This is the critical step where the problem arises. Our future x pauses to wait, but there is nothing to guarantee that the rest of the program will respect x's decision to pause! While x is paused, other parts of the program could do all sorts of things. There is no guarantee that the rest of the program would even honor x's request to be polled again once the tasks have completed!
That's exactly how we'll bring out the unsoundness. Instead of waiting and polling x again at the appropriate time, let's just not care about x anymore. We're not going to wait for the spawned task to complete. We're not going to poll x again.
Our code simply progresses forward to the end of 'life
.
See the unsoundness? The future fut_1
is only guaranteed to last within 'life
. We have gone beyond the end of 'life
, yet the task of fut_1
is still running in the executor. We're running fut_1
beyond it's lifetime!
Dealing with the unsoundness
First, it's not so bad. As you can see, the process to bring out the unsoundness of scoped tasks is pretty contrived. The "manually poll x" step is not something you do in everyday code.
Still, unsound is unsound. We can't just expose scoped_tasks
as a safe function and expect people to use it the correct way.
Alternative solutions
The easiest fix is to simply expose the function as unsafe, and document that user must not ignore the future returned from the function.
Another solution is to make scoped tasks work more similarly to scoped threads, specifically by making it use a loop to wait. This is in essence the approach taken by the async-scoped crate.
But as mentioned, looping like this "blocks the executor" and removes many benefits of async. This can become a real issue if you need to use scoped tasks in many places, or if you are in an environment such as browser WASM, where blocking is totally unacceptable.
New approach: scoped_async_spawn
Async UI faces both of the aforementioned issues: it needs to spawn a task for every UI component, and also be able to run in the browser. To make Async UI work, I made scoped_async_spawn, which uses Pin and runtime checks to make sure that the spawning future (the x in our example) can't be ignored.
The mechanism of scoped_async_spawn is a blog-post worth of content in itself, so I'll make a separate post about it soon. Stay tuned!
Discuss this post on Reddit.