What is a "spark" in Haskell
I'm confused about the notion of "spark"
Is it a thread in Haskell? Or is the action of spawning a new thread ?
Thanks everybody:
So to summarize, sparks are not thread but more of unit of computation (tasks to put it in C#/Java terms). So it's the Haskell way of implementing the task parallelism.
Sparks are not threads. forkIO
introduces Haskell threads (which map down onto fewer real OS threads). Sparks create entries in the work queues for each thread, from which they'll take tasks to execute if the thread becomes idle.
As a result sparks are very cheap (you might have billions of them in a program, while you probably won't have more than a million Haskell threads, and less than a dozen OS threads on half a dozen cores).
Think of it like this:
See A Gentle Introduction to Glasgow Parallel Haskell.
Parallelism is introduced in GPH by the
par
combinator, which takes two arguments that are to be evaluated in parallel. The expressionp `par` e
(here we use Haskell's infix operator notation) has the same value ase
, and is not strict in its first argument, i.e.bottom `par` e
has the value ofe
. (bottom
denotes a non-terminating or failing computation.) Its dynamic behaviour is to indicate thatp
could be evaluated by a new parallel thread, with the parent thread continuing evaluation ofe
. We say thatp
has been sparked, and a thread may subsequently be created to evaluate it if a processor becomes idle. Since the thread is not necessarily created,p
is similar to a lazy future.
[Emphasis in original]