Does using Tasks (TPL) library make an application multithreaded?
Recently when being interviewed, I got this question.
Q: Have you written multithreaded applications?
A: Yes
Q: Care to explain more?
A: I used Tasks
(Task Parallel library) to carry out some tasks like waiting for some info from internet while loading UI
. This improves my application usability.
Q: But, just you have used TPL
means that you have written an multithreaded
application?
Me: (Not sure what to say1)
So, whats exactly a multi-threaded application? Is it different from using Tasks
?
Solution 1:
Tasks can be used to represent operations taking place on multiple threads, but they don't have to. One can write complex TPL applications that only ever execute in a single thread. When you have a task that, for example, represents a network request for some data, that task is not going to create additional threads to accomplish that goal. Such a program is (hopefully) asynchronous, but not necessarily mutlithreaded.
Parallelism is doing more than one thing at the same time. This may or may not be the result of multiple threads.
Let's go with an analogy here.
Here is how Bob cooks dinner:
- He fills a pot of water, and boils it.
- He then puts pasta in the water.
- He drains the pasta when its done.
- He prepares the ingredients for his sauce.
- He puts all of the ingredients for his sauce in a saucepan.
- He cooks his sauce.
- He puts his sauce on his pasta.
- He eats dinner.
Bob has cooked entirely synchronously with no multithreading, asynchrony, or parallelism when cooking his dinner.
Here is how Jane cooks dinner:
- She fills a pot of water and starts boiling it.
- She prepares the ingredients for her sauce.
- She puts the pasta in the boiling water.
- She puts the ingredients in the saucepan.
- She drains her pasta.
- She puts the sauce on her pasta.
- She eats her dinner.
Jane leveraged asynchronous cooking (without any multithreading) to achieve parallelism when cooking her dinner.
Here is how Servy cooks dinner:
- He tells Bob to boil a pot of water, put in the pasta when ready, and serve the pasta.
- He tells Jane to prepare the ingredients for the sauce, cook it, and then serve it over the pasta when done.
- He waits for Bob and Jane to finish.
- He eats his dinner.
Servy leveraged multiple threads (workers) who each individually did their work synchronously, but who worked asynchronously with respect to each other to achieve parallelism.
Of course, this becomes all the more interesting if we consider, for example, whether our stove has two burners or just one. If our stove has two burners then our two threads, Bob and Jane, are both able to do their work without getting in each others way, much. They might bump shoulders a bit, or each try to grab something from the same cabinet every now and then, so they'll each be slowed down a bit, but not much. If they each need to share a single stove burner though then they won't actually be able to get much done at all whenever the other person is doing work. In that case, the work won't actually get done any faster than just having one person doing the cooking entirely synchronously, like Bob does when he's on his own. In this case we are cooking with multiple threads, but our cooking isn't parallelized. Not all multithreaded work is actually parallel work. This is what happens when you are running multiple threads on a machine with one CPU. You don't actually get work done any faster than just using one thread, because each thread is just taking turns doing work. (That doesn't mean multithreaded programs are pointless on one cores CPUs, they're not, it's just that the reason for using them isn't to improve speed.)
We can even consider how these cooks would do their work using the Task Parallel Library, to see what uses of the TPL correspond to each of these types of cooks:
So first we have bob, just writing normal non-TPL code and doing everything synchronously:
public class Bob : ICook
{
public IMeal Cook()
{
Pasta pasta = PastaCookingOperations.MakePasta();
Sauce sauce = PastaCookingOperations.MakeSauce();
return PastaCookingOperations.Combine(pasta, sauce);
}
}
Then we have Jane, who starts two different asynchronous operations, then waits for both of them after starting each of them to compute her result.
public class Jane : ICook
{
public IMeal Cook()
{
Task<Pasta> pastaTask = PastaCookingOperations.MakePastaAsync();
Task<Sauce> sauceTask = PastaCookingOperations.MakeSauceAsync();
return PastaCookingOperations.Combine(pastaTask.Result, sauceTask.Result);
}
}
As a reminder here, Jane is using the TPL, and she's doing much of her work in parallel, but she's only using a single thread to do her work.
Then we have Servy, who uses Task.Run
to create a task that represents doing work in another thread. He starts two different workers, has them each both synchronously do some work, and then waits for both workers to finish.
public class Servy : ICook
{
public IMeal Cook()
{
var bobsWork = Task.Run(() => PastaCookingOperations.MakePasta());
var janesWork = Task.Run(() => PastaCookingOperations.MakeSauce());
return PastaCookingOperations.Combine(bobsWork.Result, janesWork.Result);
}
}
Solution 2:
A Task
is a promise for future work to be completed. When using it, you can use it for I/O based
work, which does not require you to be using multiple threads for code execution. A good example is using C# 5's feature of async/await
with HttpClient
which does network based I/O work.
However, you can take advantage of the TPL
to do multithreaded work. For example, when using Task.Run
or Task.Factory.Startnew
to start a new task, behind the scenes work gets queued for you on the ThreadPool
, which the TPL
abstracts away for us, allowing you to use multiple threads.
A common scenario for using multiple threads is when you have CPU bound work which can be done simultaneously (in parallel). Working with a multithreaded application comes with great responsibility.
So we see that working with the TPL
dosen't necasserly mean using multiple threads, but you definitely can leverage it to do multithreading.
Solution 3:
Q: But, just you have used TPL means that you have written an multithreaded application?
Smart question, Task != Multi-threaded, Using TaskCompletionSource you can create a Task
which may execute in single thread(may be UI thread only) itself.
Task
is just an abstraction over an operation which may complete in future. It doesn't mean code is multi-threaded. Usually Task
involves multi-threading, not necessarily always.
And keep in mind only with knowledge of TPL
you can't say that you know multi-threading. There are many concepts you need to cover.
- Thread
- Synchronization primitives
- Thread safety
- Asynchronous programming
- Parallel programming which is a part of TPL.
- and lot more ...
and of course Task parallel library too.
Note: this is not the full list, these are just from top of my head.
Resources to learn:
- Eric's old blog
- Eric's new blog
- Stephen toub's blog
For threading alone, I suggest http://www.albahari.com/threading/
For video tutorials I suggest Pluralsight. It is paid but worth the cost.
Last but not least: Yes of course, It is Stackoverflow.