{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"\n",
"Custom Workloads with Dask Delayed\n",
"==================================\n",
"\n",
"\n",
"\n",
"*Because not all problems are dataframes*\n",
"\n",
"This notebook shows using [dask.delayed](http://dask.pydata.org/en/latest/delayed.html) to parallelize generic Python code. \n",
"\n",
"Dask.delayed is a simple and powerful way to parallelize existing code. It allows users to delay function calls into a task graph with dependencies. Dask.delayed doesn't provide any fancy parallel algorithms like Dask.dataframe, but it does give the user complete control over what they want to build.\n",
"\n",
"Systems like Dask.dataframe are built with Dask.delayed. If you have a problem that is paralellizable, but isn't as simple as just a big array or a big dataframe, then dask.delayed may be the right choice for you."
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## Start Dask Client for Dashboard\n",
"\n",
"Starting the Dask Client is optional. It will provide a dashboard which \n",
"is useful to gain insight on the computation. \n",
"\n",
"The link to the dashboard will become visible when you create the client below. We recommend having it open on one side of your screen while using your notebook on the other side. This can take some effort to arrange your windows, but seeing them both at the same is very useful when learning."
]
},
{
"cell_type": "code",
"execution_count": 1,
"metadata": {},
"outputs": [
{
"data": {
"text/html": [
"
\n",
"Client\n", "
| \n",
"\n",
"Cluster\n", "
| \n",
"