ray-project
diff --git a/‎_toc.yml‎
Lines changed: 1 addition & 0 deletions b/‎_toc.yml‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎courses/00_Developer_Intro_to_Ray/00_Intro_Ray_Core_Basics.ipynb‎
Lines changed: 3 additions & 3 deletions b/‎courses/00_Developer_Intro_to_Ray/00_Intro_Ray_Core_Basics.ipynb‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎courses/00_Developer_Intro_to_Ray/00a_Intro_Ray_Core_Advancement.ipynb‎
Lines changed: 20 additions & 11 deletions b/‎courses/00_Developer_Intro_to_Ray/00a_Intro_Ray_Core_Advancement.ipynb‎
Lines changed: 20 additions & 11 deletions
diff --git a/‎courses/00_Developer_Intro_to_Ray/01_Intro_Ray_AI_Libs_Overview.ipynb‎
Lines changed: 39 additions & 13 deletions b/‎courses/00_Developer_Intro_to_Ray/01_Intro_Ray_AI_Libs_Overview.ipynb‎
Lines changed: 39 additions & 13 deletions
@@ -68,6 +68,7 @@ parts:
   - file: courses/00_Developer_Intro_to_Ray/output/05_Intro_Ray_Serve_PyTorch_03.ipynb
   - file: courses/00_Developer_Intro_to_Ray/output/05_Intro_Ray_Serve_PyTorch_04.ipynb
   - file: courses/00_Developer_Intro_to_Ray/output/05_Intro_Ray_Serve_PyTorch_05.ipynb
+  - file: courses/00_Developer_Intro_to_Ray/output/README_01.ipynb
 - caption: Anyscale 101
   chapters:
   - file: courses/anyscale_101/output/101_anyscale_intro_jobs_01.ipynb
 
@@ -167,7 +167,7 @@
    "metadata": {},
    "source": [
     "<div class=\"alert alert-info\">\n",
-    "  <strong><a href=\"https://docs.ray.io/en/latest/ray-core/key-concepts.html#tasks\" target=\"_blank\">Tasks</a></strong> is a remote, stateless Python function invokation.\n",
+    "  <strong><a href=\"https://docs.ray.io/en/latest/ray-core/key-concepts.html#tasks\" target=\"_blank\">Tasks</a></strong> is a remote, stateless Python function invocation.\n",
     "</div>\n"
    ]
   },
@@ -466,9 +466,9 @@
  ],
  "metadata": {
   "kernelspec": {
-   "display_name": "Xing-ray-jupyter-3.11",
+   "display_name": "ray-jupyter",
    "language": "python",
-   "name": "xing-ray-jupyter"
+   "name": "python3"
   },
   "language_info": {
    "codemirror_mode": {
 
@@ -31,7 +31,7 @@
    "id": "fcd84432-84b0-426e-8546-42f33804f2fa",
    "metadata": {},
    "source": [
-    "This notebook provides a step-by-step introduction to Object store, Tasks, and Actors, which all are the fundamental building blocks of Ray that enables distributed computing.\n",
+    "This notebook provides a step-by-step introduction to Object store, Tasks, and Actors, which are all the fundamental building blocks of Ray that enables distributed computing.\n",
     "\n",
     "<div class=\"alert alert-block alert-info\">\n",
     "\n",
@@ -79,7 +79,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": null,
    "id": "98399ea9-933a-452f-be3f-bc1535006443",
    "metadata": {
     "tags": []
@@ -227,7 +227,7 @@
    "metadata": {},
    "source": [
     "<div class=\"alert alert-info\">\n",
-    "This patterns assumes that two conditions are satisftied:\n",
+    "This pattern assumes that two conditions are satisfied:\n",
     "<ol>\n",
     "<li> the object is large</li>\n",
     "<li> user wants to reuse the object multiple times</li>\n",
@@ -308,6 +308,15 @@
    "metadata": {},
    "outputs": [],
    "source": [
+    "@ray.remote\n",
+    "def remote_add(a, b):\n",
+    "    return a + b\n",
+    "\n",
+    "@ray.remote\n",
+    "def expensive_square(x):\n",
+    "    time.sleep(5)\n",
+    "    return x**2\n",
+    "\n",
     "# 1st task\n",
     "square_ref = expensive_square.remote(2)\n",
     "square_value = ray.get(square_ref)\n",
@@ -474,7 +483,7 @@
    "id": "09269628",
    "metadata": {},
    "source": [
-    "Note we did not have to re-define the remote function, instead we can an update version using `.options`"
+    "Note we did not have to re-define the remote function, instead we could have used `.options`"
    ]
   },
   {
@@ -547,9 +556,9 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "@ray.remote(runtime_env={\"env_vars\": {\"my_custom_env\": \"prod\"}})\n",
+    "@ray.remote(runtime_env={\"env_vars\": {\"MY_CUSTOM_ENV\": \"prod\"}})\n",
     "def f():\n",
-    "    env = os.environ[\"my_custom_env\"]\n",
+    "    env = os.environ[\"MY_CUSTOM_ENV\"]\n",
     "    return f\"My custom env is {env}\""
    ]
   },
@@ -624,7 +633,7 @@
    "source": [
     "However, these resource specifications are not enforced - i.e. they are entirely [logical and not physical](https://docs.ray.io/en/latest/ray-core/scheduling/resources.html#physical-resources-vs-logical-resources).\n",
     "\n",
-    "This means that you can for instance perform multiprocessing ormultithreading within a task and oversubscribe to resources."
+    "This means that you can for instance perform multiprocessing or multithreading within a task and oversubscribe to resources."
    ]
   },
   {
@@ -684,7 +693,7 @@
     "    <li><strong>Memory</strong></li>\n",
     "</ul>\n",
     "\n",
-    "<p>Ray's scheduler checks the <strong>resource specification</strong> (sometimes referred to as <strong>resource shape</strong>) to match tasks and actors with available resources in the cluster. If the exact resource combination is unavailable, Ray may autoscaler the cluster.</p>\n",
+    "<p>Ray's scheduler checks the <strong>resource specification</strong> (sometimes referred to as <strong>resource shape</strong>) to match tasks and actors with available resources in the cluster. If the exact resource combination is unavailable, Ray may autoscale the cluster.</p>\n",
     "\n",
     "<p>You can inspect the current resource availability using:</p>\n",
     "<pre><code>\n",
@@ -1142,7 +1151,7 @@
    "id": "9ad7a2da-0411-4e77-a371-3583a21c949e",
    "metadata": {},
    "source": [
-    "Define an actor with the `@ray.remote` decorator and then use `<class_name>.remote()` ask Ray to construct and instance of this actor somewhere in the cluster.\n",
+    "Define an actor with the `@ray.remote` decorator and then use `<class_name>.remote()` to ask Ray to construct an instance of this actor somewhere in the cluster.\n",
     "\n",
     "We get an actor handle which we can use to communicate with that actor, pass to other code, tasks, or actors, etc."
    ]
@@ -1324,9 +1333,9 @@
  ],
  "metadata": {
   "kernelspec": {
-   "display_name": "Xing-ray-jupyter-3.11",
+   "display_name": "ray-jupyter",
    "language": "python",
-   "name": "xing-ray-jupyter"
+   "name": "python3"
   },
   "language_info": {
    "codemirror_mode": {
 
@@ -17,7 +17,7 @@
    "source": [
     "💻 **Launch Locally**: You can run this notebook locally, but performance will be reduced.\n",
     "\n",
-    "🚀 **Launch on Cloud**: A Ray Cluster with 4 GPUs (Click [here](http://console.anyscale.com/register) to easily start a Ray cluster on Anyscale) is recommanded to run this notebook."
+    "🚀 **Launch on Cloud**: A Ray Cluster with 4 GPUs (Click [here](http://console.anyscale.com/register) to easily start a Ray cluster on Anyscale) is recommended to run this notebook."
    ]
   },
   {
@@ -51,15 +51,24 @@
   {
    "cell_type": "code",
    "execution_count": null,
-   "metadata": {
-    "tags": []
-   },
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# (Optional): If you get an XGBoostError at import, you might have to `brew install libomp` before importing xgboost again\n",
+    "!brew install libomp"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
    "outputs": [],
    "source": [
     "import asyncio\n",
     "import fastapi\n",
     "import pandas as pd\n",
     "import requests\n",
+    "# macos: If you get an XGBoostError at import, you might have to `brew install libomp` before importing xgboost again\n",
     "import xgboost\n",
     "from pydantic import BaseModel\n",
     "from sklearn.model_selection import train_test_split\n",
@@ -68,6 +77,7 @@
     "import ray.tune\n",
     "import ray.train\n",
     "from ray.train.xgboost import XGBoostTrainer as RayTrainXGBoostTrainer\n",
+    "from ray.train import RunConfig\n",
     "import ray.data\n",
     "import ray.serve"
    ]
@@ -86,7 +96,7 @@
     "\n",
     "|<img src=\"https://technical-training-assets.s3.us-west-2.amazonaws.com/Introduction_to_Ray_AIR/e2e_air.png\" width=\"100%\" loading=\"lazy\">|\n",
     "|:-:|\n",
-    "|Ray AI Libraries enable end-to-end ML development and provides multiple options for integrating with other tools and libraries form the MLOps ecosystem.|\n",
+    "|Ray AI Libraries enable end-to-end ML development and provides multiple options for integrating with other tools and libraries from the MLOps ecosystem.|\n",
     "\n"
    ]
   },
@@ -108,7 +118,7 @@
     "* **`fare_amount`**\n",
     "    * Float representing total price including tax, tip, fees, etc.\n",
     "* **`tolls_amount`**\n",
-    "    * Float represnting the total paid on tolls if any.\n",
+    "    * Float representing the total paid on tolls if any.\n",
     "\n",
     "**Target**\n",
     "* **`trip_amount`**\n",
@@ -177,7 +187,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "model_path = \"/mnt/cluster_storage/model.ubj\" # Modify this path to your local folder if it runs on your local environment"
+    "storage_folder = \"/mnt/cluster_storage/\" # Modify this path to your local folder if it runs on your local environment"
    ]
   },
   {
@@ -186,6 +196,9 @@
    "metadata": {},
    "outputs": [],
    "source": [
+    "from pathlib import Path\n",
+    "model_path = Path(storage_folder) / \"model.ubj\"\n",
+    "\n",
     "def my_xgboost_func(params):    \n",
     "    evals_result = {}\n",
     "    dtrain, dtest = load_data()\n",
@@ -196,6 +209,7 @@
     "        evals=[(dtest, \"eval\")], \n",
     "        evals_result=evals_result,\n",
     "    )\n",
+    "    # Use Path\n",
     "    bst.save_model(model_path)\n",
     "    print(f\"{evals_result['eval']}\")\n",
     "    return {\"eval-rmse\": evals_result[\"eval\"][\"rmse\"][-1]}\n",
@@ -234,6 +248,7 @@
     "        \"max_depth\": 6,\n",
     "        \"eta\": ray.tune.uniform(0.01, 0.3),\n",
     "    },\n",
+    "    run_config=RunConfig(storage_path=storage_folder),\n",
     "    tune_config=ray.tune.TuneConfig(  # Tell it which metric to tune\n",
     "        metric=\"eval-rmse\",\n",
     "        mode=\"min\",\n",
@@ -264,7 +279,7 @@
     "\n",
     "In case your training data is too large, your training might take a long time to complete.\n",
     "\n",
-    "To speed it up, shard the dataset across training workers and perform distributed XBoost training.\n",
+    "To speed it up, shard the dataset across training workers and perform distributed XGBoost training.\n",
     "\n",
     "Let's redefine `load_data` to now load a different slice of the data given the worker index/rank."
    ]
@@ -484,7 +499,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "prediction_pipeline.write_parquet(\"/mnt/cluster_storage/xgboost_predictions\") #update this to your local path if runs on your local"
+    "prediction_pipeline.write_parquet(\"./xgboost_predictions\") #update this to your local path if runs on your local"
    ]
   },
   {
@@ -500,22 +515,33 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "!ls /mnt/cluster_storage/xgboost_predictions/ #update this to your local path if runs on your local"
+    "!ls {storage_folder}/xgboost_predictions/"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### 2.6 Clean up"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
-   "source": []
+   "source": [
+    "# Run this cell for file cleanup \n",
+    "!rm -rf {storage_folder}/xgboost_predictions/\n",
+    "!rm {model_path}"
+   ]
   }
  ],
  "metadata": {
   "kernelspec": {
-   "display_name": "Xing-ray-jupyter-3.11",
+   "display_name": "ray-jupyter",
    "language": "python",
-   "name": "xing-ray-jupyter"
+   "name": "python3"
   },
   "language_info": {
    "codemirror_mode": {