No public description

tensorflower-gardener · tensorflower-gardener · commit 0d1fa0bb7f92 · 2025-05-14T12:56:09.000-07:00
PiperOrigin-RevId: 758797293
diff --git a/official/projects/waste_identification_ml/circularnet-docs/content/_index.md b/official/projects/waste_identification_ml/circularnet-docs/content/_index.md
@@ -46,6 +46,12 @@
 * [Before you begin](/official/projects/waste_identification_ml/circularnet-docs/content/view-data/before-you-begin.md)
 * [Configure the dashboard](/official/projects/waste_identification_ml/circularnet-docs/content/view-data/configure-dashboard.md)
 
+**[Retrain CircularNet models](/official/projects/waste_identification_ml/circularnet-docs/content/retrain-models/_index.md)**
+
+* [Before you begin](/official/projects/waste_identification_ml/circularnet-docs/content/retrain-models/before-you-begin.md)
+* [Prepare the training data](/official/projects/waste_identification_ml/circularnet-docs/content/retrain-models/prepare-data.md)
+* [Launch the training job](/official/projects/waste_identification_ml/circularnet-docs/content/retrain-models/launch-job.md)
+
 ## CircularNet overview
 
 CircularNet is a free computer vision model developed by Google that utilizes
diff --git a/official/projects/waste_identification_ml/circularnet-docs/content/analyze-data/_index.md b/official/projects/waste_identification_ml/circularnet-docs/content/analyze-data/_index.md
@@ -1,15 +1,15 @@
-CircularNet provides two image-analysis models. The first detects _material
-types_, and the second detects _material forms_. These models utilize a Mask
-R-CNN algorithm for image training and implement ResNet or MobileNet as the
-convolutional neural networks for image classification tasks.
+CircularNet provides an image-analysis model. It detects _material types_, and
+_material forms_. The model utilizes a Mask R-CNN algorithm for image training
+and implements ResNet or MobileNet as the convolutional neural networks for
+image classification tasks.
 
-The models are loaded sequentially to achieve accurate predictions. When working
-with images, each image undergoes preprocessing before the models use them for
+The model is loaded sequentially to achieve accurate predictions. When working
+with images, each image undergoes preprocessing before the model uses them for
 prediction. In the case of video files, the video is split into individual
 frames at a given frame rate. These frames are then processed in the same
 sequential manner as images.
 
-The predictions from the two models result in two distinct outputs, which are
+The predictions from the model result in two distinct outputs, which are
 then post-processed and combined into a single comprehensive output. This output
 includes critical information such as the number of detected objects, their
 bounding boxes, class names, class IDs, and masks for each object. Further
@@ -24,8 +24,8 @@ flow and real-time updates. A [prediction pipeline](./learn-about-pipeline) for
 Google Cloud pushes the data directly to storage buckets and BigQuery tables,
 which you can connect to the dashboard for [visualization and analysis](/official/projects/waste_identification_ml/circularnet-docs/content/view-data/).
 
-On the other hand, direct data transfer to the cloud for edge device implementations needs a client-side configuration. A [prediction pipeline](/official/projects/waste_identification_ml/circularnet-docs/content/learn-about-pipeline) for devices lets you load models sequentially and store image analysis results locally.
+On the other hand, direct data transfer to the cloud for edge device implementations needs a client-side configuration. A [prediction pipeline](/official/projects/waste_identification_ml/circularnet-docs/content/learn-about-pipeline) for devices lets you load the model and store image analysis results locally.
 
-This section describes how to apply the two specialized CircularNet models using
+This section describes how to apply the specialized CircularNet model using
 a prediction pipeline on the client side to prepare and analyze the images you
 capture.
diff --git a/official/projects/waste_identification_ml/circularnet-docs/content/analyze-data/prediction-pipeline-in-cloud.md b/official/projects/waste_identification_ml/circularnet-docs/content/analyze-data/prediction-pipeline-in-cloud.md
@@ -96,11 +96,12 @@ on Google Cloud:
     **Important:** Run the previous command in the `server` folder, which
     contains the `triton_server.sh` script.
 
-1. Exit the `server` folder and open the client folder in the
+1. Exit the `server` folder and open the `client` folder in the
    `prediction_pipeline` directory:
 
     ```
-    cd .. cd client/
+    cd ..
+    cd client/
     ```
 
     This folder contains the `pipeline_images.py` and `pipeline_videos.py`
@@ -110,22 +111,21 @@ on Google Cloud:
 
 1. If you have to modify the scripts to provide your specific paths and values
    for the prediction pipeline, edit the corresponding parameter values on the
-   script. The following example modifies the video pipeline script:
+   script. The following example modifies the image pipeline script:
 
     ```
-    vim run_gcp_videos.sh
+    vim run_images.sh
     ```
 
 	The Vim editor displays the following parameters:
 
     ```
-    --input_directoy=<path-to-input-bucket>
+    --input_directory=<path-to-input-bucket>
     --output_directory=<path-to-output-bucket>
     --fps=<frames-per-second>
     --height=<height>
     --width=<width>
-    --material_model=<material-model>
-    --material_form_model=<material-form-model>
+    --model=<circularnet-model>
     --score=<score>
     --search_range=<search-range>
     --memory=<memory>
@@ -136,19 +136,34 @@ on Google Cloud:
 
     Replace the following:
 
-    -  `<path-to-input-bucket>`: The path to [the Cloud Storage input bucket you created](#create-the-cloud-storage-input-and-output-buckets), for example `gs://my-input-bucket/`.
-    -  `<path-to-output-bucket>`: The path to [the Cloud Storage output bucket you created](#create-the-cloud-storage-input-and-output-buckets), for example `gs://my-output-bucket/`.
-    -  `<frames-per-second>`: The rate at which you want to capture images from videos to split videos into frames, for example, 15.
-    -  `<height>`: The height in pixels of the image or video frames that the model expects for prediction, for example, 512.
-    -  `<width>`: The width in pixels of the image or video frames that the model expects for prediction, for example, 1024.
-    -  `<material-model>`: The name of the material model in the Triton inference server that you want to call, for example, `material_resnet_v2_512_1024`.
-    -  `<material-form-model>`: The name of the material form model in the Triton inference server that you want to call, for example, `material_form_resnet_v2_512_1024`.
-    -  `<score>`: The threshold for model prediction, for example, 0.40.
-    -  `<search-range>`: The pixels up to which you want to track an object for object tracking in consecutive frames, for example, 100.
-    -  `<memory>`: The frames up to which you want to track an object, for example, 20.
-    -  `<project-id>`: The ID of your Google Cloud project, for example, `my-project`.
-    -  `<dataset-id>`: The ID that you want to assign to a BigQuery dataset to store prediction results, for example, `circularnet_dataset`.
-    -  `<table-id>`: The ID that you want to assign to a BigQuery table to store prediction results, for example, `circularnet_table`. If the table already exists in your Google Cloud project, the pipeline appends results to that table.
+    -  `<path-to-input-bucket>`: The path to [the Cloud Storage input bucket you
+       created](#create-the-cloud-storage-input-and-output-buckets), for example
+       `gs://my-input-bucket/`.
+    -  `<path-to-output-bucket>`: The path to [the Cloud Storage output bucket
+       you created](#create-the-cloud-storage-input-and-output-buckets), for
+       example `gs://my-output-bucket/`.
+    -  `<frames-per-second>`: The rate at which you want to capture images from
+       videos to split videos into frames, for example, `15`.
+    -  `<height>`: The height in pixels of the image or video frames that the
+       model expects for prediction, for example, `512`.
+    -  `<width>`: The width in pixels of the image or video frames that the
+       model expects for prediction, for example, `1024`.
+    -  `<circularnet-model>`: The name of the CircularNet model in the Triton
+       inference server that you want to call, for example,
+       `Jan2025_ver2_merged_1024_1024`.
+    -  `<score>`: The threshold for model prediction, for example, `0.70`.
+    -  `<search-range>`: The pixels up to which you want to track an object for
+       object tracking in consecutive frames, for example, `100`.
+    -  `<memory>`: The frames up to which you want to track an object, for
+       example, `20`.
+    -  `<project-id>`: The ID of your Google Cloud project, for example,
+       `my-project`.
+    -  `<dataset-id>`: The ID that you want to assign to a BigQuery dataset to
+       store prediction results, for example, `circularnet_dataset`.
+    -  `<table-id>`: The ID that you want to assign to a BigQuery table to store
+       prediction results, for example, `circularnet_table`. If the table
+       already exists in your Google Cloud project, the pipeline appends results
+       to that table.
 
     **Note:** If your input files are not videos but images, replace
     `run_gcp_videos.sh` on the command with `run_gcp_images.sh` and remove the
@@ -160,11 +175,14 @@ on Google Cloud:
 1. Run the prediction pipeline:
 
     ```
-    bash run_gcp_videos.sh
+    bash run_images.sh
     ```
 
-    **Note:** If your input files are not videos but images, replace
-    `run_gcp_videos.sh` on the command with `run_gcp_images.sh`.
+    **Note:** If you have a large amount of input files, you can run the
+    pipeline in a `screen` session in the background without worrying about the
+    terminal closing down. First, you launch the `screen` session with the
+    `screen -R client` command. A new session shell launches. Then, run the
+    `bash run_images.sh` script in the new shell.
 
 The script also creates a `logs` folder inside the `client` folder that saves
 the logs with the troubleshooting results and records from the models.
diff --git a/official/projects/waste_identification_ml/circularnet-docs/content/analyze-data/prediction-pipeline-in-edge.md b/official/projects/waste_identification_ml/circularnet-docs/content/analyze-data/prediction-pipeline-in-edge.md
@@ -65,10 +65,14 @@ in an edge device:
     The Vim editor displays the following parameters:
 
     ```
-    --input_directoy=<path-to-input-folder>
-    --output_directory=<path-to-output-folder> height=<height> width=<width>
-    --material_model=<material-model> material_form_model=<material-form-model>
-    --score=<score> search_range=<search-range> memory=<memory>
+    --input_directory=<path-to-input-folder>
+    --output_directory=<path-to-output-folder>
+    --height=<height>
+    --width=<width>
+    --model=<circularnet-model>
+    --score=<score>
+    --search_range=<search-range>
+    --memory=<memory>
     ```
 
     Replace the following:
@@ -78,20 +82,17 @@ in an edge device:
     -  `<path-to-output-folder>`: The path to the local folder for output image
        results in the edge device, for example `/home/images/output_files/`.
     -  `<height>`: The height in pixels of the image or video frames that the
-       model expects for prediction, for example, 512.
+       model expects for prediction, for example, `512`.
     -  `<width>`: The width in pixels of the image or video frames that the
-       model expects for prediction, for example, 1024.
-    -  `<material-model>`: The name of the material model in the Triton
+       model expects for prediction, for example, `1024`.
+    -  `<circularnet-model>`: The name of the CircularNet model in the Triton
        inference server that you want to call, for example,
-       `material_resnet_v2_512_1024`.
-    -  `<material-form-model>`: The name of the material form model in the
-       Triton inference server that you want to call, for example,
-       `material_form_resnet_v2_512_1024`.
-    -  `<score>`: The threshold for model prediction, for example, 0.40.
+       `Jan2025_ver2_merged_1024_1024`.
+    -  `<score>`: The threshold for model prediction, for example, `0.70`.
     -  `<search-range>`: The pixels up to which you want to track an object for
-       object tracking in consecutive frames, for example, 100.
+       object tracking in consecutive frames, for example, `100`.
     -  `<memory>`: The frames up to which you want to track an object, for
-       example, 20.
+       example, `20`.
 
     Save changes and exit the Vim editor. To do this, press the **Esc** key,
     type `:wq`, and then press **Enter**.
diff --git a/official/projects/waste_identification_ml/circularnet-docs/content/deploy-cn/before-you-begin.md b/official/projects/waste_identification_ml/circularnet-docs/content/deploy-cn/before-you-begin.md
@@ -21,13 +21,40 @@ one of the following options: <br><br>
     </li>
     <li><p><a href="https://cloud.google.com/compute/docs/gpus/create-gpu-vm-general-purpose">Create a Compute Engine virtual machine (VM) that has attached an NVIDIA T4 GPU</a>. Use the following settings on your VM:</p><br>
         <ul>
-            <li><strong>Machine configuration</strong>: GPUs</li>
-            <li><strong>GPU type</strong>: NVIDIA T4</li>
-            <li><strong>Number of GPUs</strong>: 1</li>
-            <li><strong>Machine type</strong>: n1-standard-8 (8 vCPU, 4 core, 30 GB memory)</li>
-            <li><strong>Boot disk</strong>: A size of 300 GB</li>
-            <li><strong>Identity and API access</strong>: Allow full access to all Cloud APIs</li>
-            <li><strong>Firewall</strong>: Allow HTTP and HTTPS traffic</li>
+            <li><strong>Machine configuration</strong>:
+              <ul>
+                <li><strong>Type</strong>: GPUs</li>
+                <li><strong>GPU type</strong>: NVIDIA T4</li>
+                <li><strong>Number of GPUs</strong>: 1</li>
+                <li><strong>Machine type</strong>: n1-standard-8 (8 vCPU, 4 core, 30 GB memory)</li>
+              </ul>
+            </li>
+            <li><strong>OS and storage</strong>: Click
+            <b>Change</b> and select the following:
+              <ul>
+                <li><strong>Operating system</strong>: Deep Learning on Linux</li>
+                <li><strong>Version</strong>: Deep Learning VM with
+                CUDA 11.3 preinstalled. Debian 11, Python 3.10. You can choose
+                any <i>M</i> number with this configuration, for example, M126.</li>
+                <li><strong>Boot disk type</strong>: Balanced persistent disk</li>
+                <li><strong>Size (GB)</strong>: 300 GB</li>
+              </ul>
+            </li>
+            <li><strong>Security</strong>: Navigate to the <b>Identity and API access</b>
+              section and select the following:
+              <ul>
+                <li><strong>Service accounts</strong>:Compute Engine default service account</li>
+                <li><strong>Access scopes</strong>: Allow full access
+                to all Cloud APIs</li>
+              </ul>
+            </li>
+            <li><strong>Networking</strong>: Navigate to the <b>Firewall</b>
+              section and select the following:
+              <ul>
+                <li>Allow HTTP traffic</li>
+                <li>Allow HTTPS traffic</li>
+              </ul>
+            </li>
         </ul>
         <p><strong>Note</strong>: Give your VM a name that is easy to remember and deploy in a region and a zone close to your physical location that allows GPUs.</p><br>
     </li>
diff --git a/official/projects/waste_identification_ml/circularnet-docs/content/deploy-cn/start-server.md b/official/projects/waste_identification_ml/circularnet-docs/content/deploy-cn/start-server.md
@@ -21,24 +21,18 @@ tool open. For more information, see [Connect to VMs](https://cloud.google.com/c
     This script loads as many models as you want at the same time. Later, you can choose which model you want to send your request to from the client side. For more information, see [Prepare and analyze images](/official/projects/waste_identification_ml/circularnet-docs/content/analyze-data/).
 
     For example, when you start analyzing images, you can send them from the
-    client to the following models in the Triton server you created:
-
-    -  `material_resnet_v2_512_1024`: shows the material and its subtype using
-       ResNet for classification on images of 512 x 1024 pixels.
-    -  `material_form_resnet_v2_512_1024`: shows the form of an object, for
-       example, can or bottle, using ResNet for classification on images of 512
-       x 1024 pixels.
-    -  `material_mobilenet_v2_512_1024`: shows the material and its subtype
-       using MobileNet for classification on images of 512 x 1024 pixels.
-    -  `material_form_mobilenet_v2_512_1024`: shows the form of an object, for
-       example, can or bottle, using MobileNet for classification on images of
-       512 x 1024 pixels.
+    client to the following model in the Triton server you created:
+
+    -  `Jan2025_ver2_merged_1024_1024`: shows the material type and form using
+       ResNet for classification on images of 1024 x 1024 pixels.
 
 You have finished setting up the Triton inference server. The server keeps
 running on the backend and your terminal window lets you run new commands to
-interact with it.
+interact with it. It takes some time for the server to be up and running.
+Wait for the **Status ready** message from the server before launching the
+client.
 
-You can confirm the server is running by opening a screen session:
+You can confirm the server is running by opening a `screen` session:
 
 1. List the `screen` sessions:
 
@@ -59,7 +53,7 @@ You can confirm the server is running by opening a screen session:
     server. The models must show a `READY` status on the `screen` session when
     they are successfully deployed.
 
-1. If you want to exit the screen session without stopping the server, press
+1. If you want to exit the `screen` session without stopping the server, press
    **Ctrl + A + D** keys.
 
 ## What's next
diff --git a/official/projects/waste_identification_ml/circularnet-docs/content/retrain-models/_index.md b/official/projects/waste_identification_ml/circularnet-docs/content/retrain-models/_index.md
@@ -0,0 +1,12 @@
+# Retrain CircularNet models
+
+CircularNet's models are initially trained with images captured from Material
+Recovery Facilities (MRFs). As a result, these open-source models are
+specialized in a limited set of materials, which might not fully align with the
+specific materials you collect.
+
+To address your specific use case, you can
+retrain the models by utilizing a pipeline built with
+[Vertex AI](https://cloud.google.com/vertex-ai/docs). This pipeline is ideal for
+analyzing materials from diverse sources, adapting to different business
+scenarios, or retraining the model with your own images.
diff --git a/official/projects/waste_identification_ml/circularnet-docs/content/retrain-models/before-you-begin.md b/official/projects/waste_identification_ml/circularnet-docs/content/retrain-models/before-you-begin.md
@@ -0,0 +1,22 @@
+## Before you begin
+
+Before starting the retraining process, ensure you meet the following
+requirements:
+
+1.  [Get access to Google Cloud](https://console.cloud.google.com/).
+1.  [Open the Google Cloud console](https://cloud.google.com/cloud-console).
+1.  [Create a project on your Google Cloud account](https://cloud.google.com/resource-manager/docs/creating-managing-projects).
+1.  Enable the Vertex AI and Cloud Storage APIs to manage programmatic
+    access and authentication.
+
+    To enable APIs, see
+    [Enabling an API in your Google Cloud project](https://cloud.google.com/endpoints/docs/openapi/enable-api).
+
+1.  [Create a Cloud Storage bucket](https://cloud.google.com/storage/docs/creating-buckets)
+    to store files.
+1.  Allocate at least four GPUs for the training job on Vertex AI. For more
+    information, see
+    [Configure compute resources for custom training](https://cloud.google.com/vertex-ai/docs/training/configure-compute).
+1.  Set up a service account for Vertex AI, with permissions to perform
+    training jobs. For more information, see
+    [Use a custom service account](https://cloud.google.com/vertex-ai/docs/general/custom-service-account).
diff --git a/official/projects/waste_identification_ml/circularnet-docs/content/retrain-models/launch-job.md b/official/projects/waste_identification_ml/circularnet-docs/content/retrain-models/launch-job.md
diff --git a/official/projects/waste_identification_ml/circularnet-docs/content/retrain-models/prepare-data.md b/official/projects/waste_identification_ml/circularnet-docs/content/retrain-models/prepare-data.md
diff --git a/official/projects/waste_identification_ml/circularnet-docs/content/view-data/_index.md b/official/projects/waste_identification_ml/circularnet-docs/content/view-data/_index.md