Refine Readme for Animation. (#1873)

yao531441 · pre-commit-ci[bot] · web-flow · commit 82200bad4902 · 2025-08-11T10:01:22.000+08:00
* Refine Readme for Animation. Signed-off-by: Yao, Qing <qing.yao@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Yao, Qing <qing.yao@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
diff --git a/comps/animation/deployment/docker_compose/README.md b/comps/animation/deployment/docker_compose/README.md
@@ -0,0 +1,194 @@
+# Deploying Avatar Animation Service
+
+This document provides a comprehensive guide to deploying the Avatar Animation microservice pipeline on Intel platforms.
+
+This guide covers two deployment methods:
+
+- [🚀 1. Quick Start with Docker Compose](#-1-quick-start-with-docker-compose): The recommended method for a fast and easy setup.
+- [🚀 2. Manual Step-by-Step Deployment (Advanced)](#-2-manual-step-by-step-deployment-advanced): For users who want to build and run each container individually.
+
+## 🚀 1. Quick Start with Docker Compose
+
+This method uses Docker Compose to start all necessary services with a single command. It is the fastest and easiest way to get the service running.
+
+### 1.1. Access the Code
+
+Clone the repository and navigate to the deployment directory:
+
+```bash
+git clone https://github.com/opea-project/GenAIComps.git
+cd GenAIComps/comps/animation/deployment/docker_compose
+```
+
+### 1.2. Deploy the Service
+
+Choose the command corresponding to your target platform.
+
+- **For Intel® Xeon® CPU:**
+
+  ```bash
+  docker compose -f compose.yaml up animation -d
+  ```
+
+- **For Intel® Gaudi® 2 HPU:**
+  ```bash
+  docker compose -f compose.yaml up animation-gaudi -d
+  ```
+
+### 1.3. Validate the Service
+
+Once the containers are running, you can validate the service. **Note:** Run these commands from the root of the `GenAIComps` repository.
+
+```bash
+# Navigate back to the root directory if you are in the docker_compose folder
+cd ../../..
+
+# Validate the Animation service endpoint
+export ip_address=$(hostname -I | awk '{print $1}')
+curl http://${ip_address}:9066/v1/animation -X POST \
+-H "Content-Type: application/json" \
+-d @comps/animation/src/assets/audio/sample_question.json
+```
+
+The expected output will be a JSON object containing the path to the generated video file:
+
+```json
+{ "wav2lip_result": ".../GenAIComps/comps/animation/src/assets/outputs/result.mp4" }
+```
+
+The generated video `result.mp4` will be available in the `comps/animation/src/assets/outputs/` directory.
+
+### 1.4. Clean Up the Deployment
+
+To stop and remove the containers, run the following command from the `comps/animation/deployment/docker_compose` directory:
+
+```bash
+docker compose down
+```
+
+---
+
+## 🚀 2. Manual Step-by-Step Deployment (Advanced)
+
+This section provides detailed instructions for building the Docker images and running each microservice container individually.
+
+### 2.1. Clone the Repository
+
+If you haven't already, clone the repository and navigate to the root directory:
+
+```bash
+git clone https://github.com/opea-project/GenAIComps.git
+cd GenAIComps
+```
+
+### 2.2. Build the Docker Images
+
+#### 2.2.1. Build Wav2Lip Server Image
+
+- **For Intel® Xeon® CPU:**
+  ```bash
+  docker build -t opea/wav2lip:latest -f comps/third_parties/wav2lip/src/Dockerfile .
+  ```
+- **For Intel® Gaudi® 2 HPU:**
+  ```bash
+  docker build -t opea/wav2lip-gaudi:latest -f comps/third_parties/wav2lip/src/Dockerfile.intel_hpu .
+  ```
+
+#### 2.2.2. Build Animation Server Image
+
+```bash
+docker build -t opea/animation:latest --build-arg https_proxy=$https_proxy --build-arg http_proxy=$http_proxy -f comps/animation/src/Dockerfile .
+```
+
+### 2.3. Configure Environment Variables
+
+Set the necessary environment variables for the containers.
+
+- **For Intel® Xeon® CPU:**
+
+  ```bash
+  export ip_address=$(hostname -I | awk '{print $1}')
+  export DEVICE="cpu"
+  export WAV2LIP_PORT=7860
+  export CHECKPOINT_PATH='/usr/local/lib/python3.11/site-packages/Wav2Lip/checkpoints/wav2lip_gan.pth'
+  export PYTHON_PATH='/usr/bin/python3.11'
+  ```
+
+- **For Intel® Gaudi® 2 HPU:**
+  ```bash
+  export ip_address=$(hostname -I | awk '{print $1}')
+  export DEVICE="hpu"
+  export WAV2LIP_PORT=7860
+  export CHECKPOINT_PATH='/usr/local/lib/python3.10/dist-packages/Wav2Lip/checkpoints/wav2lip_gan.pth'
+  export PYTHON_PATH='/usr/bin/python3.10'
+  ```
+
+### 2.4. Run the Microservice Containers
+
+#### 2.4.1. Run Wav2Lip Microservice
+
+- **For Intel® Xeon® CPU:**
+
+  ```bash
+  docker run --privileged -d --name "wav2lip-service" -p $WAV2LIP_PORT:$WAV2LIP_PORT --ipc=host \
+  -w /home/user/comps/animation/src \
+  -v $(pwd)/comps/animation/src/assets:/home/user/comps/animation/src/assets \
+  -e PYTHON=$PYTHON_PATH \
+  -e DEVICE=$DEVICE \
+  -e CHECKPOINT_PATH=$CHECKPOINT_PATH \
+  -e WAV2LIP_PORT=$WAV2LIP_PORT \
+  opea/wav2lip:latest
+  ```
+
+- **For Intel® Gaudi® 2 HPU:**
+  ```bash
+  docker run --privileged -d --name "wav2lip-gaudi-service" -p $WAV2LIP_PORT:$WAV2LIP_PORT --runtime=habana --cap-add=sys_nice --ipc=host \
+  -w /home/user/comps/animation/src \
+  -v $(pwd)/comps/animation/src/assets:/home/user/comps/animation/src/assets \
+  -e HABANA_VISIBLE_DEVICES=all -e OMPI_MCA_btl_vader_single_copy_mechanism=none \
+  -e PYTHON=$PYTHON_PATH \
+  -e DEVICE=$DEVICE \
+  -e CHECKPOINT_PATH=$CHECKPOINT_PATH \
+  -e WAV2LIP_PORT=$WAV2LIP_PORT \
+  opea/wav2lip-gaudi:latest
+  ```
+
+#### 2.4.2. Run Animation Microservice
+
+```bash
+docker run -d --name "animation-service" -p 9066:9066 --ipc=host \
+  -e http_proxy=$http_proxy \
+  -e https_proxy=$https_proxy \
+  -e WAV2LIP_ENDPOINT=http://$ip_address:$WAV2LIP_PORT \
+  opea/animation:latest
+```
+
+### 2.5. Validate the Service
+
+After starting both containers, test the animation service endpoint. Make sure you are in the root directory of the `GenAIComps` repository.
+
+```bash
+# The ip_address variable should be set from step 2.3
+curl http://${ip_address}:9066/v1/animation -X POST \
+-H "Content-Type: application/json" \
+-d @comps/animation/src/assets/audio/sample_question.json
+```
+
+You should see a successful response with the path to the output video.
+
+### 2.6. Clean Up the Deployment
+
+To stop and remove the containers you started manually, use the `docker stop` and `docker rm` commands.
+
+- **For Intel® Xeon® CPU:**
+
+  ```bash
+  docker stop wav2lip-service animation-service
+  docker rm wav2lip-service animation-service
+  ```
+
+- **For Intel® Gaudi® 2 HPU:**
+  ```bash
+  docker stop wav2lip-gaudi-service animation-service
+  docker rm wav2lip-gaudi-service animation-service
+  ```
diff --git a/comps/animation/src/README.md b/comps/animation/src/README.md
@@ -2,146 +2,31 @@
 
 The avatar animation model is a combination of two models: Wav2Lip and GAN-based face generator (GFPGAN). The Wav2Lip model is used to generate lip movements from an audio file, and the GFPGAN model is used to generate a high-quality face image from a low-quality face image. The avatar animation microservices takes an audio piece and a low-quality face image/video as input, fuses mel-spectrogram from the audio with frame(s) from the image/video, and generates a high-quality video of the face image with lip movements synchronized with the audio.
 
-# 🚀1. Start Microservice with Docker (option 1)
+## Table of contents
 
-## 1.1 Build the Docker images
+1. [Architecture](#architecture)
+2. [Deployment Options](#deployment-options)
+3. [Validated Configurations](#validated-configurations)
 
-### 1.1.1 Wav2Lip Server image
+## Architecture
 
-```bash
-git clone https://github.com/opea-project/GenAIComps.git
-cd GenAIComps
-```
+The Avatar Animation service consists of two primary microservices:
 
-- Xeon CPU
+- **Wav2Lip Server**: This microservice is the core engine for lip synchronization. It takes an audio file and a face image/video as input and generates a video where the lip movements match the provided audio. It can be deployed on both CPU and HPU.
+- **Animation Server**: This microservice acts as an orchestrator or gateway. It exposes a single endpoint for the user, receives the request, forwards it to the Wav2Lip server for processing, and then returns the final generated video to the user.
 
-```bash
-docker build -t opea/wav2lip:latest -f comps/third_parties/wav2lip/src/Dockerfile .
-```
+## Deployment Options
 
-- Gaudi2 HPU
+For detailed, step-by-step instructions on how to deploy the Avatar Animation microservice using Docker Compose on different Intel platforms, please refer to the deployment guide. The guide contains all necessary steps, including building images, configuring the environment, and running the service.
 
-```bash
-docker build -t opea/wav2lip-gaudi:latest -f comps/third_parties/wav2lip/src/Dockerfile.intel_hpu .
-```
+| Platform          | Deployment Method | Link                                                       |
+| ----------------- | ----------------- | ---------------------------------------------------------- |
+| Intel Xeon/Gaudi2 | Docker Compose    | [Deployment Guide](../deployment/docker_compose/README.md) |
 
-### 1.1.2 Animation server image
+## Validated Configurations
 
-```bash
-docker build -t opea/animation:latest --build-arg https_proxy=$https_proxy --build-arg http_proxy=$http_proxy -f comps/animation/src/Dockerfile .
-```
+The following configurations have been validated for the Avatar Animation microservice.
 
-## 1.2. Set environment variables
-
-- Xeon CPU
-
-```bash
-export ip_address=$(hostname -I | awk '{print $1}')
-export DEVICE="cpu"
-export WAV2LIP_PORT=7860
-export ANIMATION_PORT=9066
-export INFERENCE_MODE='wav2lip+gfpgan'
-export CHECKPOINT_PATH='/usr/local/lib/python3.11/site-packages/Wav2Lip/checkpoints/wav2lip_gan.pth'
-export FACE="assets/img/avatar1.jpg"
-# export AUDIO='assets/audio/eg3_ref.wav' # audio file path is optional, will use base64str in the post request as input if is 'None'
-export AUDIO='None'
-export FACESIZE=96
-export OUTFILE="assets/outputs/result.mp4"
-export GFPGAN_MODEL_VERSION=1.4 # latest version, can roll back to v1.3 if needed
-export UPSCALE_FACTOR=1
-export FPS=10
-```
-
-- Gaudi2 HPU
-
-```bash
-export ip_address=$(hostname -I | awk '{print $1}')
-export DEVICE="hpu"
-export WAV2LIP_PORT=7860
-export ANIMATION_PORT=9066
-export INFERENCE_MODE='wav2lip+gfpgan'
-export CHECKPOINT_PATH='/usr/local/lib/python3.10/dist-packages/Wav2Lip/checkpoints/wav2lip_gan.pth'
-export FACE="assets/img/avatar1.jpg"
-# export AUDIO='assets/audio/eg3_ref.wav' # audio file path is optional, will use base64str in the post request as input if is 'None'
-export AUDIO='None'
-export FACESIZE=96
-export OUTFILE="assets/outputs/result.mp4"
-export GFPGAN_MODEL_VERSION=1.4 # latest version, can roll back to v1.3 if needed
-export UPSCALE_FACTOR=1
-export FPS=10
-```
-
-# 🚀2. Run the Docker container
-
-## 2.1 Run Wav2Lip Microservice
-
-- Xeon CPU
-
-```bash
-docker run --privileged -d --name "wav2lip-service" -p 7860:7860 --ipc=host -w /home/user/comps/animation/src -e PYTHON=/usr/bin/python3.11 -v $(pwd)/comps/animation/src/assets:/home/user/comps/animation/src/assets -e DEVICE=$DEVICE -e INFERENCE_MODE=$INFERENCE_MODE -e CHECKPOINT_PATH=$CHECKPOINT_PATH -e FACE=$FACE -e AUDIO=$AUDIO -e FACESIZE=$FACESIZE -e OUTFILE=$OUTFILE -e GFPGAN_MODEL_VERSION=$GFPGAN_MODEL_VERSION -e UPSCALE_FACTOR=$UPSCALE_FACTOR -e FPS=$FPS -e WAV2LIP_PORT=$WAV2LIP_PORT opea/wav2lip:latest
-```
-
-- Gaudi2 HPU
-
-```bash
-docker run --privileged -d --name "wav2lip-gaudi-service" -p 7860:7860 --runtime=habana --cap-add=sys_nice --ipc=host -w /home/user/comps/animation/src -v $(pwd)/comps/animation/src/assets:/home/user/comps/animation/src/assets -e HABANA_VISIBLE_DEVICES=all -e OMPI_MCA_btl_vader_single_copy_mechanism=none -e PYTHON=/usr/bin/python3.10 -e DEVICE=$DEVICE -e INFERENCE_MODE=$INFERENCE_MODE -e CHECKPOINT_PATH=$CHECKPOINT_PATH -e FACE=$FACE -e AUDIO=$AUDIO -e FACESIZE=$FACESIZE -e OUTFILE=$OUTFILE -e GFPGAN_MODEL_VERSION=$GFPGAN_MODEL_VERSION -e UPSCALE_FACTOR=$UPSCALE_FACTOR -e FPS=$FPS -e WAV2LIP_PORT=$WAV2LIP_PORT opea/wav2lip-gaudi:latest
-```
-
-## 2.2 Run Animation Microservice
-
-```bash
-docker run -d -p 9066:9066 --ipc=host --name "animation-service" -e http_proxy=$http_proxy -e https_proxy=$https_proxy -e WAV2LIP_ENDPOINT=http://$ip_address:7860 opea/animation:latest
-```
-
-# 🚀3. Start Microservice with Docker Compose
-
-Alternatively, you can also start the Animation microservice with Docker Compose.
-
-- Xeon CPU
-
-```bash
-cd comps/animation/deployment/docker_compose
-docker compose -f compose.yaml up animation -d
-
-```
-
-- Gaudi2 HPU
-
-```bash
-cd comps/animation/deployment/docker_compose
-docker compose -f compose.yaml up animation-gaudi -d
-```
-
-# 🚀4. Validate Microservice
-
-Once microservice starts, user can use below script to validate the running microservice.
-
-## 4.1 Validate Wav2Lip service
-
-```bash
-cd GenAIComps
-python3 comps/third_parties/wav2lip/src/check_wav2lip_server.py
-```
-
-## 4.2 Validate Animation service
-
-```bash
-cd GenAIComps
-export ip_address=$(hostname -I | awk '{print $1}')
-curl http://${ip_address}:9066/v1/animation -X POST -H "Content-Type: application/json" -d @comps/animation/src/assets/audio/sample_question.json
-```
-
-or
-
-```bash
-cd GenAIComps
-python3 comps/third_parties/wav2lip/src/check_animation_server.py
-```
-
-The expected output will be a message similar to the following:
-
-```bash
-{'wav2lip_result': '....../GenAIComps/comps/animation/src/assets/outputs/result.mp4'}
-```
-
-Please find "comps/animation/src/assets/outputs/result.mp4" as a reference generated video.
+| **Deploy Method** | **Core Models** | **Platform**      |
+| ----------------- | --------------- | ----------------- |
+| Docker Compose    | Wav2Lip, GFPGAN | Intel Xeon/Gaudi2 |