update the smart classroom readme and user guide (open-edge-platform#912)

achamuah · web-flow · commit 80e4c60fa17b · 2025-10-30T21:18:03.000+05:30
diff --git a/education-ai-suite/smart-classroom/README.md b/education-ai-suite/smart-classroom/README.md
@@ -1,221 +1,29 @@
 # 🎓 Smart Classroom
-The Smart Classroom project is a modular, extensible framework designed to process and summarize educational content using advanced AI models. It supports transcription, summarization, and future capabilities like video understanding and real-time analysis. 
 
-## This project provides: 
+The **Smart Classroom** project is a modular, extensible framework designed to process and summarize educational content using advanced AI models. It supports transcription, summarization, and future capabilities like video understanding and real-time analysis. 
 
-### 🔊 Audio file processing and transcription (e.g., Whisper, Paraformer) 
-### 🧠 Summarization using powerful LLMs (e.g., Qwen, LLaMA) 
-### 📦 Plug-and-play architecture for integrating new ASR and LLM models 
-### ⚙️ API-first design ready for frontend integration 
-### 🛠️ Ready-to-extend for real-time streaming, diarization, translation, and video analysis 
-The goal is to transform raw classroom recordings into concise, structured summaries for students, educators, and learning platforms.
+The main features are as follows:
 
----
-### 💻 System Requirements
+•	Audio transcription with ASR models (e.g., Whisper, Paraformer)
+•	Summarization using powerful LLMs (e.g., Qwen, LLaMA)
+•	Plug-and-play architecture for integrating new ASR and LLM models
+•	API-first design ready for frontend integration
+•	Extensible roadmap for real-time streaming, diarization, translation, and video analysis
 
-- **OS:** Windows 11  
-- **Processor:** Intel® Core Ultra Series 1 (with integrated GPU support)  
-- **Memory:** 32 GB RAM (minimum recommended)  
-- **Storage:** At least 50 GB free (for models and logs)  
-- **GPU/Accelerator:** Intel® iGPU (Intel® Core Ultra Series 1, Arc GPU, or higher) for summarization acceleration  
-- **Python:** 3.12
-- **Node.js:** v18+ (for frontend) 
----
-### 🧩 Supported Models  
 
-#### 🔊 ASR (Automatic Speech Recognition)  
-- **Whisper (all models supported)**  
-  - Recommended: `whisper-small` or lower for CPU efficiency  
-  - Runs on **CPU** (Whisper is CPU-centric)  
-- **FunASR (Paraformer)**  
-  - Recommended for **Chinese transcription** (`paraformer-zh`)
-- ✅ Supports transcription of audio files up to 45 minutes in mp3 and wav formats
+## Get Started 
 
-#### 🧠 Summarization (LLMs)  
-- **Qwen Models (OpenVINO / IPEX)**  
-  - ✅ `Qwen2.0-7B-Instruct`  
-  - ✅ `Qwen2.5-7B-Instruct`
-- 💡 Summarization supports up to 7,500 tokens (≈ 45 minutes of audio) on GPU
+To see the system requirements and other installations, see the following guides:
 
-#### ⚖️ Supported Weight Formats  
-- **int8** → Recommended for lower-end CPUs (fast + efficient)  
-- **fp16** → Recommended for higher-end systems (better accuracy, GPU acceleration)  
-- **int4** → Supported, but may reduce accuracy (use only if memory-constrained)  
+- [System Requirements](./docs/user-guide/system-requirements.md): Check the hardware and software requirements for deploying the application.
+- [Get Started](./docs/user-guide/get-started.md): Follow step-by-step instructions to set up the application.
 
-💡 Run summarization on **GPU** (Intel® iGPU / Arc GPU) for faster performance.  
+## How It Works
 
----
+The basic architecture follows a modular pipeline designed for efficient audio summarisation. It begins with **audio preprocessing**, where FFMPEG chunks input audio into smaller segments for optimal handling. These segments are processed by an **ASR transcriber** (e.g., Whisper or Paraformer) to convert speech into text. Finally, an **LLM summariser** (such as Qwen or Llama), optimised through frameworks like OpenVINO IR, Llama.cpp, or IPEX, generates concise summaries, which are delivered via the **output handler** for downstream use.
 
-### ✅ 1. **Install Dependencies**
+![High-Level System Diagram](./docs/user-guide/_images/architecture.svg)
 
-**a. Install [FFmpeg](https://ffmpeg.org/download.html)** (required for audio processing):  
-- Download from [https://ffmpeg.org/download.html](https://ffmpeg.org/download.html), and add the `ffmpeg/bin` folder to your system `PATH`.
----
+## Learn More
 
-**Run your shell with admin privileges before starting the application**
-
-**b. Clone Repository:**
-
-```bash
-  git clone --no-checkout https://github.com/open-edge-platform/edge-ai-suites.git
-  cd edge-ai-suites
-  git sparse-checkout init --cone
-  git sparse-checkout set education-ai-suite
-  git checkout
-  cd education-ai-suite
-```
----
-
-**c. Install Python dependencies**
-
-It’s recommended to create a **dedicated Python virtual environment** for the base dependencies.
-
-```bash
-python -m venv smartclassroom
-smartclassroom\Scripts\activate
-
-cd smart-classroom
-python.exe -m pip install --upgrade pip
-pip install --upgrade -r requirements.txt
-pip install py-cpuinfo
-```
----
-
-
-**d. [Optional] Create Python Venv for Ipex Based Summarizer** 
-  
-If you plan to use IPEX, create a separate virtual environment.  
-  
-**Note: `smartclassroom_ipex` should only be used with FunAsr and Ipex related models (Specified in 2nd section). Don't configure Openvino related models in `smartclassroom_ipex`**
-
-```bash
-python -m venv smartclassroom_ipex
-smartclassroom_ipex\Scripts\activate
-
-python.exe -m pip install --upgrade pip
-cd smart-classroom
-pip install --upgrade -r requirements.txt
-pip install --pre --upgrade ipex-llm[xpu_2.6] --extra-index-url https://download.pytorch.org/whl/xpu
-```
-> 💡 *Use `smartclassroom` if you don’t need IPEX. Use `smartclassroom_ipex` if you want IPEX summarization.*  
-
----
-### ⚙️ 2. Configuration
-
-#### a. Default Configuration  
-  
-By default, the project uses Whisper for transcription and OpenVINO-based Qwen models for summarization.You can modify these settings in the configuration file (`smart-classroom/config.yaml`):
-
-```bash
-asr:
-  provider: openvino            # Supported: openvino, openai, funasr
-  name: whisper-tiny          # Options: whisper-tiny, whisper-small, paraformer-zh etc.
-  device: CPU                 # Whisper currently supports only CPU
-  temperature: 0.0
-
-summarizer:
-  provider: openvino          # Options: openvino or ipex
-  name: Qwen/Qwen2-7B-Instruct # Examples: Qwen/Qwen1.5-7B-Chat, Qwen/Qwen2-7B-Instruct, Qwen/Qwen2.5-7B-Instruct
-  device: GPU                 # Options: GPU or CPU
-  weight_format: int8         # Supported: fp16, fp32, int4, int8
-  max_new_tokens: 1024        # Maximum tokens to generate in summaries
-```
-#### b. Chinese Audio Transcription  
-
-For Chinese audio transcription, switch to funASR with Paraformer in your config (`smart-classroom/config.yaml`):
-```bash
-asr:
-  provider: funasr
-  name: paraformer-zh
-```
-
-#### c. IPEX-based Summarization
-
-To use IPEX for summarization, ensure:
-- IPEX-LLM is installed.
-- The environment for IPEX is activated.
-- The configuration (`smart-classroom/config.yaml`) is updated as shown below:
-
-```bash
-summarizer:
-  provider: ipex
-```
-
-**Important: After updating the configuration, reload the application for changes to take effect.**
-
----
-
-### ✅ 3. **Run the Application**
-Activate the environment before running the application:
-
-```bash
-smartclassroom\Scripts\activate  # or smartclassroom_ipex
-```
-Run the backend:
-```bash
-python main.py
-```
-
-- Bring Up Frontend:
-```bash
-cd ui
-npm install
-npm run dev -- --host 0.0.0.0 --port 5173
-```
-
-> ℹ️ Open a second (new) Command Prompt / terminal window for the frontend. The backend terminal stays busy serving requests.
-
-💡 Tips: You should see backend logs similar to this:
-
-```
-pipeline initialized
-[INFO] __main__: App started, Starting Server...
-INFO:     Started server process [21616]
-INFO:     Waiting for application startup.
-INFO:     Application startup complete.
-INFO:     Uvicorn running on http://0.0.0.0:8000 (Press CTRL+C to quit)
-```
-
-This means your pipeline server has started successfully and is ready to accept requests.
-
----
-
-### 🖥️ 4. Access the UI
-
-After starting the frontend you can open the Smart Classroom UI in a browser:
-
-Local machine:
-- http://localhost:5173
-- http://127.0.0.1:5173
-
-From another device on the same network (replace <HOST_IP> with your computer’s IP):
-- http://<HOST_IP>:5173
-
-Find your IP (Windows PowerShell):
-```
-ipconfig
-```
-Use the IPv4 Address from your active network adapter.
-
-If you changed the port, adjust the URL accordingly.
-
----
-
-### 🔍 6. Troubleshooting (Focused)
-
-- Frontend not opening: Ensure you ran npm run dev in a second terminal after starting python main.py.
-- Backend not ready: Wait until Uvicorn shows "Application startup complete" and listening on port 8000.
-- URL fails from another device: Confirm you used --host 0.0.0.0 and replace <HOST_IP> correctly.
-- Nothing at localhost:5173: Check that the frontend terminal shows Vite server running and no port conflict.
-- Firewall blocks access: Allow inbound on ports 5173 (frontend) and 8000 (backend) on Windows.
-- Auto reload not happening: Refresh manually if backend was restarted after initial UI load.
-- If you encounter the error “Port for tensor name cache_position was not found.” in the backend, it indicates the models were not configured as per the instructions in the README. To fix the issue, run:
-  ```bash
-  pip install --upgrade -r requirements.txt
-  ```
-  Then delete the models directory from `edge-ai-suites/education-ai-suite/smart-classroom/models` and try again.
-- If you face a tokenizer load issue like this:
-  ``` bash
-  Either openvino_tokenizer.xml was not provided or it was not loaded correctly. Tokenizer::encode is not available
-  ```
-  Delete the models folder from `edge-ai-suites/education-ai-suite/smart-classroom/models` and try again.
+•	[Release Notes](./docs/user-guide/release-notes.md)
diff --git a/education-ai-suite/smart-classroom/docs/user-guide/get-started.md b/education-ai-suite/smart-classroom/docs/user-guide/get-started.md
@@ -56,9 +56,11 @@ pip install --pre --upgrade ipex-llm[xpu_2.6] --extra-index-url https://download
 ```
 > 💡 *Use `smartclassroom` if you don’t need IPEX. Use `smartclassroom_ipex` if you want IPEX summarization.*
 
-## Step 2: Configure Defaults
+## Step 2: Configuration
 
-The default setup uses Whisper for transcription and OpenVINO Qwen models for summarization. You can customize these in the configuration file.
+### a. Default Configuration  
+  
+By default, the project uses Whisper for transcription and OpenVINO-based Qwen models for summarization.You can modify these settings in the configuration file (`smart-classroom/config.yaml`):
 
 ```bash
 asr:
@@ -74,16 +76,21 @@ summarizer:
   weight_format: int8         # Supported: fp16, fp32, int4, int8
   max_new_tokens: 1024        # Maximum tokens to generate in summaries
 ```
-### 💡 Tips:
-* For Chinese audio transcription, switch to funASR with Paraformer:
+### b. Chinese Audio Transcription  
 
+For Chinese audio transcription, switch to funASR with Paraformer in your config (`smart-classroom/config.yaml`):
 ```bash
 asr:
   provider: funasr
   name: paraformer-zh
 ```
 
-* (Optional) If you are using IPEX-based summarization, make sure IPEX-LLM is installed, env for ipex is activated and set following in `config`:
+### c. IPEX-based Summarization
+
+To use IPEX for summarization, ensure:
+- IPEX-LLM is installed.
+- The environment for IPEX is activated.
+- The configuration (`smart-classroom/config.yaml`) is updated as shown below:
 
 ```bash
 summarizer:
@@ -111,16 +118,59 @@ npm install
 npm run dev -- --host 0.0.0.0 --port 5173
 ```
 
-## Check Logs
+>  Open a second (new) Command Prompt / terminal window for the frontend. The backend terminal stays busy serving requests.
 
-Once the backend starts, you can see the following logs:
+💡 Tips: You should see backend logs similar to this:
 
-```bash
+```
 pipeline initialized
 [INFO] __main__: App started, Starting Server...
 INFO:     Started server process [21616]
-	@@ -92,5 +166,6 @@ INFO:     Application startup complete.
+INFO:     Waiting for application startup.
+INFO:     Application startup complete.
 INFO:     Uvicorn running on http://0.0.0.0:8000 (Press CTRL+C to quit)
 ```
 
-This means your pipeline server is up and ready to accept requests.
+This means your pipeline server has started successfully and is ready to accept requests.
+
+## Step 4: Access the UI
+
+After starting the frontend you can open the Smart Classroom UI in a browser:
+
+Local machine:
+- http://localhost:5173
+- http://127.0.0.1:5173
+
+From another device on the same network (replace <HOST_IP> with your computer’s IP):
+- http://<HOST_IP>:5173
+
+Find your IP (Windows PowerShell):
+```
+ipconfig
+```
+Use the IPv4 Address from your active network adapter.
+
+If you changed the port, adjust the URL accordingly.
+
+## Troubleshooting
+
+- Frontend not opening: Ensure you ran npm run dev in a second terminal after starting python main.py.
+- Backend not ready: Wait until Uvicorn shows "Application startup complete" and listening on port 8000.
+- URL fails from another device: Confirm you used --host 0.0.0.0 and replace <HOST_IP> correctly.
+- Nothing at localhost:5173: Check that the frontend terminal shows Vite server running and no port conflict.
+- Firewall blocks access: Allow inbound on ports 5173 (frontend) and 8000 (backend) on Windows.
+- Auto reload not happening: Refresh manually if backend was restarted after initial UI load.
+- If you encounter the error “Port for tensor name cache_position was not found.” in the backend, it indicates the models were not configured as per the instructions in the README. To fix the issue, run:
+
+  ```bash
+  pip install --upgrade -r requirements.txt
+  ```
+
+  Then delete the models directory from `edge-ai-suites/education-ai-suite/smart-classroom/models` and try again.
+- If you face a tokenizer load issue like this:
+
+  ``` bash
+  Either openvino_tokenizer.xml was not provided or it was not loaded correctly. Tokenizer::encode is not available
+  ```
+  
+  Delete the models folder from `edge-ai-suites/education-ai-suite/smart-classroom/models` and try again.
diff --git a/education-ai-suite/smart-classroom/docs/user-guide/images/architecture.svg b/education-ai-suite/smart-classroom/docs/user-guide/images/architecture.svg