NVIDIA-AI-IOT
diff --git a/‎CHANGELOG.md‎
Lines changed: 30 additions & 8 deletions b/‎CHANGELOG.md‎
Lines changed: 30 additions & 8 deletions
diff --git a/‎docs/troubleshooting.md‎
Lines changed: 99 additions & 0 deletions b/‎docs/troubleshooting.md‎
Lines changed: 99 additions & 0 deletions
diff --git a/‎pyproject.toml‎
Lines changed: 1 addition & 1 deletion b/‎pyproject.toml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎scripts/pre_commit_check.sh‎
100644100755 b/‎scripts/pre_commit_check.sh‎
100644100755
diff --git a/‎scripts/start_container.sh‎
Lines changed: 2 additions & 2 deletions b/‎scripts/start_container.sh‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎scripts/start_server.sh‎
Lines changed: 3 additions & 3 deletions b/‎scripts/start_server.sh‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎src/live_vlm_webui/__init__.py‎
Lines changed: 1 addition & 1 deletion b/‎src/live_vlm_webui/__init__.py‎
Lines changed: 1 addition & 1 deletion
@@ -7,14 +7,33 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 
 ## [Unreleased]
 
+### Planned for 0.4.0
+- **Multi-session support for cloud deployment**: Scope multi-user / multi-session architecture for cloud deployments (see current limitations in 0.3.0).
+
+---
+
+## [0.3.0] - 2026-03-02
+
+**UI upgrade and robotics-oriented prompts**
+
+### Added
+- **Video overlay controls (play / stop)**:
+  - Big green PLAY button centered on video; animates to top-left and fades when streaming starts
+  - Small red STOP button in top-left while streaming (higher opacity for visibility)
+  - Sidebar start/stop replaced by overlay flow for cleaner UX
+- **Fullscreen mode**: Toggle fullscreen on the video card with VLM output overlay; shrink and mirror buttons remain clickable (z-index fix)
+- **Robotics-oriented prompt preset**: "Robot Navigation (Simple)" system prompt—describe scene and output 5 navigation commands (`linear_x`, `angular_z`) with reasons, e.g. for bathroom-finding or similar tasks
+
 ### Fixed
-- **Model initialization race condition**: Fixed auto-selected models not being sent to server
-  - Previously, if the UI auto-selected a model on page load, it wouldn't be sent to the server
-  - This happened because `fetchModels()` ran before WebSocket connection completed
-  - Symptom: Camera opens but no VLM processing until manually selecting a model
-  - Fix: Send current model to server immediately after WebSocket connects
-  - Ensures server always uses the model shown in UI, even when auto-selected
-  - Result: VLM processing starts automatically without requiring manual model selection
+- **Model initialization race condition**: Auto-selected model is sent to server as soon as WebSocket connects so VLM processing starts without manually re-selecting the model
+- **MediaStreamError on stop**: Track end when user stops is handled as normal shutdown (logged at DEBUG only, no error/traceback)
+- **Fullscreen controls**: Shrink (minimize) and Mirror buttons stay above the VLM overlay and remain clickable in fullscreen
+- **Jetson Thor Docker** ([#14](https://github.com/NVIDIA-AI-IOT/live-vlm-webui/issues/14)): `start_container.sh` now uses `--runtime=nvidia` instead of `--gpus all` on Jetson (Thor and Orin) so containers start correctly
+
+### Changed
+- **WebRTC**: Wait for ICE gathering to complete before sending offer (reduces stuck "checking" connections)
+- **Troubleshooting**: New "WebRTC connection issues" section (ICE stuck, firewall, STUN, verification steps)
+- **Scripts**: `start_server.sh` suggests `kill -9` when port is in use
 
 ---
 
@@ -360,6 +379,9 @@ This is the initial public release of Live VLM WebUI - a real-time vision langua
 
 ---
 
-[Unreleased]: https://github.com/NVIDIA-AI-IOT/live-vlm-webui/compare/v0.1.1...HEAD
+[Unreleased]: https://github.com/NVIDIA-AI-IOT/live-vlm-webui/compare/v0.3.0...HEAD
+[0.3.0]: https://github.com/NVIDIA-AI-IOT/live-vlm-webui/compare/v0.2.1...v0.3.0
+[0.2.1]: https://github.com/NVIDIA-AI-IOT/live-vlm-webui/compare/v0.2.0...v0.2.1
+[0.2.0]: https://github.com/NVIDIA-AI-IOT/live-vlm-webui/compare/v0.1.1...v0.2.0
 [0.1.1]: https://github.com/NVIDIA-AI-IOT/live-vlm-webui/compare/v0.1.0...v0.1.1
 [0.1.0]: https://github.com/NVIDIA-AI-IOT/live-vlm-webui/releases/tag/v0.1.0
@@ -395,6 +395,105 @@ For production use, get a proper SSL certificate from Let's Encrypt or a certifi
 
 ---
 
+## WebRTC Connection Issues
+
+### No VLM analysis results / GPU not increasing / Connection stuck
+
+**Symptoms:**
+- ✅ Server starts successfully
+- ✅ Web UI loads properly
+- ✅ Camera permission granted
+- ❌ No VLM analysis results appear
+- ❌ GPU utilization stays at 0%
+- ❌ Video preview may show but no processing happens
+
+**Root Cause:** WebRTC connection is not completing. The ICE (Interactive Connectivity Establishment) connection gets stuck in "checking" state and never reaches "connected".
+
+**How to verify this is the issue:**
+
+Check server logs for this pattern:
+```log
+ICE gathering state: complete
+Created answer with 1 transceivers
+ICE connection state: checking
+Connection state: connecting
+# ❌ Connection never progresses to "connected"
+```
+
+Check browser console (F12 → Console tab):
+```javascript
+ICE connection state: checking
+# ❌ Should show "connected" but doesn't
+```
+
+**Solution:** This issue has been fixed in recent versions. Update to the latest version:
+
+```bash
+# Update to latest version
+pip install --upgrade live-vlm-webui
+
+# Or if using git:
+cd live-vlm-webui
+git pull
+pip install -e .
+```
+
+**If updating doesn't help, check these:**
+
+1. **Firewall blocking WebRTC:**
+   ```bash
+   # Allow UDP for WebRTC
+   sudo ufw allow 8090/tcp
+   sudo ufw allow 49152:65535/udp  # WebRTC ports
+   ```
+
+2. **STUN server unreachable:**
+   ```bash
+   # Test STUN server connectivity
+   curl -I stun.l.google.com:19302
+   ```
+
+3. **Corporate/Network restrictions:**
+   - Some corporate networks block WebRTC traffic
+   - Try from a different network or use mobile hotspot for testing
+   - Check if UDP traffic is blocked by your router/firewall
+
+4. **Browser compatibility:**
+   - ✅ Chrome/Edge (recommended - best WebRTC support)
+   - ✅ Firefox (good support)
+   - ⚠️ Safari (limited support)
+   - Use latest browser version
+
+5. **SSL certificate issues:**
+   - Make sure you accepted the self-signed certificate warning
+   - Clear browser cache and reload: Ctrl+Shift+R (Cmd+Shift+R on Mac)
+
+**Technical Details:**
+
+The fix ensures ICE candidates are properly gathered before exchanging WebRTC offers. Without this, the peers can't find network paths to connect, leaving the connection in "checking" state indefinitely.
+
+**Verify the fix worked:**
+
+After starting camera, you should see in server logs:
+```log
+✅ ICE gathering state: complete
+✅ Created answer with 1 transceivers
+✅ ICE connection state: checking
+✅ ICE connection state: connected    # ← This line should appear!
+✅ Connection state: connected
+```
+
+And browser console should show:
+```javascript
+ICE connection state: connected  // ← Must see this!
+```
+
+Once connected, you should immediately see:
+- VLM analysis results appearing in the UI
+- GPU utilization increasing (check with `nvidia-smi` or `jtop`)
+
+---
+
 ## VLM Backend Issues
 
 > 📖 **Reference:** For a complete list of available Vision-Language Models across different providers, see [List of VLMs](usage/list-of-vlms.md).
 
@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
 
 [project]
 name = "live-vlm-webui"
-version = "0.2.1"
+version = "0.3.0"
 description = "Real-time Vision Language Model interaction web interface"
 readme = "README.md"
 requires-python = ">=3.10"
 
@@ -1029,7 +1029,7 @@ elif [ "$ARCH" = "aarch64" ]; then
             if echo "$GPU_NAME" | grep -qi "thor"; then
                 PLATFORM="jetson-thor"
                 PLATFORM_SUFFIX="-jetson-thor"
-                GPU_FLAG="--gpus all"
+                RUNTIME_FLAG="--runtime=nvidia"
                 echo -e "   Platform: ${GREEN}NVIDIA Jetson Thor${NC} (detected via GPU: ${GPU_NAME})"
             else
                 PLATFORM="jetson-orin"
@@ -1046,7 +1046,7 @@ elif [ "$ARCH" = "aarch64" ]; then
         if [ "$L4T_VERSION" -ge 38 ]; then
             PLATFORM="jetson-thor"
             PLATFORM_SUFFIX="-jetson-thor"
-            GPU_FLAG="--gpus all"
+            RUNTIME_FLAG="--runtime=nvidia"
             echo -e "   Platform: ${GREEN}NVIDIA Jetson Thor${NC} (L4T R${L4T_VERSION})"
         else
             PLATFORM="jetson-orin"
 
@@ -184,14 +184,14 @@ if [ "$PORT_IN_USE" = true ]; then
             if [ -n "$PID" ]; then
                 PROC_INFO=$(ps -p $PID -o comm= 2>/dev/null || echo "unknown")
                 echo "  Process using port 8090: PID $PID ($PROC_INFO)"
-                echo "  kill $PID"
+                echo "  kill -9 $PID"
             else
                 echo "  lsof -ti :8090  # Find the process"
-                echo "  kill <PID>      # Stop it"
+                echo "  kill -9 <PID>   # Force stop it"
             fi
         else
             echo "  netstat -tulpn | grep :8090  # Find the process"
-            echo "  kill <PID>                    # Stop it"
+            echo "  kill -9 <PID>                 # Force stop it"
         fi
         echo ""
         echo "Option 3: Use a different port"
 
@@ -20,7 +20,7 @@
 with real-time AI analysis and system monitoring.
 """
 
-__version__ = "0.2.1"
+__version__ = "0.3.0"
 __author__ = "NVIDIA Corporation"
 __license__ = "Apache-2.0"