You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<pclass="ch-small">All updates will be posted on this page.</p>
202
207
</section>
203
208
@@ -380,15 +385,16 @@ <h3>Key Requirements</h3>
380
385
<ul>
381
386
<li>Hardware:
382
387
<ul>
383
-
<li>Raspberry Pi 4 (or similar Arm SBC).</li>
388
+
<li>Raspberry Pi 5 or 4 (or similar Arm SBC).</li>
384
389
<li>USB microphone.</li>
385
390
<li>Speaker via 3.5 mm jack or HDMI.</li>
391
+
<li>Where possible, aim to use the CPU without additional accelerators/hats. Solutions that are well-optimised through use of Quantisation, KleidiAI, and appropriate model selection - and therefore able to run entirely on CPU - are of great interest.</li>
386
392
</ul>
387
393
</li>
388
394
<li>Software:
389
395
<ul>
390
396
<li>Python with PyAudio for audio I/O.</li>
391
-
<li>Coqui STT or fine-tuned wav2vec2 for ASR.</li>
397
+
<li>Coqui STT or fine-tuned wav2vec2 for ASR, or similar.</li>
392
398
<li>eSpeak-NG or Festival for TTS.</li>
393
399
<li>Custom Python logic for intent recognition and command execution.</li>
394
400
</ul>
@@ -509,7 +515,7 @@ <h3>Objective</h3>
509
515
<h3>Project Description</h3>
510
516
<p>
511
517
Students will choose a lightweight object detector (e.g., MobileNet-SSD, YOLOv5s), convert it to an
512
-
edge-optimized format (TensorFlow Lite / ONNX Runtime), and integrate it with an OpenCV video pipeline.
518
+
edge-optimized format (TensorFlow Lite / ONNX Runtime / ExecuTorch), and integrate it with an OpenCV video pipeline.
513
519
Detected anomalies should trigger timestamped logs or saved clips.
<li>Raspberry Pi Camera Module v2 or USB webcam.</li>
531
537
<li>High-write-speed microSD card.</li>
538
+
<li>Where possible, aim to use the CPU without additional accelerators/hats. Solutions that are well-optimised through use of Quantisation, KleidiAI, and appropriate model selection - and therefore able to run entirely on CPU - are of great interest.</li>
532
539
</ul>
533
540
</li>
534
541
<li>Software:
535
542
<ul>
536
543
<li>Raspberry Pi OS.</li>
537
544
<li>Python, OpenCV.</li>
538
-
<li>TensorFlow Lite / ONNX Runtime with a pre-trained, quantized detection model.</li>
545
+
<li>TensorFlow Lite / ONNX Runtime / ExecuTorch with a pre-trained, quantized detection model.</li>
539
546
</ul>
540
547
</li>
541
548
</ul>
@@ -576,7 +583,7 @@ <h3>Learning Outcomes</h3>
576
583
577
584
<h3>Objective</h3>
578
585
<p>
579
-
Build a fully local, real-time speech-to-speech translation system optimized for Arm-based CPUs, leveraging SME2 where available (preferred) or NEON/NPU otherwise. The system must perform speech recognition, LLM-based translation or semantic rewriting, and speech synthesis entirely on-device, meeting mobile latency, power, and thermal constraints.
586
+
Build a fully local, real-time speech-to-speech translation system optimized for Arm-based CPUs, leveraging SME2 where available (preferred) or NEON or an onboard NPU otherwise. The system must perform speech recognition, LLM-based translation or semantic rewriting, and speech synthesis entirely on-device, meeting mobile latency, power, and thermal constraints.
580
587
</p>
581
588
582
589
<h3>Project Description</h3>
@@ -590,7 +597,7 @@ <h3>Project Description</h3>
590
597
<li>LLM-based translation or semantic rewriting, and</li>
591
598
<li>Text-to-speech (TTS) synthesis,</li>
592
599
</ol>
593
-
to produce natural, fluent spoken output in Language B. All inference must run locally with no cloud dependency, demonstrating efficient use of Arm CPU acceleration and mobile-friendly optimizations.
600
+
to produce natural, fluent spoken output in Language B. All inference must run locally with no cloud dependency, demonstrating efficient use of Arm CPU acceleration and mobile-friendly optimizations. To see all SME2-enabled devices, and for resources to get started, see this related Arm Developer Labs project. <ahref="https://arm-university.github.io/Arm-Developer-Labs/2025/11/27/Edge-AI-On-Mobile.html">Edge AI on Mobile using SME2</a>
<li>SME2-enabled device preferred; otherwise NEON-optimized CPU or optional onboard NPU</li>
617
+
<li>SME2-enabled device preferred; otherwise utilise an Arm-based CPU and NEON instructions or optional onboard NPU</li>
611
618
<li>Microphone and audio output (speaker or headphones)</li>
619
+
<li>Where possible, aim to use the CPU (leveraging SME2 if available) without using an onboard NPU. Solutions that are well-optimised through use of Quantisation, KleidiAI, and appropriate model selection - and therefore able to run entirely on CPU - are of great interest.</li>
0 commit comments