Skip to content

Commit 711ba22

Browse files
authored
Merge pull request #156 from arm-university/c_page_changes
Update Challenge_Page.html
2 parents 4fb854c + 0680a61 commit 711ba22

1 file changed

Lines changed: 18 additions & 11 deletions

File tree

docs/Challenge_Page.html

Lines changed: 18 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -198,6 +198,11 @@ <h2>Support & Queries</h2>
198198
clearly mentioning the problem statement number in the subject line.
199199
</p>
200200

201+
<p>For get started resources:</p>
202+
<p>
203+
<a href="https://arm-university.github.io/online-resources-arm/">Arm Education Resources</a>
204+
</p>
205+
201206
<p class="ch-small">All updates will be posted on this page.</p>
202207
</section>
203208

@@ -380,15 +385,16 @@ <h3>Key Requirements</h3>
380385
<ul>
381386
<li>Hardware:
382387
<ul>
383-
<li>Raspberry Pi 4 (or similar Arm SBC).</li>
388+
<li>Raspberry Pi 5 or 4 (or similar Arm SBC).</li>
384389
<li>USB microphone.</li>
385390
<li>Speaker via 3.5 mm jack or HDMI.</li>
391+
<li>Where possible, aim to use the CPU without additional accelerators/hats. Solutions that are well-optimised through use of Quantisation, KleidiAI, and appropriate model selection - and therefore able to run entirely on CPU - are of great interest.</li>
386392
</ul>
387393
</li>
388394
<li>Software:
389395
<ul>
390396
<li>Python with PyAudio for audio I/O.</li>
391-
<li>Coqui STT or fine-tuned wav2vec2 for ASR.</li>
397+
<li>Coqui STT or fine-tuned wav2vec2 for ASR, or similar.</li>
392398
<li>eSpeak-NG or Festival for TTS.</li>
393399
<li>Custom Python logic for intent recognition and command execution.</li>
394400
</ul>
@@ -509,7 +515,7 @@ <h3>Objective</h3>
509515
<h3>Project Description</h3>
510516
<p>
511517
Students will choose a lightweight object detector (e.g., MobileNet-SSD, YOLOv5s), convert it to an
512-
edge-optimized format (TensorFlow Lite / ONNX Runtime), and integrate it with an OpenCV video pipeline.
518+
edge-optimized format (TensorFlow Lite / ONNX Runtime / ExecuTorch), and integrate it with an OpenCV video pipeline.
513519
Detected anomalies should trigger timestamped logs or saved clips.
514520
</p>
515521
<div style="margin:24px 0; display:flex; justify-content:center;">
@@ -526,16 +532,17 @@ <h3>Key Requirements</h3>
526532
<ul>
527533
<li>Hardware:
528534
<ul>
529-
<li>Raspberry Pi 4 with proper cooling.</li>
535+
<li>Raspberry Pi 5 or 4.</li>
530536
<li>Raspberry Pi Camera Module v2 or USB webcam.</li>
531537
<li>High-write-speed microSD card.</li>
538+
<li>Where possible, aim to use the CPU without additional accelerators/hats. Solutions that are well-optimised through use of Quantisation, KleidiAI, and appropriate model selection - and therefore able to run entirely on CPU - are of great interest.</li>
532539
</ul>
533540
</li>
534541
<li>Software:
535542
<ul>
536543
<li>Raspberry Pi OS.</li>
537544
<li>Python, OpenCV.</li>
538-
<li>TensorFlow Lite / ONNX Runtime with a pre-trained, quantized detection model.</li>
545+
<li>TensorFlow Lite / ONNX Runtime / ExecuTorch with a pre-trained, quantized detection model.</li>
539546
</ul>
540547
</li>
541548
</ul>
@@ -576,7 +583,7 @@ <h3>Learning Outcomes</h3>
576583

577584
<h3>Objective</h3>
578585
<p>
579-
Build a fully local, real-time speech-to-speech translation system optimized for Arm-based CPUs, leveraging SME2 where available (preferred) or NEON/NPU otherwise. The system must perform speech recognition, LLM-based translation or semantic rewriting, and speech synthesis entirely on-device, meeting mobile latency, power, and thermal constraints.
586+
Build a fully local, real-time speech-to-speech translation system optimized for Arm-based CPUs, leveraging SME2 where available (preferred) or NEON or an onboard NPU otherwise. The system must perform speech recognition, LLM-based translation or semantic rewriting, and speech synthesis entirely on-device, meeting mobile latency, power, and thermal constraints.
580587
</p>
581588

582589
<h3>Project Description</h3>
@@ -590,7 +597,7 @@ <h3>Project Description</h3>
590597
<li>LLM-based translation or semantic rewriting, and</li>
591598
<li>Text-to-speech (TTS) synthesis,</li>
592599
</ol>
593-
to produce natural, fluent spoken output in Language B. All inference must run locally with no cloud dependency, demonstrating efficient use of Arm CPU acceleration and mobile-friendly optimizations.
600+
to produce natural, fluent spoken output in Language B. All inference must run locally with no cloud dependency, demonstrating efficient use of Arm CPU acceleration and mobile-friendly optimizations. To see all SME2-enabled devices, and for resources to get started, see this related Arm Developer Labs project. <a href="https://arm-university.github.io/Arm-Developer-Labs/2025/11/27/Edge-AI-On-Mobile.html">Edge AI on Mobile using SME2</a>
594601
</p>
595602
<div style="margin:24px 0; display:flex; justify-content:center;">
596603
<iframe width="560" height="315"
@@ -607,8 +614,10 @@ <h3>Key Requirements</h3>
607614
<li>Hardware:
608615
<ul>
609616
<li>Arm-based smartphone CPU</li>
610-
<li>SME2-enabled device preferred; otherwise NEON-optimized CPU or optional onboard NPU</li>
617+
<li>SME2-enabled device preferred; otherwise utilise an Arm-based CPU and NEON instructions or optional onboard NPU</li>
611618
<li>Microphone and audio output (speaker or headphones)</li>
619+
<li>Where possible, aim to use the CPU (leveraging SME2 if available) without using an onboard NPU. Solutions that are well-optimised through use of Quantisation, KleidiAI, and appropriate model selection - and therefore able to run entirely on CPU - are of great interest.</li>
620+
<li>No cloud inference permitted</li>
612621
</ul>
613622
</li>
614623
<li>Software:
@@ -632,8 +641,6 @@ <h3>Key Requirements</h3>
632641
<li>Examples: FastSpeech2 + HiFiGAN, VITS-lite</li>
633642
</ul>
634643
</li>
635-
<li><strong>No cloud inference permitted</strong></li>
636-
<li><strong>Quantization and Arm-specific optimizations required (SME2/NEON)</strong></li>
637644
</ul>
638645
</li>
639646
</ul>
@@ -840,4 +847,4 @@ <h2 style="color:#7c3aed; margin:0 0 15px; font-size:24px;">Thank You!</h2>
840847
});
841848
}
842849
});
843-
</script>
850+
</script>

0 commit comments

Comments
 (0)