Need expert help fine-tuning Whisper for Heimish Yiddish (production ASR)

Hi,

I’m looking for hands-on expert guidance on fine-tuning Whisper for Heimish / Hasidic Yiddish (Hebrew script).

Goal

High-quality transcription from seconds-long clips up to multi-hour audio

End-to-end transcription in ~1–3 minutes

ASR core must be production-ready and handoff-ready for later web deployment
(not building the website yet)

Current state

Whisper-small experimental fine-tune

Output quality inconsistent (phonetic drift, mixed words)

What I already have

Clean 16kHz WAV audio

Human-verified transcripts (I’m an experienced transcriber)

Proper UTF-8 CSV metadata

Long-audio chunking is understood (not my blocker)

What I need

Practical guidance from someone who has:

fine-tuned Whisper for dialects / low-resource languages

improved linguistic accuracy and output stability

Not looking for

setup / CUDA issues

basic Whisper explanations

model theory

If you’ve done similar work or can point me to the right person, I’d appreciate it.
Happy to continue privately.

Thanks,


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need expert help fine-tuning Whisper for Heimish Yiddish (production ASR) #1420

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Need expert help fine-tuning Whisper for Heimish Yiddish (production ASR) #1420

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions