Is long form inference possible with whisper_trt ?
I tried inference on 4m16s audio clip and it appeared to only transcribe 30s, here is my script:
from whisper_trt import load_trt_model
model = load_trt_model("small.en")
result = model.transcribe("test.wav")