batched-wav-nnet3-cuda2 and batched-wav-nnet3-cuda-online : online CPU feature extraction with pitch when decoding

batched-wav-nnet3-cuda2 and batched-wav-nnet3-cuda-online both use the same online cpu feature extraction code
reference void BatchedThreadedNnet3CudaOnlinePipeline::ComputeOneFeature(int element) in cudadecoder/batched-threaded-nnet3-cuda-online-pipeline.cc

the code do cpu feature extraction chunk by chunk.
if use batched-wav-nnet3-cuda command , i got a total 366 frames for a test utterance.
if i use batched-wav-nnet3-cuda2 , i print frames of each chunk , and sum all, got 289 frames.
this is because  pitch feature extraction has latency
the code also no do feature_pipeline.InputFinished(), so cannot got total 366 frames.

hope someone can help fix it, thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

batched-wav-nnet3-cuda2 and batched-wav-nnet3-cuda-online : online CPU feature extraction with pitch when decoding #4365

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

batched-wav-nnet3-cuda2 and batched-wav-nnet3-cuda-online : online CPU feature extraction with pitch when decoding #4365

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions