Skip to content

batched-wav-nnet3-cuda2 and batched-wav-nnet3-cuda-online : online CPU feature extraction with pitch when decoding #4365

Open
@garymmi

Description

@garymmi

batched-wav-nnet3-cuda2 and batched-wav-nnet3-cuda-online both use the same online cpu feature extraction code
reference void BatchedThreadedNnet3CudaOnlinePipeline::ComputeOneFeature(int element) in cudadecoder/batched-threaded-nnet3-cuda-online-pipeline.cc

the code do cpu feature extraction chunk by chunk.
if use batched-wav-nnet3-cuda command , i got a total 366 frames for a test utterance.
if i use batched-wav-nnet3-cuda2 , i print frames of each chunk , and sum all, got 289 frames.
this is because pitch feature extraction has latency
the code also no do feature_pipeline.InputFinished(), so cannot got total 366 frames.

hope someone can help fix it, thanks

Metadata

Metadata

Assignees

Labels

bugin progressIssue has been taken and is being worked on

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions