This document explains the Redis-based audio processing system implemented in the audio transcription service.
The audio transcription system has been updated to store and process audio data in memory using Redis instead of writing to the file system. This approach offers several benefits:
- Improved Performance: Reduces disk I/O operations
- Enhanced Scalability: Better compatibility with containerized environments like Cloud Run
- Simplified Deployment: Reduces dependency on persistent volumes
- Flexibility: Can still fallback to file-based processing when needed
- Resilience: Automatically restores audio buffers from Redis on restart
-
Audio Chunk Collection:
- Audio chunks are sent to the service via the API
- Chunks are initially stored in Redis using
AudioChunkDAL
-
In-Memory Processing:
- The
writestream2file
method inProcessor
class has been updated to:- Retrieve chunks from Redis
- Append chunks to an in-memory buffer
- Store the complete buffer in Redis with a backup TTL (24 hours)
- Track the last activity timestamp for each connection
- Return the same metadata as before for compatibility
- The
-
Audio Slicing:
- New methods in
AudioSlicer
allow creating and slicing audio segments directly from Redis data:from_redis
: Creates an AudioSlicer from complete audio data in Redisfrom_redis_slice
: Creates a sliced audio segment from Redis data
- New methods in
-
Transcription Processing:
- The
read
method in the transcriptionProcessor
class tries to:- First retrieve and slice audio data from Redis
- Fall back to file-based processing if Redis data is not available
- The
-
Activity-Based Flushing:
- The system tracks when each audio buffer was last updated
- Connections that haven't been updated for a configurable time period (default: 60 seconds) are:
- Flushed to disk for persistence
- Removed from memory to free resources
- Kept in Redis as a backup
-
Resilient Initialization:
- On startup, the system loads existing audio buffers from Redis
- This ensures data persistence across service restarts
-
Redis Keys:
AUDIO_BUFFER
: Stores complete audio buffers (e.g.,audio_buffer:connection_id
)AUDIO_BUFFER_LAST_UPDATED
: Tracks when buffers were last updated (e.g.,audio_buffer_last_updated:connection_id
)
-
Updated Classes:
Processor
:- Added in-memory buffer management and Redis storage
- Added activity tracking and automatic flushing of inactive connections
- Added recovery of buffers on initialization
AudioSlicer
: Added Redis-based methods for creating and slicing audioTranscriptionProcessor
: Updated to use Redis-based audio slicing
A test script is provided at app/tests/test_redis_audio.py
to demonstrate the Redis-based audio processing:
python -m app.tests.test_redis_audio
A utility script is available to manage Redis audio buffers:
-
List all audio buffers:
python -m app.scripts.flush_redis_buffers list
-
Flush a specific buffer to disk:
python -m app.scripts.flush_redis_buffers flush <connection_id> [output_dir]
-
Flush all buffers to disk:
python -m app.scripts.flush_redis_buffers flush-all [output_dir]
-
Inactivity Timeout: The time (in seconds) of inactivity before a connection is flushed to disk. Default is 60 seconds:
processor = Processor() await processor.set_inactive_timeout(120) # Set to 120 seconds
-
Redis Backup TTL: Audio buffers in Redis have a safety TTL of 24 hours. This can be adjusted in the
writestream2file
method inprocessor.py
.
The in-memory approach uses more Redis memory, but connections are automatically flushed to disk after periods of inactivity, which helps manage memory use. Monitor Redis memory usage during high-traffic periods to ensure enough capacity.
Three layers of persistence are provided:
- In-Memory Buffers: Fastest access for active connections
- Redis Storage: Backup that survives service restarts
- Disk Storage: Long-term persistence for inactive connections
Inactive connections are automatically flushed to disk after 60 seconds (configurable) of inactivity.
The system includes multiple fallback mechanisms:
- Redis data is restored on service restart
- If Redis data is not available, it attempts to read from disk
- On both Redis and disk failure, connections are gracefully removed
Both automatic and manual cleanup options are available:
- Automatic: Inactive connections are automatically flushed to disk and removed from memory
- Manual: The utility script can be used to flush specific or all buffers as needed
-
Redis Memory Issues:
- Symptom: Redis out of memory errors
- Solution:
- Decrease the inactivity timeout to flush connections to disk more quickly
- Increase Redis memory limit
-
Missing Audio Data:
- Symptom: "No audio data found in Redis" errors
- Solution:
- Check Redis connectivity
- Verify that automatic flushing isn't removing buffers too aggressively
-
Audio Format Issues:
- Symptom: "Failed to create AudioSlicer from Redis data" errors
- Solution: Verify audio format compatibility and ensure audio chunks are valid
- Implement a more sophisticated buffer management system with priority-based eviction
- Add automatic backup of Redis buffers to Cloud Storage
- Optimize memory usage by compressing audio data in Redis
- Implement buffer sharing across multiple containers for enhanced scalability