You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Confirmed working on 38 languages: AFR, ARA, BUL, CAT, CYM, CES, DAN, DEU, ELL, ENG, EPO, EST, FAS, FIN, FRA, GLE, GRC, HRV, HUN, ISL, ITA, JPN, LAT, LAV, LIT, NLD, NOR, RON, RUS, POL, POR, SLK, SPA, SRP, SWA, SWE, TUR, UKR
232
237
* MFCC and DTW computed via Python C extensions to reduce the processing time
233
-
* Several built-in TTS engine wrappers: eSpeak (default), eSpeak-ng, Festival, Nuance TTS API
238
+
* Several built-in TTS engine wrappers: AWS Polly TTS API, eSpeak (default), eSpeak-ng, Festival, Nuance TTS API
234
239
* Default TTS (eSpeak) called via a Python C extension for fast audio synthesis
235
240
* Possibility of running a custom, user-provided TTS engine Python wrapper (e.g., included example for speect)
236
241
* Batch processing of multiple audio/text pairs
237
242
* Download audio from a YouTube video
238
243
* In multilevel mode, recursive alignment from paragraph to sentence to word level
239
-
* In multilevel mode, time resolutionand/or TTS engine can be specified for each level independently
244
+
* In multilevel mode, MFCC resolution, MFCC masking, DTW margin, and TTS engine can be specified for each level independently
240
245
* Robust against misspelled/mispronounced words, local rearrangements of words, background noise/sporadic spikes
241
246
* Adjustable splitting times, including a max character/second constraint for CC applications
242
247
* Automated detection of audio head/tail
243
248
* Output an HTML file for fine tuning the sync map manually (`finetuneas` project)
244
249
* Execution parameters tunable at runtime
245
-
* Code suitable for Web app deployment (e.g., on-demand cloud computing)
246
-
* Extensive test suite including 800+ unit/integration/performance tests, that run and must pass before each release
250
+
* Code suitable for Web app deployment (e.g., on-demand cloud computing instances)
251
+
* Extensive test suite including 1,200+ unit/integration/performance tests, that run and must pass before each release
247
252
248
253
249
254
## Limitations and Missing Features
250
255
251
256
* Audio should match the text: large portions of spurious text or audio might produce a wrong sync map
252
257
* Audio is assumed to be spoken: not suitable for song captioning, YMMV for CC applications
253
-
* No protection against memory trashing if you feed extremely long audio files (>1.5h per single audio file)
258
+
* No protection against memory swapping: be sure your amount of RAM is adequate for the maximum duration of a single audio file (e.g., 4 GB RAM => max 2h audio; 16 GB RAM => max 10h audio)
If you use **aeneas** as a library, just set the appropriate
285
+
``RuntimeConfiguration`` parameters.
286
+
Please see the
287
+
[command line tutorial](http://www.readbeyond.it/aeneas/docs/clitutorial.html)
288
+
for details.
289
+
256
290
257
291
## License
258
292
@@ -282,6 +316,8 @@ No copy rights were harmed in the making of this project.
282
316
283
317
***April 2016**: the Fruch Foundation kindly sponsored the development and documentation of v1.5.0
284
318
319
+
***December 2016**: the [Centro Internazionale Del Libro Parlato "Adriano Sernagiotto"](http://www.libroparlato.org/) (Feltre, Italy) partially sponsored the development of v1.7.0
320
+
285
321
### Supporting
286
322
287
323
Would you like supporting the development of **aeneas**?
@@ -291,8 +327,7 @@ I accept sponsorships to
291
327
* fix bugs,
292
328
* add new features,
293
329
* improve the quality and the performance of the code,
294
-
* port the code to other languages/platforms,
295
-
* support of third party installations, and
330
+
* port the code to other languages/platforms, and
296
331
* improve the documentation.
297
332
298
333
Feel free to
@@ -341,6 +376,9 @@ packaged the installers for Mac OS X and Windows.
341
376
**Firat Ozdemir** contributed the `finetuneas`
342
377
HTML/JS code forfine tuning sync mapsin the browser.
343
378
379
+
**Willem van der Walt** contributed the code snippet
0 commit comments