2
2
3
3
** aeneas** is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment).
4
4
5
- * Version: 1.5.1 .0
6
- * Date: 2016-07-25
5
+ * Version: 1.6.0 .0
6
+ * Date: 2016-09-26
7
7
* Developed by: [ ReadBeyond] ( http://www.readbeyond.it/ )
8
8
* Lead Developer: [ Alberto Pettarin] ( http://www.albertopettarin.it/ )
9
9
* License: the GNU Affero General Public License Version 3 (AGPL v3)
@@ -61,8 +61,8 @@ or raw AUD/CSV/SSV/TSV/TXT/XML for further processing.
61
61
2 . [ Python] ( https://python.org/ ) 2.7 (Linux, OS X, Windows) or 3.4 or later (Linux, OS X)
62
62
3 . [ FFmpeg] ( https://www.ffmpeg.org/ )
63
63
4 . [ eSpeak] ( http://espeak.sourceforge.net/ )
64
- 5 . Python modules ` BeautifulSoup4 ` , ` lxml ` , and ` numpy `
65
- 6 . Python C headers to compile the Python C extensions (optional but strongly recommended)
64
+ 5 . Python packages ` BeautifulSoup4 ` , ` lxml ` , and ` numpy `
65
+ 6 . Python headers to compile the Python C/C++ extensions (optional but strongly recommended)
66
66
7 . A shell supporting UTF-8 (optional but strongly recommended)
67
67
68
68
### Supported Platforms
@@ -228,21 +228,22 @@ which explains how to use the built-in command line tools.
228
228
* Arbitrary text fragment granularity (single word, subphrase, phrase, paragraph, etc.)
229
229
* Input audio file formats: all those readable by ` ffmpeg`
230
230
* Output sync map formats: AUD, CSV, EAF, JSON, SMIL, SRT, SSV, SUB, TSV, TTML, TXT, VTT, XML
231
- * Confirmed working on languages: ARA, BUL, CAT, CYM, CES, DAN, DEU, ELL, ENG, EPO, EST, FAS, FIN, FRA, GLE, GRC, HRV, HUN, ISL, ITA, JPN, LAT, LAV, LIT, NLD, NOR, RON, RUS, POL, POR, SLK, SPA, SRP, SWA, SWE, TUR, UKR
231
+ * Confirmed working on 37 languages: ARA, BUL, CAT, CYM, CES, DAN, DEU, ELL, ENG, EPO, EST, FAS, FIN, FRA, GLE, GRC, HRV, HUN, ISL, ITA, JPN, LAT, LAV, LIT, NLD, NOR, RON, RUS, POL, POR, SLK, SPA, SRP, SWA, SWE, TUR, UKR
232
232
* MFCC and DTW computed via Python C extensions to reduce the processing time
233
- * Several built-in TTS engine wrappers: eSpeak (default, FLOSS) , Festival (FLOSS) , Nuance TTS API (commercial)
233
+ * Several built-in TTS engine wrappers: eSpeak (default), eSpeak-ng , Festival, Nuance TTS API
234
234
* Default TTS (eSpeak) called via a Python C extension for fast audio synthesis
235
- * A custom, user-provided TTS engine Python wrapper can be used instead of the built-in ones ( included example for speect)
235
+ * Possibility of running a custom, user-provided TTS engine Python wrapper (e.g., included example for speect)
236
236
* Batch processing of multiple audio/text pairs
237
237
* Download audio from a YouTube video
238
238
* In multilevel mode, recursive alignment from paragraph to sentence to word level
239
+ * In multilevel mode, time resolution and/or TTS engine can be specified for each level independently
239
240
* Robust against misspelled/mispronounced words, local rearrangements of words, background noise/sporadic spikes
240
241
* Adjustable splitting times, including a max character/second constraint for CC applications
241
242
* Automated detection of audio head/tail
242
243
* Output an HTML file for fine tuning the sync map manually (` finetuneas` project)
243
244
* Execution parameters tunable at runtime
244
245
* Code suitable for Web app deployment (e.g., on-demand cloud computing)
245
- * Extensive test suite including 898 unit/integration/performance tests, that run and must pass before each release
246
+ * Extensive test suite including 800+ unit/integration/performance tests, that run and must pass before each release
246
247
247
248
248
249
# # Limitations and Missing Features
@@ -299,37 +300,21 @@ Feel free to
299
300
300
301
# ## Contributing
301
302
302
- If you think you found a bug,
303
+ If you think you found a bug
304
+ or you have a feature request,
303
305
please use the
304
306
[GitHub issue tracker](https://github.com/readbeyond/aeneas/issues)
305
- to file a bug report .
307
+ to submit it .
306
308
307
- If you are able to contribute code directly, that is awesome!
308
- I will be glad to merge it!
309
- Just a few rules, to make life easier for both you and me:
309
+ If you want to ask a question
310
+ about using ** aeneas** ,
311
+ your best option consists in sending an email to the
312
+ [mailing list](https://groups.google.com/d/forum/aeneas-forced-alignment).
310
313
311
- 1. Please do not work on the ` master` branch.
312
- Instead, create a new branch on your GitHub repo
313
- by cheking out the ` devel` branch.
314
- Open a pull request from your branch on your repo
315
- to the ` devel` branch on this GitHub repo.
316
-
317
- 2. Please make your code consistent with
318
- the existing code base style
319
- (see the
320
- [Google Python Style Guide](https://google-styleguide.googlecode.com/svn/trunk/pyguide.html)
321
- ), and test your contributed code
322
- against the unit tests
323
- before opening the pull request.
324
-
325
- 3. Ideally, add some unit tests for the code you are submitting,
326
- either adding them to the existing unit tests or creating a new file
327
- in ` aeneas/tests/` .
328
-
329
- 4. ** Please note that, by opening a pull request,
330
- you automatically agree to apply
331
- the AGPL v3 license
332
- to the code you contribute.**
314
+ Finally, code contributions are welcome!
315
+ Please refer to the
316
+ [Code Contribution Guide](https://github.com/readbeyond/aeneas/blob/master/wiki/CONTRIBUTING.md)
317
+ for details about the branch policies and the code style to follow.
333
318
334
319
335
320
# # Acknowledgments
@@ -347,6 +332,9 @@ for its asynchronous usage.
347
332
** Chris Hubbard** prepared the files for
348
333
packaging aeneas as a Debian/Ubuntu ` .deb` .
349
334
335
+ ** Daniel Bair** prepared the ` brew` formula
336
+ for installing ** aeneas** and its dependencies on Mac OS X.
337
+
350
338
** Daniel Bair** , ** Chris Hubbard** , and ** Richard Margetts**
351
339
packaged the installers for Mac OS X and Windows.
352
340
0 commit comments