Skip to content

Commit 5a39faf

Browse files
committed
Changelog Update
1 parent afa84c1 commit 5a39faf

3 files changed

Lines changed: 65 additions & 16 deletions

File tree

CHANGELOG.md

Lines changed: 61 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,13 +1,69 @@
1-
## [0.0.2a2] - 2023-07-20
1+
# Changelog
2+
3+
All notable changes to this project will be documented in this file.
4+
5+
## [1.2.Surn] - 2025-04-02
6+
7+
Implemented Unlimited Music Generation (UMG) with the [hf checkpoints](https://huggingface.co/facebook/unlimited-music-generation).
8+
9+
## [1.4.0a2] - 2025-01-14
10+
11+
Add training and inference code for JASCO (https://arxiv.org/abs/2406.10970) along with the [hf checkpoints](https://huggingface.co/facebook/jasco-chords-drums-melody-1B).
12+
13+
## [1.4.0a1] - 2024-06-03
14+
15+
Adding new metric PesqMetric ([Perceptual Evaluation of Speech Quality](https://doi.org/10.5281/zenodo.6549559))
16+
17+
Adding multiple audio augmentation functions: generating pink noises, up-/downsampling, low-/highpass filtering, banpass filtering, smoothing, duck masking, boosting. All are wrapped in the `audiocraft.utils.audio_effects.AudioEffects` and can be called with the API `audiocraft.utils.audio_effects.select_audio_effects`.
18+
19+
Add training code for AudioSeal (https://arxiv.org/abs/2401.17264) along with the [hf checkpoints]( https://huggingface.co/facebook/audioseal).
20+
21+
## [1.3.0] - 2024-05-02
22+
23+
Adding the MAGNeT model (https://arxiv.org/abs/2401.04577) along with hf checkpoints and a gradio demo app.
24+
25+
Typo fixes.
26+
27+
Fixing setup.py to install only audiocraft, not the unit tests and scripts.
28+
29+
Fix FSDP support with PyTorch 2.1.0.
30+
31+
## [1.2.0] - 2024-01-11
232

3-
Music Generation set to a max of 720 seconds (12 minutes) to avoid memory issues.
33+
Adding stereo models.
434

5-
Video editing options (thanks @Surn and @oncorporation).
35+
Fixed the commitment loss, which was until now only applied to the first RVQ layer.
636

7-
Music Conditioning segment options
37+
Removed compression model state from the LM checkpoints, for consistency, it
38+
should always be loaded from the original `compression_model_checkpoint`.
839

940

10-
## [0.0.2a] - TBD
41+
## [1.1.0] - 2023-11-06
42+
43+
Not using torchaudio anymore when writing audio files, relying instead directly on the commandline ffmpeg. Also not using it anymore for reading audio files, for similar reasons.
44+
45+
Fixed DAC support with non default number of codebooks.
46+
47+
Fixed bug when `two_step_cfg` was overriden when calling `generate()`.
48+
49+
Fixed samples being always prompted with audio, rather than having both prompted and unprompted.
50+
51+
**Backward incompatible change:** A `torch.no_grad` around the computation of the conditioning made its way in the public release.
52+
The released models were trained without this. Those impact linear layers applied to the output of the T5 or melody conditioners.
53+
We removed it, so you might need to retrain models.
54+
55+
**Backward incompatible change:** Fixing wrong sample rate in CLAP (WARNING if you trained model with CLAP before).
56+
57+
**Backward incompatible change:** Renamed VALLEPattern to CoarseFirstPattern, as it was wrongly named. Probably no one
58+
retrained a model with this pattern, so hopefully this won't impact you!
59+
60+
61+
## [1.0.0] - 2023-09-07
62+
63+
Major revision, added training code for EnCodec, AudioGen, MusicGen, and MultiBandDiffusion.
64+
Added pretrained model for AudioGen and MultiBandDiffusion.
65+
66+
## [0.0.2] - 2023-08-01
1167

1268
Improved demo, fixed top p (thanks @jnordberg).
1369

@@ -24,10 +80,3 @@ Note that other implementations exist: https://github.com/camenduru/MusicGen-col
2480
## [0.0.1] - 2023-06-09
2581

2682
Initial release, with model evaluation only.
27-
28-
29-
# Changelog
30-
31-
All notable changes to this project will be documented in this file.
32-
33-
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/).

app.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -491,7 +491,7 @@ def ui(**kwargs):
491491
[
492492
"4/4 120bpm 320kbps 48khz, An 80s driving pop song with heavy drums and synth pads in the background",
493493
"./assets/bach.mp3",
494-
"melody",
494+
"stereo-melody-large",
495495
"80s Pop Synth"
496496
],
497497
[
@@ -503,13 +503,13 @@ def ui(**kwargs):
503503
[
504504
"4/4 120bpm 320kbps 48khz, 90s rock song with electric guitar and heavy drums",
505505
None,
506-
"medium",
506+
"stereo-medium",
507507
"90s Rock Guitar"
508508
],
509509
[
510510
"4/4 120bpm 320kbps 48khz, a light and cheerly EDM track, with syncopated drums, aery pads, and strong emotions",
511511
"./assets/bach.mp3",
512-
"melody",
512+
"melody-large",
513513
"EDM my Bach"
514514
],
515515
[

audiocraft/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,4 +7,4 @@
77
# flake8: noqa
88
from . import data, modules, models
99

10-
__version__ = '1.2.2a4'
10+
__version__ = '1.4.Surn'

0 commit comments

Comments
 (0)