Midi 82/vel transformer blogpost #2

WojciechMat · 2023-09-16T13:22:58Z

Initial version of a blog post without final sections.
I was unable to change width of a audio controller, so I can fit only two columns with samples in a row.
Let me know your thoughts and suggestions :))

roszcz · 2023-09-16T13:57:38Z

blog/source/_posts/MIDI-velocity-transformer.md

+  {% algrtmImg MIDI-velocity-transformer/samples/9115-pred-untrained.png pianoroll 170px %}
+  {% algrtmAudio MIDI-velocity-transformer/samples/9115-pred-untrained.mp3 %}


Missing files 🚨

should be ok now 👍

WojciechMat · 2023-09-16T17:59:13Z

I have added some more text about tokenization as well as conclusion, references and missing file from before. Everything should work now. I am not sure as to contact info at the end of the post, maybe you know how you would like for it to look like?
Would love to hear your thoughts on what more to include, what to change in the text, if I should include more clarifications or make the content more engaging let me know! 🔥

roszcz · 2023-09-19T06:13:01Z

blog/source/_posts/MIDI-velocity-transformer.md

@@ -0,0 +1,266 @@
+---
+title: MIDI Velocity Prediction with Transformer


Suggested change

title: MIDI Velocity Prediction with Transformer

title: Modelling dynamic expression in piano performance

I think we shouldn't mention MIDI or velocity in the title, as it's to technical - my suggestions aims to provide broader goal for what we're doing. I think you can add "with Transformers" if you like

roszcz · 2023-09-19T06:15:37Z

blog/source/_posts/MIDI-velocity-transformer.md

+MIDI velocity is a crucial element in music dynamics, determining the force with which a note is played, 
+which profoundly influences the emotional quality of music. 


I think you should try to explain the problem without referencing MIDI, and then you could introduce MIDI as a data structure related to this problem 🤔

roszcz · 2023-09-19T06:17:41Z

blog/source/_posts/MIDI-velocity-transformer.md

+MIDI velocity is a crucial element in music dynamics, determining the force with which a note is played, 
+which profoundly influences the emotional quality of music. 
+
+If you were to take a sequence of notes and predict their velocities by an untrained model, this is what you wold have ended up with:


Those samples could be a good demonstration of what velocity is and how it affects the music - you could say that the velocities were randomized here.

Saying it's generated by an untrained model is confusing here, because you haven't introduce the problem that the model has to solve yet.

roszcz · 2023-09-19T06:31:34Z

blog/source/_posts/MIDI-velocity-transformer.md

+within quantized MIDI data.
+
+
+### Model Overview


I think we don't need a model overview at all here. We can assume that the readers know what a transformer is, and we can include some references in the text.

What this and the next section is missing for me, is a description of how we convert the piano performance problem into a list of tokens problem.

roszcz · 2023-09-19T06:36:07Z

blog/source/_posts/MIDI-velocity-transformer.md

+MIDI data describes notes by 5 features:
+   1. Pitch - Represented as a number between 0 and 127 (or 21 to 108 for piano keys, reflecting the standard 88-key keyboard).
+   2. Start - Indicates the moment a key is pressed, measured in seconds.
+   3. End - Marks the second when the key is released.
+   4. Duration - calculated as the time elapsed between the key's press and release.
+   5. Velocity - ranging from 0 to 128, indicating the intensity of the key press.


List of points is not the best format to describe a data structure. One alternative would be to make a code snippet showing a "note" class design, and referring to piano performance as a list of notes:

class Note: pitch: int start: float end: float velocity: int

You can also use https://mermaid.live/ to make a class diagram, I think our blog should support it just like github:

classDiagram class Note{ pitch: int velocity: int start: float end: float }

Loading

(this does not look great, but you could play around with it)

I do not think it does support mermaid :(

I just updated master, and now it does 🎉

🔥🔥🔥

roszcz · 2023-09-19T06:38:15Z

blog/source/_posts/MIDI-velocity-transformer.md

+### Model Architecture
+{% algrtmImgBanner MIDI-velocity-transformer/transformer.png transformer%}
+A transformer built as described in [Attention is all you need](https://arxiv.org/abs/1706.03762) paper was used for this task.
+The important hyperparameters:
+| hyperparameter | number |
+| -------------- | :-----: |
+| Number of layers in encoder and decoder | **6** |
+| Nuber of heads in attention layers | **8** |
+| Dimension of encoder and decoder outputs | **512** |
+| Dimension of a hidden layer of position-wise fast-forward network from each layer of encoder and decoder | **2048** |


Like I mentioned, we don't need to explain what transformers are. I think the only relevant information we should share about our architecture is the number of trainable parameters.

Also, I'm agains using diagrams from external sources, either make your own, or just link to the source :)

roszcz · 2023-09-19T06:38:46Z

blog/source/_posts/MIDI-velocity-transformer.md

+Our training dataset comprised approximately 200 hours of musical data sourced from the 
+[roszcz/maestro-v1](https://huggingface.co/datasets/roszcz/maestro-v1) dataset, which includes 1276 pieces of classical music performed during piano competitions. Each musical piece was segmented into 128-note sequences, with a 64-note overlap between adjacent samples. These sequences were quantized, and each note was mapped to its corresponding index in the source and target vocabularies.


I'm not the owner of maestro, you should cite the original source: https://magenta.tensorflow.org/datasets/maestro

WojciechMat · 2023-09-19T11:27:05Z

Mermaid does work, thanks :)),
I've added notes dataframe representation, some code snippet, removed redundant info on transformer and it's architecture.
I invite you to read, suggest changes and modify the text too.

…/blog into MIDI-82/vel-transformer-blogpost

WojciechMat added 8 commits September 16, 2023 01:58

initial blogpost with no demonstration

1154980

-

52c4e1b

quantization sample with two columns

b07a4b5

second row with quantization samples and titles for samples

8a4932a

samples and pieces

cf8cd15

transformer scheme

9cbec23

different place for transformer.png

f115b47

table with hyperparemeters

a0e4071

roszcz reviewed Sep 16, 2023

View reviewed changes

WojciechMat added 5 commits September 16, 2023 17:39

missing files

5705712

delete unused files

35059b8

conclusion

60c9c9e

more text

d1bb713

contact???

0be6bb2

roszcz reviewed Sep 19, 2023

View reviewed changes

WojciechMat added 6 commits September 19, 2023 09:35

random velocities, different title, introduction initial draft

dbbfc72

class diagram

e7169af

Merge branch 'master' into MIDI-82/vel-transformer-blogpost

ea89c30

merge master

86875c2

updated blog

3105955

adjustments

e7faca7

roszcz and others added 7 commits September 21, 2023 14:37

pin node version

bc2e4fa

change text, delete unused banner

81cd486

fix pieces, add chopin from maestr-v1-sustain

c93793d

Merge branch 'MIDI-82/vel-transformer-blogpost' of github.com:Nospoko…

cd4d63f

…/blog into MIDI-82/vel-transformer-blogpost

blog update

03ea0a3

midi desc

bb13e5b

update - tiny changes

4db124a

FEED forward network fix

5dc7a2e

		{% algrtmImg MIDI-velocity-transformer/samples/9115-pred-untrained.png pianoroll 170px %}
		{% algrtmAudio MIDI-velocity-transformer/samples/9115-pred-untrained.mp3 %}

		@@ -0,0 +1,266 @@
		---
		title: MIDI Velocity Prediction with Transformer

	title: MIDI Velocity Prediction with Transformer
	title: Modelling dynamic expression in piano performance

		MIDI velocity is a crucial element in music dynamics, determining the force with which a note is played,
		which profoundly influences the emotional quality of music.

		Our training dataset comprised approximately 200 hours of musical data sourced from the
		[roszcz/maestro-v1](https://huggingface.co/datasets/roszcz/maestro-v1) dataset, which includes 1276 pieces of classical music performed during piano competitions. Each musical piece was segmented into 128-note sequences, with a 64-note overlap between adjacent samples. These sequences were quantized, and each note was mapped to its corresponding index in the source and target vocabularies.

Midi 82/vel transformer blogpost #2

Are you sure you want to change the base?

Midi 82/vel transformer blogpost #2

Uh oh!

Conversation

WojciechMat commented Sep 16, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

WojciechMat commented Sep 16, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

WojciechMat commented Sep 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants