Labels of strokes in a subset of HOMUS Dataset

From MIPAL
Revision as of 23:27, 12 July 2016 by Admin (Talk | contribs)

Jump to: navigation, search

This page is under construction!
The Handwritten Online Musical Symbols (HOMUS) dataset<ref>http://grfia.dlsi.ua.es/homus/</ref> consists of 15200 musical symbol samples collected from 100 musicians. Each symbol belongs to one of 32 classes in the consideration that the eighth, sixteenth, thirty-second, and sixty-fourth note symbols and their horizontally inverted symbols are included in the same classes, respectively. Each symbol sample in this dataset consists of at least one stroke and a stroke is defined as a sequence of two dimensional points, which are the successive locations of a stylus pen on a device in time sequence while the pen touches the device. Nonetheless, the dataset does not serve labels corresponding to the strokes of symbols in the dataset. Excluding 3200 symbols corresponding to 8 time signatures, we analyzed all of 31768 strokes in the other symbol samples for 24 classes as shown in Fig. 1.

Alt text
Figure 1. Examples of 24 symbols in a subset of HOMUS dataset

As a result, we chose 23 basic strokes as shown in Fig. 2, and labeled all the strokes as one of the twenty four classes. None stroke means the strokes that can not be categorized into any of the 23 basic strokes.

Alt text
Figure 2. Examples of 23 strokes

The labels of strokes can be downloaded by this link [download].

Table 1. The numbers of strokes comprising 24 musical symbols in Fig. 1
Stroke class Stroke # of strokes Stroke class Stroke # of strokes
0 None 4281 12 RestArc 554
1 VLine 5377 13 RestArc2 890
2 HLine 1222 14 QRest 152
3 CommonTimeArc 810 15 Fill 324
4 Dot 1888 16 WRest 89
5 WHead 1053 17 GClef 388



References

<references />

Contact

Sung Joon Son, Ph.D. candidate, E-mail: sjson718_at_snu_dot_ac_dot_kr