Difference between revisions of "SNU Dataset for Online Music Symbol Recognition"

From MIPAL
Jump to: navigation, search
(The labels of strokes in SNU dataset)
 
(16 intermediate revisions by one user not shown)
Line 1: Line 1:
Main features:
+
==Download==
 +
- [[Media:SNU DB.zip| SNU dataset]]
 
<br>
 
<br>
- 1716 Samples
+
- [[Media:SNU labeling.zip| The labels of strokes in SNU dataset]]
- 18 Different Musicians
+
- 16 Musical Symbols
+
 
<br>
 
<br>
 +
<br>
 +
 +
==SNU dataset==
 +
For research on the recognition of online handwritten music symbols the SNU dataset provides handwritten music symbol samples which were drawn by 18 musicians. Similar to the HOMUS dataset<ref>[http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=6977236&url=http%3A%2F%2Fieeexplore.ieee.org%2Fxpls%2Fabs_all.jsp%3Farnumber%3D6977236  J. Calvo-Zaragoza and J. Oncina, Recognition of Pen-Based Music Notation: the HOMUS dataset, ICPR 2014.]</ref>, the SNU dataset consists of total 1716 symbol samples and the symbol set in SNU dataset is a subset of HOMUS dataset. Each symbol sample in the SNU dataset, like the HOMUS dataset, consists of at least one stroke and a stroke is defined as a sequence of two dimensional points, which are the successive locations of a stylus pen on a device in time sequence while the pen touches the device. The symbols in the SNU dateset are a subset of the HOMUS dataset. There are 16 musical symbols in SNU dataset which is summarized in Table 1 and Table 2. Each directory contains text files which were contain handwritten music symbol data drawn by each musician.
 +
<br>
 +
The text files consist of
 +
- Symbol Name
 +
- Sets of 2D points representing each stroke seperated by a semicolon.
 +
An example of a text file:
 +
1. Natural
 +
2. 482.22,409.96;481.84,411.24;481.19,414.07;480.81,419.50;480.38,425.88;480.17,436.76;481.06,438.69;484.66,439.67;488.73,439.41;493.95,438.00;496.18,436.89;496.01,435.99;496.01,435.99;
 +
3. 482.56,418.09;482.56,418.09;483.29,418.73;485.30,420.19;488.56,421.77;492.03,423.53;495.19,427.13;493.61,435.52;491.60,439.97;490.87,454.14;490.78,460.31;492.07,470.37;492.07,470.37;
 +
 +
<br>
 +
[[file:Symbol.png|thumb|x480px|frame|none|alt=Alt text|Figure 1. Examples of 16 symbols in SNU dataset]]
 
<br>
 
<br>
 
{| class="wikitable"
 
{| class="wikitable"
Line 20: Line 34:
 
|}
 
|}
 
<br>
 
<br>
 
The Handwritten Online Musical Symbols (HOMUS) dataset<ref>[http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=6977236&url=http%3A%2F%2Fieeexplore.ieee.org%2Fxpls%2Fabs_all.jsp%3Farnumber%3D6977236  J. Calvo-Zaragoza and J. Oncina, Recognition of Pen-Based Music Notation: the HOMUS dataset, ICPR 2014.]</ref> consists of 15200 musical symbol samples collected from 100 musicians.
 
Each sample belongs to one of 32 symbols.
 
It can be download at [http://grfia.dlsi.ua.es/homus/ the dataset homepage].
 
Each symbol sample in this dataset consists of at least one stroke and a stroke is defined as a sequence of two dimensional points, which are the successive locations of a stylus pen on a device in time sequence while the pen touches the device.
 
Nonetheless, the dataset does not serve labels corresponding to the strokes of symbols in the dataset.
 
Excluding 3200 symbol samples corresponding to 8 symbols of time signatures, we analyzed all of 31768 strokes in the other samples for 24 symbols as shown in Fig. 1 and chose 23 basic strokes in Fig. 2.
 
 
<br>
 
[[file:Symbol.png|thumb|x480px|frame|none|alt=Alt text|Figure 1. Examples of 16 symbols in SNU dataset]]
 
<br>
 
 
 
{| class="wikitable"
 
{| class="wikitable"
|+ Table 1. The number of each symbol in Fig. 1
+
|+ Table 2. The number of each symbol in Fig. 1
 
! scope="col" style="width: 120px;" |  
 
! scope="col" style="width: 120px;" |  
 
! scope="col" style="width: 120px;" | Symbol
 
! scope="col" style="width: 120px;" | Symbol
Line 62: Line 64:
 
Note that most samples corresponding to 24 symbols in Fig. 1 can be represented in the combinations of the basic strokes in Fig. 2.
 
Note that most samples corresponding to 24 symbols in Fig. 1 can be represented in the combinations of the basic strokes in Fig. 2.
  
 +
==The labels of strokes in SNU dataset==
 +
In the same manner as described at [http://mipal.snu.ac.kr/index.php/Labels_of_Strokes_in_HOMUS_Dataset/ Labels of Strokes in HOMUS Dataset] the SNU dataset was labeled.<ref>[http://link.springer.com/article/10.1007/s10032-006-0026-9  H. Miyao and M. Maruyama, An online handwritten music symbol recognition system, International Journal of Document Analysis and Recognition, vol. 9, pp. 49-58, 2007.]</ref> The difference is that basic stroke set included in SNU dataset is smaller than it in HOMUS dataset. Same as symbol set in SNU dataset the stroke set is a subset of the HOMUS dataset. Table 3. shows more detail. Also there are no time signature in the SNU dataset the labels doesn't include 100.
 
<br>
 
<br>
[[Media:HOMUS_Labeling.zip|Download the labels of strokes in HOMUS dataset (v1.0)]]
+
The label files consist of  
<br><br>
+
- Symbol Name
 
+
- Label number of each stroke
None stroke in Table 1 means the strokes that can not be categorized into any of the 23 basic strokes.
+
An example of a label file:
 
+
1. Natural
 +
2. 10
 +
3. 21
 
<br>
 
<br>
 
[[file:Stroke.png|thumb|x480px|frame|none|alt=Alt text|Figure 2. Examples of 19 basic strokes in SNU dataset]]
 
[[file:Stroke.png|thumb|x480px|frame|none|alt=Alt text|Figure 2. Examples of 19 basic strokes in SNU dataset]]
 
<br>
 
<br>
 
+
{| class="wikitable"
 
+
|+ Table 3. The number of each stroke in Fig. 2
When labeling the samples of Dot symbol, some of the samples were regarded as None strokes not Dot stroke because their sizes were as large as BHead strokes, but this can be compensated in symbol recognition step, which was shown in a previous study<ref>[http://link.springer.com/article/10.1007/s10032-006-0026-9  H. Miyao and M. Maruyama, An online handwritten music symbol recognition system, International Journal of Document Analysis and Recognition, vol. 9, pp. 49-58, 2007.]</ref>.
+
! scope="col" style="width: 120px;" | Stroke label
 
+
! scope="col" style="width: 120px;" | Stroke
Each data file contains a name of symbol and the labels of its strokes in a similar manner that a symbol sample are written in a file.
+
! scope="col" style="width: 120px;" | Num. of strokes
The only difference is that the stroke labels are included in the files instead of raw data of strokes.
+
! scope="col" style="width: 120px;" | Stroke label
For consistency, the strokes of symbol samples corresponding to 8 time signatures are commonly labeled as 100.
+
! scope="col" style="width: 120px;" | Stroke
 
+
! scope="col" style="width: 120px;" | Num. of strokes
We have developed an algorithm for online handwritten musical symbol recognition using the labels of strokes.
+
|- style="text-align: center;"
The algorithm is specifically described in our paper<ref>J. Oh, S. J. Son, S. Lee, and N. Kwak, Online Recognition of Handwritten Music Symbols, ''to be submitted''.</ref>.
+
| 0 || None || 75 || 12 || RestArc || 52
 +
|- style="text-align: center;"
 +
| 1 || VLine || 914 || 13 || RestArc2 || 10
 +
|- style="text-align: center;"
 +
| 2 || HLine || 141 || 14 || QRest || 90
 +
|- style="text-align: center;"
 +
| 3 || CommonTimeArc || 0 || 15 || Fill || 52
 +
|- style="text-align: center;"
 +
| 4 || Dot || 268 || 16 || WRest || 6
 +
|- style="text-align: center;"
 +
| 5 || WHead || 174 || 17 || GClef || 141
 +
|- style="text-align: center;"
 +
| 6 || BHead || 357 || 18 || FClefArc || 98
 +
|- style="text-align: center;"
 +
| 7 || LSlash || 303 || 19 || CClef2Arc || 0
 +
|- style="text-align: center;"
 +
| 8 || RSlash || 151 || 20 || RevNaturalRt || 0
 +
|- style="text-align: center;"
 +
| 9 || StUHook || 1 || 21 || NaturalRt || 79
 +
|- style="text-align: center;"
 +
| 10 || StLHook || 119 || 22 || Lightning || 0
 +
|- style="text-align: center;"
 +
| 11 || 8Rest || 150 || 23 || Flat || 81
 +
|}
 +
<br>
 +
The SNU dataset and its label were used in our paper<ref>[https://link.springer.com/article/10.1007/s10032-017-0281-y?wt_mc=Internal.Event.1.SEM.ArticleAuthorOnlineFirst  J. Oh, S. J. Son, S. Lee, J.-W. Kwon, and N. Kwak, Online Recognition of Handwritten Music Symbols, International Journal of Document Analysis and Recognition, vol. 20, pp. 79-89, June 2017.]</ref>.
  
 
<br>
 
<br>
 +
 
==References==
 
==References==
 
<references />
 
<references />

Latest revision as of 11:22, 15 June 2017

Download

- SNU dataset
- The labels of strokes in SNU dataset

SNU dataset

For research on the recognition of online handwritten music symbols the SNU dataset provides handwritten music symbol samples which were drawn by 18 musicians. Similar to the HOMUS dataset<ref>J. Calvo-Zaragoza and J. Oncina, Recognition of Pen-Based Music Notation: the HOMUS dataset, ICPR 2014.</ref>, the SNU dataset consists of total 1716 symbol samples and the symbol set in SNU dataset is a subset of HOMUS dataset. Each symbol sample in the SNU dataset, like the HOMUS dataset, consists of at least one stroke and a stroke is defined as a sequence of two dimensional points, which are the successive locations of a stylus pen on a device in time sequence while the pen touches the device. The symbols in the SNU dateset are a subset of the HOMUS dataset. There are 16 musical symbols in SNU dataset which is summarized in Table 1 and Table 2. Each directory contains text files which were contain handwritten music symbol data drawn by each musician.
The text files consist of

- Symbol Name
- Sets of 2D points representing each stroke seperated by a semicolon.

An example of a text file:

1. Natural
2. 482.22,409.96;481.84,411.24;481.19,414.07;480.81,419.50;480.38,425.88;480.17,436.76;481.06,438.69;484.66,439.67;488.73,439.41;493.95,438.00;496.18,436.89;496.01,435.99;496.01,435.99;
3. 482.56,418.09;482.56,418.09;483.29,418.73;485.30,420.19;488.56,421.77;492.03,423.53;495.19,427.13;493.61,435.52;491.60,439.97;490.87,454.14;490.78,460.31;492.07,470.37;492.07,470.37;


Alt text
Figure 1. Examples of 16 symbols in SNU dataset


Table 1. Types of musical symbols in the SNU dataset
Note whole, half, quarter, eighth, sixteenth
Rest whole/half, quarter, eighth, sixteenth
Accidentals flat, sharp, natural
Clef G-clef, F-clef
Others dot, barline


Table 2. The number of each symbol in Fig. 1
Symbol Num. of symboles Symbol Num. of symboles
1 BarLine 263 9 Natural 80
2 Dot 81 10 Quarter-Note 108
3 Eighth-Note 142 11 Quarter-Rest 90
4 Eighth-Rest 71 12 Sharp 111
5 F-Clef 90 13 Sixteenth-Note 140
6 Flat 88 14 Sixteenth-Rest 72
7 G-Clef 147 15 Whole-Half-Rest 54
8 Half-Note 108 16 Whole-Note 71


In the choice, we tried to define as small number of the basic strokes as possible keeping the similarity between any pair of the strokes to be as small as possble, and each basic stroke was enforced to contain somewhat large variations. With those basis strokes, we labeled all the strokes as one of the twenty four classes, which is summarized in Table 1. Note that most samples corresponding to 24 symbols in Fig. 1 can be represented in the combinations of the basic strokes in Fig. 2.

The labels of strokes in SNU dataset

In the same manner as described at Labels of Strokes in HOMUS Dataset the SNU dataset was labeled.<ref>H. Miyao and M. Maruyama, An online handwritten music symbol recognition system, International Journal of Document Analysis and Recognition, vol. 9, pp. 49-58, 2007.</ref> The difference is that basic stroke set included in SNU dataset is smaller than it in HOMUS dataset. Same as symbol set in SNU dataset the stroke set is a subset of the HOMUS dataset. Table 3. shows more detail. Also there are no time signature in the SNU dataset the labels doesn't include 100.
The label files consist of

- Symbol Name
- Label number of each stroke

An example of a label file:

1. Natural
2. 10
3. 21


Alt text
Figure 2. Examples of 19 basic strokes in SNU dataset


Table 3. The number of each stroke in Fig. 2
Stroke label Stroke Num. of strokes Stroke label Stroke Num. of strokes
0 None 75 12 RestArc 52
1 VLine 914 13 RestArc2 10
2 HLine 141 14 QRest 90
3 CommonTimeArc 0 15 Fill 52
4 Dot 268 16 WRest 6
5 WHead 174 17 GClef 141
6 BHead 357 18 FClefArc 98
7 LSlash 303 19 CClef2Arc 0
8 RSlash 151 20 RevNaturalRt 0
9 StUHook 1 21 NaturalRt 79
10 StLHook 119 22 Lightning 0
11 8Rest 150 23 Flat 81


The SNU dataset and its label were used in our paper<ref>J. Oh, S. J. Son, S. Lee, J.-W. Kwon, and N. Kwak, Online Recognition of Handwritten Music Symbols, International Journal of Document Analysis and Recognition, vol. 20, pp. 79-89, June 2017.</ref>.


References

<references />


Contact

Sung Joon Son, Ph.D. candidate, E-mail: sjson718_at_snu_dot_ac_dot_kr