Water-Meter-Number-DataSet icon indicating copy to clipboard operation
Water-Meter-Number-DataSet copied to clipboard

The water-meter images are captured by camera and labeled with water-meter number, for the research of the water-meter image recognition.

Water-Meter-Number-DataSet

The Water Meter Number DataSet for the research of Water Meter Number Recognition is released by Deep Leaning and Visual Computing Lab of South China University of Technology.

Note: The Water Meter Number DataSet can only be used for non-commercial research purpose.

Download link

Download the SCUT-WMN DataSet from OneDrive.

Description

The Water Meter Number DataSet consists of two parts. The first part contains 5000 noisy samples (as shown in Fig. 1a). Within the noisy samples, there is a wide range of variation caused by illumination, refraction, occlusion, etc. The second part contains 1000 clean samples (as shown in Fig. 1b). Both the noisy and clean samples are labeled with sequential characters like "1; 2; 2; 5; 8".

Figure 1

The number of each character in the dataset is shown in Table 1.

Table 1

In Water Meter Number Recognition, there exist some "mid-state" characters, as shown in Fig. 2. Taking the 4th image (row 2, column 1) in Fig. 2b as an example, the number exceeds "20369" but does not reach "20370", so that the last two characters in the number appear at the "mid-state". And the proper water-meter number should be "20369.5".

Figure 2

To manage "mid-state" characters, we treat the "mid-state" characters as separate classes, with labels ranging from '10' to '19', as shown in Fig. 3.

Figure 3

A label l in [10, 19], denotes a character that exceeds "l − 10" (called the "lower-state") but does not reach "l − 9" (called the "higher-state"). As a special case, label "l = 19" indicates that the character is at "mid-state" between ‘9’ and ‘0’ (ignoring carry). In such setting, the label for the 4th image in Fig. 2b is "2; 0; 3; 16; 19". This label sequence could be further processed to the water-meter number "20369.5", as shown in Equation (1):

Equation 1

If the predicted digit is at the end of the sequence, and is at "mid-state", we subtract the digit by 9.5. For example, a predicted result 12, which means that the corresponding number exceeds 2 but does not reach 3, we choose to predict it as 2 and append a "mid-state" flag to the result by adding the final number with 0.5. If the "mid-state" character is not the last one, then we simply subtract the digit by 10, saying the "lower-state" is selected, which is intuitively more reasonable.

Citation and Contact

Please consider to cite our paper when you use our datasets:

@ARTICLE{8606091,
    title     = {Fully Convolutional Sequence Recognition Network for Water Meter Number Reading},
    author    = {F. Yang and L. Jin and S. Lai and X. Gao and Z. Li},
    journal   = {IEEE Access},
    year      = {2019},
}

For any questions about the dataset, please contact the authors by sending email to [email protected] and [email protected].