SCB-dataset icon indicating copy to clipboard operation
SCB-dataset copied to clipboard

Student Classroom Behavior dataset

1 SCB-dataset

Student Classroom Behavior dataset

Paper Name Dataset and Trained weights Link class
Student Classroom Behavior Detection based on Improved YOLOv7 Baidu Netdisk dataset extraction code: kjek
Baidu Netdisk models extraction code: epvk
hand-raising, reading, writing
SCB-dataset Baidu Netdisk Dataset & Models extraction code:SCB5
huggingface
image

Baidu Netdisk Dataset & Models extraction code:qg8k

This is the data of SCB-Dataset3, which includes university classroom data. However, we are remaking it because the previous data had issues such as insufficient quantity and single scenarios.

IMG_9915

2 Contact

If you encounter any issues with the SCB Dataset or cooperation, please feel free to contact me via email at : [email protected]

3 YOLO + CrowdHuman

model Trained weights Link
YOLOv5 crowdhuman_vbody_yolov5m Baidu Netdisk YOLOv5 crowdhuman_vbody_yolov5m extraction code: 5qcv
google drive YOLOv5 crowdhuman_vbody_yolov5m
YOLOv7x Baidu Netdisk YOLOv7x CrowdHuman extraction code: ewop
google drive YOLOv7x CrowdHuman
YOLOv7 Baidu Netdisk YOLOv7 CrowdHuman extraction code: ll6n
google drive YOLOv7 CrowdHuman
... ...

4 Commercial Use Restrictions

Please note that this project (including but not limited to code, datasets, model weights, etc.) is intended for academic research, personal learning, and non-commercial use only. Any commercial use (including but not limited to commercial software development, commercial data services, commercial product development, etc.) is strictly prohibited without the explicit written permission of the copyright holder. Any unauthorized commercial use will constitute an infringement of the copyright holder's rights, and the copyright holder reserves the right to pursue legal action. If you have a need for commercial use, please contact the copyright holder for authorization through the following means:

  • Contact Email: [email protected] The copyright holder will decide whether to grant commercial use permissions based on the specific circumstances and may require the signing of a corresponding authorization agreement.

5 Copyright Statement

All content of this project, including but not limited to code, datasets, model weights, documents, etc., is protected by copyright. The copyright holder reserves all rights. Without authorization, no individual or organization is permitted to copy, distribute, modify, or otherwise use the content of this project unless explicitly granted permission by the copyright holder. Thank you for respecting and supporting the rights of the copyright holder.

Citation

Please cite the following paper if you use our dataset.

@article{yang2023scb,
  title={SCB-dataset: A dataset for detecting student classroom behavior},
  author={Yang, Fan},
  journal={arXiv preprint arXiv:2304.02488},
  year={2023}
}

6 Papers and Software copyright

6.1 Papers

29 October 2023, ICIG 2023 , Student Classroom Behavior Detection Based on YOLOv7+BRA and Multi-model Fusion

杨帆,詹泽慧.基于RT-DETR-ASF的学生科学探究实验行为检测研究[J].数字教育,2024,10(05):14-23.

6.2 Software copyright

Register number Full name of software File link
2023SR0518754 基于via的课堂学生行为数据标注与yolov7目标检测系统 Baidu Netdisk extraction code: fixx
2023SR0645718 基于via的课堂学生行为数据标注与yolov7目标检测与自动标注系统 Baidu Netdisk extraction code: ssda
2023SR1443796 基于改进的YOLO的学生课堂数据自动标注与目标检测系统 Baidu Netdisk extraction code: jluk

7 Acknowledgements

For the development of this dataset, we would like to thank the following individuals, companies and universities for their support:

ELEMENT TECH(成都元素科技有限公司 https://www.elementech.net/), SICHUAN UNIVERSITY OF SCIENCE AND TECHNOLOGY, Wang Wengang(四川科技职业技术学院 王文刚), Urban Vocational College Of Sichuan, Li Jianlong(四川城市职业学院 李健龙), Chengdu Neusoft University(成都东软学院), Sichuan Normal University(四川师范大学), Beijing Normal University(北京师范大学)

8 Open Source Dataset

8.1 STBD-08

Thanks to the CBPH-Net https://github.com/icedle/CBPH-Net authors for their contributions, here is the download link for the CBPH-Net contribution dataset STBD-08:

Paper Name Paper Link Dataset Link
CBPH-Net: A Small Object Detector for Behavior Recognition in Classroom Scenarios https://ieeexplore.ieee.org/document/10185142 or https://docs.qq.com/pdf/DWFpWbWhnaHpaRm9x Baidu Netdisk dataset extraction code: 6tvu

BiTNet: A lightweight object detection network for real-time classroom behavior recognition with transformer and bi-directional pyramid network

CBPH-Net: A Small Object Detector for Behavior Recognition in Classroom Scenarios

The STBD-08 paper claims that the dataset includes 4432 images and 151574 annotations. Nevertheless, when we calculated the data provided by the author, we discovered that the actual quantities are much larger than these figures. Our statistics indicate that the dataset has 8884 images (with 7052 in the training set and 1,832 in the validation set) and 267,888 annotations (including 212,728 in the training set and 55,160 in the validation set).

Through online search, we found that the STBD-08 dataset is completed based on the dataset publicly sale online (the dataset also has 8,884 pieces), and the data volume is far lower than that of the dataset publicly sold online.

However, when we cleaned the STBD-08 dataset, we found that there are still many problematic data in it, such as non-standard bounding boxes (bbox) and some class labeling errors.

In other words, the author of STBD-08 has not made the dataset they created public, and only the original dataset purchased online is disclosed.

Train Val Total
Writing 57,164 15,298 72,462
Reading 46,872 12,060 58,932
Listening 93,509 24,019 117,528
Turning around 4,314 1,025 5,339
Raising hand 3,336 847 4,183
Standing 3,287 814 4,101
Discussing 3,710 953 4,663
Guiding 536 144 680
Total 212,728 55,160 267,888

8.2 A dataset of student classroom behavior from a paid website

This dataset is highly similar to STBD-08, mainly in that the images are highly similar and the quality of annotations is also poor

The original website Resource link (including dataset)
https://mbd.pub/o/bread/ZZiTl5lw Baidu Netdisk dataset extraction code: fwie

8.3 universe roboflow website

name link
classroom Computer Vision Project 1.6k link
Student Behaviour Detection Computer Vision Project 2.5k link
Class Monitoring Computer Vision Project 2.7k link
S.B.C Computer Vision Project 5.8k link
Multi_all Computer Vision Project 5.8k link
per Computer Vision Project 1.6k link
... ...

8.4 ClaBehavior

The ClaBehavior paper mentions 1,342 images and 9,911 annotations. However, we actually found only 400 images and 8,083 annotations on GitHub, including:

  • Train Dataset: 360 images and 7,250 annotations
  • Val Dataset: 40 images and 833 annotations

The categories include: Write, Read, Lookup, Turn_head, Raise_hand, Stand, Discuss.

Train Val Total
Write 520 59 579
read 920 101 1021
lookup 4045 435 4480
turn_head 915 96 1011
raise_hand 569 115 684
stand 58 8 66
discuss 223 19 242
Total 7250 833 8083

8.5 SCBehavior

The SCBehavior paper mentions that there are 1346 images. However, when we checked the author's GitHub, we found only 400 damaged images that cannot be viewed (360 in the Train Dataset and 40 in the Val Dataset).

8.6 UK_Datasets

UK_Datasets paper is derived from the 2019 elementary school classroom videos collected from the National Education Resources Public Service Platform (NERPSP).

UK_Datasets extracted 8754 images by frame, and considering the detection needs in real classroom scenarios, it classified these images into eight categories of typical student behaviors: writing, reading, listening, raising hands, turning, standing, discussing, and accepting teacher instructions.

The author categorized the test set portions of UK_Datasets according to the degree of occlusion: "Heavy Occlusion (HO)" and "Low Occlusion (LO)".

Unfortunately, when we downloaded the UK_Datasets for statistics, we found that the data was not original. Specifically, it originated from the 2.1 STBD-08 section and the dataset publicly available online as introduced in this paper. The author merely divided and counted these existing datasets.

Since the data itself is plagiarized and not original, this paper will not conduct statistical analysis on its data.

8.7 BNU-Wu Student-Class-Behavior-Dataset

not open source

9 Close Source Dataset

Paper Class and Static
YOLO-CBD: Classroom Behavior Detection Method Based on Behavior Feature Extraction and Aggregation focus and distract, 1000 images. Note: "Focus and distract" is a very special class, which is quite interesting. However, as a relatively subjective class, it should be rather difficult to implement in actual annotation.
CB Dataset -- Object Detector Based on Center Keypoints for Behavior Recognition in Classroom Scenes listening (11,934),noting (8,727), playing (5,649), and groveling (2,977)
HRSW dataset -- PACR-DETR: A Real-Time End-to-End Object Detector for Behavior Recognition in Various Classroom Scenarios rise hand, read, sleep, and write, 4881 images, 12631 annotations.
TCBDS -- Improving YOLOv7 for Large Target Classroom Behavior Recognition of Teachers in Smart Classroom Scenarios facing the board (1,410), facing the students(1,415), writing on the board (1,034), teaching while facing the board(869), teaching while facing the students (978), and interactive (1,525). 6,660 images ( 5,328 train images and 1,332 val images). Note: Teacher Classroom Behavior Data Set (TCBDS)
SCB-E -- SCB-LEDN: Lightweight and Efficient Object Detection Network for Student Classroom Behavior raising hands, reading, sleeping, writing, and using a mobile phone, 6,489 trainval images and 722 testing images
RSCB-Dataset -- LDSBC: Lightweight Detection Network for Student Behavior in Classroom Scenario raising hands, reading, writing, sleeping, and using mobile phones, 5,221 images and 19,000 instances of specific behaviors.
SB dataset -- Multi-object behaviour recognition based on object detection cascaded image classification in classroom scenes listening (9,343), noting (7,243), playing (5,215), and grovelling (3,504)
ActRec-Classroom -- Learning behavior analysis in classroom based on deep learning listening carefully, hand raising to answer questions, participating in discussions, reading and note taking, 5,126 images
A large-scale dataset for student behavior -- Intelligent student behavior analysis system for real classrooms hand-raising (70,000), standing (20,000), sleeping (3,000), 29000 training images, 11000 validate images
BNU-LCSAD -- Student Class Behavior Dataset: a video dataset for recognizing, detecting, and captioning students’ behaviors in classroom scenes listening carefully (984), taking notes (582), using mobile phones (545), yawning (520), eating or drinking (515), reading (365), discussing (265), looking around (252), using computers (168), sleeping or snoring (80), and raising hands (15)
Student Classroom Behavior Dataset -- Classroom learning status assessment based on deep learning raising hands (10,000), bending over (10,00), walking back and forth(10,000), writing on the blackboard (10,000), looking up (10000), bowing their heads (10,000), standing (10,00), lying on their desks (1,000).
Student behavior dataset -- Student behavior recognition for interaction detection in the classroom environment look at phone, listen to, stand, sleep, sit, talk, and write, 20,409 frames
Student action dataset -- Real-Time Attention Monitoring System for Classroom: A Deep Learning Approach for Student’s Behavior Recognition high and low attention, high: focused and raising hands, low: feeling bored, eating/drinking, laughing, reading, using a phone, distracted, and writing, 3,881 images
A large-scale student behavior dataset -- StuArt: Individualized Classroom Observation of Students with Automatic Behavior Recognition and Tracking hand-raising(70k), standing(21k), sleeping(3k), yawning(3,216) and smiling(129k), techear(15k), 36k images
Classroom behavior dataset -- BiTNet: A lightweight object detection network for real-time classroom behavior recognition with transformer and bi-directional pyramid network writing, reading, listening, raising hand, turning around, standing, discussing, and guiding, 4432 images and 151574 annotation boxes
Improved YOLOv8 algorithm for classroom student behavior detection eating(1,200), raising hands(1,000), reading(1,000), sleeping on the desk(1,000), and writing(1,000), 5200 images

Star History

Star History Chart