SCB-dataset
SCB-dataset copied to clipboard
Student Classroom Behavior dataset
1 SCB-dataset
Student Classroom Behavior dataset
| Paper Name | Dataset and Trained weights Link | class |
|---|---|---|
| Student Classroom Behavior Detection based on Improved YOLOv7 | Baidu Netdisk dataset extraction code: kjek Baidu Netdisk models extraction code: epvk |
hand-raising, reading, writing |
| SCB-dataset | Baidu Netdisk Dataset & Models extraction code:SCB5 huggingface |
Baidu Netdisk Dataset & Models extraction code:qg8k
This is the data of SCB-Dataset3, which includes university classroom data. However, we are remaking it because the previous data had issues such as insufficient quantity and single scenarios.
2 Contact
If you encounter any issues with the SCB Dataset or cooperation, please feel free to contact me via email at : [email protected]
3 YOLO + CrowdHuman
| model | Trained weights Link |
|---|---|
| YOLOv5 crowdhuman_vbody_yolov5m | Baidu Netdisk YOLOv5 crowdhuman_vbody_yolov5m extraction code: 5qcv google drive YOLOv5 crowdhuman_vbody_yolov5m |
| YOLOv7x | Baidu Netdisk YOLOv7x CrowdHuman extraction code: ewop google drive YOLOv7x CrowdHuman |
| YOLOv7 | Baidu Netdisk YOLOv7 CrowdHuman extraction code: ll6n google drive YOLOv7 CrowdHuman |
| ... | ... |
4 Commercial Use Restrictions
Please note that this project (including but not limited to code, datasets, model weights, etc.) is intended for academic research, personal learning, and non-commercial use only. Any commercial use (including but not limited to commercial software development, commercial data services, commercial product development, etc.) is strictly prohibited without the explicit written permission of the copyright holder. Any unauthorized commercial use will constitute an infringement of the copyright holder's rights, and the copyright holder reserves the right to pursue legal action. If you have a need for commercial use, please contact the copyright holder for authorization through the following means:
- Contact Email: [email protected] The copyright holder will decide whether to grant commercial use permissions based on the specific circumstances and may require the signing of a corresponding authorization agreement.
5 Copyright Statement
All content of this project, including but not limited to code, datasets, model weights, documents, etc., is protected by copyright. The copyright holder reserves all rights. Without authorization, no individual or organization is permitted to copy, distribute, modify, or otherwise use the content of this project unless explicitly granted permission by the copyright holder. Thank you for respecting and supporting the rights of the copyright holder.
Citation
Please cite the following paper if you use our dataset.
@article{yang2023scb,
title={SCB-dataset: A dataset for detecting student classroom behavior},
author={Yang, Fan},
journal={arXiv preprint arXiv:2304.02488},
year={2023}
}
6 Papers and Software copyright
6.1 Papers
29 October 2023, ICIG 2023 , Student Classroom Behavior Detection Based on YOLOv7+BRA and Multi-model Fusion
杨帆,詹泽慧.基于RT-DETR-ASF的学生科学探究实验行为检测研究[J].数字教育,2024,10(05):14-23.
6.2 Software copyright
| Register number | Full name of software | File link |
|---|---|---|
| 2023SR0518754 | 基于via的课堂学生行为数据标注与yolov7目标检测系统 | Baidu Netdisk extraction code: fixx |
| 2023SR0645718 | 基于via的课堂学生行为数据标注与yolov7目标检测与自动标注系统 | Baidu Netdisk extraction code: ssda |
| 2023SR1443796 | 基于改进的YOLO的学生课堂数据自动标注与目标检测系统 | Baidu Netdisk extraction code: jluk |
7 Acknowledgements
For the development of this dataset, we would like to thank the following individuals, companies and universities for their support:
ELEMENT TECH(成都元素科技有限公司 https://www.elementech.net/), SICHUAN UNIVERSITY OF SCIENCE AND TECHNOLOGY, Wang Wengang(四川科技职业技术学院 王文刚), Urban Vocational College Of Sichuan, Li Jianlong(四川城市职业学院 李健龙), Chengdu Neusoft University(成都东软学院), Sichuan Normal University(四川师范大学), Beijing Normal University(北京师范大学)
8 Open Source Dataset
8.1 STBD-08
Thanks to the CBPH-Net https://github.com/icedle/CBPH-Net authors for their contributions, here is the download link for the CBPH-Net contribution dataset STBD-08:
| Paper Name | Paper Link | Dataset Link |
|---|---|---|
| CBPH-Net: A Small Object Detector for Behavior Recognition in Classroom Scenarios | https://ieeexplore.ieee.org/document/10185142 or https://docs.qq.com/pdf/DWFpWbWhnaHpaRm9x | Baidu Netdisk dataset extraction code: 6tvu |
BiTNet: A lightweight object detection network for real-time classroom behavior recognition with transformer and bi-directional pyramid network
CBPH-Net: A Small Object Detector for Behavior Recognition in Classroom Scenarios
The STBD-08 paper claims that the dataset includes 4432 images and 151574 annotations. Nevertheless, when we calculated the data provided by the author, we discovered that the actual quantities are much larger than these figures. Our statistics indicate that the dataset has 8884 images (with 7052 in the training set and 1,832 in the validation set) and 267,888 annotations (including 212,728 in the training set and 55,160 in the validation set).
Through online search, we found that the STBD-08 dataset is completed based on the dataset publicly sale online (the dataset also has 8,884 pieces), and the data volume is far lower than that of the dataset publicly sold online.
However, when we cleaned the STBD-08 dataset, we found that there are still many problematic data in it, such as non-standard bounding boxes (bbox) and some class labeling errors.
In other words, the author of STBD-08 has not made the dataset they created public, and only the original dataset purchased online is disclosed.
| Train | Val | Total | |
|---|---|---|---|
| Writing | 57,164 | 15,298 | 72,462 |
| Reading | 46,872 | 12,060 | 58,932 |
| Listening | 93,509 | 24,019 | 117,528 |
| Turning around | 4,314 | 1,025 | 5,339 |
| Raising hand | 3,336 | 847 | 4,183 |
| Standing | 3,287 | 814 | 4,101 |
| Discussing | 3,710 | 953 | 4,663 |
| Guiding | 536 | 144 | 680 |
| Total | 212,728 | 55,160 | 267,888 |
8.2 A dataset of student classroom behavior from a paid website
This dataset is highly similar to STBD-08, mainly in that the images are highly similar and the quality of annotations is also poor
| The original website | Resource link (including dataset) |
|---|---|
| https://mbd.pub/o/bread/ZZiTl5lw | Baidu Netdisk dataset extraction code: fwie |
8.3 universe roboflow website
| name | link |
|---|---|
| classroom Computer Vision Project 1.6k | link |
| Student Behaviour Detection Computer Vision Project 2.5k | link |
| Class Monitoring Computer Vision Project 2.7k | link |
| S.B.C Computer Vision Project 5.8k | link |
| Multi_all Computer Vision Project 5.8k | link |
| per Computer Vision Project 1.6k | link |
| ... | ... |
8.4 ClaBehavior
The ClaBehavior paper mentions 1,342 images and 9,911 annotations. However, we actually found only 400 images and 8,083 annotations on GitHub, including:
- Train Dataset: 360 images and 7,250 annotations
- Val Dataset: 40 images and 833 annotations
The categories include: Write, Read, Lookup, Turn_head, Raise_hand, Stand, Discuss.
| Train | Val | Total | |
|---|---|---|---|
| Write | 520 | 59 | 579 |
| read | 920 | 101 | 1021 |
| lookup | 4045 | 435 | 4480 |
| turn_head | 915 | 96 | 1011 |
| raise_hand | 569 | 115 | 684 |
| stand | 58 | 8 | 66 |
| discuss | 223 | 19 | 242 |
| Total | 7250 | 833 | 8083 |
8.5 SCBehavior
The SCBehavior paper mentions that there are 1346 images. However, when we checked the author's GitHub, we found only 400 damaged images that cannot be viewed (360 in the Train Dataset and 40 in the Val Dataset).
8.6 UK_Datasets
UK_Datasets paper is derived from the 2019 elementary school classroom videos collected from the National Education Resources Public Service Platform (NERPSP).
UK_Datasets extracted 8754 images by frame, and considering the detection needs in real classroom scenarios, it classified these images into eight categories of typical student behaviors: writing, reading, listening, raising hands, turning, standing, discussing, and accepting teacher instructions.
The author categorized the test set portions of UK_Datasets according to the degree of occlusion: "Heavy Occlusion (HO)" and "Low Occlusion (LO)".
Unfortunately, when we downloaded the UK_Datasets for statistics, we found that the data was not original. Specifically, it originated from the 2.1 STBD-08 section and the dataset publicly available online as introduced in this paper. The author merely divided and counted these existing datasets.
Since the data itself is plagiarized and not original, this paper will not conduct statistical analysis on its data.
8.7 BNU-Wu Student-Class-Behavior-Dataset
9 Close Source Dataset
| Paper | Class and Static |
|---|---|
| YOLO-CBD: Classroom Behavior Detection Method Based on Behavior Feature Extraction and Aggregation | focus and distract, 1000 images. Note: "Focus and distract" is a very special class, which is quite interesting. However, as a relatively subjective class, it should be rather difficult to implement in actual annotation. |
| CB Dataset -- Object Detector Based on Center Keypoints for Behavior Recognition in Classroom Scenes | listening (11,934),noting (8,727), playing (5,649), and groveling (2,977) |
| HRSW dataset -- PACR-DETR: A Real-Time End-to-End Object Detector for Behavior Recognition in Various Classroom Scenarios | rise hand, read, sleep, and write, 4881 images, 12631 annotations. |
| TCBDS -- Improving YOLOv7 for Large Target Classroom Behavior Recognition of Teachers in Smart Classroom Scenarios | facing the board (1,410), facing the students(1,415), writing on the board (1,034), teaching while facing the board(869), teaching while facing the students (978), and interactive (1,525). 6,660 images ( 5,328 train images and 1,332 val images). Note: Teacher Classroom Behavior Data Set (TCBDS) |
| SCB-E -- SCB-LEDN: Lightweight and Efficient Object Detection Network for Student Classroom Behavior | raising hands, reading, sleeping, writing, and using a mobile phone, 6,489 trainval images and 722 testing images |
| RSCB-Dataset -- LDSBC: Lightweight Detection Network for Student Behavior in Classroom Scenario | raising hands, reading, writing, sleeping, and using mobile phones, 5,221 images and 19,000 instances of specific behaviors. |
| SB dataset -- Multi-object behaviour recognition based on object detection cascaded image classification in classroom scenes | listening (9,343), noting (7,243), playing (5,215), and grovelling (3,504) |
| ActRec-Classroom -- Learning behavior analysis in classroom based on deep learning | listening carefully, hand raising to answer questions, participating in discussions, reading and note taking, 5,126 images |
| A large-scale dataset for student behavior -- Intelligent student behavior analysis system for real classrooms | hand-raising (70,000), standing (20,000), sleeping (3,000), 29000 training images, 11000 validate images |
| BNU-LCSAD -- Student Class Behavior Dataset: a video dataset for recognizing, detecting, and captioning students’ behaviors in classroom scenes | listening carefully (984), taking notes (582), using mobile phones (545), yawning (520), eating or drinking (515), reading (365), discussing (265), looking around (252), using computers (168), sleeping or snoring (80), and raising hands (15) |
| Student Classroom Behavior Dataset -- Classroom learning status assessment based on deep learning | raising hands (10,000), bending over (10,00), walking back and forth(10,000), writing on the blackboard (10,000), looking up (10000), bowing their heads (10,000), standing (10,00), lying on their desks (1,000). |
| Student behavior dataset -- Student behavior recognition for interaction detection in the classroom environment | look at phone, listen to, stand, sleep, sit, talk, and write, 20,409 frames |
| Student action dataset -- Real-Time Attention Monitoring System for Classroom: A Deep Learning Approach for Student’s Behavior Recognition | high and low attention, high: focused and raising hands, low: feeling bored, eating/drinking, laughing, reading, using a phone, distracted, and writing, 3,881 images |
| A large-scale student behavior dataset -- StuArt: Individualized Classroom Observation of Students with Automatic Behavior Recognition and Tracking | hand-raising(70k), standing(21k), sleeping(3k), yawning(3,216) and smiling(129k), techear(15k), 36k images |
| Classroom behavior dataset -- BiTNet: A lightweight object detection network for real-time classroom behavior recognition with transformer and bi-directional pyramid network | writing, reading, listening, raising hand, turning around, standing, discussing, and guiding, 4432 images and 151574 annotation boxes |
| Improved YOLOv8 algorithm for classroom student behavior detection | eating(1,200), raising hands(1,000), reading(1,000), sleeping on the desk(1,000), and writing(1,000), 5200 images |