large-dataset topic
chart-fx
A scientific charting library focused on performance optimised real-time data visualisation at 25 Hz update rates for data sets with a few 10 thousand up to 5 million data points.
disk.frame
Fast Disk-Based Parallelized Data Manipulation Framework for Larger-than-RAM Data
Discovery
Mining Discourse Markers for Unsupervised Sentence Representation Learning
TensorFlow-Input-Pipeline
TensorFlow Input Pipeline Examples based on multi-thread and FIFOQueue
Fraud-Detection-in-Online-Transactions
Detecting Frauds in Online Transactions using Anamoly Detection Techniques Such as Over Sampling and Under-Sampling as the ratio of Frauds is less than 0.00005 thus, simply applying Classification Alg...
bigreadr
R package to read large text files based on splitting + data.table::fread
saxophone
Fast and lightweight event-driven streaming XML parser in pure JavaScript
large-network-analysis-tools
Tools and code samples for solving large network analysis problems in ArcGIS Pro
SPEC5G
This repository contains the code and data of the paper titled "SPEC5G: A Dataset for 5G Cellular Network Protocol Analysis" published at AACL 2023.