big-data topic

List big-data repositories

crate

4.0k
Stars
546
Forks
Watchers

CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.

cogcomp-nlp

469
Stars
144
Forks
Watchers

CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, t...

hazelcast

6.0k
Stars
1.8k
Forks
Watchers

Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.

data-accelerator

295
Stars
89
Forks
Watchers

Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...

fastjson2

3.5k
Stars
452
Forks
Watchers

🚄 FASTJSON2 is a Java JSON library with excellent performance.

scikit-learn-intelex

1.2k
Stars
169
Forks
Watchers

Intel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application

genie

1.7k
Stars
364
Forks
Watchers

Distributed Big Data Orchestration Service

Decentralized-Internet

488
Stars
193
Forks
Watchers

A SDK/library for decentralized web and distributing computing projects

kafka-ui

8.7k
Stars
1.1k
Forks
Watchers

Open-Source Web UI for Apache Kafka Management

aws-etl-orchestrator

324
Stars
136
Forks
Watchers

A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.