VIDA-NYU

Results 16 repositories owned by VIDA-NYU

ache

438
Stars
134
Forks
Watchers

ACHE is a web crawler for domain-specific search.

reprozip

298
Stars
33
Forks
Watchers

ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.

auctus

41
Stars
10
Forks
Watchers

Dataset search engine, discovering data from a variety of sources, profiling it, and allowing advanced queries on the index

data-polygamy

43
Stars
19
Forks
Watchers

Data Polygamy is a topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets.

domain_discovery_tool

47
Stars
18
Forks
Watchers

This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better understand a domain (or topic) as it is represented on the Web.

domain_discovery_tool_deprecated

23
Stars
8
Forks
Watchers

Seed acquisition tool to bootstrap focused crawlers

openclean

61
Stars
4
Forks
Watchers

openclean - Data Cleaning and data profiling library for Python

PipelineVis

82
Stars
6
Forks
Watchers

Pipeline Profiler is a tool for visualizing machine learning pipelines generated by AutoML tools.

shadow-accrual-maps

26
Stars
3
Forks
Watchers

Accumulated shadow data computed for New York City