Search issues

Alternative solutions for NER with N>>1 entity types

1

## Scope If we need to implement a NER system supporting mining of `N` different entity types, we can do so in different way. In particular, if `N>>1` different strategies...

FrancescoCasalegno

🔤 named-entity-recognition

Parameters extraction from tables in papers

All the features built so far support parameter extraction from unstructured natural language. This allows us to efficiently extract structured information from the text paragraphs of a paper, but in...

FrancescoCasalegno

Evaluate requirements for scaling on large number of papers (> 1 M)

Until now the runtimes of both "Search" and "Mine" functionalities have been acceptable. But the code was tested only up to ~100,000 full-text papers (size of CORD-19 v65). As we...

FrancescoCasalegno

🗄️ database

Implement utils for downloading large amounts of papers

1

The goal of this ticket is to create capabilities to download large numbers of neuroscientific papers. Ideally these papers should be in a machine readable format like text, json, html,...

FrancescoCasalegno

🗄️ database

Compare NER models obtained from different pre-trained base models

## Scope When we train a NER model, we can choose various pre-trained base models to fine-tune them on the NER task. For instance, we can choose any of `scispacy`'s...

FrancescoCasalegno

🔤 named-entity-recognition

Search
Search copied to clipboard

Metadata

Alternative solutions for NER with N>>1 entity types

Parameters extraction from tables in papers

Evaluate requirements for scaling on large number of papers (> 1 M)

Implement utils for downloading large amounts of papers

Compare NER models obtained from different pre-trained base models

Revisit benchmarks and use .env file

Implement custom model selection for BERT training

Better integration between Dockerfile and DVC

SearchServer and compute_embeddings should support same models

Automatize MySQL database setup

← Metadata

Owner

Metadata

Search Search copied to clipboard

Metadata

← Metadata

Owner

Metadata

Search
Search copied to clipboard