extraction-framework icon indicating copy to clipboard operation
extraction-framework copied to clipboard

Dbpedia History

Open datalogism opened this issue 3 years ago • 6 comments

First prototype of DBpedia History

Summary by CodeRabbit

  • New Features

    • REST-based NIF extraction path.
    • New History extraction module producing HistoryData and HistoryStats (standard and Spark jobs).
    • Sample/minidump generation scripts for testing and demos.
  • Improvements

    • More robust link parsing and HTML cleaning in NIF extraction.
    • Expanded MediaWiki connection options; default parallelism reduced for stability.
    • Updated template mappings for English, French, and Hungarian.
  • Documentation

    • Added History module README and abstract test guide.
  • Tests

    • New end-to-end extraction tests and utilities.

datalogism avatar Dec 06 '22 14:12 datalogism

Kudos, SonarCloud Quality Gate passed!    Quality Gate passed

Bug A 0 Bugs
Vulnerability A 0 Vulnerabilities
Security Hotspot A 0 Security Hotspots
Code Smell A 0 Code Smells

No Coverage information No Coverage information
No Duplication information No Duplication information

sonarqubecloud[bot] avatar Dec 08 '22 08:12 sonarqubecloud[bot]

Please retry analysis of this Pull-Request directly on SonarCloud.

sonarqubecloud[bot] avatar Jan 06 '23 12:01 sonarqubecloud[bot]

Please retry analysis of this Pull-Request directly on SonarCloud.

sonarqubecloud[bot] avatar Jan 06 '23 12:01 sonarqubecloud[bot]