article-extractor topic

List article-extractor repositories

article-parser

50
Stars
6
Forks
Watchers

Extract article or news by url or html, parse the title and content, output in markdown format.

saber

31
Stars
21
Forks
Watchers

【 Spring Boot 实战开发】10 分钟快速构建一个自己的技术文章博客

wos-excel-converter

30
Stars
7
Forks
Watchers

This is a small and easy-to-use desktop application that allows exporting Web of Science API Expanded and InCites API data in Excel/CSV/JSON/XML with a configurable and flexible data export structure.

dnlp

21
Stars
5
Forks
Watchers

📚 Сборник полезных штук из Natural Language Processing: Определение языка текста, Разделение текста на предложения, Получение основного содержимого из html документа

IKFB

33
Stars
3
Forks
Watchers

Involution King Fun Book (IKFB, Chinese: 快卷, 卷王快乐本) is an integrated management system for papers and literature. Powered by Electron.

Dcinside_Explorer_Python

18
Stars
2
Forks
Watchers

디시인사이드 Client-Side 글 검색기 입니다.

textractor

15
Stars
4
Forks
Watchers

从html中提取正文,用于新闻类网页

extractor

57
Stars
6
Forks
57
Watchers

Using LLMs and AI browser automation to robustly extract web data

MinerU-HTML

154
Stars
18
Forks
154
Watchers

MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep Research Agents, RAG applications, and training data generation.