An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pdf

aphp/edspdf

EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-learning-based approaches to classify text blocs between body and meta-data.

Language: Python - Size: 8.93 MB - Last synced at: about 12 hours ago - Pushed at: 3 months ago - Stars: 48 - Forks: 7

aphp/edspdf-poppler

Poppler extension for EDS-PDF

Language: Python - Size: 1.53 MB - Last synced at: about 12 hours ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

aphp/edspdf-mupdf

MuPDF extension for EDS-PDF

Language: Python - Size: 1.56 MB - Last synced at: about 12 hours ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0