mirror of
https://github.com/bung87/webpage_extractors
synced 2026-01-14 13:31:34 +00:00
No description
| evaluation | ||
| src | ||
| tests | ||
| .gitignore | ||
| README.md | ||
| webpage_extractors.nimble | ||
webpage_extractors
web page html content extractors
The goal is providing serveral extractors and compare their performance.
Note: Under development, Apis can be changed at any time.
Apis
Basic content extractor, no need for language detection and stop words.
proc extractContentBasic*(s: string, textOnly = false): string =