Skip to content

reorx/readability

Repository files navigation

Readability

Another algorithm & implementation of widely known readability conception.

Usage:

import requests
from readability import Readability

html = requests.get('http://blog.hucheng.com/articles/482.html').content
parser = Readability(html.decode('utf8'))

parser.title
parser.article
parser.article.get_text()

About

html main body extractor

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published