breadability

Reworked Python Readability parsing library.

Download .zip Download .tar.gz View on GitHub

breadability - another readability Python port

I've tried to work with the various forks of some ancient codebase that ported readability_ to Python. The lack of tests, unused regex's, and commented out sections of code in other Python ports just drove me nuts.

I put forth an effort to bring in several of the better forks into one codebase, but they've diverged so much that I just can't work with it.

So what's any sane person to do? Re-port it with my own repo, add some tests, infrastructure, and try to make this port better. OSS FTW (and yea, NIH FML, but oh well I did try)

This is a pretty straight port of the JS here:

Installation

This does depend on lxml so you'll need some C headers in order to install things from pip so that it can compile.

::

sudo apt-get install libxml2-dev libxslt-dev
pip install breadability

Usage

cmd line


::

    $ breadability http://wiki.python.org/moin/BeginnersGuide

Options