better-exceptions - Pretty and useful exceptions in Python, automatically. (more…)
Read more »
I have a particular view of Python performance from my experience on the Pyston project, and since this view is somewhat nonstandard I wanted to take some time to explain it and give a motivating example. (more…)
Read more »
MusicTools - Python library with tools to download, label and sort music files. (more…)
Read more »
Shuffle is a key primitive in large-scale data processing applications. The
difficulty of large-scale shuffle has inspired the development of many
specialized shuffle systems. While these systems greatly improve shuffle
performance and reliability, they c... (more…)
Read more »
In Part 2 of our comparison of Python and go from a Data Engineering perspective, we'll finally take a look at Apache Beam and Google Dataflow and how the go SDK and the Python SDK differ, what drawbacks we're dealing with, how fast it is by running exten... (more…)
Read more »