Synthetic Data Tool
See Synthetic Data.
I've updated the repository with a "Noisy Data" feature.
This will generate bulk data with invalid field values.
It helps with testing ETL pipelines to be sure they will scale to the expected volumes.
more ...See Synthetic Data.
I've updated the repository with a "Noisy Data" feature.
This will generate bulk data with invalid field values.
It helps with testing ETL pipelines to be sure they will scale to the expected volumes.
more ...I've published a framework for doing SQL-like programming in Pure Python -- no database required.
Here: https://github.com/slott56/functional-SQL. See the functional-SQL documentation.
This allows us to transform SQL:
SELECT n.name, v.c2
FROM names_table n, values_table v
WHERE n.code = v.c1
To pure Python:
Select(name …I've published a framework for making tools to leverage formal schema definitions to synthesize bulk data for performance tuning.
See the original talk.
See the documentation
more ...Book? Second draft (with tech review comments addressed) off to editors.
Boat? Still on the hard during Hurricane Season.
I've had a passing interest in data synthesis for decades.
Early on in my career, I figured out how the Z/OS IEDBG utility worked. See https://www.ibm …