The current scripts do two things:
-
"Normalizing" a
SELECT-FROM-WHERE
query, by striping away everything that is not a join predicate (t1.col1 = t2.col2
) from theWHERE
clause. -
Translate the normalized query to CQ (a.k.a. tensor algebra), dropping any column that does not participate in any join. If we kept all columns the CQ would be way too large, and we would generate extremely deep loop nests in TACO.
-
Compile with
cargo build --release
(install cargo here) -
Enter pipenv with
pipenv shell
(install pipenv here) -
Run with
cargo run 9b.sql
, where9b.sql
is a file containing 1 SQL query. -
Use
fmtsql.py
to format the SQL output; runsed -E 's/(\_|\.)//g' 9b.cq
with9b.cq
containing the CQ output to strip.
and_
away from the CQ.