Skip to content

Conversation

@luliqwerty
Copy link
Contributor

This PR is to Integrating DQE (Differential Query Execution) Testing Approaches into SQLancer. More information see DQE paper.

Take MySQL as the first step.
The code is divided into three main parts:

  1. DQEBase.class: defines the core logic of DQE, including the addition and deletion of auxiliary columns, and the comparison of query results.

  2. SQLQueryError.java: provide standardized error representation and comparison functionality for SQL query operations in DQE.

  3. MySQLDQEOracle.class: Inherits from DQEBase and implements the specific logic of executing DQE in MySQL, including generating queries, executing queries, comparing results, etc.

@mrigger
Copy link
Contributor

mrigger commented Jun 30, 2025

Thanks, that's great! Some high-level comments first:

  • for the SQLQueryError, this looks quite similar to ExpectedErrors. Do we really need this class or can we combine these classes?
  • for statements like UPDATE, DELETE, and ALTER TABLE, do you think we can use and extend the generator classes rather than implementing the logic in the DQE oracle?

@luliqwerty
Copy link
Contributor Author

Thanks, that's great! Some high-level comments first:

  • for the SQLQueryError, this looks quite similar to ExpectedErrors. Do we really need this class or can we combine these classes?
  • for statements like UPDATE, DELETE, and ALTER TABLE, do you think we can use and extend the generator classes rather than implementing the logic in the DQE oracle?

Regarding the SQLQueryError and ExpectedErrors classes, while they do share some surface-level similarities, I think it's important to highlight that their purposes and usage patterns are quite different in practice:

  • SQLQueryError is specifically designed to represent actual errors returned by the database, with structured fields like error level, code, and message. It plays a crucial role in DQE logic to distinguish between different types of errors returned by statements such as UPDATE, SELECT, or DELETE.
  • On the other hand, ExpectedErrors serves as a declarative mechanism for defining acceptable errors during testing, primarily relying on substring or regex-based matching. It does not require the same level of structural detail as SQLQueryError.

Given these differences, merging them into a single class could lead to confusion, increased coupling, and reduced clarity in terms of responsibility separation. I believe keeping them separate helps maintain a clean architecture and supports better extensibility in the future.

That said, if you have any alternative suggestions or design ideas on how we can better organize this part of the code, I’d be more than happy to discuss them further. Thanks again for your thoughtful feedback! @mrigger @JensonSung

@JensonSung
Copy link
Contributor

I think current implementation is good but specialized for DQE method.

@JensonSung
Copy link
Contributor

Thanks, that's great! Some high-level comments first:

  • for the SQLQueryError, this looks quite similar to ExpectedErrors. Do we really need this class or can we combine these classes?
  • for statements like UPDATE, DELETE, and ALTER TABLE, do you think we can use and extend the generator classes rather than implementing the logic in the DQE oracle?

First, revising ExepctedErrors like SQLQueryError to provide a structured information may help ease to identify errors compared to a string-formed information. While DQE does not distinguish which error is expected but compare whether errors are consistent. A clear name could be required.
Second, it seems that every kind of statements like update and delete deserve a proper generator, preparing for new testing methods.
These two improvements can be put on the agenda.

@luliqwerty luliqwerty deleted the branch sqlancer:main July 4, 2025 12:56
@luliqwerty luliqwerty closed this Jul 4, 2025
@luliqwerty luliqwerty deleted the main branch July 4, 2025 12:56
@luliqwerty luliqwerty restored the main branch July 4, 2025 12:57
@luliqwerty luliqwerty reopened this Jul 4, 2025
boolean generateLimit = false;
int limit = 0;
if (operateOnSingleTable) {
if (Randomly.getBooleanWithSmallProbability()) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the expression generator already has methods for generating ORDER BYs, so we should try to re-use those. Could we perhaps also re-use or factor out components from the random query generator?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have add some tests for your mentioned method generateOrderBys() but I got some unexpected errors my repo commit. Sometimes test can't pass with many errors, sometimes this method generated statement like EXISTS (SELECT 1 WHERE FALSE) DESC, which never return rows in SQL statement. I don't know if the methed have some logic errors, or maybe my test have some wrongs, can you give me some suggestions? Thank you very much.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe only generating columns in order by expressions is enough for applying DQE, which can identify the correctness of complex expressions through WHERE clauses.

String selectRowId = String.format("SELECT %s FROM %s", rowId, tableName);
SQLancerResultSet resultSet = new SQLQueryAdapter(selectRowId).executeAndGet(state);
HashSet<String> rows = new HashSet<>();
if (resultSet != null) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we have some auxiliary methods for fetching rows (used by other test oracles). Can we perhaps re-use any of these here?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@luliqwerty Try to find if we can re-use some auxiliary methods for fetching rows in other test oracles.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hello @mrigger @JensonSung
I can't find a suitable method to get rows you mentioned. The confusing method is CERTOracle.getRow(), which is used to get the count of rows. Can you give me some advice or maybe we can just use the way I used to do.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Current implementation can reuse the method getResultSetFirstColumnAsString.

@mrigger
Copy link
Contributor

mrigger commented Jul 6, 2025

Given these differences, merging them into a single class could lead to confusion, increased coupling, and reduced clarity in terms of responsibility separation. I believe keeping them separate helps maintain a clean architecture and supports better extensibility in the future.

Okay, then let's leave them separate for now. I guess there is a conceptual commonality (both are about errors returned by the database system), but if that does not (yet) make sense at an implementation level, we could try to address this later.

Copy link
Contributor

@mrigger mrigger left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks close to being merge ready. Could you have another look at my comments? @JensonSung, perhaps you could also do another review then?

String selectRowId = String.format("SELECT %s FROM %s", rowId, tableName);
SQLancerResultSet resultSet = new SQLQueryAdapter(selectRowId).executeAndGet(state);
HashSet<String> rows = new HashSet<>();
if (resultSet != null) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

String rowId = tableName + "." + COLUMN_ROWID;
String selectRowId = String.format("SELECT %s FROM %s", rowId, tableName);
SQLancerResultSet resultSet = new SQLQueryAdapter(selectRowId).executeAndGet(state, false);
HashSet<String> rows = new HashSet<>();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some of the code seems identical to the one above for the update case. Can we factor those out?

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, some methods could be extracted for fetching these helper columns, like fetching rowID. @luliqwerty

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@mrigger you are right, thanks a lot, I've reused the existing method.

@JensonSung
Copy link
Contributor

Sorry for possible confusing. @sayJason is my another account. I try to use the same one. Any problem, @JensonSung

Copy link
Contributor

@mrigger mrigger left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just did another quick and noticed some final small requests. Could you please have a last look at them before we merge the PR? Other than that, @JensonSung, do the changes look good to you?

Excited to finally have DQE supported in SQLancer! Thanks a lot, @luliqwerty!

- DQEBase: remove unnecessary method
- MySQLDQEOracle: simplify adding expectedErrors
- MySQLErrors: add a expectedErrors
@JensonSung
Copy link
Contributor

Good jobs @luliqwerty . LGTM. Happy to merge @mrigger .

@mrigger
Copy link
Contributor

mrigger commented Aug 11, 2025

Agreed, lgtm, thanks a lot!

@mrigger mrigger merged commit 061a337 into sqlancer:main Aug 11, 2025
13 of 26 checks passed
@mrigger
Copy link
Contributor

mrigger commented Aug 11, 2025

I just noticed that when executing the tests, there seems to be an issue:

[INFO] -------------------------------------------------------
[INFO]  T E S T S
[INFO] -------------------------------------------------------
[INFO] Running sqlancer.dbms.TestMySQLDQE
java.sql.SQLNonTransientConnectionException: No operations allowed after connection closed.
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:110)
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:97)
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:89)
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:63)
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:73)

Could you check whether you can reproduce this locally? This subsequently also results in a crash.

@luliqwerty
Copy link
Contributor Author

I just noticed that when executing the tests, there seems to be an issue:

[INFO] -------------------------------------------------------
[INFO]  T E S T S
[INFO] -------------------------------------------------------
[INFO] Running sqlancer.dbms.TestMySQLDQE
java.sql.SQLNonTransientConnectionException: No operations allowed after connection closed.
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:110)
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:97)
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:89)
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:63)
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:73)

Could you check whether you can reproduce this locally? This subsequently also results in a crash.

This is an unexpected error, as the CI for the merge commit passed. The error indicates that the code is attempting to execute an SQL operation on a database connection that has already been closed. However, it seems that DQE did not perform this action. @mrigger @JensonSung

@JensonSung
Copy link
Contributor

I just noticed that when executing the tests, there seems to be an issue:

[INFO] -------------------------------------------------------
[INFO]  T E S T S
[INFO] -------------------------------------------------------
[INFO] Running sqlancer.dbms.TestMySQLDQE
java.sql.SQLNonTransientConnectionException: No operations allowed after connection closed.
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:110)
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:97)
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:89)
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:63)
	at com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:73)

Could you check whether you can reproduce this locally? This subsequently also results in a crash.

This is an unexpected error, as the CI for the merge commit passed. The error indicates that the code is attempting to execute an SQL operation on a database connection that has already been closed. However, it seems that DQE did not perform this action. @mrigger @JensonSung

It seems to be a crash bug. I run DQE locally in SQLancer, which executes without such errors.

@mrigger
Copy link
Contributor

mrigger commented Aug 14, 2025

Could it be that the local version of MySQL and the one in the CI are different? If they are the same, perhaps it could be helpful to run for DQE a bit longer. It could be that it triggered a crash bug, as pointed out by @JensonSung.

@mrigger
Copy link
Contributor

mrigger commented Aug 14, 2025

Btw, it would also be great to add DQE as part of the documentation of SQLancer in the table of the main README.md.

@JensonSung
Copy link
Contributor

Could it be that the local version of MySQL and the one in the CI are different? If they are the same, perhaps it could be helpful to run for DQE a bit longer. It could be that it triggered a crash bug, as pointed out by @JensonSung.

Sorry for the late message. The local MySQL has a different version. I try a bit longer time to run DQE, which runs well and does not report errors.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants