test/cqlpy: add test for long table names by nyh · Pull Request #23229 · scylladb/scylladb

nyh · 2025-03-10T17:00:30Z

Scylla inherited a 48-character limit on the length of table (and keyspace) names from Cassandra 3. It turns out that Cassandra 4 and 5 unintentionally dropped this limit (see history lesson in CASSANDRA-20425), and now Cassandra accepts longer table names. Some Cassandra users are using such longer names and disappointed that Scylla doesn't allow them.

This patch includes tests for this feature. One test tries a 48-character table name - it passes on Scylla and all versions of Cassandra. A second test tries a 100-character table name - this one passes on Cassandra version 4 and above (but not on 3), and fails on Scylla so marked "xfail". A third test tries a 500-character table name. This one fails badly on Cassandra (see CASSANDRA-20389), but passes on Scylla today. This test is important because we need to be sure that it continues to pass on Scylla even after the Scylla is fixed to allow the 100-character test.

Refs #4480 - an issue we already have about supporting longer names

Note on the test implementation:
Ideally, the test for a particular table-name length shouldn't just
create the table - it should also make sure we can write table to it
and flush it, i.e., that sstables can get written correctly. But in
practice, these complications are not needed, because in modern Scylla
it is the directory name which contains the table's name, and the
individual sstable files do not contain the table's name. Just creating
the table already creates the long directory name, so that is the part
that needs to be tested. If we created this directory successfully,
later creating the short-named sstables inside it can't fail.

swasik · 2025-03-10T17:07:24Z

+        yield
+        # The user's "with" code is running during the yield. If it didn't
+        # throw we return from the function - the raises_or_not() passed as
+        # the "or not" case.


What is rises_or_not?

Oops, wrong reference in the comment... I saw it and thought I fixed it... I'll fix it.

nyh · 2025-03-10T17:11:11Z

A note to reviewers (which I should probably have mentioned in the commit message):

Ideally, the test for a particular table-name length shouldn't just create the table - it should also make sure we can write table to it and flush it, so sstables get written correctly. But in practice, these complications are not needed, because in modern Scylla it is the directory name which contains the table's name, and the individual sstable files do not contain the table's name. Just creating the table already creates the long directory name, so that is the part that needs to be tested. If we created this directory successfully, later creating the short-named sstables inside it can't fail.

Scylla inherited a 48-character limit on the length of table (and keyspace) names from Cassandra 3. It turns out that Cassandra 4 and 5 unintentionally dropped this limit (see history lesson in CASSANDRA-20425), and now Cassandra accepts longer table names. Some Cassandra users are using such longer names and disappointed that Scylla doesn't allow them. This patch includes tests for this feature. One test tries a 48-character table name - it passes on Scylla and all versions of Cassandra. A second test tries a 100-character table name - this one passes on Cassandra version 4 and above (but not on 3), and fails on Scylla so marked "xfail". A third test tries a 500-character table name. This one fails badly on Cassandra (see CASSANDRA-20389), but passes on Scylla today. This test is important because we need to be sure that it continues to pass on Scylla even after the Scylla is fixed to allow the 100-character test. Refs scylladb#4480 - an issue we already have about supporting longer names Note on the test implementation: Ideally, the test for a particular table-name length shouldn't just create the table - it should also make sure we can write table to it and flush it, i.e., that sstables can get written correctly. But in practice, these complications are not needed, because in modern Scylla it is the directory name which contains the table's name, and the individual sstable files do not contain the table's name. Just creating the table already creates the long directory name, so that is the part that needs to be tested. If we created this directory successfully, later creating the short-named sstables inside it can't fail. Signed-off-by: Nadav Har'El <[email protected]>

nyh · 2025-03-10T17:16:57Z

Pushed a new version with fixed comment and an implementation note copied from the "note to reviewers" above.

scylladb-promoter · 2025-03-11T01:20:05Z

🟢 CI State: SUCCESS

✅ - Build
✅ - Unit Tests

Build Details:

Duration: 8 hr 3 min
Builder: spider4.cloudius-systems.com

swasik · 2025-03-10T17:08:45Z

+from cassandra.protocol import InvalidRequest
+from .util import unique_name
+
+# passes_or_raises() is similar to pytest.raises(), except that while raises()


Don't we have some separate file to store our own extensions to pytest? So that we have all of them in a single place.

I have test/cqlpy/util.py.
My philosophy is that we should only move things there if they have multiple users. We should not pretend that every "nice" used-once function should be a library function. If we do, the result will be a library, not a test suite. And it will be a bad library. What often happens (see dtest as an example) is that:

One person creates a "convenience function" everyone will want and puts it in a central file.

The second person can't use this function because it doesn't do exactly what he needs, so creates another function and puts it in a central file.

A third person wants to use these functions, but neither does exactly what he needs, so he adds an options to the first function.

...

A few years later, you have 3 different functions doing similar functions, each of them have 7 different options to configure them in just the right way.

When someone wants to read tests, it's impossible to understand what you're seeing. Instead of seeing 5 lines of understandable CQL or DynamoDB calls, you see a single function call with 7 different parameters and have no idea what is actually happening.

So I want to move this into util.py is we see we need exactly the same code in multiple places, and only then.

By the way, passes_or_raises() is definitely a good candidate to being promoted to a library. I also used the same code in test/alternator, so it's already two different users (although in two separate test suites), and I tried to write it in a pretty-general way (I think) that isn't very specific to one particular use case.

But it's a slippery slope - should new_named_table() also be moved into a library? What about padded_name()? I vote no, until additional tests would like to use such a utility. At this point we have 1700 test functions in cqlpy, and this was the first time I needed new_named_table() :-) Actually, we already had new_named_table() in test/alternator, but the implementation is completely different (it creates an Alternator table, vs. a CQL table) so it can't be shared anyway.

Thinking about this some more, an interesting feature of passes_or_raises(), which is different from the other functions I mentioned in my above rant, is that it is a "pure" pytest function - it does not use CQL, Alternator API, async io, or any of the specifics of any of our test suites. So we could have something like pylib/pytest.py which will contain pytest-only code that both test/cqlpy and test/alternator (and all other test suites) can share.

While I can do that, I'm worried this pylib/pytest.py will become a kitchen sink (in the good case) or garbage can (in the worst case) of dozens of random utility functions, and am really hesitant about getting the ball rolling in that direction.

Ok, if it does not have maintainer who makes sure that it does not start to be messy then probably it is fine to keep this function local.

nyh assigned swasik and guy9 Mar 10, 2025

swasik reviewed Mar 10, 2025

View reviewed changes

nyh mentioned this pull request Mar 10, 2025

Allow longer table name length #4480

Closed

scylladbbot added the status/ci in progress label Mar 10, 2025

nyh added the backport/none Backport is not required label Mar 10, 2025

nyh force-pushed the test-4480 branch from a593be6 to cab5d5c Compare March 10, 2025 17:16

scylladbbot added status/ci in progress and removed status/ci in progress labels Mar 10, 2025

swasik approved these changes Mar 11, 2025

View reviewed changes

scylladb-promoter closed this in a72dde2 Mar 14, 2025

scylladbbot added the promoted-to-master label Mar 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test/cqlpy: add test for long table names#23229

test/cqlpy: add test for long table names#23229
nyh wants to merge 1 commit intoscylladb:masterfrom
nyh:test-4480

nyh commented Mar 10, 2025 •

edited

Loading

Uh oh!

swasik Mar 10, 2025

Uh oh!

nyh Mar 10, 2025

Uh oh!

nyh commented Mar 10, 2025

Uh oh!

nyh commented Mar 10, 2025

Uh oh!

scylladb-promoter commented Mar 11, 2025

Uh oh!

swasik Mar 10, 2025

Uh oh!

nyh Mar 11, 2025

Uh oh!

nyh Mar 11, 2025

Uh oh!

nyh Mar 11, 2025

Uh oh!

swasik Mar 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

nyh commented Mar 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

swasik Mar 10, 2025

Choose a reason for hiding this comment

Uh oh!

nyh Mar 10, 2025

Choose a reason for hiding this comment

Uh oh!

nyh commented Mar 10, 2025

Uh oh!

nyh commented Mar 10, 2025

Uh oh!

scylladb-promoter commented Mar 11, 2025

🟢 CI State: SUCCESS

Build Details:

Uh oh!

swasik Mar 10, 2025

Choose a reason for hiding this comment

Uh oh!

nyh Mar 11, 2025

Choose a reason for hiding this comment

Uh oh!

nyh Mar 11, 2025

Choose a reason for hiding this comment

Uh oh!

nyh Mar 11, 2025

Choose a reason for hiding this comment

Uh oh!

swasik Mar 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

nyh commented Mar 10, 2025 •

edited

Loading