Introduction
The New York Times publishes “mini-crosswords,” which are crossword puzzles on a relatively small grid (usually 5×5), without any black squares, so that every row and column of the grid must spell a word. The figure below shows an example of a solution to such a puzzle.
How hard is it to create these puzzles? This post is motivated by a recent College Mathematics Journal article (see Reference (1) below) that considers this question, and describes an approach using the Metropolis-Hastings algorithm to randomly sample instances of puzzles.
But instead of just randomly sampling one puzzle at a time, can we actually enumerate all possible puzzles? In particular, my idea was to reduce the problem of finding a crossword puzzle solution to that of finding a (generalized) exact cover with appropriately crafted constraints. This would be handy, because we already have code for solving exact cover problems, using Knuth’s Dancing Links (DLX) algorithm (see here and here for similar past exercises).
Crossword as an exact cover
To state the problem more precisely: given a positive integer indicating the size of the grid (
in the above example), and a dictionary
of
words each of length
over alphabet
, we must construct an exact cover problem whose solution corresponds to a placement of letters in all
grid positions such that each row and column spells a word in
.
To do this, we construct a zero-one matrix with rows, each corresponding to placing one of the
dictionary words either “across” (in one of the
rows of the puzzle) or “down” (in one of the
columns). A solution will consist of a subset of
rows of the matrix:
“across” words, one in each row of the puzzle, and
“down” words, one in each column, each pair of which intersect in the appropriate common letter.
To represent the constraints, we initially need columns in our matrix (where
), each indexed by a tuple (puzzle row
, puzzle column
, alphabet letter
, across or down). For a given matrix row– representing placement of a word with letters
in a particular location and orientation in the puzzle grid– we set to one those
columns corresponding to the
different
grid locations where the word will be placed in the puzzle… where for each alphabet letter
, we set the “across” column to one if and only if either
- the word is “across” and
matches the letter of the word in this location (i.e.,
), or
- the word is “down” and
does not match the letter of the word in this location (i.e.,
).
If neither of these conditions is satisfied, we instead set the “down” column to one. The following Python code produces the number and list of (row, column) pairs of the resulting sparse matrix.
letters = 'abcdefghijklmnopqrstuvwxyz'
m = len(words)
n = len(words[0])
print(m * n * 2 * (n * 26), file=file)
b = [True, False]
for row, ((w, word), k, across) in enumerate(
product(enumerate(words), range(n), b)):
for col, (i, j, letter, horiz) in enumerate(
product(range(n), range(n), letters, b)):
if ((i if across else j) == k and
(word[j if across else i] == letter) == (across == horiz)):
print(row, col, file=file)
However, we’re not quite finished. Although the desired end result is a crossword with distinct words in each row and column, such as the 6×6 solution shown in (a) below, as Howard observes in the CMJ article, there are many valid solutions that use the same word more than once, including the extreme cases of symmetric “word squares” such as the one shown in (b) below.

(a) 6×6 mini-crossword, where the word in each row and column is unique. (b) 6×6 “word square,” a symmetric grid with each row and corresonding column containing the same word.
We can eliminate this duplication by adding “optional” columns to the zero-one matrix, one for each word in the dictionary, and solve the resulting generalized exact cover problem, so that each word may be used at most once in a solution.
Results
All of the source code is available here, as well as on GitHub. My initial test used the 4×4 case discussed in Howard’s paper, with his dictionary of 1826 words. He describes a process for estimating the total number of possible puzzles by repeated sampling using the Markov chain Monte Carlo approach: “We estimate that there are approximately
73,000–74,000 distinct puzzles each with no repeated words.” This is pretty accurate; it turns out that there are exactly 74,339 (each contributing two symmetric pairs of solutions to the generalized exact cover problem, for a total of 148,678 solutions).
References:
- Howard, C. Douglas, It’s Puzzling, College Mathematics Journal, 49(4) September 2018, p. 242-249 [DOI]
- Knuth, D., Dancing Links, Millenial Perspectives in Computer Science, 2000, p. 187-214 (arXiv)
