Bump Windows max open files from 512 to 2048 #620

Thireus · 2025-07-16T23:46:05Z

Allows up to 2048 shards to be loaded on Windows builds, from the current default of 512. This change is specific to Windows, it instructs the Windows OS that the binary requires 2048 of max opened files. This is the equivalent to Linux's ulimit -n.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

https://learn.microsoft.com/en-us/cpp/c-runtime-library/reference/setmaxstdio?view=msvc-160

ikawrakow · 2025-07-17T05:39:22Z

src/llama.cpp

        }

+        #ifdef _WIN32
+        int _setmaxstdio_ret = _setmaxstdio(2048); // 8,192 may be supported - https://learn.microsoft.com/en-us/cpp/c-runtime-library/reference/setmaxstdio?view=msvc-160


Don't you want to make this dependent on the value of GGML_MAX_CONTEXTS instead of it being simply set to 2048?

I don't know much about Windows, but if I understand correctly the description of the _setmaxstdio function, it changes the max. number of files that can be open at the same time at the stream I/O level. The default for this is 512. The Microsoft engineers must have had a reason to keep it at 512 instead of just setting it to the 8192 limit of the low I/O level. If they did have a reason, then my thinking is that it would be wise to not increase the stream I/O limit unless necessary. It only becomes necessary if we want to use more than 512 shards, which is only possible if we have changed the value of GGML_MAX_CONTEXTS.

I agree, and this is what I was saying here: #611 (comment)

The default for this is 512. The Microsoft engineers must have had a reason to keep it at 512 instead of just setting it to the 8192 limit of the low I/O level.

Since this came up, I've looked into it, best reason I found was this (from a time when the true maximum was 2048):

I believe the limit has to do with the ability to inherit the open files from a CreateProcess call. The CreateProcess has only 2048 slots for passing handles (both on 32-bit and 64-bit). You can debug a program and step into the system, exec, or spawn CRT functions to see the limit of the 2048 slots.

If you use the Win32 file API (CreateFile, WriteFile, ReadFile, CloseHandle, etc.), then you don't have a limit on open files (well, you do but I believe it is based on your resources like memory).

Source: https://stackoverflow.com/questions/1803552/setmaxstdio-max-open-files-is-2048-only

alongside this corroborating piece from https://bugs.mysql.com/bug.php?id=24509 (they also mention Win32 on that page):

It's a hard windows limit due to the fact of using posix-like
functions in some places. I will open 2nd bug report about a
handle leak when that 2048 limit is hit.

If 2048/8192+ is wanted Win32 API might be needed (not sure how big a change that would be).

If we are sure that limitations in CreateProcess implementation is the only reason, then it wouldn't be an issue as llama.cpp is not actually spawning new processes. A file handle leak each time one starts a llama.cpp process is not too bad either: one simply needs to reboot their Windows box from time to time just like in the old days. Just joking. If there is indeed a file handle leak, then it is even more important to make the increase conditional upon GGML_MAX_CONTEXTS > 512.

Change made. Please let me know if this is now acceptable.

If there is indeed a file handle leak, then it is even more important to make the increase conditional upon GGML_MAX_CONTEXTS > 512.

I wouldn't take the "leak" part seriously as it is from "10 Dec 2006", just included that because it mentioned the handles. Win32 should only be needed if models large enough (much more than deepseek) and people have 2048 limits (instead of 8192).

ikawrakow/ik_llama.cpp#620

Bump windows max open files from 512 to 2048

e7c048d

https://learn.microsoft.com/en-us/cpp/c-runtime-library/reference/setmaxstdio?view=msvc-160

Thireus changed the title ~~Bump windows max open files from 512 to 2048~~ Bump Windows max open files from 512 to 2048 Jul 16, 2025

Thireus mentioned this pull request Jul 16, 2025

Bump GGML_MAX_CONTEXTS to allow loading more shards #611

Merged

4 tasks

ikawrakow reviewed Jul 17, 2025

View reviewed changes

Make _GGML_STDIO_TARGET dependent of GGML_MAX_CONTEXTS for Windows

695dcc2

ikawrakow approved these changes Jul 17, 2025

View reviewed changes

ikawrakow merged commit 6950c82 into ikawrakow:main Jul 17, 2025

Thireus added a commit to Thireus/llama.cpp that referenced this pull request Aug 10, 2025

Bump Windows max open files from 512 to 2048

d7b5465

ikawrakow/ik_llama.cpp#620

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bump Windows max open files from 512 to 2048 #620

Bump Windows max open files from 512 to 2048 #620

Uh oh!

Thireus commented Jul 16, 2025

Uh oh!

ikawrakow Jul 17, 2025 •

edited

Loading

Uh oh!

saood06 Jul 17, 2025 •

edited

Loading

Uh oh!

ikawrakow Jul 17, 2025

Uh oh!

Thireus Jul 17, 2025

Uh oh!

saood06 Jul 17, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Bump Windows max open files from 512 to 2048 #620

Bump Windows max open files from 512 to 2048 #620

Uh oh!

Conversation

Thireus commented Jul 16, 2025

Uh oh!

ikawrakow Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

saood06 Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ikawrakow Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

Thireus Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

saood06 Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ikawrakow Jul 17, 2025 •

edited

Loading

saood06 Jul 17, 2025 •

edited

Loading

saood06 Jul 17, 2025 •

edited

Loading