Weird network behaviour

dumesnil · November 13, 2025, 6:11pm

Hello,

I’ve just received my Spark trying to use it as an appliance, and used xfreerdp3 to connect to it.

I’ve been playing with ollama, using the docker from the playbook using webUI.
I’ve noticed that when I pull a big model like gpt-oss:120b, all other connections seem to fail somehow ( my ssh becomes unresponsive, and my xfreerdp disconnects… ), until the file has finished loading.

Once it is finished loading, everything returns to normal…

Am I the only one experimenting this (meaning it’s my setup, somehow), or is it something else?

(The spark is connected to my wired net though a rj45 at 1000bps. My other linux boxes on the same switch don’t suffer from this… I’ve killed ipv6 on the spark in case it is interfering with something, since the firewall blocks ipv6… )

Thx,

eugr · November 13, 2025, 7:08pm

Something wrong with your setup. No networking problems here, either on DGX OS, or on Fedora 43 - everything is working as expected, no slowdowns even when the network link is saturated.

abull · November 13, 2025, 7:36pm

Make sure to launch the DGX Dashboard http://localhost:11000 and make sure you Spark is up to date.

Also we recommend using NVIDIA Sync.

See this playbook for more info: Set Up Local Network Access | DGX Spark

Neurfer · November 14, 2025, 12:44am

Wait, what?

@dumesnil I get exactely same symptom when I run gpt-oss:120b. I can see that the almost entire 120GB memory is being used up. And I get 5GB speed on Ethernet (NOT WiFi) network. So I just gave up running that model. And I don’t even use Ollama. It’s too slow. I use llama.cpp directly. Same issue!
@eugr , You don’t get the same symptom? Did you do any optimization, like Swap turned off, or persistent mode off.. anything?
@abull I thought NVidia Sync is just a fancy UI using SSH in the backend. If the SSH get disconnected, how’s the NVidia Sync gonna work?

dumesnil · November 14, 2025, 9:15am

Thank you all for your replies.

@eugr hmmm ok, I can accept that something is messed up on my side… like… the cable and the wifi are connected (wifi is still on from first bootup), so maybe the routes get mixed up somehow even tho they have different priorities… Also my setup doesn’t allow ipv6 on the local network (firewalls and such beyond my control)… I’ll investigate a bit and try to sniff whatever is happening if nothing works out ok…

@abull been there, done that already…

@Neurfer this is interesting. I’ve noticed that too, even though the model is only around 60GB big… I’ve also noticed (correct me if I’m wrong) that no swap is setup, so everything is bound to be held in memory. (EDIT: there is no swap partition, but there is a swapfile…. 16GB…. I’ve extended it to 80GB just in case - it’s slower than memory, but better than nothing)

For the moment, I’m trying to use a many official playbooks as possible before tweaking the OS too much.

[btw, THIS, is the reason I’m using xfreerdp in the first place: the whole nvidia AI stack only agrees to run on windows, mac os, and ubuntu ( I’m on debian, the installer won’t let me run anything because it doesn’t recognize my debian as a valid platform, so I turned to the spark to handle the ubuntu side of things, install the stack etc… that’s how I noticed the sluggish behaviour).

I was planning on using the spark as some sort of portable appliance that would fit in my backpack, but this idea seems to be just a dream… I was wondering if there was any way to just connect the usbc of the spark to the usbc of my linux laptop and turn the spark into a local workhorse, but from what I read elsewhere, it seems to be a misconception on my part…

It’s a bit too soon for that though… I need to test stuffs beforehand…]

eugr · November 14, 2025, 4:31pm

How do you run gpt-oss? Which version? What llama-server command? It should take about 65GB only, even with full context - if it takes 120 on your system, something is not right.

abull · November 17, 2025, 6:12pm

NVIDIA-sync allows makes it easier to work with the DGX Spark.
It’s more than just for managing an SSH connection (yes it creates a tunnel for you).

One example that might be helpful in your scenario is a custom port (also uses ollama)- see step 4: Open WebUI with Ollama | DGX Spark

NVIDIA Sync does install on Debian - see the instruction in the playbook
Step 1: Set Up Local Network Access | DGX Spark
Step 2: for other things you can do with NVIDIA-sync.

Are you having a performance issue with a DGX Spark playbook, NGC container or other resource?
BTW: no firewall configured on the DGX spark by default

Topic		Replies	Views
Very poor performance with Ollama on DGX Spark – looking for help DGX Spark / GB10 Projects	5	302	December 12, 2025
Has anyone tried an alternative Linux distro? DGX Spark / GB10	60	1438	December 8, 2025
When we install an LLM model and start a chat session, the response speed becomes extremely slow DGX Spark / GB10 llama	1	90	December 6, 2025
Failed to run Qwen3-235B-A22B-FP4 model on a two spark's cluster DGX Spark / GB10	7	459	October 30, 2025
GDX Spark is extremely slow on a short LLM test DGX Spark / GB10	18	890	December 4, 2025
Loss of Wireless Sync Access When Setting Up the Spark Cluster DGX Spark / GB10	5	66	December 10, 2025
vLLM on dual sparks DGX Spark / GB10	3	235	December 1, 2025
ConnectX-7 NIC in DGX Spark DGX Spark / GB10	66	1370	December 2, 2025
Successful 2 DGX Spark cluster setup? DGX Spark / GB10	12	1150	October 21, 2025
What I've learned so far as a "non-tech" Day 1 DGX Spark adopter DGX Spark / GB10	26	2182	November 13, 2025

Weird network behaviour

Related topics