You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Use the `uv run <command>` to execute scripts within the managed environment. This helps maintain consistency across different shells and sessions.
136
136
- Ensure you have the necessary CUDA drivers and PyTorch installed compatible with your hardware.
137
137
- On the first install, `flash-attn` can take a while to install (~45min with 48 CPU hyperthreads). After it is built once, it is cached in your `uv`'s cache dir making subsequent installs much quicker.
138
+
- If you update your environment in `pyproject.toml`, it is necessary to force a rebuild of the virtual environments by setting `NRL_FORCE_REBUILD_VENVS=true` next time you launch a run.
138
139
-**Reminder**: Don't forget to set your `HF_HOME`, `WANDB_API_KEY`, and `HF_DATASETS_CACHE` (if needed). You'll need to do a `huggingface-cli login` as well for Llama models.
139
140
140
141
## Training Backends
@@ -413,7 +414,7 @@ For detailed instructions on how to set up and launch NeMo RL on Slurm or Kubern
413
414
git submodule update --init --recursive
414
415
```
415
416
416
-
and then force a rebuild of the virutal environments by setting `NRL_FORCE_REBUILD_VENVS=true` next time you launch a run:
417
+
and then force a rebuild of the virtual environments by setting `NRL_FORCE_REBUILD_VENVS=true` next time you launch a run:
417
418
418
419
```sh
419
420
NRL_FORCE_REBUILD_VENVS=true uv run examples/run_grpo.py ...
0 commit comments