snakemake
diff --git a/‎CHANGELOG.md‎
Lines changed: 7 additions & 0 deletions b/‎CHANGELOG.md‎
Lines changed: 7 additions & 0 deletions
diff --git a/‎docs/snakefiles/reporting.rst‎
Lines changed: 23 additions & 0 deletions b/‎docs/snakefiles/reporting.rst‎
Lines changed: 23 additions & 0 deletions
diff --git a/‎docs/snakefiles/testing.rst‎
Lines changed: 18 additions & 6 deletions b/‎docs/snakefiles/testing.rst‎
Lines changed: 18 additions & 6 deletions
diff --git a/‎pyproject.toml‎
Lines changed: 1 addition & 1 deletion b/‎pyproject.toml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎src/snakemake/api.py‎
Lines changed: 8 additions & 1 deletion b/‎src/snakemake/api.py‎
Lines changed: 8 additions & 1 deletion
diff --git a/‎src/snakemake/cli.py‎
Lines changed: 14 additions & 1 deletion b/‎src/snakemake/cli.py‎
Lines changed: 14 additions & 1 deletion
diff --git a/‎src/snakemake/persistence.py‎
Lines changed: 93 additions & 78 deletions b/‎src/snakemake/persistence.py‎
Lines changed: 93 additions & 78 deletions
@@ -1,6 +1,13 @@
 # Changelog
 
 
+## [9.10.1](https://github.com/snakemake/snakemake/compare/v9.10.0...v9.10.1) (2025-09-01)
+
+
+### Performance Improvements
+
+* optimize persistence implementation (only write metadata once, reduce file operations for improving glusterfs performance) ([#3679](https://github.com/snakemake/snakemake/issues/3679)) ([122c713](https://github.com/snakemake/snakemake/commit/122c71379eeef6799a4448428594ec4f9b5b43ec))
+
 ## [9.10.0](https://github.com/snakemake/snakemake/compare/v9.9.0...v9.10.0) (2025-08-19)
 
 
 
@@ -329,3 +329,26 @@ For example, this allows you to set a logo at the top (by using CSS to inject a
 For an example with a custom stylesheet defining a logo, see :download:`the report here <../../tests/test_report/expected-results/report.html>` (with a custom branding for the University of Duisburg-Essen).
 For the complete mechanics, you can also have a look at the `full example source code  <https://github.com/snakemake/snakemake/tree/main/tests/test_report/>`__ and :download:`the custom stylesheet with the logo definition <../../tests/test_report/custom-stylesheet.css>`.
 
+Custom report metadata
+^^^^^^^^^^^^^^^^^^^^^^
+
+You can define custom metadata that is displayed on the landing page of the report.
+The metadata is provided as a `YTE <https://yte-template-engine.github.io>`_ yaml template.
+
+.. code-block:: bash
+
+    snakemake --report report.html --report-metadata yte_template.yaml
+
+An example metadata yaml template that contains information about the work directory in which the workflow was run contains the following definitions.
+
+.. code-block:: yaml
+
+    __definitions__:
+    - import os
+
+    Workflow name: Test Workflow
+    Workdir: ?os.getcwd()
+    Contributors:
+      - Test Contributor
+      - Another Contributor
+
@@ -13,14 +13,26 @@ By running
 
 Snakemake is instructed to take one representative job for each rule and copy its input files to a hidden folder ``.tests/unit``,
 along with generating test cases for Pytest_.
+Pytest_ tests can be run as:
 
-Importantly, note that such unit tests shall not be generated from big data, as they should usually be finished in a few seconds.
-Further, it makes sense to store the generated unit tests in version control (e.g. git), such that huge files are not recommended.
-Instead, we suggest to first execute the workflow that shall be tested with some kind of small dummy datasets, and then use the results thereof to generate the unit tests.
-The small dummy datasets can in addition be used to generate an integration test, that could e.g. be stored under ``.tests/integration``, next to the unit tests.
+.. code-block:: bash
+
+    pytest .tests/unit/
+
+or, optionally, if you want to use a local conda cache and disable pytest caching:
+
+.. code-block:: bash
+
+    pytest -p no:cacheprovider .tests/unit/ --conda-prefix /path/to/cache/conda/
 
 Each auto-generated unit test is stored in a file ``.tests/unit/test_<rulename>.py``, and executes just the one representative job of the respective rule.
 After successful execution of the job, it will compare the obtained results with those that have been present when running ``snakemake --generate-unit-tests``.
-By default, the comparison happens byte by byte (using ``cmp``). This behavior can be overwritten by modifying the test file.
+By default, the comparison happens byte by byte (using ``cmp/zcmp/bzcmp/xzcmp``). This behavior can be overwritten by modifying the test file.
+
+NOTE: Importantly, such unit tests shall not be generated from big data, as they should usually be finished in a few seconds.
+Furthermore, it makes sense to store the generated unit tests in version control (e.g. git), such that huge files are also not recommended.
+Instead, we suggest to first execute the workflow that shall be tested with some kind of small dummy datasets while keeping all temp files (``--notemp``),
+and then use the results thereof to generate the unit tests.
+The small dummy datasets can in addition be used to generate an integration test, that could e.g. be stored under ``.tests/integration``, next to the unit tests.
 
-.. _Pytest: https://pytest.org
+.. _Pytest: https://pytest.org
@@ -51,7 +51,7 @@ dependencies = [
   "snakemake-interface-executor-plugins>=9.3.2,<10.0",
   "snakemake-interface-common>=1.20.1,<2.0",
   "snakemake-interface-storage-plugins>=4.1.0,<5.0",
-  "snakemake-interface-report-plugins>=1.1.0,<2.0.0",
+  "snakemake-interface-report-plugins>=1.2.0,<2.0.0",
   "snakemake-interface-logger-plugins>=1.1.0,<2.0.0",
   "snakemake-interface-scheduler-plugins>=2.0.0,<3.0.0",
   "tabulate",
 
@@ -19,6 +19,7 @@
     GroupSettings,
     SchedulingSettings,
     WorkflowSettings,
+    GlobalReportSettings,
 )
 
 if sys.version_info < MIN_PY_VERSION:
@@ -677,13 +678,15 @@ def create_report(
         self,
         reporter: str = "html",
         report_settings: Optional[ReportSettingsBase] = None,
+        global_report_settings: Optional[GlobalReportSettings] = None,
     ):
         """Create a report for the workflow.
 
         Arguments
         ---------
         report: Path -- The path to the report.
-        report_stylesheet: Optional[Path] -- The path to the report stylesheet.
+        report_settings: Optional[ReportSettingsBase] -- Report settings for the html report.
+        global_report_settings: Optional[GlobalReportSettings] -- Report settings that apply to all report plugins.
         reporter: str -- report plugin to use (default: html)
         """
 
@@ -693,9 +696,13 @@ def create_report(
         if report_settings is not None:
             report_plugin.validate_settings(report_settings)
 
+        if global_report_settings is None:
+            global_report_settings = GlobalReportSettings()
+
         self.workflow_api._workflow.create_report(
             report_plugin=report_plugin,
             report_settings=report_settings,
+            global_report_settings=global_report_settings,
         )
 
     @_no_exec
 
@@ -67,6 +67,7 @@
     WorkflowSettings,
     StrictDagEvaluation,
     PrintDag,
+    GlobalReportSettings,
 )
 from snakemake.target_jobs import parse_target_jobs_cli_args
 from snakemake.utils import available_cpu_count, update_config
@@ -719,7 +720,9 @@ def get_argument_parser(profiles=None):
         "--keep-going",
         "-k",
         action="store_true",
-        help="Go on with independent jobs if a job fails.",
+        help="Go on with independent jobs if a job fails during execution. "
+        "This only applies to runtime failures in job execution, "
+        "not to errors during workflow parsing or DAG construction.",
     )
     group_exec.add_argument(
         "--rerun-triggers",
@@ -945,6 +948,13 @@ def get_argument_parser(profiles=None):
         help="Custom stylesheet to use for report. In particular, this can be used for "
         "branding the report with e.g. a custom logo, see docs.",
     )
+    group_report.add_argument(
+        "--report-metadata",
+        metavar="FILE",
+        type=Path,
+        help="Custom metadata to use for the landing page of the report. In particular, "
+        "this can be used to provide metadata in the report e.g. the work directory, see docs.",
+    )
     group_report.add_argument(
         "--reporter",
         metavar="PLUGIN",
@@ -2102,6 +2112,9 @@ def args_to_api(args, parser):
                         dag_api.create_report(
                             reporter=args.reporter,
                             report_settings=report_settings,
+                            global_report_settings=GlobalReportSettings(
+                                metadata_template=args.report_metadata
+                            ),
                         )
                     elif args.generate_unit_tests:
                         dag_api.generate_unit_tests(args.generate_unit_tests)
 
@@ -22,6 +22,7 @@
 from snakemake_interface_executor_plugins.persistence import (
     PersistenceExecutorInterface,
 )
+from snakemake_interface_executor_plugins.settings import ExecMode
 
 from snakemake.common.tbdstring import TBDString
 import snakemake.exceptions
@@ -311,58 +312,64 @@ async def finished(self, job):
             # do not store metadata if not requested
             return
 
-        code = self._code(job.rule)
-        input = self._input(job)
-        log = self._log(job)
-        params = self._params(job)
-        shellcmd = job.shellcmd
-        conda_env = self._conda_env(job)
-        software_stack_hash = self._software_stack_hash(job)
-        fallback_time = time.time()
-        for f in job.output:
-            rec_path = self._record_path(self._incomplete_path, f)
-            starttime = os.path.getmtime(rec_path) if os.path.exists(rec_path) else None
-            # Sometimes finished is called twice, if so, lookup the previous starttime
-            if not os.path.exists(rec_path):
-                starttime = self._read_record(self._metadata_path, f).get(
-                    "starttime", None
+        if (
+            self.dag.workflow.exec_mode == ExecMode.DEFAULT
+            or self.dag.workflow.remote_execution_settings.immediate_submit
+        ):
+            code = self._code(job.rule)
+            input = self._input(job)
+            log = self._log(job)
+            params = self._params(job)
+            shellcmd = job.shellcmd
+            conda_env = self._conda_env(job)
+            software_stack_hash = self._software_stack_hash(job)
+            fallback_time = time.time()
+            for f in job.output:
+                rec_path = self._record_path(self._incomplete_path, f)
+                starttime = (
+                    os.path.getmtime(rec_path) if os.path.exists(rec_path) else None
                 )
+                # Sometimes finished is called twice, if so, lookup the previous starttime
+                if not os.path.exists(rec_path):
+                    starttime = self._read_record(self._metadata_path, f).get(
+                        "starttime", None
+                    )
 
-            endtime = (
-                (await f.mtime()).local_or_storage()
-                if await f.exists()
-                else fallback_time
-            )
+                endtime = (
+                    (await f.mtime()).local_or_storage()
+                    if await f.exists()
+                    else fallback_time
+                )
 
-            checksums = (
-                (infile, await infile.checksum(self.max_checksum_file_size))
-                for infile in job.input
-            )
-            self._record(
-                self._metadata_path,
-                {
-                    "record_format_version": RECORD_FORMAT_VERSION,
-                    "code": code,
-                    "rule": job.rule.name,
-                    "input": input,
-                    "log": log,
-                    "params": params,
-                    "shellcmd": shellcmd,
-                    "incomplete": False,
-                    "starttime": starttime,
-                    "endtime": endtime,
-                    "job_hash": hash(job),
-                    "conda_env": conda_env,
-                    "software_stack_hash": software_stack_hash,
-                    "container_img_url": job.container_img_url,
-                    "input_checksums": {
-                        infile: checksum
-                        async for infile, checksum in checksums
-                        if checksum is not None
+                checksums = (
+                    (infile, await infile.checksum(self.max_checksum_file_size))
+                    for infile in job.input
+                )
+                self._record(
+                    self._metadata_path,
+                    {
+                        "record_format_version": RECORD_FORMAT_VERSION,
+                        "code": code,
+                        "rule": job.rule.name,
+                        "input": input,
+                        "log": log,
+                        "params": params,
+                        "shellcmd": shellcmd,
+                        "incomplete": False,
+                        "starttime": starttime,
+                        "endtime": endtime,
+                        "job_hash": hash(job),
+                        "conda_env": conda_env,
+                        "software_stack_hash": software_stack_hash,
+                        "container_img_url": job.container_img_url,
+                        "input_checksums": {
+                            infile: checksum
+                            async for infile, checksum in checksums
+                            if checksum is not None
+                        },
                     },
-                },
-                f,
-            )
+                    f,
+                )
         # remove incomplete marker only after creation of metadata record.
         # otherwise the job starttime will be missing.
         self._remove_incomplete_marker(job)
@@ -639,30 +646,32 @@ def _params(self, job: Job):
     def _output(self, job):
         return sorted(job.output)
 
-    def _record(self, subject, json_value, id):
+    def _record(
+        self,
+        subject,
+        json_value,
+        id,
+        mode=stat.S_IRUSR | stat.S_IWUSR | stat.S_IRGRP | stat.S_IWGRP,
+    ):
         recpath = self._record_path(subject, id)
-        recdir = os.path.dirname(recpath)
-        os.makedirs(recdir, exist_ok=True)
-        # Write content to temporary file and rename it to the final file.
-        # This avoids race-conditions while writing (e.g. on NFS when the main job
-        # and the cluster node job propagate their content and the system has some
-        # latency including non-atomic propagation processes).
-        with tempfile.NamedTemporaryFile(
-            mode="w",
-            dir=recdir,
-            delete=False,
-            # Add short prefix to final filename for better debugging.
-            # This may not be the full one, because that may be too long
-            # for the filesystem in combination with the prefix from the temp
-            # file.
-            suffix=f".{os.path.basename(recpath)[:8]}",
-        ) as tmpfile:
-            json.dump(json_value, tmpfile)
-        # ensure read and write permissions for user and group
-        os.chmod(
-            tmpfile.name, stat.S_IRUSR | stat.S_IWUSR | stat.S_IRGRP | stat.S_IWGRP
-        )
-        os.replace(tmpfile.name, recpath)
+        try:
+            recpath_stat = os.stat(recpath)
+        except FileNotFoundError:
+            recpath_stat = None
+            recdir = os.path.dirname(recpath)
+            os.makedirs(recdir, exist_ok=True)
+
+        with open(recpath, "w") as recfile:
+            json.dump(json_value, recfile)
+
+        # ensure read and write permissions for user and group if they don't include the required mode
+        if recpath_stat is None:
+            os.chmod(recpath, mode)
+        else:
+            existing = stat.S_IMODE(recpath_stat.st_mode)
+            new_mode = existing | mode
+            if existing != new_mode:
+                os.chmod(recpath, new_mode)
 
     def _delete_record(self, subject, id):
         try:
@@ -687,15 +696,21 @@ def _read_record_cached(self, subject, id):
     def _read_record_uncached(self, subject, id):
         if not self._exists_record(subject, id):
             return dict()
-        with open(self._record_path(subject, id), "r") as f:
+        path = self._record_path(subject, id)
+        with open(path, "r") as f:
             try:
                 return json.load(f)
-            except json.JSONDecodeError as e:
-                pass
-        # case: file is corrupted, delete it
-        logger.warning("Deleting corrupted metadata record.")
-        self._delete_record(subject, id)
-        return dict()
+            except json.JSONDecodeError:
+                # Since record writing cannot be reliably made atomic (some network
+                # filesystems, e.g. gluster have issues with writing to a temp file
+                # and then moving) we ignore corrupted or incompletely written records
+                # here.
+                # They can only occur if a snakemake process is running and one does a
+                # dry-run (or intentionally disables locking) at the same time.
+                logger.warning(
+                    f"Ignore corrupted or currently written metadata record {path}."
+                )
+                return dict()
 
     def _exists_record(self, subject, id):
         return os.path.exists(self._record_path(subject, id))