Make sure that the tests ensure that functions _create_description_xml() and _create_run_from_xml() are each others exact complements.
e.g.,
- a parameter free classifier
- a classifier with multiple components
- a tagged run
- runs that come from the server (and thus have global evaluation measures, measures per sample/fold set)
- ... feel free to add