WIP: Remove Asserts by brandenchan · Pull Request #444 · deepset-ai/FARM

brandenchan · 2020-07-06T13:57:06Z

The idea of this PR generally is to remove assertion statement so that running systems do not get interrupted by errors. What used to be asserts will, where appropriate, be converted into logging.error() or logging.warning().

This PR focuses only on QA related components. Asserts may still be present if they can be caught by try catch statements such as Processor._featurize_samples() or Processor._init_samples_in_baskets().

Timoeller

I made some suggestions in the code

Timoeller · 2020-07-08T12:38:47Z

farm/modeling/prediction_head.py

+        logit_input = logits is not None
+        preds_input = preds is not None
+
+        if logit_input and preds_input:


syntax seems confusing:
how about

if logits and preds: logger.warning("Both logits and preds have been passed as input to the TextClassificationHead") if (logits is None) and (preds is None): logger.error("Neither logits nor preds have been passed as input to the TextClassificationHead")

Timoeller · 2020-07-08T12:40:59Z

farm/modeling/adaptive_model.py

-                (f"Label_tensor_names are missing inside the {head.task_name} Prediction Head. Did you connect the model"
-                " with the processor through either 'model.connect_heads_with_processor(processor.tasks)'"
-                " or by passing the processor to the Adaptive Model?")
+            if not hasattr(head, "label_tensor_name"):


Not sure about this one. Can the model work without this? e.g. if it is in inference mode.

If so we should remove the assert, if not and the functionality must break further downstream, then lets keep the assert here

Timoeller · 2020-07-08T12:42:48Z

farm/modeling/prediction_head.py

+        if logit_input:
+            logger.warning("QuestionAnsweringHead.formatted_preds() received logit input when it only expects pred input")
+        if not preds_input:
+            logger.warning("QuestionAnsweringHead.formatted_preds() did not receive the preds input it expects")


this should be at least an logger.error or even kept as assert, since the app breaks without preds

Timoeller · 2020-07-08T12:45:37Z

farm/modeling/prediction_head.py

        # are prediction spans
        preds_d = self.aggregate_preds(preds, passage_start_t, ids, seq_2_start_t)

-        assert len(preds_d) == len(baskets)


we should through a logger error here, since we need as many preds_d (on document level) as we have baskets.
If that is not the case it isnt necessarily breaking a cosuming app. Example: haystack retrieves 10 docs, one of those documents is malformatted so it cannot contain preds. Still we want to give back preds_d on all other 9 docs so the reader produces some answer

Timoeller · 2020-07-08T12:49:01Z

farm/modeling/tokenization.py

            else:
                start_of_word.append(False)

-    assert len(tokens) == len(token_offsets) == len(start_of_word)


here we either need tests or keep the assert?

Timoeller · 2020-07-08T12:49:59Z

farm/modeling/prediction_head.py


        # This fn is used to align QA output of len=n_docs and Classification output of len=n_passages
        def chunk(iterable, lengths):
-            assert sum(lengths) == len(iterable)


I do not know what this assert does... lets discuss in detail or throw a logger.error?

brandenchan · 2020-07-22T14:22:02Z

See #468 for how these changes were actually implemented

First attempt

1c94ef6

Timoeller suggested changes Jul 8, 2020

View reviewed changes

brandenchan mentioned this pull request Jul 22, 2020

Remove assertions or replace with logging error #468

Merged

brandenchan closed this Jul 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Remove Asserts#444

WIP: Remove Asserts#444
brandenchan wants to merge 1 commit intomasterfrom
remove_assert

brandenchan commented Jul 6, 2020

Uh oh!

Timoeller left a comment

Uh oh!

Timoeller Jul 8, 2020

Uh oh!

Timoeller Jul 8, 2020

Uh oh!

Timoeller Jul 8, 2020

Uh oh!

Timoeller Jul 8, 2020

Uh oh!

Timoeller Jul 8, 2020

Uh oh!

Timoeller Jul 8, 2020

Uh oh!

brandenchan commented Jul 22, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

brandenchan commented Jul 6, 2020

Uh oh!

Timoeller left a comment

Choose a reason for hiding this comment

Uh oh!

Timoeller Jul 8, 2020

Choose a reason for hiding this comment

Uh oh!

Timoeller Jul 8, 2020

Choose a reason for hiding this comment

Uh oh!

Timoeller Jul 8, 2020

Choose a reason for hiding this comment

Uh oh!

Timoeller Jul 8, 2020

Choose a reason for hiding this comment

Uh oh!

Timoeller Jul 8, 2020

Choose a reason for hiding this comment

Uh oh!

Timoeller Jul 8, 2020

Choose a reason for hiding this comment

Uh oh!

brandenchan commented Jul 22, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants