Making mixed precision work with all optimizers #7654

ptrendx · 2017-08-29T15:41:00Z

No description provided.

piiswrong · 2017-09-25T21:54:12Z

python/mxnet/optimizer.py

+        """
+        weight_master_copy = None
+        if self.multi_precision and weight.dtype == numpy.float16:
+            weight_master_copy = array(weight, ctx=weight.context, dtype=numpy.float32)


weight.astype(float32)

piiswrong · 2017-09-25T21:55:18Z

python/mxnet/optimizer.py

+            # Wrapper for mixed precision
+            weight_master_copy = state[0]
+            original_state = state[1]
+            grad32 = array(grad, ctx=grad.context, dtype=numpy.float32)


grad.astype

piiswrong · 2017-09-25T21:56:34Z

python/mxnet/optimizer.py

+            original_state = state[1]
+            grad32 = array(grad, ctx=grad.context, dtype=numpy.float32)
+            self.update(index, weight_master_copy, grad32, original_state)
+            weight[:] = weight_master_copy.astype(weight.dtype)


nd.cast(weight, dtype=weight.dtype, out=weight) to avoid a copy

piiswrong · 2017-09-25T22:00:26Z

python/mxnet/optimizer.py

            The state associated with the weight.
        """

+    def create_mp_state(self, index, weight):


Use full name create_state_multi_precision

piiswrong · 2017-09-25T22:01:43Z

python/mxnet/optimizer.py

        """

+    def create_mp_state(self, index, weight):
+        """Creates auxiliary state for a given weight, including FP32 master


including FP32 master copy if necessary.
->
including fp32 high precision copy if original weight is fp16

piiswrong · 2017-09-25T22:02:17Z

python/mxnet/optimizer.py

        """
        raise NotImplementedError()

+    def update_mp(self, index, weight, grad, state):


update_multi_precision

piiswrong · 2017-09-25T22:04:12Z

python/mxnet/optimizer.py

        return momentum

-    def update(self, index, weight, grad, state):
+    def create_state(self, index, weight):


There should be a _create_state_impl function that both create_mp_state and create_state use. And create_state should keep the original behavior (always multi_precision=False)

I don't think _create_state_impl is necessary, since this is basically create_state.

piiswrong · 2017-09-25T22:04:35Z

tests/python/unittest/test_optimizer.py

+    rg_options = [{}, {'rescale_grad': 0.14}, {'rescale_grad': 0.8}]
+    wd_options = [{}, {'wd': 0.03}, {'wd': 0.05}, {'wd': 0.07}]
+    mp_options = [{}, {'multi_precision': False}, {'multi_precision': True}]
+    for dtype in [np.float16, np.float32]:


This is exploding exponentially...

And that is fine - previously there was a list of test cases where it was really hard to make sure that all possible combinations of parameters are tested (and all possible combinations should be tested). This design guarantees that.

piiswrong · 2017-10-06T19:42:44Z

I think you need to merge in master

ptrendx · 2017-10-06T19:53:01Z

I did (and it fixed a lot of errors) but I still got an unrelated segfault :-(.

ptrendx · 2017-10-06T21:29:53Z

@piiswrong Passed!

piiswrong · 2017-10-08T05:22:46Z

Thanks

* Making mixed precision work with all optimizers * Restart CI * Restart CI

ptrendx mentioned this pull request Sep 24, 2017

Question about Float16 #7996

Closed

ptrendx force-pushed the fp32_weight_master_copy branch from 936e2bd to 5becd3d Compare September 25, 2017 18:19

ptrendx requested review from mli and piiswrong as code owners September 25, 2017 18:19

piiswrong reviewed Sep 25, 2017

View reviewed changes

ptrendx force-pushed the fp32_weight_master_copy branch from 6bd2368 to df3c841 Compare October 3, 2017 21:39

Making mixed precision work with all optimizers

9cab44a

ptrendx force-pushed the fp32_weight_master_copy branch from df3c841 to 9cab44a Compare October 5, 2017 22:09

ptrendx added 3 commits October 5, 2017 16:34

Restart CI

98e282e

Merge branch 'upstream' into fp32_weight_master_copy

d2aae03

Restart CI

371b29a

piiswrong merged commit ee97715 into apache:master Oct 8, 2017

mbaijal pushed a commit to mbaijal/incubator-mxnet that referenced this pull request Oct 9, 2017

Making mixed precision work with all optimizers (apache#7654)

d1195e5

* Making mixed precision work with all optimizers * Restart CI * Restart CI

indhub mentioned this pull request Oct 11, 2017

test_operator_gpu.test_rms fails #8230

Closed

mbaijal pushed a commit to mbaijal/incubator-mxnet that referenced this pull request Oct 12, 2017

Making mixed precision work with all optimizers (apache#7654)

3533257

* Making mixed precision work with all optimizers * Restart CI * Restart CI

crazy-cat pushed a commit to crazy-cat/incubator-mxnet that referenced this pull request Oct 26, 2017

Making mixed precision work with all optimizers (apache#7654)

9775c54

* Making mixed precision work with all optimizers * Restart CI * Restart CI

Making mixed precision work with all optimizers #7654

Making mixed precision work with all optimizers #7654

Uh oh!

Conversation

ptrendx commented Aug 29, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

piiswrong commented Oct 6, 2017

Uh oh!

ptrendx commented Oct 6, 2017

Uh oh!

ptrendx commented Oct 6, 2017

Uh oh!

piiswrong commented Oct 8, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants