[NativeAOT] Implements eager finalization of weak references #75436

VSadov · 2022-09-12T06:01:41Z

Fixes:#75107

eager finalization is an important perf and reliability improvement for code that uses a lot of weak references.
now that we have eager finalization, simplified the WeakReference<T> and WeakReference implementations a bit.
extra method table flags can now be stored in place of ComponentSize, which is rarely used (only arrays and strings need that). A similar pattern as used in CoreClr to increase data density in method tables. The trick allows 16 more bits for type traits, as long as the traits do not apply to arrays.

VSadov · 2022-09-12T14:48:17Z

/azp run runtime-extra-platforms

azure-pipelines · 2022-09-12T14:48:40Z

Azure Pipelines successfully started running 1 pipeline(s).

jkotas · 2022-09-12T19:57:25Z

src/coreclr/tools/Common/TypeSystem/Common/TypeFlags.cs

EagerFinalizer and CriticalFinalizer are not generic type system concepts. They do not need to have a flag here.

We only need to check these when producing EETypes. We can do the check only when building EEType.

It is the first time I need to change something in the type system, so Iwas just following existing patterns - like HasFinalizer.
What I hear is that instead of setting this flags we could just check for the same info that we capture in flags, but later.
Do we have a more appropriate example?

We only need to check these when producing EETypes. We can do the check only when building EEType.

Where is that done? At the locations where these flags are currently consumed?

You should only need to compute these flags here: https://github.com/dotnet/runtime/blob/e3cd737cea578c629a194474fd67d660fc7a9903/src/coreclr/tools/aot/ILCompiler.Compiler/Compiler/DependencyAnalysis/EETypeNode.cs#L657

Something like:

if (type is MetadataType mdType && mdType.Module == context.SystemModule && mdType.Name == "WeakReference" pr "WeakReference`1" mdType.Namespace == "System") { flags |= TypeFlags.HasEagerFinalizer; }

You should not need the one in EETypeBuilderHelpers. Copying from the template type should be enough.

jkotas · 2022-09-12T20:02:25Z

src/coreclr/nativeaot/System.Private.CoreLib/src/System/WeakReference.NativeAot.cs

Why do we need KeepAlive in a finalizer?

In theory a finalizer never runs concurrently with itself, thus neither Interlocked.Exchange nor KeepAlive are necessary, since noone could be recycling the handle while we are freeing it.

However, since nongeneric WeakReference is not sealed, it is hard to guarantee anything.

The original code looked like it tried to handle concurrent finalization. It does look like an attempt to handle just a particular kind of misuse, so I kept such assumption.

I guess we can remove both Interlocked.Exchange and KeepAlive. If things are broken, they can be broken in many ways.

I hope deriving from WeakReference or at least overriding the finalizer and doing weird things in it is not a common practice.

It would be nice to match what CoreCLR does (work towards sharing the code eventually).

CoreCLR has a bunch of code that handles COM and I am not very familiar with requirements. It looks a lot more complicated, perhaps because it is in native code and maybe because of the COM stuff.

Ignoring COM, there is nothing AOT specific here. WeakReferences are just thin facades to GC and handles, which are the same for CoreCLR.
CoreCLR could be trivially switched to use managed implementation (which seems simpler), just would need to handle the COM stuff.

The COM stuff is for http://github.com/microsoft/cswinRT/ support. If once we want to make nativeaot work well for cswinrt, we will need to implement the COM stuff too.

we will need to implement the COM stuff too

I'd hope there is a way to do that in managed code.
There is some code in NativeAOT under #if ENABLE_WINRT, but maybe it is a different kind of WINRT.

I looked at the native implementation a bit more. The COM is really about detecting RCWs and storing them via a different kind of handle.

The native implementation seems to be adding a lot of complexity just for being native. There is gc protection, for the reference itself and for the referenced object.

There is also a spinlock in the setter. Since the kind of the handle can be changing, the assignment is not a single operation.

There is also a comment saying that the lock synchronizes with finalization done by GC, but GC does not run concurrently with FCalls, (not the blocking mark phases when we know which finalizables are not reachable), so I am not sure the spinlock helps with that. Besides, the getter does not take the spinlock anyways. There is comment saying it is ok, but it is not very convincing. If handle recycling is possible, it would be possible to fetch an object of a wrong type and that is still a GC hole.

I think with the eager finalization the handle recycling is not possible in WeakReference<T>, because eager finalization does not run concurrently with managed code. The same should hold for the nongeneric WeakReference as well, unless it is overriden, but then there are endless opportunities for breakage.
Same should hold for the FCalls as well. I see some GCX_PREEMP(); in the code, but gc.pThis protection should work as KeepAlive.

Maybe it is worth to actually make the native implementation closer to managed, or even switch to managed, if COM stuff can be handled via FCall callouts. Hard to tell without actually trying.

src/coreclr/nativeaot/Common/src/Internal/Runtime/MethodTable.cs

jkotas · 2022-09-12T20:08:11Z

src/coreclr/nativeaot/Runtime/ObjectLayout.h

I think uintptr_t would be more appropriate type to use for the handle.

src/coreclr/nativeaot/Runtime/ObjectLayout.h

jkotas · 2022-09-12T20:12:13Z

src/coreclr/tools/Common/Internal/Runtime/MethodTable.Constants.cs

We should match the tricks that we do for this in MethodTable. It is 0x80000000 so that the JIT can optimize this as sign check. Also, the flags are fetched as dword so that it can use the smaller instructions (on x64).

Like - merge the Flags and FlagsEx into one uint32 and move the HasComponentSize to the sign bit?

or just move the bit?

Check the codegen for go_through_object for CoreCLR and make sure that it is as good.

Right, we want mT->HasComponentSize() in the following just be a sign check.
(now it probably loads a short into a 32bit register and does a bit test)

inline size_t my_get_size (Object* ob) { MethodTable* mT = header(ob)->GetMethodTable(); return (mT->GetBaseSize() + (mT->HasComponentSize() ? ((size_t)((CObjectHeader*)ob)->GetNumComponents() * mT->RawGetComponentSize()) : 0)); }

The perf here is typically gated by the cache misses when fetching the mT pointer - we are looking at headers of semi-random objects, and that is often a miss.

It would not hurt if codegen for HasComponentSize is tighter though.

jkotas · 2022-09-12T22:07:12Z

src/coreclr/nativeaot/System.Private.CoreLib/src/System/WeakReference.NativeAot.cs

This is subtle change. Target property is virtual and it call be overridden to do whatever in the inherited type.

I did not even think the Target could be virtual. GetGeneration makes even less sense if Target is overridden.

I guess I will revert to the original implementation, except the KeepAlive(wo);. We do not need to protect the handle once we fetched the obj. The handle can be gone by the time we return to the caller anyways.

The old code also does not do RuntimeImports.RhHandleGet(h) ?? TryGetComTarget(). Another difference.

Oops, sorry. I confused IsAlive and GC.GetGeneration. Both have subtle changes if Target is overridden.
I'll revert both to the old implementation.

src/coreclr/tools/Common/Internal/Runtime/EETypeBuilderHelpers.cs

src/coreclr/nativeaot/Common/src/Internal/Runtime/MethodTable.cs

src/coreclr/nativeaot/Runtime/MethodTable.cpp

src/coreclr/nativeaot/Runtime/ObjectLayout.cpp

src/coreclr/nativeaot/System.Private.CoreLib/src/System/WeakReference.NativeAot.cs

jkotas · 2022-09-14T00:38:13Z

src/coreclr/nativeaot/Runtime/DebugHeader.cpp

cc @noahfalk This is revving the debugger contract.

There are more changes when native side is updated, but the most observable part is that a few MT flags have moved around

MichalStrehovsky · 2022-09-13T23:28:42Z

src/coreclr/nativeaot/Common/src/Internal/Runtime/MethodTable.cs

Suggested change

// - type arg count for typedefs,

// - type arg count for generic type definitions MethodTables,

Doesn't hurt to spell it out. Generic type definition MethodTables can be weird because they never show up as allocated.

fixed the comment, on the native side as well.

src/coreclr/nativeaot/Common/src/Internal/Runtime/MethodTable.cs

MichalStrehovsky · 2022-09-14T00:33:51Z

src/coreclr/nativeaot/Runtime/inc/MethodTable.h

We made this a single int on the managed side and made emission also happen as a single int. Should we collapse to a single integer here too?

The native changes are coming shortly.

It is not a lot, but we can optimize important things like computing object size in GC.

MichalStrehovsky · 2022-09-14T00:39:17Z

src/coreclr/tools/aot/ILCompiler.Compiler/Compiler/DependencyAnalysis/EETypeNode.cs

Could you update the large comment at the top of the file? We no longer emit/access this as two ushorts, but as a single int.

...coreclr/nativeaot/System.Private.TypeLoader/src/Internal/Runtime/TypeLoader/EETypeCreator.cs

jkotas

LGTM otherwise. Thanks!

VSadov · 2022-09-14T01:13:58Z

With the native changes, we have

=== object size computation:

                s = size (oo);
00007FF6202E23FA  mov         rcx,qword ptr [r10]  
00007FF6202E23FD  and         rcx,0FFFFFFFFFFFFFFF8h  
00007FF6202E2401  mov         eax,dword ptr [rcx]  
00007FF6202E2403  test        eax,eax  
HasComponentSize     => 00007FF6202E2405  jns         SVR::gc_heap::mark_object_simple1+4DAh (07FF6202E282Ah)  
num components => 00007FF6202E240B  mov         edx,dword ptr [r10+8]  
comp size      => 00007FF6202E240F  movzx       eax,ax  
arr body size  => 00007FF6202E2412  imul        rdx,rax  
00007FF6202E2416  jmp         SVR::gc_heap::mark_object_simple1+4DCh (07FF6202E282Ch)

=== figuring whether an instanse is eagerly finalizable:

bool GCToEEInterface::EagerFinalized(Object* obj)
{
00007FF7C9279C07  mov         r8,rcx  
    if (!obj->GetGCSafeMethodTable()->HasEagerFinalizer())
00007FF7C9279C0A  and         rax,0FFFFFFFFFFFFFFF8h  
00007FF7C9279C0E  mov         edx,dword ptr [rax]  
HasEagerFinalizer => 00007FF7C9279C10  test        dl,1  
00007FF7C9279C13  je          GCToEEInterface::EagerFinalized+42h (07FF7C9279C42h)  
HasComponentSize  => 00007FF7C9279C15  test        edx,edx  
00007FF7C9279C17  js          GCToEEInterface::EagerFinalized+42h (07FF7C9279C42h)

Co-authored-by: Jan Kotas <[email protected]>

VSadov · 2022-09-14T08:04:16Z

Thanks!!

VSadov requested a review from MichalStrehovsky as a code owner September 12, 2022 06:01

ghost added the area-NativeAOT-coreclr label Sep 12, 2022

ghost assigned VSadov Sep 12, 2022

VSadov requested a review from marek-safar as a code owner September 12, 2022 06:41

VSadov requested a review from jkotas September 12, 2022 14:51

jkotas reviewed Sep 12, 2022

View reviewed changes

src/coreclr/nativeaot/Common/src/Internal/Runtime/MethodTable.cs Outdated Show resolved Hide resolved

jkotas reviewed Sep 12, 2022

View reviewed changes

src/coreclr/nativeaot/Runtime/ObjectLayout.h Outdated Show resolved Hide resolved

jkotas reviewed Sep 12, 2022

View reviewed changes

VSadov force-pushed the wr branch from aad88cb to c194b1a Compare September 13, 2022 02:59

VSadov added 11 commits September 12, 2022 20:59

HasEagerFinalizerFlag

49954ba

EETypeFlagsEx

c8e8059

comments

07386b3

Simplify WeakReference<T>

cf6e19b

Reconcile nongeneric WeakReference with changes in WeakReference<T>

2da23ad

a few touch ups

8c1fb5f

adjusted comment

f29e65d

fix mono build

7617786

make GCC happy

8de1eaa

simple PR feedback.

0a08d92

HasEagerFinalizer is not a typesystem concept

f6171e3

VSadov force-pushed the wr branch from c194b1a to f6171e3 Compare September 13, 2022 04:18

jkotas reviewed Sep 13, 2022

View reviewed changes

src/coreclr/tools/Common/Internal/Runtime/EETypeBuilderHelpers.cs Outdated Show resolved Hide resolved

VSadov added 2 commits September 12, 2022 22:32

do not call Name twice

ab6afdc

reorder bits

3d2babe