Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[X64] [tannergooding] Minor cleanup of the Vector64/128/256/512 implementations to improve fall ... #404

Open
MihuBot opened this issue Jun 6, 2024 · 2 comments

Comments

@MihuBot
Copy link
Owner

MihuBot commented Jun 6, 2024

Job completed in 27 minutes.
dotnet/runtime#103095

Diffs

Diffs
Found 263 files with textual diffs.

Summary of Code Size diffs:
(Lower is better)

Total bytes of base: 39807267
Total bytes of diff: 39778503
Total bytes of delta: -28764 (-0.07 % of base)
Total relative delta: -107.50
    diff is an improvement.
    relative diff is an improvement.


Total byte diff includes -9022 bytes from reconciling methods
	Base had   96 unique methods,     9022 unique bytes
	Diff had    0 unique methods,        0 unique bytes

Top file regressions (bytes):
        2602 : System.Numerics.Tensors.dasm (0.33 % of base)

Top file improvements (bytes):
      -31366 : System.Private.CoreLib.dasm (-0.45 % of base)

2 total files with Code Size differences (1 improved, 1 regressed), 255 unchanged.

Top method regressions (bytes):
         671 (126.37 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
         373 (37.75 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
         183 (26.52 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
         138 (418.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Sum[ubyte](System.Runtime.Intrinsics.Vector128`1[ubyte]):ubyte (FullOpts)
         101 (17.50 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Dot(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):long (FullOpts)
          87 (26.44 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudeOperator`1[long]:Invoke(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          87 (32.95 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudePropagateNaNOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          87 (26.44 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudePropagateNaNOperator`1[long]:Invoke(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          87 (32.71 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudePropagateNaNOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          86 (25.90 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudeOperator`1[long]:Invoke(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          86 (25.90 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudePropagateNaNOperator`1[long]:Invoke(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          81 (19.71 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudeOperator`1[int]:Invoke(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudeOperator`1[short]:Invoke(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudePropagateNaNOperator`1[int]:Invoke(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudePropagateNaNOperator`1[short]:Invoke(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudeOperator`1[int]:Invoke(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudeOperator`1[short]:Invoke(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudePropagateNaNOperator`1[int]:Invoke(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudePropagateNaNOperator`1[short]:Invoke(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          79 (14.11 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          73 (10.93 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          69 (20.72 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMaxOperator`1[double]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[double],byref,System.Runtime.Intrinsics.Vector512`1[double]) (FullOpts)
          69 (21.50 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMinOperator`1[double]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[double],byref,System.Runtime.Intrinsics.Vector512`1[double]) (FullOpts)
          66 (18.80 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMaxMagnitudeOperator`1[double]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[double],byref,System.Runtime.Intrinsics.Vector512`1[double]) (FullOpts)
          66 (19.47 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMinMagnitudeOperator`1[double]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[double],byref,System.Runtime.Intrinsics.Vector512`1[double]) (FullOpts)
          61 (34.08 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Divide(System.Runtime.Intrinsics.Vector128`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
          61 (5.07 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.ShiftLeft(System.Runtime.Intrinsics.Vector512`1[ubyte],int):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          61 (5.07 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.ShiftRightArithmetic(System.Runtime.Intrinsics.Vector512`1[ubyte],int):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          47 (427.27 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[ubyte]:get_AllBitsSet():System.Runtime.Intrinsics.Vector64`1[ubyte] (FullOpts)
          46 (24.73 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxNumberOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          44 (30.99 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+AbsoluteOperator`1[long]:Invoke(System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          44 (23.40 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinNumberOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          42 (7.75 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives:CosineSimilarityCore[double](System.ReadOnlySpan`1[double],System.ReadOnlySpan`1[double]):double (FullOpts)
          42 (20.90 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxPropagateNaNOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          42 (20.69 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinPropagateNaNOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          40 (66.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Dot[double](System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):double (FullOpts)
          40 (66.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Dot(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):double (FullOpts)
          39 (354.55 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[short]:get_AllBitsSet():System.Runtime.Intrinsics.Vector64`1[short] (FullOpts)
          34 (170.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Sum[double](System.Runtime.Intrinsics.Vector128`1[double]):double (FullOpts)
          33 (18.75 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudeOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          33 (18.03 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudeOperator`1[ubyte]:Invoke(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          33 (18.03 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudePropagateNaNOperator`1[ubyte]:Invoke(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          33 (18.75 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudeOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          32 (10.32 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMaxMagnitudeOperator`1[long]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[long],byref,System.Runtime.Intrinsics.Vector512`1[long]) (FullOpts)
          32 (10.32 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMinMagnitudeOperator`1[long]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[long],byref,System.Runtime.Intrinsics.Vector512`1[long]) (FullOpts)
          32 (34.04 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Dot[short](System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):short (FullOpts)
          32 (34.04 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Dot(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):short (FullOpts)
          32 (28.32 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.AnyWhereAllBitsSet(System.Runtime.Intrinsics.Vector64`1[ubyte]):ubyte (FullOpts)
          29 (15.68 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudeOperator`1[ubyte]:Invoke(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          29 (15.68 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudePropagateNaNOperator`1[ubyte]:Invoke(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          29 (207.14 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Sum[long](System.Runtime.Intrinsics.Vector128`1[long]):long (FullOpts)
          29 (263.64 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[int]:get_AllBitsSet():System.Runtime.Intrinsics.Vector64`1[int] (FullOpts)
          28 (46.67 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+AddOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double]):double (FullOpts)
          28 (12.17 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMaxMagnitudeOperator`1[int]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[int],byref,System.Runtime.Intrinsics.Vector512`1[int]) (FullOpts)
          28 (12.17 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMaxMagnitudeOperator`1[short]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[short],byref,System.Runtime.Intrinsics.Vector512`1[short]) (FullOpts)
          28 (10.94 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMaxMagnitudeOperator`1[ubyte]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[ubyte],byref,System.Runtime.Intrinsics.Vector512`1[ubyte]) (FullOpts)
          28 (12.96 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMaxOperator`1[int]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[int],byref,System.Runtime.Intrinsics.Vector512`1[int]) (FullOpts)
          28 (12.84 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMaxOperator`1[long]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[long],byref,System.Runtime.Intrinsics.Vector512`1[long]) (FullOpts)
          28 (12.96 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMaxOperator`1[short]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[short],byref,System.Runtime.Intrinsics.Vector512`1[short]) (FullOpts)
          28 (10.53 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMaxOperator`1[ubyte]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[ubyte],byref,System.Runtime.Intrinsics.Vector512`1[ubyte]) (FullOpts)
          28 (12.28 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMinMagnitudeOperator`1[int]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[int],byref,System.Runtime.Intrinsics.Vector512`1[int]) (FullOpts)
          28 (12.28 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMinMagnitudeOperator`1[short]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[short],byref,System.Runtime.Intrinsics.Vector512`1[short]) (FullOpts)
          28 (10.94 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMinMagnitudeOperator`1[ubyte]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[ubyte],byref,System.Runtime.Intrinsics.Vector512`1[ubyte]) (FullOpts)
          28 (12.96 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMinOperator`1[int]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[int],byref,System.Runtime.Intrinsics.Vector512`1[int]) (FullOpts)
          28 (12.84 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMinOperator`1[long]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[long],byref,System.Runtime.Intrinsics.Vector512`1[long]) (FullOpts)
          28 (12.96 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMinOperator`1[short]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[short],byref,System.Runtime.Intrinsics.Vector512`1[short]) (FullOpts)
          28 (10.61 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMinOperator`1[ubyte]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[ubyte],byref,System.Runtime.Intrinsics.Vector512`1[ubyte]) (FullOpts)
          28 (155.56 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Sum[int](System.Runtime.Intrinsics.Vector128`1[int]):int (FullOpts)
          28 (46.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Sum[double](System.Runtime.Intrinsics.Vector512`1[double]):double (FullOpts)
          28 (46.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Sum(System.Runtime.Intrinsics.Vector512`1[double]):double (FullOpts)
          24 (30.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Dot[int](System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):int (FullOpts)
          24 (30.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Dot(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):int (FullOpts)
          24 (28.24 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.AnyWhereAllBitsSet(System.Runtime.Intrinsics.Vector64`1[short]):ubyte (FullOpts)
          23 (27.06 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Sum[short](System.Runtime.Intrinsics.Vector128`1[short]):short (FullOpts)
          23 (209.09 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[double]:get_AllBitsSet():System.Runtime.Intrinsics.Vector64`1[double] (FullOpts)
          23 (209.09 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[long]:get_AllBitsSet():System.Runtime.Intrinsics.Vector64`1[long] (FullOpts)
          19 (15.57 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          19 (15.57 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          14 (46.67 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+AddOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector256`1[double]):double (FullOpts)
          14 (38.89 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Dot[double](System.Numerics.Vector`1[double],System.Numerics.Vector`1[double]):double (FullOpts)
          14 (46.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Sum(System.Runtime.Intrinsics.Vector256`1[double]):double (FullOpts)
          14 (24.56 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.AnyWhereAllBitsSet(System.Runtime.Intrinsics.Vector64`1[int]):ubyte (FullOpts)
          13 (3.10 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.ShiftRightArithmetic(System.Runtime.Intrinsics.Vector512`1[long],int):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          12 (18.46 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives:ElementWiseSelect[double](System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          12 (18.46 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives:ElementWiseSelect[int](System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          12 (18.46 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives:ElementWiseSelect[long](System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          12 (18.46 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives:ElementWiseSelect[short](System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          12 (18.46 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives:ElementWiseSelect[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          12 (3.53 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+RoundToEvenOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          12 (0.91 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+TanOperatorDouble:Invoke(System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          12 (1.55 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+TanOperatorSingle:Invoke(System.Runtime.Intrinsics.Vector512`1[float]):System.Runtime.Intrinsics.Vector512`1[float] (FullOpts)
          12 (12.90 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+TruncateOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ConditionalSelect[double](System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ConditionalSelect[int](System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ConditionalSelect[long](System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ConditionalSelect[short](System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ConditionalSelect[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.ConditionalSelect(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.ConditionalSelect(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)

Top method improvements (bytes):
       -2083 (-87.89 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Dot[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):ubyte (FullOpts)
       -1265 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftLeft[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte],int):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts) (1 base, 0 diff methods)
       -1265 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightArithmetic[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte],int):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts) (1 base, 0 diff methods)
       -1095 (-79.23 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Dot(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):ubyte (FullOpts)
       -1080 (-88.02 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Dot(System.Runtime.Intrinsics.Vector256`1[ubyte],System.Runtime.Intrinsics.Vector256`1[ubyte]):ubyte (FullOpts)
        -713 (-82.91 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Dot[ubyte](System.Runtime.Intrinsics.Vector256`1[ubyte],System.Runtime.Intrinsics.Vector256`1[ubyte]):ubyte (FullOpts)
        -523 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftLeft[ubyte](System.Runtime.Intrinsics.Vector256`1[ubyte],int):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts) (1 base, 0 diff methods)
        -523 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightArithmetic[ubyte](System.Runtime.Intrinsics.Vector256`1[ubyte],int):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts) (1 base, 0 diff methods)
        -433 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightArithmetic[double](System.Runtime.Intrinsics.Vector512`1[double],int):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts) (1 base, 0 diff methods)
        -433 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightArithmetic[long](System.Runtime.Intrinsics.Vector512`1[long],int):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts) (1 base, 0 diff methods)
        -423 (-83.60 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Dot(System.Runtime.Intrinsics.Vector128`1[ubyte],System.Runtime.Intrinsics.Vector128`1[ubyte]):ubyte (FullOpts)
        -411 (-44.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.ShiftLeft(System.Runtime.Intrinsics.Vector256`1[ubyte],int):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts)
        -411 (-44.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.ShiftRightArithmetic(System.Runtime.Intrinsics.Vector256`1[ubyte],int):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts)
        -294 (-46.96 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[short],short):System.Runtime.Intrinsics.Vector256`1[short] (FullOpts)
        -266 (-26.42 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Divide[int](System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
        -255 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftLeft[ubyte](System.Runtime.Intrinsics.Vector128`1[ubyte],int):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts) (1 base, 0 diff methods)
        -255 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightArithmetic[ubyte](System.Runtime.Intrinsics.Vector128`1[ubyte],int):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts) (1 base, 0 diff methods)
        -251 (-84.80 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Dot[short](System.Runtime.Intrinsics.Vector128`1[short],System.Runtime.Intrinsics.Vector128`1[short]):short (FullOpts)
        -237 (-21.35 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Divide[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
        -233 (-26.72 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Divide[long](System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
        -221 (-24.58 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Dot[long](System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):long (FullOpts)
        -218 (-13.81 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Divide[short](System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
        -202 (-70.88 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Dot[ubyte](System.Runtime.Intrinsics.Vector128`1[ubyte],System.Runtime.Intrinsics.Vector128`1[ubyte]):ubyte (FullOpts)
        -202 (-91.82 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[short]:op_UnaryNegation(System.Runtime.Intrinsics.Vector128`1[short]):System.Runtime.Intrinsics.Vector128`1[short] (FullOpts)
        -191 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightArithmetic[double](System.Runtime.Intrinsics.Vector256`1[double],int):System.Runtime.Intrinsics.Vector256`1[double] (FullOpts) (1 base, 0 diff methods)
        -191 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightArithmetic[long](System.Runtime.Intrinsics.Vector256`1[long],int):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts) (1 base, 0 diff methods)
        -187 (-81.30 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.CopyTo(System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.__Canon[],int) (FullOpts)
        -186 (-90.73 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.TryCopyTo(System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Span`1[System.__Canon]):ubyte (FullOpts)
        -183 (-81.33 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Create(System.__Canon[],int):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
        -174 (-87.44 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.CopyTo(System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.__Canon[]) (FullOpts)
        -174 (-87.44 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Create(System.__Canon[]):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
        -173 (-88.72 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.CopyTo(System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Span`1[System.__Canon]) (FullOpts)
        -173 (-88.72 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Create(System.ReadOnlySpan`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
        -173 (-88.72 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Negate(System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
        -169 (-44.83 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[int],int):System.Runtime.Intrinsics.Vector256`1[int] (FullOpts)
        -167 (-34.94 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Dot(System.Runtime.Intrinsics.Vector256`1[long],System.Runtime.Intrinsics.Vector256`1[long]):long (FullOpts)
        -165 (-32.93 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[int],System.Runtime.Intrinsics.Vector256`1[int]):System.Runtime.Intrinsics.Vector256`1[int] (FullOpts)
        -157 (-20.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[short],System.Runtime.Intrinsics.Vector256`1[short]):System.Runtime.Intrinsics.Vector256`1[short] (FullOpts)
        -147 (-77.78 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Create(System.__Canon[],int):System.Runtime.Intrinsics.Vector256`1[System.__Canon] (FullOpts)
        -147 (-88.55 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.TryCopyTo(System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.Span`1[System.__Canon]):ubyte (FullOpts)
        -144 (-77.01 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.CopyTo(System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.__Canon[],int) (FullOpts)
        -140 (-88.05 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Negate(System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[System.__Canon] (FullOpts)
        -138 (-84.66 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Create(System.__Canon[]):System.Runtime.Intrinsics.Vector256`1[System.__Canon] (FullOpts)
        -137 (-86.16 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Create(System.ReadOnlySpan`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[System.__Canon] (FullOpts)
        -136 (-75.98 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:CopyTo[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.__Canon[],int) (FullOpts)
        -135 (-84.38 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.CopyTo(System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.__Canon[]) (FullOpts)
        -135 (-76.27 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Create[System.__Canon](System.__Canon[],int):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
        -135 (-84.38 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Create[System.__Canon](System.__Canon[]):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
        -134 (-85.90 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.CopyTo(System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.Span`1[System.__Canon]) (FullOpts)
        -134 (-85.90 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Create[System.__Canon](System.ReadOnlySpan`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
        -134 (-87.58 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:TryCopyTo[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Span`1[System.__Canon]):ubyte (FullOpts)
        -133 (-24.45 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[ubyte],System.Runtime.Intrinsics.Vector256`1[ubyte]):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts)
        -133 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftLeft[ubyte](System.Runtime.Intrinsics.Vector64`1[ubyte],int):System.Runtime.Intrinsics.Vector64`1[ubyte] (FullOpts) (1 base, 0 diff methods)
        -133 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightArithmetic[ubyte](System.Runtime.Intrinsics.Vector64`1[ubyte],int):System.Runtime.Intrinsics.Vector64`1[ubyte] (FullOpts) (1 base, 0 diff methods)
        -133 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightLogical[ubyte](System.Runtime.Intrinsics.Vector64`1[ubyte],int):System.Runtime.Intrinsics.Vector64`1[ubyte] (FullOpts) (1 base, 0 diff methods)
        -132 (-84.08 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:CopyTo[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.__Canon[]) (FullOpts)
        -131 (-85.62 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:CopyTo[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Span`1[System.__Canon]) (FullOpts)
        -128 (-87.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:op_UnaryNegation(System.Runtime.Intrinsics.Vector128`1[ubyte]):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
        -124 (-59.62 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.WithElement(System.Runtime.Intrinsics.Vector512`1[System.__Canon],int,System.__Canon):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
        -121 (-29.44 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Multiply(System.Runtime.Intrinsics.Vector256`1[long],System.Runtime.Intrinsics.Vector256`1[long]):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts)
        -118 (-38.19 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.ShiftRightArithmetic(System.Runtime.Intrinsics.Vector256`1[double],int):System.Runtime.Intrinsics.Vector256`1[double] (FullOpts)
        -117 (-39.26 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Multiply[long](long,System.Runtime.Intrinsics.Vector256`1[long]):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts)
        -117 (-28.47 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[long],System.Runtime.Intrinsics.Vector256`1[long]):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts)
        -117 (-39.26 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Multiply(System.Runtime.Intrinsics.Vector256`1[long],long):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts)
        -117 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftLeft[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],int):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
        -117 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightArithmetic[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],int):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
        -117 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightLogical[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],int):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
        -114 (-30.89 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.ShiftLeft(System.Runtime.Intrinsics.Vector128`1[ubyte],int):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
        -114 (-30.89 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.ShiftRightArithmetic(System.Runtime.Intrinsics.Vector128`1[ubyte],int):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
        -114 (-57.58 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:get_Item(int):System.__Canon:this (FullOpts)
        -114 (-59.38 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.GetElement(System.Runtime.Intrinsics.Vector512`1[System.__Canon],int):System.__Canon (FullOpts)
        -111 (-85.38 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.TryCopyTo(System.Runtime.Intrinsics.Vector128`1[System.__Canon],System.Span`1[System.__Canon]):ubyte (FullOpts)
        -111 (-71.15 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:GetElement[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],int):System.__Canon (FullOpts)
        -111 (-71.15 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Negate[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
        -111 (-71.15 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:WithElement[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],int,System.__Canon):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
        -110 (-36.91 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[long],long):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts)
        -109 (-40.52 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Divide(System.Runtime.Intrinsics.Vector128`1[short],short):System.Runtime.Intrinsics.Vector128`1[short] (FullOpts)
        -108 (-71.52 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.CopyTo(System.Runtime.Intrinsics.Vector128`1[System.__Canon],System.__Canon[],int) (FullOpts)
        -108 (-72.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.__Canon[],int):System.Runtime.Intrinsics.Vector128`1[System.__Canon] (FullOpts)
        -101 (-84.17 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Negate(System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[System.__Canon] (FullOpts)
        -101 (-84.17 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:get_AllBitsSet():System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
        -101 (-84.17 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:get_Count():int (FullOpts)
        -101 (-84.17 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:op_UnaryNegation(System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
        -101 (-84.17 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.AnyWhereAllBitsSet(System.Runtime.Intrinsics.Vector512`1[System.__Canon]):ubyte (FullOpts)
         -99 (-79.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.CopyTo(System.Runtime.Intrinsics.Vector128`1[System.__Canon],System.__Canon[]) (FullOpts)
         -99 (-79.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.__Canon[]):System.Runtime.Intrinsics.Vector128`1[System.__Canon] (FullOpts)
         -99 (-70.21 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Create[System.__Canon](System.__Canon[],int):System.Runtime.Intrinsics.Vector256`1[System.__Canon] (FullOpts)
         -98 (-81.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.CopyTo(System.Runtime.Intrinsics.Vector128`1[System.__Canon],System.Span`1[System.__Canon]) (FullOpts)
         -98 (-81.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Create(System.ReadOnlySpan`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[System.__Canon] (FullOpts)
         -98 (-83.76 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:TryCopyTo[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.Span`1[System.__Canon]):ubyte (FullOpts)
         -98 (-81.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:GetHashCode():int:this (FullOpts)
         -96 (-69.06 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:CopyTo[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.__Canon[],int) (FullOpts)
         -96 (-79.34 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:CopyTo[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.__Canon[]) (FullOpts)
         -96 (-79.34 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Create[System.__Canon](System.__Canon[]):System.Runtime.Intrinsics.Vector256`1[System.__Canon] (FullOpts)
         -95 (-81.20 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:CopyTo[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.Span`1[System.__Canon]) (FullOpts)
         -95 (-81.20 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Create[System.__Canon](System.ReadOnlySpan`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[System.__Canon] (FullOpts)
         -93 (-80.17 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Dot[double](System.Runtime.Intrinsics.Vector128`1[double],System.Runtime.Intrinsics.Vector128`1[double]):double (FullOpts)
         -93 (-32.75 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.ShiftRightArithmetic(System.Runtime.Intrinsics.Vector256`1[long],int):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts)
         -93 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftLeft[short](System.Runtime.Intrinsics.Vector64`1[short],int):System.Runtime.Intrinsics.Vector64`1[short] (FullOpts) (1 base, 0 diff methods)
         -93 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightArithmetic[short](System.Runtime.Intrinsics.Vector64`1[short],int):System.Runtime.Intrinsics.Vector64`1[short] (FullOpts) (1 base, 0 diff methods)

Top method regressions (percentages):
          47 (427.27 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[ubyte]:get_AllBitsSet():System.Runtime.Intrinsics.Vector64`1[ubyte] (FullOpts)
         138 (418.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Sum[ubyte](System.Runtime.Intrinsics.Vector128`1[ubyte]):ubyte (FullOpts)
          39 (354.55 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[short]:get_AllBitsSet():System.Runtime.Intrinsics.Vector64`1[short] (FullOpts)
          29 (263.64 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[int]:get_AllBitsSet():System.Runtime.Intrinsics.Vector64`1[int] (FullOpts)
          23 (209.09 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[double]:get_AllBitsSet():System.Runtime.Intrinsics.Vector64`1[double] (FullOpts)
          23 (209.09 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[long]:get_AllBitsSet():System.Runtime.Intrinsics.Vector64`1[long] (FullOpts)
          29 (207.14 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Sum[long](System.Runtime.Intrinsics.Vector128`1[long]):long (FullOpts)
          34 (170.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Sum[double](System.Runtime.Intrinsics.Vector128`1[double]):double (FullOpts)
          28 (155.56 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Sum[int](System.Runtime.Intrinsics.Vector128`1[int]):int (FullOpts)
         671 (126.37 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          40 (66.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Dot[double](System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):double (FullOpts)
          40 (66.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Dot(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):double (FullOpts)
          14 (46.67 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+AddOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector256`1[double]):double (FullOpts)
          28 (46.67 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+AddOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double]):double (FullOpts)
          14 (46.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Sum(System.Runtime.Intrinsics.Vector256`1[double]):double (FullOpts)
          28 (46.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Sum[double](System.Runtime.Intrinsics.Vector512`1[double]):double (FullOpts)
          28 (46.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Sum(System.Runtime.Intrinsics.Vector512`1[double]):double (FullOpts)
          14 (38.89 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Dot[double](System.Numerics.Vector`1[double],System.Numerics.Vector`1[double]):double (FullOpts)
         373 (37.75 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          61 (34.08 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Divide(System.Runtime.Intrinsics.Vector128`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
          32 (34.04 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Dot[short](System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):short (FullOpts)
          32 (34.04 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Dot(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):short (FullOpts)
          87 (32.95 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudePropagateNaNOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          87 (32.71 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudePropagateNaNOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudeOperator`1[int]:Invoke(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudeOperator`1[short]:Invoke(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudePropagateNaNOperator`1[int]:Invoke(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudePropagateNaNOperator`1[short]:Invoke(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudeOperator`1[int]:Invoke(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudeOperator`1[short]:Invoke(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudePropagateNaNOperator`1[int]:Invoke(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          79 (31.85 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudePropagateNaNOperator`1[short]:Invoke(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          44 (30.99 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+AbsoluteOperator`1[long]:Invoke(System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          24 (30.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Dot[int](System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):int (FullOpts)
          24 (30.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Dot(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):int (FullOpts)
          32 (28.32 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.AnyWhereAllBitsSet(System.Runtime.Intrinsics.Vector64`1[ubyte]):ubyte (FullOpts)
          24 (28.24 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.AnyWhereAllBitsSet(System.Runtime.Intrinsics.Vector64`1[short]):ubyte (FullOpts)
          23 (27.06 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Sum[short](System.Runtime.Intrinsics.Vector128`1[short]):short (FullOpts)
         183 (26.52 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          87 (26.44 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudeOperator`1[long]:Invoke(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          87 (26.44 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudePropagateNaNOperator`1[long]:Invoke(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          86 (25.90 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudeOperator`1[long]:Invoke(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          86 (25.90 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudePropagateNaNOperator`1[long]:Invoke(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          46 (24.73 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxNumberOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          14 (24.56 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.AnyWhereAllBitsSet(System.Runtime.Intrinsics.Vector64`1[int]):ubyte (FullOpts)
          44 (23.40 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinNumberOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          69 (21.50 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMinOperator`1[double]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[double],byref,System.Runtime.Intrinsics.Vector512`1[double]) (FullOpts)
          42 (20.90 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxPropagateNaNOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          69 (20.72 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMaxOperator`1[double]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[double],byref,System.Runtime.Intrinsics.Vector512`1[double]) (FullOpts)
          42 (20.69 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinPropagateNaNOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
           8 (20.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AndNot[double](System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
           8 (20.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AndNot[int](System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
           8 (20.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AndNot[long](System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
           8 (20.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AndNot[short](System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
           8 (20.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AndNot[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
           8 (20.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.AndNot(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
           8 (20.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.AndNot(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
           8 (20.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.AndNot(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
           8 (20.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.AndNot(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
           8 (20.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.AndNot(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          81 (19.71 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts)
          66 (19.47 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMinMagnitudeOperator`1[double]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[double],byref,System.Runtime.Intrinsics.Vector512`1[double]) (FullOpts)
          66 (18.80 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+IndexOfMaxMagnitudeOperator`1[double]:Invoke(byref,System.Runtime.Intrinsics.Vector512`1[double],byref,System.Runtime.Intrinsics.Vector512`1[double]) (FullOpts)
          33 (18.75 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudeOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          33 (18.75 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MinMagnitudeOperator`1[double]:Invoke(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          12 (18.46 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives:ElementWiseSelect[double](System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          12 (18.46 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives:ElementWiseSelect[int](System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          12 (18.46 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives:ElementWiseSelect[long](System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          12 (18.46 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives:ElementWiseSelect[short](System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          12 (18.46 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives:ElementWiseSelect[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ConditionalSelect[double](System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ConditionalSelect[int](System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ConditionalSelect[long](System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ConditionalSelect[short](System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ConditionalSelect[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.ConditionalSelect(System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double],System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.ConditionalSelect(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.ConditionalSelect(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.ConditionalSelect(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          12 (18.18 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.ConditionalSelect(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          33 (18.03 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudeOperator`1[ubyte]:Invoke(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          33 (18.03 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives+MaxMagnitudePropagateNaNOperator`1[ubyte]:Invoke(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
         101 (17.50 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Dot(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):long (FullOpts)
           8 (17.39 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.AnyWhereAllBitsSet(System.Runtime.Intrinsics.Vector64`1[long]):ubyte (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Abs[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector128`1[System.Nullable`1[int]]):System.Runtime.Intrinsics.Vector128`1[System.Nullable`1[int]] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Abs[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector128`1[System.Numerics.Vector`1[float]]):System.Runtime.Intrinsics.Vector128`1[System.Numerics.Vector`1[float]] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Ceiling[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector128`1[System.Nullable`1[int]]):System.Runtime.Intrinsics.Vector128`1[System.Nullable`1[int]] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Ceiling[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector128`1[System.Numerics.Vector`1[float]]):System.Runtime.Intrinsics.Vector128`1[System.Numerics.Vector`1[float]] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Floor[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector128`1[System.Nullable`1[int]]):System.Runtime.Intrinsics.Vector128`1[System.Nullable`1[int]] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Floor[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector128`1[System.Numerics.Vector`1[float]]):System.Runtime.Intrinsics.Vector128`1[System.Numerics.Vector`1[float]] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Abs[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector256`1[System.Nullable`1[int]]):System.Runtime.Intrinsics.Vector256`1[System.Nullable`1[int]] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Abs[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]]):System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Ceiling[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector256`1[System.Nullable`1[int]]):System.Runtime.Intrinsics.Vector256`1[System.Nullable`1[int]] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Ceiling[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]]):System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Floor[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector256`1[System.Nullable`1[int]]):System.Runtime.Intrinsics.Vector256`1[System.Nullable`1[int]] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Floor[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]]):System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:GetElement[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]],int):System.Numerics.Vector`1[float] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:WithElement[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]],int,System.Numerics.Vector`1[float]):System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.GetElement(System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]],int):System.Numerics.Vector`1[float] (FullOpts)
           3 (15.79 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.WithElement(System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]],int,System.Numerics.Vector`1[float]):System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]] (FullOpts)

Top method improvements (percentages):
         -25 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftLeft[double](System.Runtime.Intrinsics.Vector128`1[double],int):System.Runtime.Intrinsics.Vector128`1[double] (FullOpts) (1 base, 0 diff methods)
         -25 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftLeft[int](System.Runtime.Intrinsics.Vector128`1[int],int):System.Runtime.Intrinsics.Vector128`1[int] (FullOpts) (1 base, 0 diff methods)
         -25 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftLeft[long](System.Runtime.Intrinsics.Vector128`1[long],int):System.Runtime.Intrinsics.Vector128`1[long] (FullOpts) (1 base, 0 diff methods)
         -25 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftLeft[short](System.Runtime.Intrinsics.Vector128`1[short],int):System.Runtime.Intrinsics.Vector128`1[short] (FullOpts) (1 base, 0 diff methods)
         -45 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftLeft[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon],int):System.Runtime.Intrinsics.Vector128`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftLeft[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector128`1[System.Nullable`1[int]],int):System.Runtime.Intrinsics.Vector128`1[System.Nullable`1[int]] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftLeft[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector128`1[System.Numerics.Vector`1[float]],int):System.Runtime.Intrinsics.Vector128`1[System.Numerics.Vector`1[float]] (FullOpts) (1 base, 0 diff methods)
        -255 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftLeft[ubyte](System.Runtime.Intrinsics.Vector128`1[ubyte],int):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts) (1 base, 0 diff methods)
         -89 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightArithmetic[double](System.Runtime.Intrinsics.Vector128`1[double],int):System.Runtime.Intrinsics.Vector128`1[double] (FullOpts) (1 base, 0 diff methods)
         -25 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightArithmetic[int](System.Runtime.Intrinsics.Vector128`1[int],int):System.Runtime.Intrinsics.Vector128`1[int] (FullOpts) (1 base, 0 diff methods)
         -89 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightArithmetic[long](System.Runtime.Intrinsics.Vector128`1[long],int):System.Runtime.Intrinsics.Vector128`1[long] (FullOpts) (1 base, 0 diff methods)
         -25 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightArithmetic[short](System.Runtime.Intrinsics.Vector128`1[short],int):System.Runtime.Intrinsics.Vector128`1[short] (FullOpts) (1 base, 0 diff methods)
         -45 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightArithmetic[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon],int):System.Runtime.Intrinsics.Vector128`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightArithmetic[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector128`1[System.Nullable`1[int]],int):System.Runtime.Intrinsics.Vector128`1[System.Nullable`1[int]] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightArithmetic[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector128`1[System.Numerics.Vector`1[float]],int):System.Runtime.Intrinsics.Vector128`1[System.Numerics.Vector`1[float]] (FullOpts) (1 base, 0 diff methods)
        -255 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightArithmetic[ubyte](System.Runtime.Intrinsics.Vector128`1[ubyte],int):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts) (1 base, 0 diff methods)
         -25 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightLogical[double](System.Runtime.Intrinsics.Vector128`1[double],int):System.Runtime.Intrinsics.Vector128`1[double] (FullOpts) (1 base, 0 diff methods)
         -25 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightLogical[int](System.Runtime.Intrinsics.Vector128`1[int],int):System.Runtime.Intrinsics.Vector128`1[int] (FullOpts) (1 base, 0 diff methods)
         -25 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightLogical[long](System.Runtime.Intrinsics.Vector128`1[long],int):System.Runtime.Intrinsics.Vector128`1[long] (FullOpts) (1 base, 0 diff methods)
         -25 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightLogical[short](System.Runtime.Intrinsics.Vector128`1[short],int):System.Runtime.Intrinsics.Vector128`1[short] (FullOpts) (1 base, 0 diff methods)
         -45 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightLogical[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon],int):System.Runtime.Intrinsics.Vector128`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightLogical[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector128`1[System.Nullable`1[int]],int):System.Runtime.Intrinsics.Vector128`1[System.Nullable`1[int]] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightLogical[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector128`1[System.Numerics.Vector`1[float]],int):System.Runtime.Intrinsics.Vector128`1[System.Numerics.Vector`1[float]] (FullOpts) (1 base, 0 diff methods)
         -48 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ShiftRightLogical[ubyte](System.Runtime.Intrinsics.Vector128`1[ubyte],int):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts) (1 base, 0 diff methods)
         -28 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftLeft[double](System.Runtime.Intrinsics.Vector256`1[double],int):System.Runtime.Intrinsics.Vector256`1[double] (FullOpts) (1 base, 0 diff methods)
         -28 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftLeft[int](System.Runtime.Intrinsics.Vector256`1[int],int):System.Runtime.Intrinsics.Vector256`1[int] (FullOpts) (1 base, 0 diff methods)
         -28 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftLeft[long](System.Runtime.Intrinsics.Vector256`1[long],int):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts) (1 base, 0 diff methods)
         -28 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftLeft[short](System.Runtime.Intrinsics.Vector256`1[short],int):System.Runtime.Intrinsics.Vector256`1[short] (FullOpts) (1 base, 0 diff methods)
         -81 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftLeft[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon],int):System.Runtime.Intrinsics.Vector256`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftLeft[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector256`1[System.Nullable`1[int]],int):System.Runtime.Intrinsics.Vector256`1[System.Nullable`1[int]] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftLeft[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]],int):System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]] (FullOpts) (1 base, 0 diff methods)
        -523 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftLeft[ubyte](System.Runtime.Intrinsics.Vector256`1[ubyte],int):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts) (1 base, 0 diff methods)
        -191 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightArithmetic[double](System.Runtime.Intrinsics.Vector256`1[double],int):System.Runtime.Intrinsics.Vector256`1[double] (FullOpts) (1 base, 0 diff methods)
         -28 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightArithmetic[int](System.Runtime.Intrinsics.Vector256`1[int],int):System.Runtime.Intrinsics.Vector256`1[int] (FullOpts) (1 base, 0 diff methods)
        -191 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightArithmetic[long](System.Runtime.Intrinsics.Vector256`1[long],int):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts) (1 base, 0 diff methods)
         -28 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightArithmetic[short](System.Runtime.Intrinsics.Vector256`1[short],int):System.Runtime.Intrinsics.Vector256`1[short] (FullOpts) (1 base, 0 diff methods)
         -81 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightArithmetic[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon],int):System.Runtime.Intrinsics.Vector256`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightArithmetic[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector256`1[System.Nullable`1[int]],int):System.Runtime.Intrinsics.Vector256`1[System.Nullable`1[int]] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightArithmetic[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]],int):System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]] (FullOpts) (1 base, 0 diff methods)
        -523 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightArithmetic[ubyte](System.Runtime.Intrinsics.Vector256`1[ubyte],int):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts) (1 base, 0 diff methods)
         -28 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightLogical[double](System.Runtime.Intrinsics.Vector256`1[double],int):System.Runtime.Intrinsics.Vector256`1[double] (FullOpts) (1 base, 0 diff methods)
         -28 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightLogical[int](System.Runtime.Intrinsics.Vector256`1[int],int):System.Runtime.Intrinsics.Vector256`1[int] (FullOpts) (1 base, 0 diff methods)
         -28 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightLogical[long](System.Runtime.Intrinsics.Vector256`1[long],int):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts) (1 base, 0 diff methods)
         -28 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightLogical[short](System.Runtime.Intrinsics.Vector256`1[short],int):System.Runtime.Intrinsics.Vector256`1[short] (FullOpts) (1 base, 0 diff methods)
         -81 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightLogical[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon],int):System.Runtime.Intrinsics.Vector256`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightLogical[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector256`1[System.Nullable`1[int]],int):System.Runtime.Intrinsics.Vector256`1[System.Nullable`1[int]] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightLogical[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]],int):System.Runtime.Intrinsics.Vector256`1[System.Numerics.Vector`1[float]] (FullOpts) (1 base, 0 diff methods)
         -51 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ShiftRightLogical[ubyte](System.Runtime.Intrinsics.Vector256`1[ubyte],int):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts) (1 base, 0 diff methods)
         -43 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftLeft[double](System.Runtime.Intrinsics.Vector512`1[double],int):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts) (1 base, 0 diff methods)
         -43 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftLeft[int](System.Runtime.Intrinsics.Vector512`1[int],int):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts) (1 base, 0 diff methods)
         -43 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftLeft[long](System.Runtime.Intrinsics.Vector512`1[long],int):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts) (1 base, 0 diff methods)
         -43 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftLeft[short](System.Runtime.Intrinsics.Vector512`1[short],int):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts) (1 base, 0 diff methods)
        -117 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftLeft[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],int):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftLeft[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector512`1[System.Nullable`1[int]],int):System.Runtime.Intrinsics.Vector512`1[System.Nullable`1[int]] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftLeft[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector512`1[System.Numerics.Vector`1[float]],int):System.Runtime.Intrinsics.Vector512`1[System.Numerics.Vector`1[float]] (FullOpts) (1 base, 0 diff methods)
       -1265 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftLeft[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte],int):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts) (1 base, 0 diff methods)
        -433 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightArithmetic[double](System.Runtime.Intrinsics.Vector512`1[double],int):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts) (1 base, 0 diff methods)
         -43 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightArithmetic[int](System.Runtime.Intrinsics.Vector512`1[int],int):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts) (1 base, 0 diff methods)
        -433 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightArithmetic[long](System.Runtime.Intrinsics.Vector512`1[long],int):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts) (1 base, 0 diff methods)
         -43 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightArithmetic[short](System.Runtime.Intrinsics.Vector512`1[short],int):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts) (1 base, 0 diff methods)
        -117 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightArithmetic[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],int):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightArithmetic[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector512`1[System.Nullable`1[int]],int):System.Runtime.Intrinsics.Vector512`1[System.Nullable`1[int]] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightArithmetic[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector512`1[System.Numerics.Vector`1[float]],int):System.Runtime.Intrinsics.Vector512`1[System.Numerics.Vector`1[float]] (FullOpts) (1 base, 0 diff methods)
       -1265 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightArithmetic[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte],int):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts) (1 base, 0 diff methods)
         -43 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightLogical[double](System.Runtime.Intrinsics.Vector512`1[double],int):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts) (1 base, 0 diff methods)
         -43 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightLogical[int](System.Runtime.Intrinsics.Vector512`1[int],int):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts) (1 base, 0 diff methods)
         -43 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightLogical[long](System.Runtime.Intrinsics.Vector512`1[long],int):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts) (1 base, 0 diff methods)
         -43 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightLogical[short](System.Runtime.Intrinsics.Vector512`1[short],int):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts) (1 base, 0 diff methods)
        -117 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightLogical[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],int):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightLogical[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector512`1[System.Nullable`1[int]],int):System.Runtime.Intrinsics.Vector512`1[System.Nullable`1[int]] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightLogical[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector512`1[System.Numerics.Vector`1[float]],int):System.Runtime.Intrinsics.Vector512`1[System.Numerics.Vector`1[float]] (FullOpts) (1 base, 0 diff methods)
         -76 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ShiftRightLogical[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte],int):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts) (1 base, 0 diff methods)
         -47 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftLeft[double](System.Runtime.Intrinsics.Vector64`1[double],int):System.Runtime.Intrinsics.Vector64`1[double] (FullOpts) (1 base, 0 diff methods)
         -59 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftLeft[int](System.Runtime.Intrinsics.Vector64`1[int],int):System.Runtime.Intrinsics.Vector64`1[int] (FullOpts) (1 base, 0 diff methods)
         -47 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftLeft[long](System.Runtime.Intrinsics.Vector64`1[long],int):System.Runtime.Intrinsics.Vector64`1[long] (FullOpts) (1 base, 0 diff methods)
         -93 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftLeft[short](System.Runtime.Intrinsics.Vector64`1[short],int):System.Runtime.Intrinsics.Vector64`1[short] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftLeft[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon],int):System.Runtime.Intrinsics.Vector64`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftLeft[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector64`1[System.Nullable`1[int]],int):System.Runtime.Intrinsics.Vector64`1[System.Nullable`1[int]] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftLeft[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector64`1[System.Numerics.Vector`1[float]],int):System.Runtime.Intrinsics.Vector64`1[System.Numerics.Vector`1[float]] (FullOpts) (1 base, 0 diff methods)
        -133 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftLeft[ubyte](System.Runtime.Intrinsics.Vector64`1[ubyte],int):System.Runtime.Intrinsics.Vector64`1[ubyte] (FullOpts) (1 base, 0 diff methods)
         -47 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightArithmetic[double](System.Runtime.Intrinsics.Vector64`1[double],int):System.Runtime.Intrinsics.Vector64`1[double] (FullOpts) (1 base, 0 diff methods)
         -59 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightArithmetic[int](System.Runtime.Intrinsics.Vector64`1[int],int):System.Runtime.Intrinsics.Vector64`1[int] (FullOpts) (1 base, 0 diff methods)
         -47 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightArithmetic[long](System.Runtime.Intrinsics.Vector64`1[long],int):System.Runtime.Intrinsics.Vector64`1[long] (FullOpts) (1 base, 0 diff methods)
         -93 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightArithmetic[short](System.Runtime.Intrinsics.Vector64`1[short],int):System.Runtime.Intrinsics.Vector64`1[short] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightArithmetic[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon],int):System.Runtime.Intrinsics.Vector64`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightArithmetic[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector64`1[System.Nullable`1[int]],int):System.Runtime.Intrinsics.Vector64`1[System.Nullable`1[int]] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightArithmetic[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector64`1[System.Numerics.Vector`1[float]],int):System.Runtime.Intrinsics.Vector64`1[System.Numerics.Vector`1[float]] (FullOpts) (1 base, 0 diff methods)
        -133 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightArithmetic[ubyte](System.Runtime.Intrinsics.Vector64`1[ubyte],int):System.Runtime.Intrinsics.Vector64`1[ubyte] (FullOpts) (1 base, 0 diff methods)
         -47 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightLogical[double](System.Runtime.Intrinsics.Vector64`1[double],int):System.Runtime.Intrinsics.Vector64`1[double] (FullOpts) (1 base, 0 diff methods)
         -59 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightLogical[int](System.Runtime.Intrinsics.Vector64`1[int],int):System.Runtime.Intrinsics.Vector64`1[int] (FullOpts) (1 base, 0 diff methods)
         -47 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightLogical[long](System.Runtime.Intrinsics.Vector64`1[long],int):System.Runtime.Intrinsics.Vector64`1[long] (FullOpts) (1 base, 0 diff methods)
         -89 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightLogical[short](System.Runtime.Intrinsics.Vector64`1[short],int):System.Runtime.Intrinsics.Vector64`1[short] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightLogical[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon],int):System.Runtime.Intrinsics.Vector64`1[System.__Canon] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightLogical[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector64`1[System.Nullable`1[int]],int):System.Runtime.Intrinsics.Vector64`1[System.Nullable`1[int]] (FullOpts) (1 base, 0 diff methods)
         -19 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightLogical[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector64`1[System.Numerics.Vector`1[float]],int):System.Runtime.Intrinsics.Vector64`1[System.Numerics.Vector`1[float]] (FullOpts) (1 base, 0 diff methods)
        -133 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ShiftRightLogical[ubyte](System.Runtime.Intrinsics.Vector64`1[ubyte],int):System.Runtime.Intrinsics.Vector64`1[ubyte] (FullOpts) (1 base, 0 diff methods)
        -202 (-91.82 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[short]:op_UnaryNegation(System.Runtime.Intrinsics.Vector128`1[short]):System.Runtime.Intrinsics.Vector128`1[short] (FullOpts)
        -186 (-90.73 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.TryCopyTo(System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Span`1[System.__Canon]):ubyte (FullOpts)
        -173 (-88.72 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.CopyTo(System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Span`1[System.__Canon]) (FullOpts)
        -173 (-88.72 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Create(System.ReadOnlySpan`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)

663 total methods with Code Size differences (442 improved, 221 regressed), 245326 unchanged.

--------------------------------------------------------------------------------

Artifacts:

@MihuBot
Copy link
Owner Author

MihuBot commented Jun 6, 2024

Top method regressions

671 (126.37 % of base) - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector,T>.Divide(System.Runtime.Intrinsics.Vector512`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector512`1[ubyte]
 ; Assembly listing for method System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX - Unix
 ; FullOpts code
 ; optimized code
 ; rbp based frame
 ; partially interruptible
 ; No PGO data
-; 0 inlinees with PGO data; 17 single block inlinees; 9 inlinees without PGO data
+; 0 inlinees with PGO data; 34 single block inlinees; 27 inlinees without PGO data
 ; Final local variable assignments
 ;
-;  V00 RetBuf       [V00,T09] (  4,  4   )   byref  ->  rbx         single-def
+;  V00 RetBuf       [V00,T01] (  5,  5   )   byref  ->  rdi         single-def
 ;* V01 arg0         [V01    ] (  0,  0   )  struct (64) zero-ref    single-def <System.Runtime.Intrinsics.Vector512`1[ubyte]>
-;  V02 arg1         [V02,T10] (  3,  3   )   ubyte  ->  rsi         single-def
-;  V03 OutArgs      [V03    ] (  1,  1   )  struct (64) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace"
-;  V04 tmp1         [V04,T18] (  2,  4   )  simd32  ->  [rbp-0x50]  spill-single-def "impAppendStmt"
-;  V05 tmp2         [V05,T19] (  2,  4   )  simd32  ->  [rbp-0x70]  do-not-enreg[HS] hidden-struct-arg "spilled call-like call argument"
-;  V06 tmp3         [V06,T15] (  3,  6   )  simd32  ->  [rbp-0x90]  spill-single-def "Inlining Arg" <System.Runtime.Intrinsics.Vector256`1[ubyte]>
-;  V07 tmp4         [V07,T20] (  2,  4   )  simd16  ->  [rbp-0xA0]  spill-single-def "impAppendStmt"
+;  V02 arg1         [V02,T02] (  3,  3   )   ubyte  ->  rsi         single-def
+;# V03 OutArgs      [V03    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace"
+;  V04 tmp1         [V04,T25] (  2,  4   )  simd32  ->  mm0         "impAppendStmt"
+;  V05 tmp2         [V05,T26] (  2,  4   )  simd32  ->  mm1         "spilled call-like call argument"
+;  V06 tmp3         [V06,T19] (  3,  6   )  simd32  ->  mm0         "Inlining Arg" <System.Runtime.Intrinsics.Vector256`1[ubyte]>
+;  V07 tmp4         [V07,T27] (  2,  4   )  simd16  ->  mm1         "impAppendStmt"
 ;* V08 tmp5         [V08    ] (  0,  0   )  simd16  ->  zero-ref    "spilled call-like call argument"
-;  V09 tmp6         [V09,T16] (  3,  6   )  simd16  ->  [rbp-0xB0]  do-not-enreg[SF] "Inlining Arg" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
+;  V09 tmp6         [V09,T20] (  3,  6   )  simd16  ->  [rbp-0x10]  do-not-enreg[SF] "Inlining Arg" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
 ;* V10 tmp7         [V10    ] (  0,  0   )  struct ( 8) zero-ref    "impAppendStmt" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
 ;* V11 tmp8         [V11    ] (  0,  0   )  struct ( 8) zero-ref    "spilled call-like call argument" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;  V12 tmp9         [V12    ] (  2,  5   )  struct ( 8) [rbp-0xB8]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;  V13 tmp10        [V13,T01] (  5, 17   )     int  ->  r15         "Inline stloc first use temp"
-;  V14 tmp11        [V14    ] (  2, 10   )  struct ( 8) [rbp-0xC0]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V12 tmp9         [V12    ] (  9,  9   )  struct ( 8) [rbp-0x18]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V13 tmp10        [V13,T11] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
+;  V14 tmp11        [V14    ] (  9, 18   )  struct ( 8) [rbp-0x20]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
 ;* V15 tmp12        [V15    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
-;  V16 tmp13        [V16    ] (  2,  5   )  struct ( 8) [rbp-0xC8]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;  V17 tmp14        [V17,T02] (  5, 17   )     int  ->  r13         "Inline stloc first use temp"
-;  V18 tmp15        [V18    ] (  2, 10   )  struct ( 8) [rbp-0xD0]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;* V19 tmp16        [V19    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
-;  V20 tmp17        [V20,T21] (  3,  3   )  simd16  ->  [rbp-0xE0]  do-not-enreg[SF] ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
-;  V21 tmp18        [V21,T17] (  3,  6   )  simd16  ->  [rbp-0xF0]  do-not-enreg[SF] "Inlining Arg" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
-;* V22 tmp19        [V22    ] (  0,  0   )  struct ( 8) zero-ref    "impAppendStmt" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;* V23 tmp20        [V23    ] (  0,  0   )  struct ( 8) zero-ref    "spilled call-like call argument" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;  V24 tmp21        [V24    ] (  2,  5   )  struct ( 8) [rbp-0xF8]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;  V25 tmp22        [V25,T03] (  5, 17   )     int  ->  r15         "Inline stloc first use temp"
-;  V26 tmp23        [V26    ] (  2, 10   )  struct ( 8) [rbp-0x100]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;* V27 tmp24        [V27    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
-;  V28 tmp25        [V28    ] (  2,  5   )  struct ( 8) [rbp-0x108]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;  V29 tmp26        [V29,T04] (  5, 17   )     int  ->  r13         "Inline stloc first use temp"
-;  V30 tmp27        [V30    ] (  2, 10   )  struct ( 8) [rbp-0x110]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V16 tmp13        [V16    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
+;* V17 tmp14        [V17    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
+;  V18 tmp15        [V18    ] (  9,  9   )  struct ( 8) [rbp-0x28]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V19 tmp16        [V19,T12] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
+;  V20 tmp17        [V20    ] (  9, 18   )  struct ( 8) [rbp-0x30]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V21 tmp18        [V21    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
+;* V22 tmp19        [V22    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
+;* V23 tmp20        [V23    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
+;  V24 tmp21        [V24,T29] (  3,  3   )  simd16  ->  [rbp-0x40]  do-not-enreg[SF] ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
+;  V25 tmp22        [V25,T21] (  3,  6   )  simd16  ->  [rbp-0x50]  do-not-enreg[SF] "Inlining Arg" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
+;* V26 tmp23        [V26    ] (  0,  0   )  struct ( 8) zero-ref    "impAppendStmt" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V27 tmp24        [V27    ] (  0,  0   )  struct ( 8) zero-ref    "spilled call-like call argument" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V28 tmp25        [V28    ] (  9,  9   )  struct ( 8) [rbp-0x58]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V29 tmp26        [V29,T13] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
+;  V30 tmp27        [V30    ] (  9, 18   )  struct ( 8) [rbp-0x60]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
 ;* V31 tmp28        [V31    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
-;  V32 tmp29        [V32,T22] (  3,  3   )  simd16  ->  [rbp-0x120]  do-not-enreg[SF] ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
-;* V33 tmp30        [V33    ] (  0,  0   )  simd32  ->  zero-ref    "Inline return value spill temp" <System.Runtime.Intrinsics.Vector256`1[ubyte]>
-;  V34 tmp31        [V34,T23] (  1,  1   )  simd32  ->  [rbp+0x10]  single-def "field V01._lower (fldOffset=0x0)" P-INDEP
-;  V35 tmp32        [V35,T24] (  1,  1   )  simd32  ->  [rbp+0x30]  single-def "field V01._upper (fldOffset=0x20)" P-INDEP
-;  V36 tmp33        [V36,T11] (  2,  2   )    long  ->  r15         "field V10._00 (fldOffset=0x0)" P-INDEP
-;  V37 tmp34        [V37,T12] (  2,  2   )    long  ->  rdi         "field V11._00 (fldOffset=0x0)" P-INDEP
-;  V38 tmp35        [V38    ] (  2,  5   )    long  ->  [rbp-0xB8]  do-not-enreg[X] addr-exposed "field V12._00 (fldOffset=0x0)" P-DEP
-;  V39 tmp36        [V39    ] (  2,  9   )    long  ->  [rbp-0xC0]  do-not-enreg[X] addr-exposed "field V14._00 (fldOffset=0x0)" P-DEP
-;  V40 tmp37        [V40    ] (  2,  5   )    long  ->  [rbp-0xC8]  do-not-enreg[X] addr-exposed "field V16._00 (fldOffset=0x0)" P-DEP
-;  V41 tmp38        [V41    ] (  2,  9   )    long  ->  [rbp-0xD0]  do-not-enreg[X] addr-exposed "field V18._00 (fldOffset=0x0)" P-DEP
-;  V42 tmp39        [V42,T13] (  2,  2   )    long  ->  r15         "field V22._00 (fldOffset=0x0)" P-INDEP
-;  V43 tmp40        [V43,T14] (  2,  2   )    long  ->  rsi         "field V23._00 (fldOffset=0x0)" P-INDEP
-;  V44 tmp41        [V44    ] (  2,  5   )    long  ->  [rbp-0xF8]  do-not-enreg[X] addr-exposed "field V24._00 (fldOffset=0x0)" P-DEP
-;  V45 tmp42        [V45    ] (  2,  9   )    long  ->  [rbp-0x100]  do-not-enreg[X] addr-exposed "field V26._00 (fldOffset=0x0)" P-DEP
-;  V46 tmp43        [V46    ] (  2,  5   )    long  ->  [rbp-0x108]  do-not-enreg[X] addr-exposed "field V28._00 (fldOffset=0x0)" P-DEP
-;  V47 tmp44        [V47    ] (  2,  9   )    long  ->  [rbp-0x110]  do-not-enreg[X] addr-exposed "field V30._00 (fldOffset=0x0)" P-DEP
-;  V48 cse0         [V48,T00] (  6, 18   )     int  ->  r14         hoist "CSE #01: aggressive"
-;  V49 cse1         [V49,T05] (  3, 12   )    long  ->  r13         "CSE #02: aggressive"
-;  V50 cse2         [V50,T06] (  3, 12   )    long  ->  r12         "CSE #03: aggressive"
-;  V51 cse3         [V51,T07] (  3, 12   )    long  ->  r13         "CSE #04: aggressive"
-;  V52 cse4         [V52,T08] (  3, 12   )    long  ->  r12         "CSE #05: aggressive"
+;* V32 tmp29        [V32    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
+;* V33 tmp30        [V33    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
+;  V34 tmp31        [V34    ] (  9,  9   )  struct ( 8) [rbp-0x68]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V35 tmp32        [V35,T14] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
+;  V36 tmp33        [V36    ] (  9, 18   )  struct ( 8) [rbp-0x70]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V37 tmp34        [V37    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
+;* V38 tmp35        [V38    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
+;* V39 tmp36        [V39    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
+;  V40 tmp37        [V40,T30] (  3,  3   )  simd16  ->  [rbp-0x80]  do-not-enreg[SF] ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
+;* V41 tmp38        [V41    ] (  0,  0   )  simd32  ->  zero-ref    "Inline return value spill temp" <System.Runtime.Intrinsics.Vector256`1[ubyte]>
+;  V42 tmp39        [V42,T22] (  3,  6   )  simd32  ->  mm1         "Inlining Arg" <System.Runtime.Intrinsics.Vector256`1[ubyte]>
+;  V43 tmp40        [V43,T28] (  2,  4   )  simd16  ->  mm2         "impAppendStmt"
+;* V44 tmp41        [V44    ] (  0,  0   )  simd16  ->  zero-ref    "spilled call-like call argument"
+;  V45 tmp42        [V45,T23] (  3,  6   )  simd16  ->  [rbp-0x90]  do-not-enreg[SF] "Inlining Arg" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
+;* V46 tmp43        [V46    ] (  0,  0   )  struct ( 8) zero-ref    "impAppendStmt" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V47 tmp44        [V47    ] (  0,  0   )  struct ( 8) zero-ref    "spilled call-like call argument" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V48 tmp45        [V48    ] (  9,  9   )  struct ( 8) [rbp-0x98]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V49 tmp46        [V49,T15] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
+;  V50 tmp47        [V50    ] (  9, 18   )  struct ( 8) [rbp-0xA0]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V51 tmp48        [V51    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
+;* V52 tmp49        [V52    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
+;* V53 tmp50        [V53    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
+;  V54 tmp51        [V54    ] (  9,  9   )  struct ( 8) [rbp-0xA8]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V55 tmp52        [V55,T16] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
+;  V56 tmp53        [V56    ] (  9, 18   )  struct ( 8) [rbp-0xB0]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V57 tmp54        [V57    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
+;* V58 tmp55        [V58    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
+;* V59 tmp56        [V59    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
+;  V60 tmp57        [V60,T31] (  3,  3   )  simd16  ->  [rbp-0xC0]  do-not-enreg[SF] ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
+;  V61 tmp58        [V61,T24] (  3,  6   )  simd16  ->  [rbp-0xD0]  do-not-enreg[SF] "Inlining Arg" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
+;* V62 tmp59        [V62    ] (  0,  0   )  struct ( 8) zero-ref    "impAppendStmt" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V63 tmp60        [V63    ] (  0,  0   )  struct ( 8) zero-ref    "spilled call-like call argument" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V64 tmp61        [V64    ] (  9,  9   )  struct ( 8) [rbp-0xD8]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V65 tmp62        [V65,T17] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
+;  V66 tmp63        [V66    ] (  9, 18   )  struct ( 8) [rbp-0xE0]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V67 tmp64        [V67    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
+;* V68 tmp65        [V68    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
+;* V69 tmp66        [V69    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
+;  V70 tmp67        [V70    ] (  9,  9   )  struct ( 8) [rbp-0xE8]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V71 tmp68        [V71,T18] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
+;  V72 tmp69        [V72    ] (  9, 18   )  struct ( 8) [rbp-0xF0]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V73 tmp70        [V73    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
+;* V74 tmp71        [V74    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
+;* V75 tmp72        [V75    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
+;  V76 tmp73        [V76,T32] (  3,  3   )  simd16  ->  [rbp-0x100]  do-not-enreg[SF] ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
+;* V77 tmp74        [V77    ] (  0,  0   )  simd32  ->  zero-ref    "Inline return value spill temp" <System.Runtime.Intrinsics.Vector256`1[ubyte]>
+;* V78 tmp75        [V78    ] (  0,  0   )  struct (64) zero-ref    ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector512`1[ubyte]>
+;  V79 tmp76        [V79,T35] (  1,  1   )  simd32  ->  [rbp+0x10]  single-def "field V01._lower (fldOffset=0x0)" P-INDEP
+;  V80 tmp77        [V80,T36] (  1,  1   )  simd32  ->  [rbp+0x30]  single-def "field V01._upper (fldOffset=0x20)" P-INDEP
+;  V81 tmp78        [V81,T03] (  2,  2   )    long  ->  rsi         "field V10._00 (fldOffset=0x0)" P-INDEP
+;  V82 tmp79        [V82,T04] (  2,  2   )    long  ->  rax         "field V11._00 (fldOffset=0x0)" P-INDEP
+;  V83 tmp80        [V83    ] (  9,  9   )    long  ->  [rbp-0x18]  do-not-enreg[X] addr-exposed "field V12._00 (fldOffset=0x0)" P-DEP
+;  V84 tmp81        [V84    ] (  9, 17   )    long  ->  [rbp-0x20]  do-not-enreg[X] addr-exposed "field V14._00 (fldOffset=0x0)" P-DEP
+;  V85 tmp82        [V85    ] (  9,  9   )    long  ->  [rbp-0x28]  do-not-enreg[X] addr-exposed "field V18._00 (fldOffset=0x0)" P-DEP
+;  V86 tmp83        [V86    ] (  9, 17   )    long  ->  [rbp-0x30]  do-not-enreg[X] addr-exposed "field V20._00 (fldOffset=0x0)" P-DEP
+;  V87 tmp84        [V87,T05] (  2,  2   )    long  ->  rsi         "field V26._00 (fldOffset=0x0)" P-INDEP
+;  V88 tmp85        [V88,T06] (  2,  2   )    long  ->  rax         "field V27._00 (fldOffset=0x0)" P-INDEP
+;  V89 tmp86        [V89    ] (  9,  9   )    long  ->  [rbp-0x58]  do-not-enreg[X] addr-exposed "field V28._00 (fldOffset=0x0)" P-DEP
+;  V90 tmp87        [V90    ] (  9, 17   )    long  ->  [rbp-0x60]  do-not-enreg[X] addr-exposed "field V30._00 (fldOffset=0x0)" P-DEP
+;  V91 tmp88        [V91    ] (  9,  9   )    long  ->  [rbp-0x68]  do-not-enreg[X] addr-exposed "field V34._00 (fldOffset=0x0)" P-DEP
+;  V92 tmp89        [V92    ] (  9, 17   )    long  ->  [rbp-0x70]  do-not-enreg[X] addr-exposed "field V36._00 (fldOffset=0x0)" P-DEP
+;  V93 tmp90        [V93,T07] (  2,  2   )    long  ->  rsi         "field V46._00 (fldOffset=0x0)" P-INDEP
+;  V94 tmp91        [V94,T08] (  2,  2   )    long  ->  rax         "field V47._00 (fldOffset=0x0)" P-INDEP
+;  V95 tmp92        [V95    ] (  9,  9   )    long  ->  [rbp-0x98]  do-not-enreg[X] addr-exposed "field V48._00 (fldOffset=0x0)" P-DEP
+;  V96 tmp93        [V96    ] (  9, 17   )    long  ->  [rbp-0xA0]  do-not-enreg[X] addr-exposed "field V50._00 (fldOffset=0x0)" P-DEP
+;  V97 tmp94        [V97    ] (  9,  9   )    long  ->  [rbp-0xA8]  do-not-enreg[X] addr-exposed "field V54._00 (fldOffset=0x0)" P-DEP
+;  V98 tmp95        [V98    ] (  9, 17   )    long  ->  [rbp-0xB0]  do-not-enreg[X] addr-exposed "field V56._00 (fldOffset=0x0)" P-DEP
+;  V99 tmp96        [V99,T09] (  2,  2   )    long  ->  rsi         "field V62._00 (fldOffset=0x0)" P-INDEP
+;  V100 tmp97       [V100,T10] (  2,  2   )    long  ->  rax         "field V63._00 (fldOffset=0x0)" P-INDEP
+;  V101 tmp98       [V101    ] (  9,  9   )    long  ->  [rbp-0xD8]  do-not-enreg[X] addr-exposed "field V64._00 (fldOffset=0x0)" P-DEP
+;  V102 tmp99       [V102    ] (  9, 17   )    long  ->  [rbp-0xE0]  do-not-enreg[X] addr-exposed "field V66._00 (fldOffset=0x0)" P-DEP
+;  V103 tmp100      [V103    ] (  9,  9   )    long  ->  [rbp-0xE8]  do-not-enreg[X] addr-exposed "field V70._00 (fldOffset=0x0)" P-DEP
+;  V104 tmp101      [V104    ] (  9, 17   )    long  ->  [rbp-0xF0]  do-not-enreg[X] addr-exposed "field V72._00 (fldOffset=0x0)" P-DEP
+;  V105 tmp102      [V105,T33] (  2,  2   )  simd32  ->  mm0         "field V78._lower (fldOffset=0x0)" P-INDEP
+;  V106 tmp103      [V106,T34] (  2,  2   )  simd32  ->  mm1         "field V78._upper (fldOffset=0x20)" P-INDEP
+;  V107 cse0        [V107,T00] ( 65, 65   )     int  ->  rcx         "CSE #01: aggressive"
 ;
-; Lcl frame size = 312
+; Lcl frame size = 256
 
 G_M7625_IG01:
        push     rbp
-       push     r15
-       push     r14
-       push     r13
-       push     r12
-       push     rbx
-       sub      rsp, 312
-       lea      rbp, [rsp+0x160]
-       mov      rbx, rdi
-						;; size=28 bbWeight=1 PerfScore 7.00
+       sub      rsp, 256
+       lea      rbp, [rsp+0x100]
+						;; size=16 bbWeight=1 PerfScore 1.75
 G_M7625_IG02:
        vmovups  ymm0, ymmword ptr [rbp+0x10]
-       vmovups  ymmword ptr [rbp-0x90], ymm0
        vmovaps  ymm1, ymm0
-       vmovaps  xmmword ptr [rbp-0xB0], xmm1
-       mov      rdi, qword ptr [rbp-0xB0]
-       mov      qword ptr [rbp-0xC0], rdi
-       xor      r15d, r15d
-       movzx    r14, sil
-						;; size=46 bbWeight=1 PerfScore 8.75
+       vmovaps  xmmword ptr [rbp-0x10], xmm1
+       mov      rax, qword ptr [rbp-0x10]
+       mov      qword ptr [rbp-0x20], rax
+       movzx    rax, byte  ptr [rbp-0x20]
+       movzx    rcx, sil
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x18], al
+       movzx    rax, byte  ptr [rbp-0x1F]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x17], al
+       movzx    rax, byte  ptr [rbp-0x1E]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x16], al
+       movzx    rax, byte  ptr [rbp-0x1D]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x15], al
+       movzx    rax, byte  ptr [rbp-0x1C]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x14], al
+       movzx    rax, byte  ptr [rbp-0x1B]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x13], al
+       movzx    rax, byte  ptr [rbp-0x1A]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x12], al
+       movzx    rax, byte  ptr [rbp-0x19]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x11], al
+       mov      rsi, qword ptr [rbp-0x18]
+       mov      rax, qword ptr [rbp-0x08]
+       mov      qword ptr [rbp-0x30], rax
+       movzx    rax, byte  ptr [rbp-0x30]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x28], al
+       movzx    rax, byte  ptr [rbp-0x2F]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x27], al
+       movzx    rax, byte  ptr [rbp-0x2E]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x26], al
+       movzx    rax, byte  ptr [rbp-0x2D]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x25], al
+       movzx    rax, byte  ptr [rbp-0x2C]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x24], al
+       movzx    rax, byte  ptr [rbp-0x2B]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x23], al
+       movzx    rax, byte  ptr [rbp-0x2A]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x22], al
+						;; size=203 bbWeight=1 PerfScore 419.25
 G_M7625_IG03:
-       lea      rdi, [rbp-0xC0]
-       movsxd   r13, r15d
-       movzx    rdi, byte  ptr [rdi+r13]
-       mov      esi, r14d
-       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Divide(ubyte,ubyte):ubyte
-       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Divide(ubyte,ubyte):ubyte
-       lea      rdi, [rbp-0xB8]
-       mov      byte  ptr [rdi+r13], al
-       inc      r15d
-       cmp      r15d, 8
-       jl       SHORT G_M7625_IG03
-						;; size=50 bbWeight=4 PerfScore 37.00
+       movzx    rax, byte  ptr [rbp-0x29]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x21], al
+       mov      rax, qword ptr [rbp-0x28]
+       mov      qword ptr [rbp-0x40], rsi
+       mov      qword ptr [rbp-0x38], rax
+       vmovaps  xmm1, xmmword ptr [rbp-0x40]
+       vextractf128 xmm0, ymm0, 1
+       vmovaps  xmmword ptr [rbp-0x50], xmm0
+       mov      rax, qword ptr [rbp-0x50]
+       mov      qword ptr [rbp-0x60], rax
+       movzx    rax, byte  ptr [rbp-0x60]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x58], al
+       movzx    rax, byte  ptr [rbp-0x5F]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x57], al
+       movzx    rax, byte  ptr [rbp-0x5E]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x56], al
+       movzx    rax, byte  ptr [rbp-0x5D]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x55], al
+       movzx    rax, byte  ptr [rbp-0x5C]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x54], al
+       movzx    rax, byte  ptr [rbp-0x5B]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x53], al
+       movzx    rax, byte  ptr [rbp-0x5A]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x52], al
+       movzx    rax, byte  ptr [rbp-0x59]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x51], al
+       mov      rsi, qword ptr [rbp-0x58]
+       mov      rax, qword ptr [rbp-0x48]
+       mov      qword ptr [rbp-0x70], rax
+       movzx    rax, byte  ptr [rbp-0x70]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x68], al
+       movzx    rax, byte  ptr [rbp-0x6F]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x67], al
+       movzx    rax, byte  ptr [rbp-0x6E]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x66], al
+       movzx    rax, byte  ptr [rbp-0x6D]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x65], al
+       movzx    rax, byte  ptr [rbp-0x6C]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x64], al
+       movzx    rax, byte  ptr [rbp-0x6B]
+       xor      edx, edx
+						;; size=208 bbWeight=1 PerfScore 396.75
 G_M7625_IG04:
-       mov      r15, qword ptr [rbp-0xB8]
-       mov      rdi, qword ptr [rbp-0xA8]
-       mov      qword ptr [rbp-0xD0], rdi
-       xor      r13d, r13d
-						;; size=24 bbWeight=1 PerfScore 3.25
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x63], al
+       movzx    rax, byte  ptr [rbp-0x6A]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x62], al
+       movzx    rax, byte  ptr [rbp-0x69]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x61], al
+       mov      rax, qword ptr [rbp-0x68]
+       mov      qword ptr [rbp-0x80], rsi
+       mov      qword ptr [rbp-0x78], rax
+       vinserti128 ymm0, ymm1, xmmword ptr [rbp-0x80], 1
+       vmovups  ymm1, ymmword ptr [rbp+0x30]
+       vmovaps  ymm2, ymm1
+       vmovaps  xmmword ptr [rbp-0x90], xmm2
+       mov      rax, qword ptr [rbp-0x90]
+       mov      qword ptr [rbp-0xA0], rax
+       movzx    rax, byte  ptr [rbp-0xA0]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x98], al
+       movzx    rax, byte  ptr [rbp-0x9F]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x97], al
+       movzx    rax, byte  ptr [rbp-0x9E]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x96], al
+       movzx    rax, byte  ptr [rbp-0x9D]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x95], al
+       movzx    rax, byte  ptr [rbp-0x9C]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x94], al
+       movzx    rax, byte  ptr [rbp-0x9B]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x93], al
+       movzx    rax, byte  ptr [rbp-0x9A]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x92], al
+       movzx    rax, byte  ptr [rbp-0x99]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0x91], al
+       mov      rsi, qword ptr [rbp-0x98]
+       mov      rax, qword ptr [rbp-0x88]
+       mov      qword ptr [rbp-0xB0], rax
+       movzx    rax, byte  ptr [rbp-0xB0]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xA8], al
+       movzx    rax, byte  ptr [rbp-0xAF]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xA7], al
+       movzx    rax, byte  ptr [rbp-0xAE]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xA6], al
+       movzx    rax, byte  ptr [rbp-0xAD]
+       xor      edx, edx
+       div      edx:eax, ecx
+						;; size=296 bbWeight=1 PerfScore 423.75
 G_M7625_IG05:
-       lea      rdi, [rbp-0xD0]
-       movsxd   r12, r13d
-       movzx    rdi, byte  ptr [rdi+r12]
-       mov      esi, r14d
-       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Divide(ubyte,ubyte):ubyte
-       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Divide(ubyte,ubyte):ubyte
-       lea      rdi, [rbp-0xC8]
-       mov      byte  ptr [rdi+r12], al
-       inc      r13d
-       cmp      r13d, 8
-       jl       SHORT G_M7625_IG05
-						;; size=50 bbWeight=4 PerfScore 37.00
+       mov      byte  ptr [rbp-0xA5], al
+       movzx    rax, byte  ptr [rbp-0xAC]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xA4], al
+       movzx    rax, byte  ptr [rbp-0xAB]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xA3], al
+       movzx    rax, byte  ptr [rbp-0xAA]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xA2], al
+       movzx    rax, byte  ptr [rbp-0xA9]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xA1], al
+       mov      rax, qword ptr [rbp-0xA8]
+       mov      qword ptr [rbp-0xC0], rsi
+       mov      qword ptr [rbp-0xB8], rax
+       vmovaps  xmm2, xmmword ptr [rbp-0xC0]
+       vextractf128 xmm1, ymm1, 1
+       vmovaps  xmmword ptr [rbp-0xD0], xmm1
+       mov      rax, qword ptr [rbp-0xD0]
+       mov      qword ptr [rbp-0xE0], rax
+       movzx    rax, byte  ptr [rbp-0xE0]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xD8], al
+       movzx    rax, byte  ptr [rbp-0xDF]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xD7], al
+       movzx    rax, byte  ptr [rbp-0xDE]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xD6], al
+       movzx    rax, byte  ptr [rbp-0xDD]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xD5], al
+       movzx    rax, byte  ptr [rbp-0xDC]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xD4], al
+       movzx    rax, byte  ptr [rbp-0xDB]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xD3], al
+       movzx    rax, byte  ptr [rbp-0xDA]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xD2], al
+       movzx    rax, byte  ptr [rbp-0xD9]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xD1], al
+       mov      rsi, qword ptr [rbp-0xD8]
+       mov      rax, qword ptr [rbp-0xC8]
+       mov      qword ptr [rbp-0xF0], rax
+       movzx    rax, byte  ptr [rbp-0xF0]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xE8], al
+       movzx    rax, byte  ptr [rbp-0xEF]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xE7], al
+						;; size=322 bbWeight=1 PerfScore 396.50
 G_M7625_IG06:
-       mov      rdi, qword ptr [rbp-0xC8]
-       mov      qword ptr [rbp-0xE0], r15
-       mov      qword ptr [rbp-0xD8], rdi
-       vmovaps  xmm1, xmmword ptr [rbp-0xE0]
-       vmovaps  xmmword ptr [rbp-0xA0], xmm1
-       vmovups  ymm0, ymmword ptr [rbp-0x90]
-       vextractf128 xmm0, ymm0, 1
-       vmovaps  xmmword ptr [rbp-0xF0], xmm0
-       mov      rdi, qword ptr [rbp-0xF0]
-       mov      qword ptr [rbp-0x100], rdi
-       xor      r15d, r15d
-						;; size=76 bbWeight=1 PerfScore 16.25
+       movzx    rax, byte  ptr [rbp-0xEE]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xE6], al
+       movzx    rax, byte  ptr [rbp-0xED]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xE5], al
+       movzx    rax, byte  ptr [rbp-0xEC]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xE4], al
+       movzx    rax, byte  ptr [rbp-0xEB]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xE3], al
+       movzx    rax, byte  ptr [rbp-0xEA]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xE2], al
+       movzx    rax, byte  ptr [rbp-0xE9]
+       xor      edx, edx
+       div      edx:eax, ecx
+       mov      byte  ptr [rbp-0xE1], al
+       mov      rax, qword ptr [rbp-0xE8]
+       mov      qword ptr [rbp-0x100], rsi
+       mov      qword ptr [rbp-0xF8], rax
+       vinserti128 ymm1, ymm2, xmmword ptr [rbp-0x100], 1
+       vmovups  ymmword ptr [rdi], ymm0
+       vmovups  ymmword ptr [rdi+0x20], ymm1
+       mov      rax, rdi
+						;; size=145 bbWeight=1 PerfScore 174.75
 G_M7625_IG07:
-       lea      rdi, [rbp-0x100]
-       movsxd   r13, r15d
-       movzx    rdi, byte  ptr [rdi+r13]
-       mov      esi, r14d
-       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Divide(ubyte,ubyte):ubyte
-       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Divide(ubyte,ubyte):ubyte
-       lea      rdi, [rbp-0xF8]
-       mov      byte  ptr [rdi+r13], al
-       inc      r15d
-       cmp      r15d, 8
-       jl       SHORT G_M7625_IG07
-						;; size=50 bbWeight=4 PerfScore 37.00
-G_M7625_IG08:
-       mov      r15, qword ptr [rbp-0xF8]
-       mov      rdi, qword ptr [rbp-0xE8]
-       mov      qword ptr [rbp-0x110], rdi
-       xor      r13d, r13d
-						;; size=24 bbWeight=1 PerfScore 3.25
-G_M7625_IG09:
-       lea      rdi, [rbp-0x110]
-       movsxd   r12, r13d
-       movzx    rdi, byte  ptr [rdi+r12]
-       mov      esi, r14d
-       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Divide(ubyte,ubyte):ubyte
-       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Divide(ubyte,ubyte):ubyte
-       lea      rsi, [rbp-0x108]
-       mov      byte  ptr [rsi+r12], al
-       inc      r13d
-       cmp      r13d, 8
-       jl       SHORT G_M7625_IG09
-						;; size=50 bbWeight=4 PerfScore 37.00
-G_M7625_IG10:
-       mov      rsi, qword ptr [rbp-0x108]
-       mov      qword ptr [rbp-0x120], r15
-       mov      qword ptr [rbp-0x118], rsi
-       vmovaps  xmm1, xmmword ptr [rbp-0xA0]
-       vinserti128 ymm0, ymm1, xmmword ptr [rbp-0x120], 1
-       vmovups  ymmword ptr [rbp-0x50], ymm0
-       vmovups  ymm1, ymmword ptr [rbp+0x30]
-       vmovups  ymmword ptr [rsp], ymm1
-       mov      esi, r14d
-       lea      rdi, [rbp-0x70]
-       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Vector256`1[ubyte]:op_Division(System.Runtime.Intrinsics.Vector256`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector256`1[ubyte]
-       call     [rax]System.Runtime.Intrinsics.Vector256`1[ubyte]:op_Division(System.Runtime.Intrinsics.Vector256`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector256`1[ubyte]
-       vmovups  ymm0, ymmword ptr [rbp-0x50]
-       vmovups  ymmword ptr [rsp], ymm0
-       vmovups  ymm0, ymmword ptr [rbp-0x70]
-       vmovups  ymmword ptr [rsp+0x20], ymm0
-       mov      rdi, rbx
-       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Vector512:Create[ubyte](System.Runtime.Intrinsics.Vector256`1[ubyte],System.Runtime.Intrinsics.Vector256`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte]
-       call     [rax]System.Runtime.Intrinsics.Vector512:Create[ubyte](System.Runtime.Intrinsics.Vector256`1[ubyte],System.Runtime.Intrinsics.Vector256`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte]
-       mov      rax, rbx
-						;; size=112 bbWeight=1 PerfScore 33.75
-G_M7625_IG11:
        vzeroupper 
-       add      rsp, 312
-       pop      rbx
-       pop      r12
-       pop      r13
-       pop      r14
-       pop      r15
+       add      rsp, 256
        pop      rbp
        ret      
-						;; size=21 bbWeight=1 PerfScore 5.25
+						;; size=12 bbWeight=1 PerfScore 2.75
 
-; Total bytes of code 531, prolog size 25, PerfScore 225.50, instruction count 109, allocated bytes for code 531 (MethodHash=ae93e236) for method System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
+; Total bytes of code 1202, prolog size 16, PerfScore 1815.50, instruction count 313, allocated bytes for code 1202 (MethodHash=ae93e236) for method System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)

Larger list of diffs: https://gist.github.com/MihuBot/28100e9970bd0c22ecc871216381a5d7

@MihuBot
Copy link
Owner Author

MihuBot commented Jun 6, 2024

@tannergooding

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant