Chapter 12 edits #78

dankamongmen · 2024-09-15T07:11:42Z

Very few edits here. You suddenly start using MiB in 12-5. I'm all for the MB/MiB distinction, but you should probably use it everywhere if you're going to use it here? I didn't convert MiB to MB (this is your call), but I did change several instances of Mib that clearly meant MiB to MiB.

12.1.2: i'd talk about GCC Function Multiversioning (https://gcc.gnu.org/wiki/FunctionMultiVersioning)
12.1.3 p208: avoiding FMA might improve performance, but it might hurt accuracy (FMA uses an internal buffer to hold the first result, avoiding the loss of intermediate precision)

dendibakh · 2024-09-26T15:31:31Z

Very few edits here. You suddenly start using MiB in 12-5. I'm all for the MB/MiB distinction, but you should probably use it everywhere if you're going to use it here? I didn't convert MiB to MB (this is your call), but I did change several instances of Mib that clearly meant MiB to MiB.

Thanks, Nick. I didn't pay too much attention when I was writing... I will clean up across the book.

12.1.2: i'd talk about GCC Function Multiversioning (https://gcc.gnu.org/wiki/FunctionMultiVersioning)

This is GCC- and x86-specific, and I haven't seen it used in any production software.

12.1.3 p208: avoiding FMA might improve performance, but it might hurt accuracy (FMA uses an internal buffer to hold the first result, avoiding the loss of intermediate precision)

I appreciate that you've mentioned it. It's surprising how many software vendors care about bit-exact results, so FMA for them is a no-go, even though technically, yes, it will give them more accurate FP calculations.

dendibakh · 2024-09-26T15:39:47Z

chapters/12-Other-Tuning-Areas/12-1 CPU-Specific Optimizations.md

-* dotprod: support for dot product instructions for accelerating machine learning workloads.
-* sve: enables scalable vector length instructions.
-* sme: Scalable Matrix Extension for accelerating matrix multiplication.
+* Advanced SIMD: also known as NEON, provides arithmetic SIMD instructions.


ARM likes lowercase names, I've seen it in many places. Even disassembled code is shown in lowercase. But now I checked their official website, at it all capitalized. And also it will look consistent with everything else. So I guess I'll leave capitalized. Fine.

dankamongmen added 3 commits September 15, 2024 01:37

12-1: capitalize ARM extensions, number agreement, Alder Lake

0fcdb2f

11-2: small changes

82b2b31

12-7: Mib -> MiB

3a08953

dendibakh approved these changes Sep 26, 2024

View reviewed changes

dendibakh merged commit bbc4c1c into dendibakh:main Sep 26, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chapter 12 edits #78

Chapter 12 edits #78

dankamongmen commented Sep 15, 2024 •

edited

Loading

dendibakh commented Sep 26, 2024

dendibakh Sep 26, 2024

Chapter 12 edits #78

Chapter 12 edits #78

Conversation

dankamongmen commented Sep 15, 2024 • edited Loading

dendibakh commented Sep 26, 2024

dendibakh Sep 26, 2024

Choose a reason for hiding this comment

dankamongmen commented Sep 15, 2024 •

edited

Loading