Skip to content

OptimizeFacesLRU

Chuck Walbourn edited this page Feb 5, 2018 · 9 revisions

Reorders faces to improve post-transform vertex cache reuse using the Forsyth algorithm.

HRESULT OptimizeFacesLRU(
   const uint16_t* indices, size_t nFaces,
   uint32_t* faceRemap,
   uint32_t lruCacheSize = OPTFACES_LRU_DEFAULT );

HRESULT OptimizeFacesLRU(
   const uint32_t* indices, size_t nFaces,
   uint32_t* faceRemap,
   uint32_t lruCacheSize = OPTFACES_LRU_DEFAULT );

HRESULT OptimizeFacesLRUEx(
   const uint16_t* indices, size_t nFaces,
   const uint32_t* attributes,
   uint32_t* faceRemap,
   uint32_t lruCacheSize = OPTFACES_LRU_DEFAULT );

HRESULT OptimizeFacesEx(
   const uint32_t* indices, size_t nFaces,
   const uint32_t* attributes,
   uint32_t* faceRemap,
   uint32_t lruCacheSize = OPTFACES_LRU_DEFAULT );

See also OptimizeFaces

Parameters

faceRemap is an array describing the reordering, and must have nFaces entries: oldLoc = faceRemap[newLoc]. See ReorderIB for details.

Note this matches the D3DXMesh::Optimize method and D3DXOptimizeFaces function definitions of the face remap array.

vertexCache is the size of the vertex cache to simulate for the optimization. This number should range from 1 to 64, and defaults to 32.

Remarks

The primary benefits of using Forsyth's algorithm are:

  1. Adjacency is not required
  2. If the simulated cache size is too large for the target hardware, performance drop-off tends to be more graceful than the Hoppe algorithm. Therefore a less conservative simulated cache size is required in practice.

Degenerate and 'unused' faces are skipped by the optimization, so they do not appear in the remap order.

There are models where the optimization fails to produce a faster ordering. For this reason, you should evaluate the original and optimized models with ComputeVertexCacheMissRate. If the new model is worse, use the original.

Example

auto mesh = std::unique_ptr<WaveFrontReader<uint16_t>>();

if ( FAILED( mesh->Load( L"test.obj" ) ) )
   // Error

size_t nFaces = mesh->indices.size() / 3;

std::unique_ptr<uint32_t[]> faceRemap( new uint32_t[ nFaces ] );
if ( FAILED( OptimizeFacesLRU( &mesh->indices.front(), nFaces,
   faceRemap.get() ) ) )
   // Error

std::unique_ptr<uint16_t[]> newIndices( new uint16_t[ nFaces * 3 ] );
if ( FAILED( ReorderIB( &mesh->indices.front(), nFaces, faceRemap.get(),
   newIndices.get() ) ) )
   // Error

Further Reading

Post Transform Cache

Forsyth, T.; "Linear-Speed Vertex Cache Optimisation" link

For Use

  • Universal Windows Platform apps
  • Windows desktop apps
  • Windows 11
  • Windows 10
  • Windows 8.1
  • Xbox One
  • Xbox Series X|S
  • Windows Subsystem for Linux

Architecture

  • x86
  • x64
  • ARM64

For Development

  • Visual Studio 2022
  • Visual Studio 2019 (16.11)
  • clang/LLVM v12 - v18
  • GCC 10.5, 11.4, 12.3
  • MinGW 12.2, 13.2
  • CMake 3.20

Related Projects

DirectX Tool Kit for DirectX 11

DirectX Tool Kit for DirectX 12

DirectXTex

DirectXMath

Tools

Test Suite

Content Exporter

DxCapsViewer

See also

DirectX Landing Page

Clone this wiki locally