Computer Architecture

University of British Columbia

Isabelle Andre

Computer Architecture

A series of labs on relevant and state of the art computer architecture topics using C++ and the ChampSim Simulator

Table of Content

Lab 1 Instrumentation, Program Analysis, and Modeling

This project discusses program analysis using Pintool and writing efficient sorting algorithms. A Pintool was designed to count the number of instructions for various sorting algortihms such as Bubble sort. The number of memory and non-memory instructions were counted as well as the total execution time required to execute the algorithm. Another sorting algorithm was implemented to execute at 25x lower total instructions than Bubble Sort.

Lab 2 High-Performing Cache Replacement Policies

This assignment consisted in implementing high-performance and industry-grade cache-replacement policies for the shared Last Level Cache (LLC). The ChampSim simulator is used to implement, execute, and test caching policies. More specifically, we focus on the implementation and surface-level performance analysis of 4 different caching policies: LIP, BIP, DIP, and pLRU. The performance of these algorithms are showcased using spec2006 integer benchmarks. Finally, we compare the performance of these caching policies using cumulative IPC, hit rate, miss latency, and their IPC geomean, as compared with LRU.

Lab 3 Architectural Simulator and Code Optimization

This assignment consisted in understanding the implementation of architectural simulators and how to generate traces for the simulator from applications. A naive matrix multiplication function was provided to be optimized in speedup and memory. In Part A, we explore the simulator’s pipelined architecture and implement a non-pipelined core by modifying the current out-of-order core implementation of ChampSim. In Part B and this report, we optimize the basic matrix multiplication implementation by achieving 30% speed-up using loop reordering and matrix transposition, and reducing the L1D cache miss-rate by 8x by using blocked matrix multiplication.

Lab 4 Branch Predictor Implementation

This assignment consisted in understanding the implementation of branch prediction policies using ChampSim. Two branch predictors were implemented. A 2-bit correlated branch predictor with a single bit of history was first created, followed by a hashed-gselect predictor with 5 bits of branch history. The hashing employs a XOR function to index the table. The performance of both branch predictors were compared against the hashed perceptron predictor across 5 different spec2006 workloads using geomean.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Lab 1 - Pintool		Lab 1 - Pintool
Lab 2 - High-Performance Cache Replacement Policies		Lab 2 - High-Performance Cache Replacement Policies
Lab 3 - Deep-Diving into the Architectural Simulator and Code Optomization		Lab 3 - Deep-Diving into the Architectural Simulator and Code Optomization
Lab 4 - Branch Predictor implementations		Lab 4 - Branch Predictor implementations
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computer Architecture

Table of Content

Lab 1 Instrumentation, Program Analysis, and Modeling

Lab 2 High-Performing Cache Replacement Policies

Lab 3 Architectural Simulator and Code Optimization

Lab 4 Branch Predictor Implementation

About

Releases

Packages

Languages

Abeilles14/Computer-Architecture

Folders and files

Latest commit

History

Repository files navigation

Computer Architecture

Table of Content

Lab 1 Instrumentation, Program Analysis, and Modeling

Lab 2 High-Performing Cache Replacement Policies

Lab 3 Architectural Simulator and Code Optimization

Lab 4 Branch Predictor Implementation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages