This tutorial presents a series of gradual changes to a sample application. The changes were prompted by advice and statistics from the tool, leading to a 46x performance boost on a single core, and more than 83x on an 8-core system, where the final version displays almost perfect linear scaling.

Download Tutorial PDF