6.7. Avoid False Sharing

6.7. Avoid False Sharing
	Chapter 6. Optimization Workflow

In the presence of multiple threads that share data, there are a number of sharing effects that may affect performance. One such sharing pattern is false sharing. It arises if at least two threads are both using unrelated data placed close enough to end up in the same cache line. False sharing occurs when they repeatedly update their respective data in such a way that the cache line migrates back and forth between the two threads' caches.

Often this can be avoided by giving explicit alignment pragmas to the compiler.

In OpenMP programs False sharing arises when several threads maintain their respective partial result in a vector indexed by the thread rank. Replacing this with thread local variables often helps.

	Tip
	Avoid writing to global data that is accessed from multiple threads.

	Tip
	Align shared global data to cache line boundaries.

	Tip
	Don't store temporary, thread specific data in an array indexed by the thread id or rank.

	Tip
	When parallelizing an algorithm, partition data sets along cache lines, not across cache lines.


6.6. Use Non-Temporal Hints for Data without Temporal Reuse		6.8. Avoid Communication between Caches (Coherence Traffic)