Introduction - If you have any usage issues, please Google them yourself
The timestamp write below is a non-zero post-sync op, which on Gen6 necessitates a CS stall. CS stalls need stall at scoreboard set. See the comments for intel_emit_post_sync_nonzero_flush().