parallelism race-condition refactoring c C++ multicore TBB paraformance