We develop the HPCToolkit performance measurement, analysis, and attribution tool suite and the Dyninst binary analysis and instrumentation tool suite. We parallelize the tools with multithreading to analyze large scale applications and develop data race detectors for OpenMP programs.