* add cuda kernel launch overhead benchmark - source part. * can customize the nvcc_archs_support. * set SB_MICRO_PATH for azure pipeline tests.
__Major Revisions__ * add clang-format to lint cpp sources * add cpp lint in GitHub Actions