Skip to content

Commit 2346e03

Browse files
committed
Fix: Compiling w/out Intel TBB
1 parent 7ae2c01 commit 2346e03

File tree

3 files changed

+17
-4
lines changed

3 files changed

+17
-4
lines changed

.vscode/settings.json

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@
1717
"bootcamps",
1818
"Byrne",
1919
"Cawley",
20-
"cblas",
20+
"cBLAS",
2121
"CCCL",
2222
"chriskohlhoffasio",
2323
"clflush",
@@ -29,7 +29,8 @@
2929
"cppcoro",
3030
"cstddef",
3131
"CTRE",
32-
"cublas",
32+
"cuBLAS",
33+
"cuBLASLt",
3334
"CUDA",
3435
"denormal",
3536
"DOTPROD",
@@ -97,6 +98,7 @@
9798
"prefetcher",
9899
"pthread",
99100
"PTXAS",
101+
"quadpair",
100102
"RDMA",
101103
"reorderable",
102104
"Shankhdhar",
@@ -127,6 +129,7 @@
127129
"VHDL",
128130
"VNNI",
129131
"VPCLMULQDQ",
132+
"warpgroup",
130133
"Weis",
131134
"WGMMA",
132135
"wmma",

README.md

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -83,6 +83,13 @@ The build will pull and compile several third-party dependencies from the source
8383
- Nvidia's [CCCL](https://github.com/nvidia/cccl) for GPU-accelerated algorithms.
8484
- Nvidia's [CUTLASS](https://github.com/nvidia/cutlass) for GPU-accelerated Linear Algebra.
8585

86+
To build without Parallel STL, Intel TBB, and CUDA:
87+
88+
```sh
89+
cmake -B build_release -D CMAKE_BUILD_TYPE=Release -D USE_INTEL_TBB=OFF -D USE_NVIDIA_CCCL=OFF
90+
cmake --build build_release --config Release
91+
```
92+
8693
To control the output or run specific benchmarks, use the following flags:
8794

8895
```sh

less_slow.cpp

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -465,8 +465,11 @@ BENCHMARK(sorting)->Args({8196, false})->Args({8196, true});
465465
*
466466
* @see Feature testing macros: https://en.cppreference.com/w/cpp/utility/feature_test
467467
*/
468+
#if !defined(USE_INTEL_TBB)
469+
#define USE_INTEL_TBB 1
470+
#endif // !defined(USE_INTEL_TBB)
468471

469-
#if defined(__cpp_lib_parallel_algorithm)
472+
#if defined(__cpp_lib_parallel_algorithm) && USE_INTEL_TBB
470473
#include <execution> // `std::execution::par_unseq`
471474

472475
template <typename execution_policy_>
@@ -548,7 +551,7 @@ BENCHMARK_CAPTURE(sorting_with_executors, par_unseq, std::execution::par_unseq)
548551
* by Bryce Adelstein Lelbach at CppCon 2016: https://youtu.be/Vck6kzWjY88
549552
*/
550553

551-
#endif // defined(__cpp_lib_parallel_algorithm)
554+
#endif // defined(__cpp_lib_parallel_algorithm) && USE_INTEL_TBB
552555

553556
#if defined(_OPENMP)
554557
/**

0 commit comments

Comments
 (0)