Skip to content

BKM for intra-op vs. inter-op parallelism? #4764

Answered by martin-frbg
moderato asked this question in Q&A
Discussion options

You must be logged in to vote

Yes, sounds like you'd get best performance from using a single-threaded OpenBLAS (build options USE_THREAD=0 USE_LOCKING=1) or by calling openblas_set_num_threads(1) (or maybe any other small number, if your computer is big or your program is not using many threads at that momemt) before entering a multithreaded code section that makes BLAS calls.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@moderato
Comment options

Answer selected by moderato
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants