Darryl Gove's blog

Friday Sep 28, 2007

Perflib and parallel regions

If perflib is called from within a parallel region, you get the serial version rather than a parallel version. This is probably better than ending up with N^2 threads when you set OMP_NUM_THREADS to be N.

Comments:

Strange. I would have expected a global thread pool (in which case I can't see the issue to handle nested parallelism) or at least a heuristic of starting parallel code with a value of OMP_NUM_THREADS divided by the number of parallel threads started.

I guess there is room for improvement here...

Posted by Marc on September 30, 2007 at 10:13 AM PDT #

Post a Comment:
Comments are closed for this entry.

Calendar

Search this blog

About

Solaris Application Programming

Book resources

The Developer's Edge

Book resources

OpenSPARC Internals

Book resources

Recent entries

Custom search

Tag cloud

book cmt communityone compiler cooltools cpu2006 dtrace gcc libraries linker multithreading openmp opensolaris opensparc optimisation optimization parallelisation parallelization performance performanceanalyzer programming secondlife solaris solarisapplicationprogramming sparc spot sunstudio ultrasparc ultrasparct2 x86

Links

Webcasts

Articles

Presentations

Interesting docs

Navigation

Referers

Feeds