1[/=========================================================================== 2 Copyright (c) 2017 Steven Ross, Francisco Tapia, Orson Peters 3 4 5 Distributed under the Boost Software License, Version 1.0 6 See accompanying file LICENSE_1_0.txt or copy at 7 http://www.boost.org/LICENSE_1_0.txt 8=============================================================================/] 9 10[section:linux_parallel 3.4- Linux Benchmarks] 11[: 12The following results are from complex benchmarks, not include in the library because they use non free software. 13(If you are interested in them, contact fjtapia@gmail.com) 14 15There are 3 types of benchmarks, 16[: 17*64 bits integers 18 19*strings 20 21*objects of several sizes. 22 23The objects are arrays of integers. The heavy comparison sums all the elements in each, and the light comparison uses only the first number in the array. 24 25These were run on an Intel(R) Core(TM) i7-5820K CPU @ 3.30GHz, with 6 cores and 2 threads by core, and 15M of cache. The compiler used was a GCC 6.3 64 bits 26 27] 28 29 30[h4[_100 000 000 NUMBERS OF 64 BITS RANDOMLY FILLED]] 31 32[*[teletype] 33`` 34 | | | 35 | Time | Maximum | 36 | secs | Memory Used | 37 -------------------------+---------+-------------+ 38 OMP parallel sort | 1.1990 | 1564 MB | 39 TBB parallel_sort | 1.6411 | 789 MB | 40 block_indirect_sort | 0.9270 | 790 MB | 41 | | | 42 OMP parallel stable sort | 1.5814 | 1972 MB | 43 TBB parallel stable sort | 1.1745 | 1570 MB | 44 sample sort | 1.2872 | 1566 MB | 45 parallel stable sort | 1.7158 | 1176 MB | 46 | | | 47 48`` 49] 50 51[h4[_10 000 000 STRINGS RANDOMLY FILLED]] 52[*[teletype] 53`` 54 | | | 55 | Time | Maximum | 56 | secs | Memory Used | 57 -------------------------+---------+-------------+ 58 OMP parallel sort | 1.5738 | 2023 MB | 59 TBB parallel_sort | 1.8626 | 826 MB | 60 block_indirect_sort | 1.2411 | 825 MB | 61 | | | 62 OMP parallel stable sort | 2.3214 | 2024 MB | 63 TBB parallel stable sort | 1.4383 | 1143 MB | 64 sample sort | 1.5097 | 1135 MB | 65 parallel stable sort | 2.0970 | 978 MB | 66 | | | 67`` 68] 69 70 71[h4[_OBJECTS RANDOMLY FILLED]] 72 73[: 74The objects are arrays of 64 bits numbers 75 76They are compared in two ways : 77[: 78 (H) Heavy : The comparison is the sum of all the numbers of the array. 79 80 (L) Light : The comparison is using only the first element of the array, 81 as a key 82] 83] 84[*[teletype] 85`` 86 87 88 | | | | | | | | 89 | 100000000 | 50000000 | 25000000 | 12500000 | 6250000 | 1562500 | | 90 | objects of| objects of|objects of |objects of |objects of |objects of | Maximum | 91 | 8 bytes | 16 bytes | 32 bytes | 64 bytes | 128 bytes | 512 bytes | Memory | 92 | | | | | | | Used | 93 | H L | H L | H L | H L | H L | H L | | 94 --------------------+-----------+-----------+-----------+-----------+-----------+-----------+-------------+ 95 OMP parallel sort | 1.18 1.17| 0.73 0.66| 0.51 0.45| 0.43 0.39| 0.41 0.37| 0.37 0.32| 1569 MB | 96 TBB parallel_sort | 1.71 1.59| 0.85 0.81| 0.56 0.54| 0.51 0.42| 0.45 0.39| 0.36 0.32| 788 MB | 97 block_indirect_sort | 1.11 1.08| 0.66 0.63| 0.49 0.46| 0.43 0.39| 0.40 0.37| 0.37 0.33| 794 MB | 98 | | | | | | | | 99 OMP par stable sort | 1.55 1.55| 1.38 1.35| 1.23 1.22| 1.17 1.17| 1.09 1.08| 0.97 0.97| 1980 MB | 100 TBB par stable sort | 1.23 1.23| 0.89 0.85| 0.74 0.72| 0.71 0.69| 0.69 0.69| 0.68 0.68| 1573 MB | 101 | | | | | | | | 102 sample sort | 1.32 1.32| 0.84 0.78| 0.66 0.63| 0.63 0.62| 0.62 0.61| 0.60 0.60| 1568 MB | 103 parallel stable sort| 1.80 1.90| 1.17 1.07| 0.83 0.75| 0.76 0.71| 0.72 0.70| 0.70 0.69| 1177 MB | 104 | | | | | | | | 105 106 107 108`` 109] 110] 111[endsect] 112 113 114 115