• Home
  • Line#
  • Scopes#
  • Navigate#
  • Raw
  • Download
1[/===========================================================================
2 Copyright (c) 2017 Steven Ross, Francisco Tapia, Orson Peters
3
4
5 Distributed under the Boost Software License, Version 1.0
6 See accompanying file LICENSE_1_0.txt or copy at
7 http://www.boost.org/LICENSE_1_0.txt
8=============================================================================/]
9
10[section:linux_parallel 3.4- Linux Benchmarks]
11[:
12The following results are from complex benchmarks, not include in the library because they use non free software.
13(If you are interested in them, contact fjtapia@gmail.com)
14
15There are 3 types of benchmarks,
16[:
17*64 bits integers
18
19*strings
20
21*objects of several sizes.
22
23The objects are arrays of integers.  The heavy comparison sums all the elements in each, and the light comparison uses only the first number in the array.
24
25These were run on an Intel(R) Core(TM) i7-5820K CPU @ 3.30GHz, with 6 cores and 2 threads by core, and 15M of cache. The compiler used was a GCC 6.3 64 bits
26
27]
28
29
30[h4[_100 000 000 NUMBERS OF 64 BITS  RANDOMLY FILLED]]
31
32[*[teletype]
33``
34                             |         |             |
35                             |  Time   |   Maximum   |
36                             |  secs   | Memory Used |
37    -------------------------+---------+-------------+
38    OMP parallel sort        | 1.1990  |   1564 MB   |
39    TBB parallel_sort        | 1.6411  |    789 MB   |
40    block_indirect_sort      | 0.9270  |    790 MB   |
41                             |         |             |
42    OMP parallel stable sort | 1.5814  |   1972 MB   |
43    TBB parallel stable sort | 1.1745  |   1570 MB   |
44    sample sort              | 1.2872  |   1566 MB   |
45    parallel stable sort     | 1.7158  |   1176 MB   |
46                             |         |             |
47
48``
49]
50
51[h4[_10 000 000   STRINGS  RANDOMLY FILLED]]
52[*[teletype]
53``
54                             |         |             |
55                             |  Time   |   Maximum   |
56                             |  secs   | Memory Used |
57    -------------------------+---------+-------------+
58    OMP parallel sort        | 1.5738  |   2023 MB   |
59    TBB parallel_sort        | 1.8626  |    826 MB   |
60    block_indirect_sort      | 1.2411  |    825 MB   |
61                             |         |             |
62    OMP parallel stable sort | 2.3214  |   2024 MB   |
63    TBB parallel stable sort | 1.4383  |   1143 MB   |
64    sample sort              | 1.5097  |   1135 MB   |
65    parallel stable sort     | 2.0970  |    978 MB   |
66                             |         |             |
67``
68]
69
70
71[h4[_OBJECTS RANDOMLY FILLED]]
72
73[:
74The objects are arrays of 64 bits numbers
75
76They are compared in two ways :
77[:
78     (H) Heavy : The comparison is the sum of all the numbers of the array.
79
80     (L) Light : The comparison is using only the first element of the array,
81                 as a key
82]
83]
84[*[teletype]
85``
86
87
88                        |           |           |           |           |           |           |             |
89                        | 100000000 |  50000000 |  25000000 |  12500000 |   6250000 |   1562500 |             |
90                        | objects of| objects of|objects of |objects of |objects of |objects of |  Maximum    |
91                        |  8 bytes  | 16 bytes  | 32 bytes  | 64 bytes  | 128 bytes | 512 bytes |  Memory     |
92                        |           |           |           |           |           |           |  Used       |
93                        |  H     L  |  H     L  |  H     L  |  H     L  |  H     L  |  H     L  |             |
94    --------------------+-----------+-----------+-----------+-----------+-----------+-----------+-------------+
95    OMP parallel sort   | 1.18  1.17| 0.73  0.66| 0.51  0.45| 0.43  0.39| 0.41  0.37| 0.37  0.32|   1569 MB   |
96    TBB parallel_sort   | 1.71  1.59| 0.85  0.81| 0.56  0.54| 0.51  0.42| 0.45  0.39| 0.36  0.32|    788 MB   |
97    block_indirect_sort | 1.11  1.08| 0.66  0.63| 0.49  0.46| 0.43  0.39| 0.40  0.37| 0.37  0.33|    794 MB   |
98                        |           |           |           |           |           |           |             |
99    OMP par stable sort | 1.55  1.55| 1.38  1.35| 1.23  1.22| 1.17  1.17| 1.09  1.08| 0.97  0.97|   1980 MB   |
100    TBB par stable sort | 1.23  1.23| 0.89  0.85| 0.74  0.72| 0.71  0.69| 0.69  0.69| 0.68  0.68|   1573 MB   |
101                        |           |           |           |           |           |           |             |
102    sample sort         | 1.32  1.32| 0.84  0.78| 0.66  0.63| 0.63  0.62| 0.62  0.61| 0.60  0.60|   1568 MB   |
103    parallel stable sort| 1.80  1.90| 1.17  1.07| 0.83  0.75| 0.76  0.71| 0.72  0.70| 0.70  0.69|   1177 MB   |
104                        |           |           |           |           |           |           |             |
105
106
107
108``
109]
110]
111[endsect]
112
113
114
115