Lines Matching +full:- +full:s
24 [launch]: https://pytorch.org/docs/stable/distributed.html#launch-utility
29 -----------------------------------
31 -----------------------------------
37 --- nvidia-smi topo -m ---
40 … NV1 NV1 NV2 NV2 SYS SYS SYS SYS PIX SYS PHB 0-19,40-59
41 … X NV2 NV1 SYS NV2 SYS SYS SYS PIX SYS PHB 0-19,40-59
42 … NV2 X NV2 SYS SYS NV1 SYS SYS PHB SYS PIX 0-19,40-59
43 … NV1 NV2 X SYS SYS SYS NV1 SYS PHB SYS PIX 0-19,40-59
44 … SYS SYS SYS X NV1 NV1 NV2 PIX SYS PHB SYS 0-19,40-59
45 … NV2 SYS SYS NV1 X NV2 NV1 PIX SYS PHB SYS 0-19,40-59
46 … SYS NV1 SYS NV1 NV2 X NV2 PHB SYS PIX SYS 0-19,40-59
47 … SYS SYS NV1 NV2 NV1 NV2 X PHB SYS PIX SYS 0-19,40-59
63 --------------------------
69 …1 GPUs -- no ddp: p50: 0.097s 329/s p75: 0.097s 329/s p90: 0.097s 329/s p95: …
70 …1 GPUs -- 1M/1G: p50: 0.100s 319/s p75: 0.100s 318/s p90: 0.100s 318/s p95: …
71 …2 GPUs -- 1M/2G: p50: 0.103s 310/s p75: 0.103s 310/s p90: 0.103s 310/s p95: …
72 …4 GPUs -- 1M/4G: p50: 0.103s 310/s p75: 0.103s 310/s p90: 0.103s 310/s p95: …
73 …8 GPUs -- 1M/8G: p50: 0.104s 307/s p75: 0.104s 307/s p90: 0.104s 306/s p95: …
74 …16 GPUs -- 2M/8G: p50: 0.104s 306/s p75: 0.104s 306/s p90: 0.104s 306/s p95:…
79 …1 GPUs -- no ddp: p50: 0.162s 197/s p75: 0.162s 197/s p90: 0.162s 197/s p95: …
80 …1 GPUs -- 1M/1G: p50: 0.171s 187/s p75: 0.171s 186/s p90: 0.171s 186/s p95: …
81 …2 GPUs -- 1M/2G: p50: 0.176s 182/s p75: 0.176s 181/s p90: 0.176s 181/s p95: …
82 …4 GPUs -- 1M/4G: p50: 0.176s 182/s p75: 0.176s 181/s p90: 0.176s 181/s p95: …
83 …8 GPUs -- 1M/8G: p50: 0.179s 179/s p75: 0.179s 178/s p90: 0.180s 178/s p95: …
84 …16 GPUs -- 2M/8G: p50: 0.179s 178/s p75: 0.180s 177/s p90: 0.183s 174/s p95:…
89 …1 GPUs -- no ddp: p50: 0.145s 220/s p75: 0.145s 220/s p90: 0.145s 220/s p95: …
90 …1 GPUs -- 1M/1G: p50: 0.147s 217/s p75: 0.147s 217/s p90: 0.148s 216/s p95: …
91 …2 GPUs -- 1M/2G: p50: 0.153s 209/s p75: 0.153s 209/s p90: 0.153s 209/s p95: …
92 …4 GPUs -- 1M/4G: p50: 0.153s 208/s p75: 0.153s 208/s p90: 0.154s 208/s p95: …
93 …8 GPUs -- 1M/8G: p50: 0.157s 204/s p75: 0.157s 204/s p90: 0.157s 203/s p95: …
94 …16 GPUs -- 2M/8G: p50: 0.157s 203/s p75: 0.157s 203/s p90: 0.158s 203/s p95:…
99 …1 GPUs -- no ddp: p50: 0.415s 77/s p75: 0.415s 77/s p90: 0.416s 76/s p95: …
100 …1 GPUs -- 1M/1G: p50: 0.425s 75/s p75: 0.426s 75/s p90: 0.426s 75/s p95: …
101 …2 GPUs -- 1M/2G: p50: 0.438s 73/s p75: 0.439s 72/s p90: 0.439s 72/s p95: …
102 …4 GPUs -- 1M/4G: p50: 0.439s 72/s p75: 0.439s 72/s p90: 0.440s 72/s p95: …
103 …8 GPUs -- 1M/8G: p50: 0.447s 71/s p75: 0.447s 71/s p90: 0.448s 71/s p95: …
104 …16 GPUs -- 2M/8G: p50: 0.450s 71/s p75: 0.451s 70/s p90: 0.451s 70/s p95:…
109 Run the benchmark with the `--json PATH_TO_REPORT_FILE` argument to
117 -------------------- --------------------
126 1 GPUs: p75: 0.101s 317/s -0.3% p95: 0.101s 317/s -0.4%
127 2 GPUs: p75: 0.104s 306/s -1.0% p95: 0.104s 306/s -1.0%
128 4 GPUs: p75: 0.105s 305/s -1.6% p95: 0.105s 304/s -1.8%
129 8 GPUs: p75: 0.107s 299/s -2.6% p95: 0.107s 298/s -2.7%
130 16 GPUs: p75: 0.108s 294/s -3.8% p95: 0.122s 262/s -16.4%
135 1 GPUs: p75: 0.172s 185/s -1.2% p95: 0.172s 185/s -1.3%
136 2 GPUs: p75: 0.179s 178/s -2.1% p95: 0.179s 178/s -2.0%
137 4 GPUs: p75: 0.180s 177/s -2.6% p95: 0.180s 177/s -2.6%
138 8 GPUs: p75: 0.184s 173/s -3.5% p95: 0.184s 173/s -3.5%
139 16 GPUs: p75: 0.187s 170/s -0.1% p95: 0.204s 157/s -7.9%
144 1 GPUs: p75: 0.149s 214/s -1.0% p95: 0.149s 214/s -0.9%
145 2 GPUs: p75: 0.156s 205/s -1.5% p95: 0.156s 205/s -1.6%
146 4 GPUs: p75: 0.156s 204/s -1.6% p95: 0.157s 204/s -1.8%
147 8 GPUs: p75: 0.159s 200/s -1.5% p95: 0.159s 200/s -1.5%
148 16 GPUs: p75: 0.161s 198/s -1.9% p95: 0.162s 197/s -2.3%
153 1 GPUs: p75: 0.427s 74/s -0.8% p95: 0.428s 74/s -0.7%
154 2 GPUs: p75: 0.444s 72/s -1.3% p95: 0.445s 71/s -0.7%
155 4 GPUs: p75: 0.444s 72/s -1.1% p95: 0.445s 71/s -0.8%
156 8 GPUs: p75: 0.452s 70/s -1.3% p95: 0.452s 70/s -1.3%
157 16 GPUs: p75: 0.455s 70/s -0.7% p95: 0.456s 70/s -0.6%