... | ... | @@ -20,6 +20,9 @@ Dt = 1.0 seconds |
|
|
|
|
|
![base-5](uploads/89bf03de5326dbbd56ab78c0548284bc/base-5.png)
|
|
|
|
|
|
- Broadcast takes about 60 to 230 ms
|
|
|
- Reduce takes from 180 ms to 380 ms
|
|
|
- Time per iteration is 650 ms
|
|
|
|
|
|
Data transfer is done all at once through one single blocking MPI instruction.
|
|
|
Dt = 1.0 s
|
... | ... | @@ -27,6 +30,10 @@ Dt = 1.0 s |
|
|
![all-at-once-5](uploads/afd3253f7a71005bf376d39c6e3d68c6/all-at-once-5.png)
|
|
|
![base-5.code_legend](uploads/56f3e0f7bc8eb219a295bd16106f9008/base-5.code_legend.png)
|
|
|
|
|
|
- Broadcast takes about 95 ms
|
|
|
- Reduce takes about 380 ms
|
|
|
- Time per iteration is 650 ms
|
|
|
|
|
|
## Using 8 workers and 1 server
|
|
|
|
|
|
**One worker is computing half a token sequence (32 tokens / worker)**
|
... | ... | |