... | ... | @@ -20,6 +20,9 @@ Dt = 1.0 seconds |
|
|
|
|
|

|
|
|
|
|
|
- Broadcast takes about 60 to 230 ms
|
|
|
- Reduce takes from 180 ms to 380 ms
|
|
|
- Time per iteration is 650 ms
|
|
|
|
|
|
Data transfer is done all at once through one single blocking MPI instruction.
|
|
|
Dt = 1.0 s
|
... | ... | @@ -27,6 +30,10 @@ Dt = 1.0 s |
|
|

|
|
|

|
|
|
|
|
|
- Broadcast takes about 95 ms
|
|
|
- Reduce takes about 380 ms
|
|
|
- Time per iteration is 650 ms
|
|
|
|
|
|
## Using 8 workers and 1 server
|
|
|
|
|
|
**One worker is computing half a token sequence (32 tokens / worker)**
|
... | ... | |