Skip to content
GitLab
Projects Groups Topics Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
  • Register
  • Sign in
  • L llm.c - GPT2
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributor statistics
    • Graph
    • Compare revisions
  • Issues 4
    • Issues 4
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Package Registry
    • Container Registry
    • Terraform modules
  • Monitor
    • Monitor
    • Metrics
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • tchatela
  • llm.c - GPT2
  • Wiki
  • Runtime and performances

Runtime and performances · Changes

Page history
Update Runtime and performances authored Jul 01, 2024 by tchatela's avatar tchatela
Show whitespace changes
Inline Side-by-side
Runtime-and-performances.md
View page @ b983adcd
......@@ -61,3 +61,18 @@ As we see, the time taken for each iteration is 7 seconds on average. This same
However, something else stands out. On another test, which runtime is shown just below, we see that using one single CPU (but still using OpenMP), increases our average runtime up to 41 seconds per iteration. This is unexpected, as in theory the runtime for 2 cpu should be twice the runtime for 1 cpu (at most).
![test-5-openmp-tinyshakespeare-mean](uploads/d600df70538473235c9037630724a5a2/test-5-openmp-tinyshakespeare-mean.png)
## General comparison
This board shows the mean runtime taken for one iteration. The sequential versions are using one thread, the parallels ones are using 2 threads.
| setup \\ version | seq. without -fopenmp | seq with -fopenmp | openMP | openmp/nOS-V |
|------------------|-----------------------|-------------------|--------|--------------|
| without using numactl | 29 756 ms | 31 250 ms | 6830 ms | 10 449 ms |
| using numactl | 30 093 ms | ERROR | 6694 ms | ERROR |
Two tests are giving the following error :
`ERROR numa_sched_setaffinity_v2_int() failed: Invalid argument` \
`sched_setaffinity: Invalid argument`
\ No newline at end of file
Clone repository
  • Distributed Model
  • Fork Join Model
  • GPT2 Parallelization and porting
  • Metrics
  • Runtime and performances
  • Task Based Model
  • Various informations
  • _sidebar
  • Home