Skip to content
GitLab
  • Menu
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • sac2c sac2c
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 392
    • Issues 392
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 15
    • Merge requests 15
  • Deployments
    • Deployments
    • Releases
  • Wiki
    • Wiki
  • External wiki
    • External wiki
  • Activity
  • Graph
  • Create a new issue
  • Commits
  • Issue Boards
Collapse sidebar
  • sac-group
  • sac2csac2c
  • Merge requests
  • !63

introduce further CUDA memcpy mechanisms (pinned, registrered, and managed memory models)

  • Review changes

  • Download
  • Email patches
  • Plain diff
Merged Hans-Nikolai Viessmann requested to merge hans/sac2c:hans-cuda-transmechs into develop Aug 18, 2018
  • Overview 0
  • Commits 30
  • Changes 40

This is top get my extra CUDA transfer mech stuff merged in - additionally we also now support generating CUDA modules. There are also some additions to the HWLOC stuff for pinning to cores nearest the IO bus etc.

Some work is still needed regarding headers, as most of the work was done before the sac.h split. There are several commits marked for this purpose (not anymore).

As far as I can tell, everything works. I did notice though that nvcc seems to pull in libstdc++ stuff when building object files, meaning that we either:

  • get nvcc to not pull in anything from the c++ libs, or
  • use c++ for the linking stage iff we are compiling cuda code (this is what we do now)

TODO:

  • fix header messiness
  • make sure hwloc extensions still work
  • extend cmake to build cuda-based runtime libs
  • fix/improve documentation
  • house-keeping, things need tidying...
  • tests!!!!
Edited Oct 08, 2018 by Hans-Nikolai Viessmann
Assignee
Assign to
Reviewer
Request review from
Time tracking
Source branch: hans-cuda-transmechs