Paper 1 Groups and Systems

The following are the group and system type assignments for each group (please note your group number as it is used in documents you will edit):

  • Group 1: GPU (GPGPU): Daniel, Aarushi, Ziming

  • Group 2: Cluster supercomputer: Ivan, Ibrahem, Brian, Rodas

  • Group 3: MPP supercomputer: Kevin, Leo, Zakir, Derrick

  • Group 4: Green Supercomputer: Mandie, Tommy, Henry

  • Group 5: Cluster supercomputer: Sasha, Dzineon, Yatin, Jonathan

  • Group 6: Cloud systems: Abdel, Tony, Sean

Picking a Specific System

Your group will pick a specific system of your assigned type to present to the class. Take an look at some options and pick a system early and quickly, and then move on to investigating your system and planing your presentation.

In Systems Suggestions for each Group I’ve includes some suggestions of specific systems for each group to consider. It is important that each group picks a unique system to present. Thus, if your group picks a machine from outside my list of suggestions you should coordinate with the other group(s) that are doing a similar system to ensure that there are no duplicate systems selected (groups 2-5 should talk to each other about the specific systems they plan to present, and the specific in-depth focus to avoid duplication). You may not change the system type of your assigned group.

In addition, groups 2, 3, 5 should pick a system from the June 2023 Top 500 list, and Group 4 should pick a system from near the top of the June 2023 Green 500 list.

You can find machines of different types by:

  • The architecture is listed with each machine.

  • Use the sublist generator (under Statistics menu, choose SublistGenerator) to generate a list of only specific architectures (choose MPP or cluster).

Usually systems at national labs, universities or supercomputing centers have more on-line information than systems at companies. Also, there is often more public information and articles about machines at the top of the list.

If you click on a system on the list, you will get a page with details that usually includes a link to the organization’s webpage about the system.

Systems Suggestions for each Group

Group 1: GPUs (GPGPU focus)

The GPU Architecture and GPGPU Programming Model should be the focus of your presentation. Do not present GPUs in the context of their use for graphics processing. Instead, focus on GPUs for general purpose parallel programming.

Look at Nvidia’s site for documentation about GPU architecture. CUDA is Nvidia’s language for General-Purpose programming on a GPU (GPGPU). You may include a high-level discussion of programming language for GPGPU computing, but focus more of your talk on the GPU architecture and system for supporting GPGPU programming and not on the details of languages (like CUDA) for GPGPU programming; the GPU architecture is very interesting.

You could also talk about other accelerator computing devices, but focus on GPUs. See Chapter 15.1 of Dive into Systems for a high-level overview. Search the web for GPGPU. There is a lot out there.

Groups 2 and 5: Clusters

If you pick a machine near the top of the list, you may find more information about it. Your two groups should coordinate to make sure you select different systems, and DO NOT pick systems that are Amazon EC2 cluster instances:

Search for Architecture: Cluster on the top 500 sublist generator

Here are some suggestions (please note the caviats with some systems):

  4.  Leonardo - BullSequana XH2000, Xeon.
      EuroHPC/CINECA, Italy

  5.  Summit - IBM Power System AC922.
      DOE/SC/Oak Ridge National Laboratory, US
      https://www.ornl.gov/directorate/ccsd
      (CAVIAT: only pick one of Summit and Seirra across the two groups)

  6. Sierra - IBM Power System AC922.
      DOE/NNSA/LLNL, US
      https://hpc.llnl.gov/hardware/platforms/sierra
      (CAVIAT: only pick one of Summit and Seirra across the two groups)

  9. Selene - Nvidia, AMD.
     Nvidia Corp, US
     https://www.nvidia.com/en-us/data-center/dgx-superpod/

 10. Tianhe-2A - TH-IVB-FEP Cluster, Intel Xeon ...
    National Super Computer Center in Guangzhou, China
    http://en.nscc-gz.cn/
    (also Wikipedia and google search for articles)
    here is one article about Tianhe-2:
    http://www.netlib.org/utk/people/JackDongarra/PAPERS/tianhe-2-dongarra-report.pdf

 21. Frotera - Dell C6420 ...
    Intel Omni-Path , Dell EMC
    Texas Advanced Computing Center/Univ. of Texas, US
    https://www.tacc.utexas.edu/

And there are many more options in the Top500.

Group 3: MPP Systems

Search for Architecture: MPP on the top 500 sublist generator

Check with Group 4 that you are not selecting the same system.

Here are some suggestions from the Top500 list (and look on the list for others):

 1. Frontier - HPE Cray EX235a, AMD
    DOE/SC/Oak Ridge National Laboratory, US
    **The first exaflop machine!**

 2. Supercomputer Fugaku - Supercomputer Fugaku, A64FX 48C
     RIKEN Center for Computational Science, Japan

 3. LUMI - HPE Cray EX235a, AMD
    EuroHPC/CSC Finland

 7. Sunway TaihuLight - Sunway MPP,
    National Supercomputing Center in Wuxi, China
    http://www.nsccwx.cn/  (upper-right link to English)

 8. Perlmutter - HPE Cray EX235n, AMD
    DOE/SC/LBNL/NERSC, US

... and lots more

If you pick a system with a Cray architecture, looking at Cray’s website may be useful too.

Group 4: Green Supercomputer

You should define and discuss briefly what green computing means, and the criteria for ordering machines on the Green500 list. Then pick one or two systems near the top to discuss some details of how they achieve power efficiency. If the system is near the top of the green500 list, there is likely some documentation on their webpage promoting it and also some articles about it. Start by doing some searches for some of these on the web to help you find a good machine or two to discuss in detail. It is fine to just cover one machine, but if information is sparse, you may want to add another that is different in a significant way.

Many of these systems use NVIDA systems, don’t focus too much on the GPU architecture itself as another group is presenting on this, but you can briefly talk about it if it fits into your talk.

Check with Group 3 that you are not selecting the same system.

Some suggestions of a few from the Green 500 list that may have a bit more information include:

  1. Henri - ThinkSystem SR670 V2
     Flatiron Institute, United States
     (NOTE: this might be the best if you can find enough info about it)

  2. Frontier TDS - HPE Cray EX235a
     DOE/SC/Oak Ridge National Laboratory, United Statesi
     (NOTE: only pick this one if Group 3 does not pick a Frontier system)

  3. Adastra - HPE Cray EX235a
     GENCI-CINES, France

  4. Setonix – GPU - HPE Cray EX235a
     Kensington, Australia

  5. Dardel GPU - HPE Cray EX235a
     KTH - Royal Institute of Technology, Sweden

  6.  Frontier - HPE Cray EX235a
      DOE/SC/ORNL, US
      (NOTE: only pick this one if Group 3 does not pick a Frontier system)

  7. LUMI - HPE Cray EX235a
     EuroHPC/CSC, Finland
     (NOTE: only pick this one if Group 3 does not)

  8. amplitUDE (GPU Partition) - MEGWARE D50DNP, Xeon Platinum
     University of Duisburg-Essen, Germany

Others further down the list are okay too, but try to pick one in the top 8 if possible.

Group 6: Cloud System

Part of this presentation should involve definition(s) of Cloud computing (what is it? what are common features? what are goals of cloud computing? what are the features of the system you are presenting?) The wikipedia cloud computing page may be a good place to start. Also, Chapter 15.3 of Dive into Systems has a high-level overview of cloud computing. You could also define and talk about some or all of IaaS, Paas, SaaS.

The other part of your presentation should involve either picking a Cloud system to present, or presenting a specific software system for cloud management or cloud application development. If you do the later, you should still include the definitions of a cloud system in your presentation.

Some commercial cloud systems may not have a lot of public information about the details of their system. Try to pick one that has a reasonable amount, and try to search for articles about the system. If there isn’t much out there, pick a different system with more documentation about it.

Here are few suggestions, you are free to find others:

Some Example commercial cloud systems:

  Amazon EC2, AWS
  Microsoft Azure
  Google App Engine
  IBM Cloud Services

  Amazon EC2 Instance Cluster (#64 on Top500)
  https://aws.amazon.com/ec2/instance-types/
  (this may not be as good a fit for a cloud presentation as it
   designed as an instance used as a supercomputer, but it may be
   worth mentioning...it is impressively high on the Top500)

Some open SW for cloud management or cloud application development:

  OpenStack (software for cloud system): openstack.org
  Eucalyptus
  Rackspace
  Salesforce