Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Research Computing Seminars

...

These sessions cooperate with a larger body of seminars hosted by Library and Technology Services. Of particular interest are the Python and R seminars, since the skills that you learn in these classes can translate into much larger problems that require high-performance computing.

Check this space soon for dates, times, and registration instructions!

DateTitle
January 31
Expand
titleResearch Computing Resources at Lehigh

Description: This seminar provides an overview of the Research Computing resources available to the Lehigh research community. We introduce high-performance computing (HPC) and provide a guide for gaining access to research computing resources, specifically the hardware, software, and training required to support new research. We also provide an overview of external computing resources provided by the National Science Foundation (NSF).

Instructor: Ryan Bradley

TBD
Expand
titleLinux Basics for Research Computing

Description: Linux is a free and open source operating system that is the operating system of choice of the world's leading supercomputers as well as LTS high-performance computing (HPC) clusters and some computer labs at Lehigh. This session will provide an introduction to the Linux environment, using the command line, logging into remote systems, and transferring data, and the use of terminal-based text editors. This seminar is designed for researchers who want to use Linux resources to support their work.

Instructor: Ryan Bradley

TBD
Expand
titleUsing the SLURM Scheduler on Lehigh’s HPC Cluster

Description: Lehigh uses the SLURM scheduler to ensure that many researchers can effectively share our high-performance computing (HPC) resources. This session provides a hands-on introduction to SLURM so that participants can learn to translate their research questions into hardware requests on an HPC platform by formulating, submitting, and monitoring batch jobs. This session may also be useful for researchers who also leverage external HPC platforms in their work.

Instructor: Ryan Bradley

TBD
Expand
titleIntroduction to the Open OnDemand Portal

Description: Open OnDemand (OOD) is an NSF-funded open-source high-performance computing (HPC) portal developed by the Ohio Supercomputing Center. The goal of OOD is to provide an easy way for system administrators to provide web access to their HPC resources. This seminar introduces OOD to access interactive applications such as MATLAB, Jupyter Lab/Notebooks, and RStudio on Lehigh's HPC cluster. The portal provides a more fully-featured user interface to the cluster, complementing the standard, terminal-based method for remotely accessing the cluster.

Instructor: Ryan Bradley

TBD
Expand
titleBring Your Own Software: Containers on HPC Resources

Description: Container systems provide the tools for researchers to more easily build and migrate their software to diverse computing platforms. Singularity is an open-source container engine designed to bring operating system-level virtualization to scientific and high-performance computing. In this seminar, we’ll provide an overview of Singularity and teach you to build your own software in a container system for use on Sol. These techniques can improve the durability and reproducibility of your research.

Instructor: Ryan Bradley

TBD
Expand
titleAdvanced Python on HPC Resources

Description: This session will provide a tour of advanced Python methods that can be useful on the cluster. Python often serves as a glue for binding high-performance computation to complex data structures and intricate workflows. This seminar will briefly survey the NumPy and SciPy mathematical libraries, using HDF5 to generate complex data files, and the method for extending Python with C extension modules.

Instructor: Ryan Bradley

Spring 2022 HPC Seminars

When: Thursday @ 2PM

...

DateTitle

February 3


Expand
titleResearch Computing resources at Lehigh

Description: This seminar provides an overview of Research Computing resources available to the Lehigh research community.

Instructor: Alex Pacheco


SlidesRecordings
February 10


Expand
titleLinux: Basic Commands & Environment

Description: Linux is a free and open source operating system that is the OS of choice of the world's leading supercomputers as well as LTS HPC clusters and some computer labs at Lehigh. This session will provide an introduction to the linux/unix environment, command line basics, logging in to remote system, transferring data, vi/emacs editors etc to get started with using a Linux/Unix based computer. This seminar is geared towards researchers who want to learn or need to learn how to use a linux/unix based resource.

Instructor: Sachin Joshi


SlidesRecordings
February 17


Expand
titleUsing SLURM scheduler on Lehigh's HPC cluster

Description: This seminar provides a hands on introduction to using the SLURM scheduler to submit and monitor jobs. SLURM is the scheduler on Lehigh's HPC resources, Sol and Hawk, and national supercomputing resources including XSEDE and NERSC.

Prerequisites: An HPC account or an account on national supercomputing resources (XSEDE, DOE, etc) that uses SLURM.

Familiarity with Linux/Unix environment, basic command and *nix editors such as vi or emacs is mandatory.

Instructor: Alex Pacheco


SlidesRecordings
February 24



Expand
titlePython Programming

Description: In this seminar, you will learn the basics of Python, including language fundamentals and basic programming.

Prerequisites: Programming background is beneficial but not required.

Instructor: Sachin Joshi


SlidesRecordings
March 3


Expand
titleR Programming

Description: In this seminar, you will learn the basics of R, including language fundamentals, data types, functions and basic programming including File I/O.

Prerequisites: Programming background is beneficial but not required.

Instructor: Jeremy Mack


SlidesRecordings
March 10


Expand
titleIntroduction to Open OnDemand

DescriptionOpen OnDemand (OOD) is an NSF-funded open-source HPC portal developed by the Ohio Supercomputing Center. The goal of OOD is to provide an easy way for system administrators to provide web access to their HPC resources. This seminar introduces OOD to access interactive applications such as MATLAB, Jupyter Lab/Notebooks, and RStudio on Lehigh's HPC cluster.

Prerequisites: An HPC account with an active allocation and a web browser. A Lehigh IP or VPN required.

Instructor: Alex Pacheco


SlidesRecordings
March 17


Expand
titleData Visualization with Python

Description: This seminar provides a hands on introduction to Data Visualization using the Python programming language.

Prerequisites: Programming background in Python is required.

Instructor: Sachin Joshi


SlidesRecordings
March 24


Expand
titleData Visualization with R

Description: This seminar provides a hands on introduction to Data Visualization using the R programming language

Prerequisites: Programming background in R is required.

Instructor: Jeremy Mack


SlidesRecordings
March 31


Expand
titleBring Your Own Software: Containers on HPC Resources

Description: Singularity is an open-source container engine designed to bring operating system-level virtualization to scientific and high-performance computing. In this seminar, we’ll provide an overview of Singularity and how you can Build Your Own Software in a singularity container for use in your research on Sol.

Prerequisites: A Linux system with singularity installed (check your package manager if your distribution provides singularity or see installation instructions).  

Instructor: Alex Pacheco


SlidesRecordings
April 7


Expand
titleObject-Oriented Programming with Python

Description:  Object Oriented Programming or OOP is a programming paradigm which provides a means of structuring programs (combining data and functionality) so that properties and behaviors are bundled into individual objects. In this seminar, you’ll learn the basic concepts of OOP in Python: Python Classes, Object Instances, Defining and Working with Methods and OOP Inheritance.

Prerequisites:  Programming background in Python is required. Knowledge of OOP’s concepts is beneficial but not required.

Instructor: Sachin Joshi


SlidesRecordings
April 14


Expand
titleShiny Apps in R

Description: Shiny Apps, interactive web applications built using R programming language, have grown in popularity. This session will provide attendees experience using the Shiny package and the idea of reactive programming, which forms the basis of building an interactive web application.

Prerequisites: Programming background in R is required.

Instructor: Jeremy Mack


SlidesRecordings

...

TitleDownloads


Expand
titleResearch Computing resources at Lehigh

Description: This training provides an overview of Research Computing resources available to the Lehigh research community.


SlidesRecordings

Writing an Allocation Proposal

SlidesRecordings


Expand
titleUsing SLURM scheduler on Sol

Description: This training provides a hands on introduction to using the SLURM scheduler to submit and monitor jobs. SLURM is the scheduler on Lehigh's HPC resource, Sol, and national supercomputing resources including XSEDE and NERSC.

Prerequisites: An account on Sol or an account on national supercomputing resources (XSEDE, DOE, etc) that uses SLURM.

Familiarity with Linux/Unix environment, basic command and *nix editors such as vi or emacs is mandatory.


SlidesRecordings


Expand
titleIntroduction to Open OnDemand

DescriptionOpen OnDemand (OOD) is an NSF-funded open-source HPC portal developed by the Ohio Supercomputing Center. The goal of OOD is to provide an easy way for system administrators to provide web access to their HPC resources. This tutorial introduces OOD to access Lehigh's Sol cluster.

Prerequisites: An account on Sol with an active allocation and a web browser. A Lehigh IP or VPN required.


SlidesRecordings


Expand
titleBring Your Own Software

Description: This seminar is geared towards users who wish to bring or build their own software stack to use on Sol and Hawk (or even local linux systems). Topics to be covered include best practices for installing packages using make, cmake and configure, SPACK package manager, and Singularity. 

Prerequisites: Some familiarity with compilers, and linux environment is required.


Slides
A Brief Introduction to LinuxSlidesRecordings


Expand
titleLinux: Basic Commands & Environment

Description: Linux is a free and open source operating system that is the OS of choice of the world's leading supercomputers as well as LTS HPC clusters and some computer labs at Lehigh. This session will provide an introduction to the linux/unix environment, command line basics, logging in to remote system, transferring data, vi/emacs editors etc to get started with using a Linux/Unix based computer. This training is geared towards researchers who want to learn or need to learn how to use a linux/unix based resource.


SlidesRecordings
Basic Shell ScriptingSlidesRecordings
Advanced Shell ScriptingSlidesRecordings


Expand
titleR Programming

Description: In this tutorial, you will learn the basics of R, including language fundamentals, data types, functions and basic programming including File I/O.

Prerequisites: Programming background is beneficial but not required.


SlidesRecordings


Expand
titleData Visualization with R

Description: This tutorial provides a hands on introduction to Data Visualization using the R programming language

Prerequisites: Programming background in R is required.


SlidesRecordings


Expand
titleShiny Apps in R

Description: Shiny Apps, interactive web applications built using R programming language, have grown in popularity. This session will provide attendees experience using the Shiny package and the idea of reactive programming, which forms the basis of building an interactive web application.

Prerequisites: Programming background in R is required.


SlidesRecordings


Expand
titlePython Programming

Description: In this seminar, you will learn the basics of Python, including language fundamentals and basic programming.

Prerequisites: Programming background is beneficial but not required.


SlidesRecordings


Expand
titlePython Data Structures

Description: Data Structures are the fundamental constructs around which you build your programs. Python is a high-level, interpreted, interactive and object-oriented scripting language using which we can study the fundamentals of data structure in a simpler way as compared to other programming languages. In this seminar we are going to study a short overview of some frequently used data structures in general and how they are related to some specific python data types.

Prerequisites: Programming background in Python is required. Knowledge of data structure concepts is beneficial but not required.


SlidesRecordings


Expand
titleData Visualization with Python

Description: This seminar provides a hands on introduction to Data Visualization using the Python programming language.

Prerequisites: Programming background in Python is required.


SlidesRecordings


Expand
titleObject-Oriented Programming with Python

Description:  Object Oriented Programming or OOP is a programming paradigm which provides a means of structuring programs (combining data and functionality) so that properties and behaviors are bundled into individual objects. In this seminar, you’ll learn the basic concepts of OOP in Python: Python Classes, Object Instances, Defining and Working with Methods and OOP Inheritance.

Prerequisites:  Programming background in Python is required. Knowledge of OOP’s concepts is beneficial but not required.


SlidesRecordings


Expand
titleMachine Learning

Description: Machine learning is a method of data analysis that automates analytical model building. It is a branch of artificial intelligence (AI) based on the idea that systems can learn from data, identify patterns and make decisions with minimal human intervention. In this two part seminar you’ll learn the basic machine learning concepts: supervised and unsupervised learning, classification and regression algorithms. This is a two part seminar.

Prerequisites: Basic knowledge of general CS concepts such as data structures, linear algebra and design of algorithm is required.


Slides (Part I, Part II)Recordings (Part I, Part II)


Expand
titleText Mining 

Description: Text mining, also known as Text Analytics is the process of deriving high-quality information from text. It is the process of examining large collections of written resources to generate new information and to transform the unstructured text into structured data for use in further analysis. In this seminar you’ll learn the basic introduction to text analysis, different steps of text analysis: gathering textual data, cleaning and preparing textual data , analyzing the data (ETL) and visualizing textual data using tools from the HathiTrust Research Center (HTRC).

Prerequisites: Programming background is required. Basic knowledge of general CS concepts such as data structures, linear algebra and design of algorithms is required.


SlidesRecordings


Expand
titleMATLAB

Description: MATLAB is a programming platform designed specifically for engineers and scientists. MATLAB is a special-purpose language that is an excellent choice for writing moderate-size programs that solve problems involving the manipulation of numbers. MATLAB is easy to learn, versatile and the design of the language makes it possible to write a powerful program in a few lines. Using MATLAB, you can: Analyze data, Develop algorithms and Create models and applications. In this seminar you’ll learn the basics of MATLAB, including language fundamentals and basic programming that will help you achieve the above functionalities.

Prerequisites: Programming background is beneficial but not required.


Slides
Version Control with GITSlides
Document Creation with LaTeXSlides
Using Virtualized Software at Lehigh

Storage Options at LehighSlides
Research Data ManagementSlides
Enhancing Research ImpactSlides

...