RedLine Performance Solutions

AWS Cloud SME

  • Job Type: Full Time
  • Industry Type: IT Sector
  • Industry Location: Remote
  • Experience: NA
  • No. of Positions: 1
  • Primary Skills: Amazon Web Services aws CloudFormation S3 Lambda
  • Secondary Skills: Configuration management Optimization Job scheduling Systems engineering
  • Job Location: Remote, Remote
  • Posted Date: Posted today
Job Description

RedLine Performance Solutions (RedLine) has been in the HPC solutions engineering services business for over 22 years and is consistently determined to keep the "bar of excellence" quite high for new hires. This enables RedLine to accomplish what other firms cannot and promotes a high level of staff retention. We offer services ranging from full life cycle HPC systems engineering to remote managed services to HPC program analysis. We are located in the Washington, DC area and are looking for the right candidate to join as a AWS Cloud Subject Matter Expert (SME).

 

We are seeking a AWS Cloud SME to support a cloud contract supporting the National Environmental Satellite, Data, and Information Service (NESDIS), a Line Office under NOAA. Working with Customers, from C-Level executives to technology leaders, there are opportunities to leverage and grow your expertise in HPC workflows optimization and HPC systems architectures on AWS. The AWS Cloud SME will provide expertise with the integration of scientific algorithms in a cloud-based high performance computing (HPC) environment.

 

This role will help design, optimize, build and operate HPC systems on AWS. It requires the candidate to master the best practices to run HPC applications on highly elastic and dynamically provisioned HPC infrastructures as well as optimize performance and costs of Customer’s HPC workflows. Cloud SME will work with scientists at NESDIS to understand the algorithm design, leverage his or her knowledge of HPCs to work with the scientists to optimize the algorithm for an HPC environment (scalability, parallelization), and then integrate the algorithm in the Product Generation (PG) Cloud HPC Framework.

 

US citizenship and the ability to obtain a Public Trust security clearance are mandatory requirements for this position. This position can be remote with occasional onsite support in Silver Spring, MD.

 

Duties and Responsibilities:

  • Work with scientists to design algorithms that run in an HPC framework
  • Provide expertise to scientists to help them optimize the for parallel processing and scalability
  • Architect, Design, and Deploy HPC data processing Pipelines - Scheduling
  • Operate and manage HPC cluster framework that can operate on any Cloud Service Provider (i.e. AWS, Azure, GCP) architecture level
  • Design and build operational framework for HPC cluster environment
  • Must have in-depth knowledge with HPC software
  • Familiar with running multiple cores, understand scalability, and high availability
  • Familiar with science data formats; NetCDF, BUFR, HDF5,
  • Participate and potentially lead technical presentations on the work
  • Participate in team meetings and interact with funding clients
  • Optimize HPC framework using native-cloud services (serverless)

Requirements:

 

Education:

  • Bachelor of Science (BS) + 5 Years Experience or Master of Science (MS) +3 Years Experience or Doctorate of Philosophy (PhD) + 1 Year Experience
  • Area of Study: Computer Science; Computer Engineering; Mathematics; Physics

 

Technical Skills:

  • 5+ years professional Linux experience
  • 5+ years cluster management experience
  • 5+ years storage experience
  • Experience with parallel and multiprocessing programming interfaces including OpenACC, MPI, and / or OpenMP
  • Experience with common scientific software and libraries such as LAMMPS, FFTW, NAMD, R, and MATLAB
  • Experience with one or more HPC job scheduler: Grid Engine, LSF, PBS, SLURM, Torque, HTCondor, Symphony, GRID Server, Windows HPC Pack
  • Strong scripting skills in shell and / or python
  • Demonstrated use of a configuration management tools (Puppet, Ansible, or similar technologies)
  • Ability to use, and build ways to use, automation frameworks
  • Extensive experience implementing HPC solutions in a commercial cloud environment, specifically with Amazon Web Services (AWS)
  • Excellent written, oral, and verbal communication skills
  • Ability to obtain a Public Trust clearance
  • US Citizenship and must continuously reside in the United States for the last two years

 

Preferred Skills:

 

  • Extensive experience implementing HPC solutions in a commercial cloud environment, specifically with Amazon Web Services (AWS)
  • 5+ years of experience working with PBS Pro for optimizing, job scheduling and workload management in HPC environments
  • Extensive experience satellite data processing
  • Extensive experience in docker container
  • Ph.D. in Computer Science, Computer Engineering, Mathematics, or a closely related technical area

To learn more about RedLine please visit our website at www.redlineperf.com

Relevant Job Openings
Azure Data Architect with Talend
Java Technical Lead
Java Architect
.Net Architects
Java Architects
Java Architect