EPAM Systems
Senior AWS HPC Engineer
EPAM SystemsLatvia16 days ago
Full-timeRemote FriendlyInformation Technology, Engineering +1

We are seeking a Senior AWS HPC Engineer to join our team focused on high-performance computing solutions. You will be responsible for deploying and managing Open OnDemand portals integrated with AWS services to optimize HPC workloads. Join us to contribute to cutting-edge cloud and HPC projects and enhance client computing capabilities.

 

Feel free to work remotely from anywhere across Latvia or connect with colleagues at our Riga office.

 

 

Responsibilities

  • Deploy, configure, and customize the Open OnDemand web portal for HPC workloads
  • Integrate Open OnDemand with AWS services such as EC2, S3, FSx, ParallelCluster, and IAM
  • Collaborate with AWS and HPC teams to ensure seamless user access and workload execution
  • Implement best practices for performance, scalability, and cost optimization
  • Develop automation scripts and infrastructure as code using Terraform, CloudFormation, or Ansible
  • Document architecture, configurations, and operational processes
  • Maintain security standards through Identity and Access Management integration
  • Monitor HPC environment performance and troubleshoot issues
  • Optimize Linux system configurations for HPC workloads
  • Coordinate with cross-functional teams to support client requirements
  • Evaluate new technologies and recommend improvements for HPC infrastructure

 

Requirements

  • 3+ years of experience managing HPC environments
  • Proven hands-on experience deploying and maintaining Open OnDemand
  • Strong knowledge of Linux systems administration
  • Experience with AWS HPC services and cloud infrastructure
  • Proficiency in scripting languages such as Bash, Python, or Ruby
  • Familiarity with authentication and user management systems like LDAP, SSO, or Keycloak
  • Ability to implement Identity and Access Management (IAM) policies
  • Experience developing infrastructure as code (Terraform, CloudFormation, or Ansible)
  • Strong problem-solving and communication skills
  • English language proficiency at B2 level or higher

 

Nice to have

  • AWS certification such as Solutions Architect, SysOps, or DevOps
  • Experience with AWS ParallelCluster or similar HPC orchestration tools
  • Knowledge of SLURM or other workload managers

 

We offer

  • Engineering Heritage: Best-in-class experts sharing a culture of engineering excellence and tackling complex engineering challenges for over 30 years.
  • Advanced Tech Stack: Innovative projects where you can apply or enhance your expertise in Cloud, Data, AI, and other emerging technologies.
  • World-Class Clients: Work closely with 295+ of the Forbes Global 2000 on creating disruptive solutions that make a global impact.
  • Professional Growth: Exceptional support for career development with comprehensive resources for upskilling or reskilling in pioneering practices.
  • GenAI Community: Strong AI competencies with 600+ experts across 55+ locations driving GenAI-enabled transformation journeys.
  • Entrepreneurial Culture: If you're passionate and dedicated to improving business transformation, we provide the support you need to bring your ideas to life.
  • Hybrid Setup: The flexibility to work from any location in Latvia, whether it's your home or our office in Riga.
  • Other Benefits: Additional vacation and trust days, private health insurance, Employee Stock Purchase Plan and more.

 

About EPAM

EPAM is a leading global provider of digital platform engineering and development services. For over 30 years, our team has helped leading brands navigate the waves of digital transformation, building solutions that help them stay competitive through constant market disruption.With offices in 55+ countries, EPAM has grown in Latvia to over 150+ talented innovators in 3 years. We foster creativity and unconventional ways of doing things, welcoming like-minded professionals to join us.

 

Salary range €3.8K-€5.1K gross, based on your experience and interview results.

 

Key Skills

Ranked by relevance