Linux Administrator – High Performance Computing

Despre rol

Locaţia
India
Haryana
Gurgaon
Companie
Siemens Energy Industrial Turbomachinery India Private Limited
Organizație
EVP Global Functions
Unitate operațională
Digital Core
Normă întreagă/normă parțială
Cu normă completă
Nivel de experiență
Profesionist de nivel intermediar

A Snapshot of Your Day

Your day begins with a deep dive into the state of the Linux-based High Performance Computing (HPC) environment. You review system metrics, load, I/O performance, logs, and alerts to detect potential bottlenecks early and ensure stable daily operations. 

You then focus on enabling your colleagues to work smoothly. This includes handling incidents and service requests, supporting users with Linux OS, filesystem, scheduler, or access-related issues, and ensuring systems and shared resources perform reliably. System-level troubleshooting—such as analyzing filesystem performance, scheduler behavior, network latency, or unexpected application behavior—is a regular part of your role. Alongside daily operations, you contribute to the HPC development roadmap by exploring new technologies, bringing in new ideas, and continuously learning to support evolving requirements. Automation, scripting, documentation, and collaboration with internal and global teams are central to ensuring a secure, efficient, and future-ready HPC environment. 

How You’ll Make an Impact

HPC Operations & Service Delivery 

  • Provide sustained Linux and HPC operational support for daily operations, incidents, and service requests across compute, storage, and user environments. 
  • Perform Linux system administration tasks, including installation, configuration, patching, updates, and maintenance of Linux servers supporting HPC workloads. 
  • Troubleshoot issues across Linux OS, HPC compute nodes, schedulers, filesystems, networks, and applications, ensuring minimal disruption to users. 
  • Diagnose and resolve vague or incomplete user requests through strong analytical and problem-solving skills, working closely with end users. 
  • Ensure operational stability, performance, and user satisfaction of HPC platforms. 
  • Collaborate with internal teams and global stakeholders to support operational needs and ongoing improvements. 

What You Bring

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical discipline. 
  • Linux certification (RHCSA/RHCE) is a plus. (Good to have) 
  • Minimum relevant 3 years of professional experience in HPC-focused Linux administration and Automation. 
  • Hands-on experience supporting engineering simulation workloads in HPC environments. 

Skills

  • Strong Linux system administration skills with hands-on experience on Red Hat Enterprise Linux (RHEL) and compatible distributions.  
  • Experience with installation, configuration, maintenance, troubleshooting, and performance tuning of Linux servers supporting HPC workloads.  
  • Knowledge of HPC cluster concepts, including compute nodes, login nodes, storage, and shared environments.  
  • Familiarity with cluster management and job scheduling systems (e.g., PBS, or similar) and understanding of batchbased workload execution.  
  • Exposure to parallel computing technologies such as MPI, OpenMP, or CUDA, and an understanding of how applications scale on HPC systems.  
  • Strong scripting and automation skills using Bash, Python, and Ansible to support operational efficiency, reliability, and repeatable system management. 
  • Understanding of highperformance storage and networking, including filesystems (NFS, parallel filesystems) and concepts like InfiniBand or highspeed Ethernet.  
  • Experience working with monitoring, logging, and troubleshooting tools to ensure system stability, performance, and availability in an HPC environment.

 

Process & Methodology 

  • Understanding of ITIL-based incident and service management. 
  • Ability to work with structured processes while remaining flexible and solution-oriented. 

Personal Attributes 

  • Good communication skills. 
  • Team player with positive attitude 
  • High degree of ownership, adaptability, and collaborative mindset. 
  • Ability to work effectively across time zones. 

About the Team

Who is Siemens Energy?

At Siemens Energy, we are more than just an energy technology company. We meet the growing energy demand across 90+ countries while ensuring our climate is protected. With more than 92,000 dedicated employees, we not only generate electricity for over 16% of the global community, but we’re also using our technology to help protect people and the environment.

Our global team is committed to making balanced, reliable, and affordable energy a reality by pushing the boundaries of what is possible. We uphold a 150-year legacy of innovation that encourages our search for people who will support our focus on decarbonization, new technologies, and energy transformation.

"Let’s make tomorrow different today" is our genuine dedication at Siemens Energy to all customers and employees on the way to a balanced future.

Our Commitment to Diversity

Check out this video to learn more about Siemens Energy: https://www.siemens-energy.com/employeevideo

Lucky for us, we are not all the same. Through diversity, we generate power. We run on inclusion and our combined creative energy is fueled by over 130 nationalities. Siemens Energy celebrates character – no matter what ethnic background, gender, age, religion, identity, or disability. We energize society, all of society, and we do not discriminate based on our differences.

Regards

  • We offer options to work flexibly, especially after successful onboarding – whether it be working remotely, flexible working hours or a combination of both
  • Working with a distributed team
  • Opportunities to work on and lead a variety of innovative projects
  • Supportive work culture

https://jobs.siemens-energy.com/jobs