HPC Systems Engineer

Location: 

Spring, TX, US, 77389

Company:  ExxonMobil Global Services Company

About Houston

ExxonMobil's state-of-the-art campus north of Houston serves as home to its Upstream, Product Solutions and Low Carbon Solutions businesses and their associated service groups. The facility opened in 2014 and accommodates more than 10,000 employees and visitors.
 

By bringing many global functional groups together, the campus provides employees with the tools and capabilities needed today, and in the future, to achieve business objectives and accelerate the discovery of new resources, technologies and products. It was designed to foster improved collaboration, creativity and innovation and enhance the company’s ability to attract, develop and retain the top talent in the industry.
 

The campus is located in Spring, Texas, on 385 wooded acres immediately to the west of Interstate Highway 45 (I-45), at the intersection of I-45 and the Hardy Toll Road, approximately 25 miles from the cultural vibrancy of downtown Houston.
 

The campus was constructed to the highest standards of energy efficiency and environmental stewardship. Its design incorporates extensive research into best practices in building and workplace design through extensive benchmarking of the world’s top academic, research, and corporate facilities.
 

Learn more about what we do in Houston here.

What role you will play in our team

The HPC Systems Engineer role has the overall responsibility to work within a small team of highly skilled HPC specialists to provide a performant, reliable, and secure high-performance computing (HPC) environment.  The HPC Systems Engineer will be involved in various aspects of designing and engineering our HPC system as well as be responsible for managing day-to-day operations and maintenance activities including, but not limited to the following: general troubleshooting of any issues that may arise, monitoring overall system health, performing system maintenance tasks, and evaluating new hardware/system software.
Spring, TX, USA

What you will do

  • Establish strategies for overall support of the systems portfolio (4 supercomputers and large parallel file systems across the domain)
  • Evaluate new hardware and software and understand potential benefits/impacts it can have in the scientific computing and seismic imaging environments
  • Perform software installations and upgrades, inclusive of operating system in world class HPC environment
  • Monitor overall system performance and health and partner with 3rd parties on hardware support as needed
  • Provide support for the management of data in the environment (140+ Petabytes of parallel filesystem)
  • Advanced consulting with users to resolve problems and ensure they can effectively utilize the systems
  • Interact with both business customers and technical teams that are globally distributed and within varied time zones
  • Engaging with vendors for problem resolution of existing infrastructure and discussion of roadmaps and new technologies for evaluations
  • Foster a supportive work environment and maintains open, productive interactions among team and across organizations
  • Build and maintain cross-organizational contacts to facilitate execution of work

About you

Skills and Qualifications 

 

  • B.E./B.Tech in Computer Science or related degree area (e.g., Computer Engineering, Information Systems) or equivalent skills work experience.
  • Excellent technical, analytical, and communication skills
  • A minimum of 10 years of hands-on Linux experience (e.g., RHEL, CentOS) and production infrastructure support (e.g., networking, storage, monitoring, compute, installation, configuration, maintenance, upgrade, retirement)
  • A minimum of 5 years' experience in HPC technologies (e.g., installation, configuration, maintenance, upgrade, retirement, problem resolution) such as parallel/distributed files systems (e.g., Lustre, GPFS), high speed interconnect fabrics (e.g., Infiniband, Omni-Path), and HPC batch scheduling software suites (e.g., PBSPro, SLURM)
  • Experience in HPC technologies such as parallel/distributed files systems (e.g., Lustre, GPFS), high speed interconnect fabrics (e.g., Infiniband, Omni-Path), and HPC batch scheduling software suites (e.g., PBSPro, SLURM)
  • Proficiency in technical writing and documentation of solutions
  • Solid understanding of data center operations fundamentals in networking, cooling, and power
  • Works well in a team environment
  • Self-motivated"

 

Preferred Knowledge/Skills/Experience

 

  • Strong IT skills in infrastructure and applications
  • Experience with supporting large scale production environments
  • Experience in implementing changes and security controls in a global framework.
  • Understanding of data center operations fundamentals in networking, cooling, and power
  • Knowledge and experience with installing/compiling vendor and open-source software
  • Knowledge and experience with application/infrastructure deployment and support in one or more of the major cloud environments

Your benefits

An ExxonMobil career is one designed to last. Our commitment to you runs deep: our employees grow personally and professionally, with benefits built on our core categories of health, security, finance, and life.
 

We offer you:
 

  • Pension Plan: Enrollment is automatic and at no cost to you. The basic benefit is a monthly annuity to be paid to you in retirement for the rest of your life. 
  • Savings Plan: You can contribute between 6% and 20% of your pay and are encouraged to enroll right away. If you contribute at least 6% to your savings plan, the Company will contribute a 7% match. 
  • Workplace Flexibility: We have several programs such as “Flex your Day”, providing ad-hoc flexibility around when and where you work, as well as longer-term programs such as leaves of absence and part-time work.
  • Comprehensive medical, dental, and vision plans. 
  • Culture of Health: Programs and resources to support your wellbeing. 
  • Employee Health Advisory Program: Provides confidential professional counseling for you and your family, including tools and resources promoting mental health and resiliency at no additional cost to you. 
  • Disability Plan: Income replacement for when you cannot work due to illness or injury occurring on or off the job. Enrollment is automatic and at no cost to you.

    More information on our Company’s benefits can be found at  www.exxonmobilfamily.com.


    Please note benefits may be changed from time to time without notice, subject to applicable law.
     

Stay connected with us

Learn more at our website
Follow us on LinkedIN and Instagram
Like us on Facebook 
Subscribe our channel at YouTube

EEO Statement


ExxonMobil is an Equal Opportunity Employer.  All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, age, sexual orientation, gender identity, national origin, citizenship status, protected veteran status, genetic information, or physical or mental disability.

Job Group Capability

Data Science, Digital & Analytics

Job Group

Computational & Data Sciences

Functional Skills

High Performance Computational (HPC) Architecture
High Performance Scientific Computing


Nearest Major Market: Houston

Job Segment: Cloud, Systems Engineer, Open Source, Computer Science, Linux, Technology, Engineering