View All Jobs 4159

Sr. Devops Engineer (HPC)

Manage and optimize HPC clusters supporting engineering and scientific computing at SpaceX
Hawthorne, California, United States
Senior
$160,000 – 220,000 USD / year
yesterday
SpaceX

SpaceX

An aerospace manufacturer and space transport services company known for its Falcon rockets and Dragon spacecraft.

Sr. DevOps Engineer (HPC)

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.

SR. DEVOPS ENGINEER (HPC)

SpaceX is looking for a Sr. DevOps Engineer with strong knowledge and experience in a world class engineering organization. This employee will be a member of the HPC team and will support SpaceX personnel and proprietary systems. The ideal candidate will be flexible and flourish in a fast paced and challenging environment. They should be a self-starter, self-motivator and possess ingenuity to excel at this position.

RESPONSIBILITIES:

  • Administer and manage HPC clusters, storage systems, and high-speed networks.
  • Provide application support to SpaceX employees across engineering disciplines.
  • Install and integrate Linux-based compute clusters.
  • Write instructional documentation and convey highly technical ideas in non-technical terms.

BASIC QUALIFICATIONS:

  • 5+ years of hands-on experience with client and server hardware/software, management tools, enterprise networking, virtualization, and security technologies.
  • Bachelor's degree in computer science, engineering, math, or scientific discipline and 5+ years of systems engineering experience; OR 7+ years of professional experience building software in lieu of a degree.
  • Experience with Linux.

PREFERRED SKILLS AND EXPERIENCE:

  • 5+ years of professional experience building, deploying and troubleshooting Linux systems.
  • Experience with a scripting language (Bash, Python) to automate and solve reoccurring tasks.
  • Experience building, deploying and troubleshooting HPC clusters.
  • Familiarity with cluster resource managers (Slurm, PBS, LSF).
  • Experience with monitoring and alerting technologies (Prometheus, Grafana, Nagios).
  • Familiarity with scientific and engineering computing (CFD, FEA).
  • Familiarity with ML frameworks (PyTorch, Tensorflow).
  • Familiarity with GPU usage in a compute cluster and Cuda.
  • Experience with containers (Docker, Podman, Singularity).
  • Experience deploying and maintaining automated configuration management software (Puppet, Ansible).
  • Comfortable working with mission critical and sensitive systems, with a sense of urgency appropriate to the responsibilities.
  • Eligibility for access to classified material up to TS/SCI with Polygraph.

ADDITIONAL REQUIREMENTS:

  • Must be willing to work extended hours and weekends as needed.

COMPENSATION AND BENEFITS:

Pay Range: Sr. DevOps Engineer: $160,000.00-$220,000.00/per year

Your actual level and base salary will be determined on a case-by-case basis and may vary based on the following considerations: job-related knowledge and skills, education, and experience.

Base salary is just one part of your total rewards package at SpaceX. You may also be eligible for long-term incentives, in the form of company stock, stock options, or long-term cash awards, as well as potential discretionary bonuses and the ability to purchase additional stock at a discount through an Employee Stock Purchase Plan. You will also receive access to comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short and long-term disability insurance, life insurance, paid parental leave, and various other discounts and perks. You may also accrue 3 weeks of paid vacation and will be eligible for 10 or more paid holidays per year. Employees accrue paid sick leave pursuant to Company policy which satisfies or exceeds the accrual, carryover, and use requirements of the law.

+ Show Original Job Post
























Sr. Devops Engineer (HPC)
Hawthorne, California, United States
$160,000 – 220,000 USD / year
Software
About SpaceX
An aerospace manufacturer and space transport services company known for its Falcon rockets and Dragon spacecraft.