✨ About The Role
- Design and implement improvements to Zoox's cutting-edge HPC infrastructure and storage technologies for machine learning workloads
- Investigate new distributed system paradigms and technologies to meet Zoox's computational and storage needs
- Create production-grade web service APIs, SDKs, and tools to enhance developer experience for all Zoox software teams
- Responsible for designing systems that optimize storage technologies in the cloud and datacenters for performance, reliability, and efficiency
- Collaborate across all Zoox software divisions, from data engineering to computer vision perception to simulation, to support computational needs
âš¡ Requirements
- Experienced software engineer with a background in distributed systems and a proficiency in Python, Java, or other managed languages
- Skilled in designing and implementing improvements to high-performance computing infrastructure and optimizing storage technologies for machine learning workloads
- Comfortable working on end-to-end responsibilities including distributed system design, algorithmic job scheduling, and cloud scaling
- Strong knowledge of cloud computing platforms such as AWS, GCP, or Azure
- Bachelor's degree in computer science or a related field is required, with bonus qualifications including experience with workload management systems and machine learning