✨ About The Role
- Develop a highly available service for ML model serving and enhance Ray Serve and other libraries to simplify the development of ML applications in production
- Work on improving autoscaling capabilities to drive performance enhancements and cost savings
- Optimize latency and throughput for both single- and multi-model serving scenarios
- Collaborate with open-source users and customers to build and maintain world-class systems for serving ML models in production
- Contribute directly to the Anyscale platform used by customers for critical applications
âš¡ Requirements
- Experienced software engineer with a strong background in algorithms, data structures, and system design
- Proficient in modern machine learning tooling such as PyTorch, TensorFlow, and JAX
- At least 2+ years of relevant work experience in building and maintaining open-source projects or machine learning infrastructure in production
- Skilled in developing highly available serving systems and optimizing latency and throughput for ML model serving
- Ability to collaborate with diverse teams and customers, ranging from startups to industry-leading companies