Google→
Software Engineer, Next Generation AI/ML Infrastructure
Entry LevelOn-site
Location
Sunnyvale, CA
Salary
$147k–$211k/yr
Experience
Not specified
Posted
1 month ago
Skills
c++distributed processinglarge-scale computingmachine learning life-cyclemodel serving systemsml pipelinestraining data generationinference servicesonline serving systemsdata processing internalsflumebeamsparkflink
Job Description
Summary: Google is a leading technology company that develops next-generation technologies impacting billions of users. They are seeking a Software Engineer to work on critical projects involving the development and scaling of AI/ML infrastructure, focusing on large-scale computing and data processing.
Responsibilities:
- Build FeatureML to streamline the user journey for Data Engineers and AI Practitioners, accelerating the creation, deployment, and management of features for production-scale ML models
- Architect and build an available and elastic inference service designed to manage massive traffic demands
- Develop a unified processing and storage platform, tailoring it specifically for DeepMind demanding data needs
Required Qualifications:
- Bachelor's degree or equivalent practical experience
- 2 years of experience with coding in C++, or 1 year of experience with an advanced degree
- 1 year of experience with distributed processing
- 1 year of experience with large-scale computing
Preferred Qualifications:
- Master's degree or PhD in Computer Science, or a related technical field
- Experience building and scaling components of the Machine Learning life-cycle (e.g., feature stores, model serving systems, ML pipelines, training data generation)
- Experience developing, optimizing, and operating inference services or online serving systems at petascale
- Experience with data processing internals or managing major systems like Flume, Beam, Spark, or Flink
- Ability to drive projects from ideation to production deployment and long-term maintenance
- Ability to collaborate effectively with researchers, applied scientists, and product teams to translate abstract requirements into production infrastructure
Required Skills: C++, Distributed Processing, Large-scale Computing, Machine Learning Life-cycle, Model Serving Systems, ML Pipelines, Training Data Generation, Inference Services, Online Serving Systems, Data Processing Internals, Flume, Beam, Spark, Flink
Benefits: Bonus, Equity, Benefits
Benefits
Bonus
Equity
Benefits