AI Engineer - Machine Learning (Contract)
Please note that this is a contract role providing services to Microsoft through external staffing partners of Allegis Global Solutions. If you are selected for this role, you will be employed by AGS and will not be an employee of Microsoft.
Summary
Our team seeks a Machine Learning Engineer to advance the development and improvement of our software foundation and tools vital for training state-of-the-art AI models. Your role will be centered on creating strong, scalable, and efficient training infrastructures and frameworks facilitating the full spectrum of the machine learning process, from handling data to deploying models. In collaboration with researchers and software engineers, you'll ensure that training systems are smoothly integrated and functioning, expanding the limits of AI's capabilities, especially in practical robotics scenarios. Additionally, you will investigate innovative methods to effectively utilize diverse datasets within our training framework.
Job Responsibilities
- Create and uphold efficient, scalable, and distributed training systems—including data preprocessing, training orchestration, and model assessment—for training large-scale AI models.
- Enhance the efficiency of training procedures to improve performance and use of resources, while maintaining scalability and dependability.
- Collaborate with researchers to create training and evaluation pipelines for state-of-the-art algorithms.
- Develop and design benchmarks for evaluating ML models.
- Perform training and and fine-tuning of foundation models for robotic applications .
- Monitor and analyze pipelines, identifying bottlenecks and proposing solutions to improve efficiency and performance.
- Ensure the robustness and reliability of the training infrastructure, including automated testing and continuous integration.
Requirements
- BS, MS or higher degree in Computer Science, Robotics, Engineering or a related field, or equivalent practical experience.
- Proficiency in Python, C++, or similar and at least one deep learning library such as PyTorch, TensorFlow, JAX, etc.
- Strong background in distributed computing, parallel processing techniques, handling large-scale datasets and data preprocessing.
- Deep understanding of state-of-the-art machine learning techniques and models.
- Experience with cloud-based training environments (AWS, Google Cloud, Azure).
- Experience in developing and maintaining software tooling and infrastructure for machine learning.
- Deep understanding and practical experience with software engineering principles, including algorithms, data structures, and system design.
- Experience with continuous integration and automated testing frameworks.
Additional Details
- Location: Redmond, WA (Hybrid: 3 days / week onsite)
- Duration: 12 Months
- Pay Range: $66 - 75 per hour
- Weekly Schedule: 40 hours
AGS is an Equal Opportunity/Affirmative Action Employer (M/F/Disability/Veterans). We will consider all applications without regard to race, gender, sexual orientation, gender identity, age, color, religion, national origin, veteran status, disability, genetic information or any other status protected by applicable law.