top of page

Senior Machine Learning / Data Engineer - Evaluation

  • 10 hours ago
  • 3 min read

As a Senior Software Engineer – Evaluation, you will design and implement systems that measure and monitor the performance of our computer vision, automatic speech recognition (ASR), and small language model (SLM) systems. You will develop evaluation methodologies, benchmarking pipelines, and monitoring tools that ensure our AI systems perform reliably in real-world environments. You will help establish the evaluation standards and performance benchmarks that guide the development of all AI systems at the company.


You will work closely with machine learning and data engineering teams to evaluate model performance, identify failure modes, and guide improvements to data collection and model training, helping bring AI-powered aviation tools to market.


This role is ideal for someone who enjoys designing robust evaluation systems and uncovering insights from model performance data, and wants to have a direct impact on the quality and reliability of our AI systems.


LOCATION

Work arrangement: On-site

Primary location: Seattle, WA or Bozeman, MT


About VTI Aerospace

VTI Aerospace builds AI-powered perception and pilot assist technologies to unlock the future of aviation. With offices in Bozeman, MT and Seattle, WA, we are a team of engineers and technologists from Boeing, Airbus, Aurora, and beyond. We are passionate about pushing the boundaries of autonomy and aviation safety. Our company is dedicated to advancing the field of aviation with cutting-edge solutions


What You'll Do

  • Define and implement evaluation methodologies for computer vision, ASR, and language systems

  • Identify and track key performance indicators (KPIs) that measure system and model effectiveness

  • Perform dataset coverage analysis to understand strengths, gaps, and biases in training and evaluation data

  • Identify model deficiencies and collaborate with ML engineers to improve training data and model performance

  • Build scalable model evaluation pipelines in Python for automated benchmarking and regression testing

  • Design and maintain systems for monitoring model performance and drift in production environments

  • Architect data models and storage systems for evaluation results using relational and time-series databases

  • Build dashboards and reporting tools to visualize model performance and evaluation metrics

  • Collaborate with ML and data engineering teams to improve evaluation workflows and data pipelines

  • Contribute to ground-truth data pipelines and processes that support reliable evaluation


What We're Looking For

  • 5+ years of experience developing production software systems in Python

  • Strong experience designing and implementing data processing or analysis pipelines

  • Experience building systems for machine learning evaluation, experimentation, or benchmarking

  • Experience working with large datasets and building scalable data workflows

  • Familiarity with statistical methods, experimental design, and model performance analysis

  • Experience building automated pipelines using tools such as Airflow or similar orchestration frameworks

  • Experience working with evaluation or experiment tracking tools such as MLflow or similar systems

  • Experience designing data models and working with relational databases and time-series data

  • Strong collaboration skills and ability to work closely with machine learning and data engineering teams


Bonus Points

  • Experience working with computer vision, ASR, or language models in production environments

  • Experience using PyTorch or other deep learning frameworks

  • Experience evaluating multimodal or large-scale AI systems

  • Experience building monitoring systems for ML models in production

  • Experience designing dataset analysis or data quality tooling

  • Experience building dashboards or monitoring tools using Grafana or similar platforms


Why You'll Love Working Here

  • Opportunity to work on meaningful problems in aviation and deliver products to industry at a rapid pace

  • Small team with significant impact on company direction

  • Collaborative and innovative environment

  • Competitive compensation and benefits

  • Opportunities for growth and leadership


VTI offers a fast-paced and dynamic work environment that focuses on technical excellence and collaboration. Additionally, we foster a culture of work-life balance to prevent burn-out.    


Compensation packages are comprised of salary and incentive stock options. Salaries may range from $80,000 to $175,000 per year depending on role and individuals experience. VTI additionally offers 401K with a 4% match, an unlimited PTO policy, and health care reimbursement up to the IRS allowable limit through QSEHRA.  


VTI is an equal opportunity employer and values diversity. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status or disability status. All job offers are contingent upon the candidate passing background, and reference checks.  


At this time, we are unable to provide visa sponsorship or consider candidates who require visa transfers. Applicants must be authorized to work in the United States without the need for visa sponsorship now or in the future.


Recent Posts

See All
Senior Robotics Software Engineer (Autonomy)

As a Senior Robotics Software Engineer (Autonomy), you will develop and deploy onboard autonomy capabilities that power our FlightStack system. Working on edge compute systems in non-network-connected

 
 
Technical Program Manager

As a Technical Program Manager, you will help translate company strategy into coordinated execution across the engineering organization. Working closely with the CTO, you will help plan, prioritize, a

 
 
Senior Cloud Infrastructure Engineer

As a Senior Software Engineer - Infrastructure, you will design, build, and operate scalable, cost-efficient cloud infrastructure that powers our machine learning workloads, data systems, and producti

 
 

Contact us

Follow Us:
  • LinkedIn
bottom of page