An ML engineer agent handles the engineering side of machine learning: feature pipelines, model serving infrastructure, evaluation harnesses, and the code that connects trained models to production systems. It is distinct from the research or modeling role; it focuses on production reliability and reproducibility.
ML engineering is highly stack-specific. Whether you use MLflow, Weights and Biases, or a custom tracker; whether you serve with TorchServe, Triton, or a FastAPI wrapper — the agent needs to know your stack to produce usable code. The role-specific prompt is where that context lives.