Is the Platform Engineer, Model Shaping role remote?

It's hybrid — Together AI expects some on-site time in San Francisco .

What's the salary range?

Together AI lists $200,000–$290,000 for this role.

How much experience is required?

At least 3 years of relevant experience for this Platform Engineer, Model Shaping role.

Where is the role based?

Together AI is hiring for this position in San Francisco .

What's the tech stack?

Joblaze extracted these technologies from the posting: Terraform, AWS, Grafana, Prometheus, GitHub Actions, ArgoCD.

What seniority level is this role?

Together AI targets mid-level candidates for this position.

Is this full-time or contract?

Full-time for this Platform Engineer, Model Shaping role at Together AI.

Platform Engineer, Model Shaping at Together AI

Joblaze summary

In the role of Platform Engineer for Model Shaping at Together AI, the individual will focus on developing and maintaining the backend infrastructure that supports model customization and evaluation. Key skills include expertise in Linux environments, Kubernetes, and programming in Python or Go, along with experience in infrastructure automation and cloud administration. This position is ideal for someone with over three years of experience in backend engineering, particularly in high-reliability production systems. Together AI emphasizes collaboration across teams, fostering a research-driven environment that values innovation in AI infrastructure.

Joblaze insights

Listed today on Joblaze — tracked directly from Together AI's career page.
Together AI has 59 other open roles including Research Intern RL & Post-Training Systems, Turbo (Fall 2026), Workplace Coordinator, Senior Software Engineer(Amsterdam).
2452 active Python roles on Joblaze right now.
1234 active AWS roles on Joblaze right now.
1126 active Kubernetes roles on Joblaze right now.
Salary band is above the typical range for DevOps/SRE roles (median ~$160,000).

Quick facts

Is the Platform Engineer, Model Shaping role remote?: It's hybrid — Together AI expects some on-site time in San Francisco .
What's the salary range?: Together AI lists $200,000–$290,000 for this role.
How much experience is required?: At least 3 years of relevant experience for this Platform Engineer, Model Shaping role.
Where is the role based?: Together AI is hiring for this position in San Francisco .
What's the tech stack?: Joblaze extracted these technologies from the posting: Terraform, AWS, Grafana, Prometheus, GitHub Actions, ArgoCD.
What seniority level is this role?: Together AI targets mid-level candidates for this position.
Is this full-time or contract?: Full-time for this Platform Engineer, Model Shaping role at Together AI.

From the original posting

About the Role

The Model Shaping team at Together AI works on products and research for tailoring open foundation models to downstream applications. We build services that allow machine learning developers to choose the best models for their tasks and further improve these models using domain-specific data. In addition to that, we develop new methods for more efficient model training and evaluation, drawing inspiration from a broad spectrum of ideas across machine learning, natural language processing, and ML systems.

As a Platform Engineer in Model Shaping, you will work at the intersection of backend engineering and infrastructure, building the foundational layers of Together’s platform for model customization and evaluation. You will design, develop, and operate both the backend services and the underlying systems that enable us to sustainably and reliably scale production workflows launched by our users, as well as internal research experiments.

You will operate in a cross-functional environment, collaborating with other engineers and researchers in the team to improve the infrastructure based on the needs of projects they work on. You will also interact with other engineering teams at Together (such as Commerce, Data Engineering, and Cloud Infrastructure) to integrate the services developed by Model Shaping with systems developed by those teams.

Responsibilities

Design and build Together’s systems and infrastructure for model customization, including user-facing features and internal improvements
Contribute to reliability improvements for the platform, participating in an on-call rotation and improving processes for incident response
Create and improve internal tooling for deployment, continuous integration, and observability
Build a job orchestration platform spanning multiple datacenters, supporting a highly heterogeneous hardware landscape
Partner with teams developing internal services, co-designing these services and incorporating them in systems built within Together

Requirements

3+ years of experience in building infrastructure or backend components of production services
Extensive experience designing, operating, and troubleshooting production Linux environments and Kubernetes-based platforms
Strong software engineering background in Python or Go
Experienced with infrastructure automation tools (Terraform, Ansible), monitoring/observability stacks (Prometheus, Grafana), and CI/CD pipelines (GitHub Actions, ArgoCD)
Cloud environment (e.g., AWS/GCP/Azure) administration experience, preferably with a hybrid bare-metal/cloud environment
Strong communication skills, be willing to document systems and processes and collaborate with peers of varying technical expertise
Comfortable operating across the stack, from cluster operations and infrastructure automation to backend service development

Experience in any of the following will make you stand out:

Developing large-scale production systems with high reliability requirements
Pipeline orchestration frameworks (e.g., Kubeflow, Argo Workflows, Flyte)
Managing GPU workloads on HPC clusters, ideally with hands-on experience in operating NVIDIA’s networking stack (e.g., NCCL, Mellanox firmware, GPUDirect RDMA)
Deployment of services for AI training or inference
Networking fundamentals, including TCP/IP, DNS, routing, load balancing, TLS, and network debugging tools
Maintaining or contributing to open-source projects

About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancements such as FlashAttention, RedPajama, SWARM Parallelism, and SpecExec. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.

Compensation

We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is $200,000 - $290,000. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

Equal Opportunity

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Please see our privacy policy at https://www.together.ai/privacy