job
laze
Jobs
Companies
Remote
Sign in
Create alert
Jobs
Companies
Remote
← Back to jobs
GRPO
2 jobs tagged GRPO
I
Member of Technical Staff – Model Training
Remote
Visa
Inflection AI
RLAIF
DeepSpeed
DPO
PyTorch
GRPO
USA
$175k–$350k/yr
Published 9 months ago
Apply →
T
AI Researcher, Core ML (Turbo)
Together AI
RLAIF
speculative decoding
SGLang
vLLM
reward modeling
San Francisco
$200k–$280k/yr
Indexed 1 day ago
Apply →