The LLM Algorithm Engineer will advance post-training of large language models, manage context protocols, optimize distributed training, and build evaluation pipelines.
Job Responsibilities:
1. Advanced post-training of large language models (e.g. SFT, RLHF/RLAIF, continual pretraining).
2. Aligning models for reliable JSON-schema function calls and external tool usage.
3. Design, deploy, and operate Model Context Protocol (MCP) servers that handle checkpoint routing, manage context windows, and enforce safety gates.
4. Experience in distributed training and inference with DeepSpeed/FSDP, LoRA/QLoRA, mixed precision, and performance tuning on vLLM or Triton clusters.
5. Build offline and live eval pipelines for alignment, factuality, grounding, and hallucinations.
Qualifications
1. Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
2. 3+ years of experience in developing and optimizing large language models.
3. Proven track record in implementing advanced post-training techniques (SFT, RLHF, RLAIF, continual pretraining).
4. Hands-on experience with distributed training frameworks (DeepSpeed, FSDP) and optimization techniques (LoRA, QLoRA, mixed precision).
5. Familiarity with model alignment, JSON-schema function calls, and external tool integration.
6. Experience in building and maintaining evaluation pipelines for model performance assessment.
7. Proficiency in Python and relevant machine learning frameworks (e.g., PyTorch, TensorFlow).
8. Strong understanding of distributed systems and high-performance computing.
9. Experience with model deployment and inference optimization on vLLM or Triton clusters.
10. Knowledge of JSON-schema and API development.
Top Skills
Deepspeed
Fsdp
Lora
Python
PyTorch
Qlora
TensorFlow
Triton
Vllm
Similar Jobs
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Lead a software engineering team to develop scalable insurance solutions, focusing on the transition to Google Cloud Platform and continuous improvement.
Top Skills:
.NetC#Cloud SqlCSSGoGoogle Cloud PlatformHTMLJavaScriptMs SqlMsmqPostgresPubsubRabbitMQReact
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Manage software engineering teams, ensuring delivery of scalable solutions, addressing technical issues, and coaching team members for development.
Top Skills:
.NetAngular 15+AngularjsAsp.NetC#JavaScriptSQL
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
The Software Engineer will design and implement technical solutions, collaborate in an Agile environment, build APIs, manage cloud infrastructure, and ensure secure access and integration with third-party platforms.
Top Skills:
Agile ScrumAngularApigeeC#CSSGoGoogle Cloud StorageHelmHTMLIamJavaScriptKubernetesMicroservicesReactRestful ApisSQLTemporal WorkflowsTerraformTypescriptVault
What you need to know about the Melbourne Tech Scene
Home to 650 biotech companies, 10 major research institutes and nine universities, Melbourne is among one of the top cities for biotech. In fact, some of the greatest medical advancements were conceptualized and developed here, including Symex Lab's "lab-on-a-chip" solution that monitors hormones to predict ovulation for conception, and Denteric's vaccine for periodontal gum disease. Yet, the thousands of people working in the city's healthtech sector are just getting started, to say nothing of the tech advancements across all other sectors.