Senior Data Scientist— LLM Training & Fine-tuning
About the job
Job Title Senior Data Scientist— LLM Training & Fine-tuning (Indian Languages, Tool Calling, Speed) Location: Bangalore About the Role We’re looking for a hands-on Data Scientist / Research Scientist who can fine-tune and train open-source LLMs end-to-end —not just run LoRA scripts. You’ll own model improvement for Indian languages + code-switching (Hinglish, etc.) , instruction following , and reliable tool/function calling , with a strong focus on latency, throughput, and production deployability. This is a builder role: you’ll take models from research → experiments → evals → production. What You’ll Do (Responsibilities) • Train and fine-tune open LLMs (continued pretraining, SFT, preference optimization like DPO/IPO/ORPO, reward modeling if needed) for: - Indian languages + multilingual / code-switching - Strong instruction following - Reliable tool/function calling (structured JSON, function schemas, deterministic outputs) • Build data pipelines for high-quality training corpora: Instruction datasets, tool-call traces, multilingual data, synthetic data generation - De-duplication, contamination control, quality filtering, safety filtering • Develop evaluation frameworks and dashboards: - Offline + online evals, regression testing - Tool-calling accuracy, format validity, multilingual benchmarks, latency/cost metrics • Optimize models for speed and serving : - Quantization (AWQ/GPTQ/bnb), distillation, speculative decoding, KV-cache optimizations - Serve via vLLM/TGI/TensorRT-LLM/ONNX where appropriate • Improve alignment and reliability : - Reduce hallucinations, improve refusal behavior, enforce structured outputs - Prompting + training strategies for robust compliance and guardrails • Collaborate with engineering to ship: Model packaging, CI for evals, A/B testing, monitoring drift and quality • Contribute research: Read papers, propose experiments, publish internal notes, and turn ideas into measurable gains.
Requirements
- LLM Training
- Fine-tuning
- Python
- PyTorch
Qualifications
- 4 - 6 years in ML/DS
- Direct LLM training/fine-tuning experience
Preferred Technologies
- LLM Training
- Fine-tuning
- Python
- PyTorch
Similar Jobs
Senior Data Science Consultant
Zorba Consulting
Senior AI / Data Scientist
VMC Soft Technologies, Inc
Senior AI / Data Scientist
VMC Soft Technologies, Inc