A

Senior Data Scientist— LLM Training & Fine-tuning

Apna
Bangalore Not disclosed
Yesterday
On-Site
Apply to Job

About the job

Job Title Senior Data Scientist— LLM Training & Fine-tuning (Indian Languages, Tool Calling, Speed) Location: Bangalore About the Role We’re looking for a hands-on Data Scientist / Research Scientist who can fine-tune and train open-source LLMs end-to-end —not just run LoRA scripts. You’ll own model improvement for Indian languages + code-switching (Hinglish, etc.) , instruction following , and reliable tool/function calling , with a strong focus on latency, throughput, and production deployability. This is a builder role: you’ll take models from research → experiments → evals → production. What You’ll Do (Responsibilities) • Train and fine-tune open LLMs (continued pretraining, SFT, preference optimization like DPO/IPO/ORPO, reward modeling if needed) for: - Indian languages + multilingual / code-switching - Strong instruction following - Reliable tool/function calling (structured JSON, function schemas, deterministic outputs) • Build data pipelines for high-quality training corpora: Instruction datasets, tool-call traces, multilingual data, synthetic data generation - De-duplication, contamination control, quality filtering, safety filtering • Develop evaluation frameworks and dashboards: - Offline + online evals, regression testing - Tool-calling accuracy, format validity, multilingual benchmarks, latency/cost metrics • Optimize models for speed and serving : - Quantization (AWQ/GPTQ/bnb), distillation, speculative decoding, KV-cache optimizations - Serve via vLLM/TGI/TensorRT-LLM/ONNX where appropriate • Improve alignment and reliability : - Reduce hallucinations, improve refusal behavior, enforce structured outputs - Prompting + training strategies for robust compliance and guardrails • Collaborate with engineering to ship: Model packaging, CI for evals, A/B testing, monitoring drift and quality • Contribute research: Read papers, propose experiments, publish internal notes, and turn ideas into measurable gains.

Requirements

  • LLM Training
  • Fine-tuning
  • Python
  • PyTorch

Qualifications

  • 4 - 6 years in ML/DS
  • Direct LLM training/fine-tuning experience

Preferred Technologies

  • LLM Training
  • Fine-tuning
  • Python
  • PyTorch

Similar Jobs

Zorba Consulting

Senior Data Science Consultant

Zorba Consulting

NoidaNot disclosed
11 hours agoOn-Site
VMC Soft Technologies, Inc

Senior AI / Data Scientist

VMC Soft Technologies, Inc

JunagadhNot disclosed
2 days agoOn-Site
VMC Soft Technologies, Inc

Senior AI / Data Scientist

VMC Soft Technologies, Inc

ErodeNot disclosed
3 days agoRemote