null

Syncarp

Nashik • Not disclosed

Yesterday

On-Site

About the job

Hiring for a Global IT service provider, in AI Distinguished Forward Deployed Engineers. Experience : 8+ Years, based out of Chennai Role Summary : You represent the pinnacle of Applied AI engineering. You are not just using APIs; you are optimizing the models themselves. You understand the mathematics behind the attention mechanism, you know how to squeeze performance out of GPUs, and you can customize models for specific domains. You provide the high-level technical vision and handle the most difficult edge cases. Key Responsibilities : Model Fine-Tuning : Implement PEFT (Parameter-Efficient Fine-Tuning), LoRA, and QLoRA to adapt open-source models (Llama 3, Mistral) to specific client domains. Optimization & Quantization : Perform model quantization to reduce inference costs and latency without sacrificing quality. Manage Dense Vectors and embedding optimizations. State-of-the-Art Exploration : Continuously research and implement the latest advancements (e.g., State Space Models, Long-Context optimizations) into client deliverables. Strategic Consulting : Act as a trusted advisor to C-level client executives, defining the "Art of the Possible" and guiding long-term AI roadmaps. Technical Requirements : Deep Learning : PyTorch / TensorFlow, Transformers architecture internals, Attention mechanisms. Model Ops : Serving custom models (vLLM, TGI), GPU memory management, Quantization techniques (GGUF, AWQ). Advanced Data : Training data curation, synthetic data generation, RLHF concepts. Leadership : Ability to define the technical culture and set standards for the entire FDE organization.

Requirements

Applied AI Engineering
Model Optimization
Deep Learning

Qualifications

NIT / IIT Education

Preferred Technologies

Applied AI Engineering
Model Optimization
Deep Learning

About the company

null

Similar Jobs

null

Wipro

Hyderabad•Not disclosed

Yesterday•On-Site

null

MPC Cloud Consulting Pvt Ltd

Gurugram•Not disclosed

2 weeks ago•On-Site

null

Infosys

India•Not disclosed

3 weeks ago•On-Site