About the job
Key Responsibilities • Collaborate directly with the founding team on core AI model development and strategy • Fine-tune and customize LLMs for specific use cases and performance requirements • Evaluate and benchmark different language models to determine optimal solutions • Optimize model performance using LoRA, QLoRA, and other parameter-efficient methods • Implement and experiment with RLHF workflows • Design and execute training pipelines for custom model development • Research and implement cutting-edge techniques in model optimization and efficiency • Create new AI solutions that solve real-world problems at scale • Lead technical initiatives and mentor junior team members as the team grows Requirements • Demonstrate deep understanding of LLM architectures such as transformer models and attention mechanisms • Apply hands-on experience fine-tuning LLMs like GPT, LLaMA, and Mistral in production environments • Show strong knowledge of training and fine-tuning processes including data preparation, hyperparameter optimization, and evaluation • Use LoRA, QLoRA, and parameter-efficient fine-tuning techniques effectively • Implement RLHF and human preference learning in practice • Program at an expert level in Python with deep learning frameworks like PyTorch and Transformers • Understand distributed training and model parallelization techniques • Hold a bachelor’s degree from IIT, NIT, or BITS in computer science or a related field • Demonstrate strong computer science fundamentals in algorithms, data structures, and system design • Apply a solid mathematical foundation in linear algebra, statistics, and optimization • Possess 0–1 years of hands-on experience with deep learning and LLMs • Prove successful fine-tuning and deployment of models in production • Use model evaluation frameworks and benchmarking methodologies • Work with MLOps tools and model deployment pipelines • Optimize GPU performance and ensure efficient model serving • Work independently and drive projects to completion as a self-starter • Stay motivated and passionate about pushing AI boundaries • Solve complex technical challenges and thrive in ambiguous environments • Debug and optimize complex systems with strong problem-solving skills • Collaborate effectively with excellent communication skills • Stay research-oriented and up to date with the latest AI developments Why This Role Is Special • Work directly with founders on core product decisions and technical strategy • Shape the AI architecture from inception to production scale • Lead research initiatives and influence the direction of AI capabilities • Access cutting-edge research and implement the latest techniques • Work with state-of-the-art hardware and computational resources • Collaborate with brilliant minds and learn from industry experts • Join as a founding team member with significant equity and growth potential • Take on leadership opportunities as the team expands • Gain industry recognition through publications and open-source contributions • Work primarily in-office at the CBD Bangalore location for maximum collaboration
About the company
Icecreamlabs is an AI venture studio dedicated to building AI-first startups that tackle complex enterprise challenges. Founded by IIT and Stanford alumni, we are a fast-moving team focused on developing next-generation AI agents for enterprise applications. Our passion lies in solving intricate problems, experimenting with cutting-edge AI technologies, and shaping the future of work through intelligent automation. We value curiosity, speed, and creativity, thriving in the dynamic environment of an early-stage startup.
Similar Jobs
AI Engineer
Infiswift Technologies
AI Engineer
Vista Applied Solutions Group Inc
AI Engineer
JRD Systems