About the job
About Us DataWeave is a SaaS-based digital commerce analytics platform that empowers retailers with competitive intelligence and equips consumer brands with digital shelf analytics globally. Using proprietary AI technology, DataWeave analyzes over 500+ billion data points across 400,000+ brands, 4,000+ websites, and 20+ industry verticals. Our clients include Nordstrom, Overstock, The Home Depot, Mars, Bush Brothers, Mondelez, Pernod Ricard, and more. We are a globally distributed team of 220+ engineers, product managers, and eCommerce experts with technology offices in Bangalore. What We Offer • Opportunities to work on cutting-edge AI research in NLP, Computer Vision, and Large Language Models (LLMs) • Immediate impact on product and business decisions in the retail/eCommerce domain • End-to-end ownership of projects from ideation to deployment • Culture of openness, collaboration, and mentorship • Flexible work environment and continuous learning opportunities • Competitive rewards and fast-paced career growth Role Overview The Lead / Sr Data Scientist will drive AI innovation and solve complex business problems in the retail domain. The role involves developing production-ready ML / DL models, leading research initiatives, mentoring junior team members, and ensuring AI solutions align with product strategy. Responsibilities • Build robust ML models using state-of-the-art architectures for NLP, Computer Vision, and Deep Learning • Solve complex retail problems such as product matching, attribute extraction, and price optimization • Optimize models for scalability, efficiency, and deployment with MLOps best practices • Take end-to-end ownership of AI projects, from research to production • Mentor and guide junior team members, fostering a culture of innovation and collaboration • Collaborate with cross-functional teams to translate business problems into AI solutions Required Qualifications • Bachelor’s degree in Computer Science, Data Science, Mathematics, or a related field • 6+ years of hands-on experience in AI / ML development (3+ years acceptable with exceptional expertise in GenAI / LLMs) • Expert-level Python proficiency with experience in PyTorch or TensorFlow • Strong experience in Generative AI, LLMs, vision-language models, and multimodal systems • Hands-on experience with NLP and CV libraries: SpaCy, NLTK, HuggingFace Transformers, OpenCV • Experience in model training, fine-tuning, quantization, evaluation, and deployment of transformer-based models (BERT, GPT, T5, LLaMA, etc.) • Familiarity with model optimization and scalability techniques (quantization, distillation, pruning, ONNX, TensorRT-LLM, DeepSpeed, etc.) • Strong understanding of LLM ecosystems including OpenAI, Anthropic, Meta, Google, Mistral, AWS Bedrock • Proven ability to lead projects and mentor teams in a high-velocity product environment Preferred / Good to Have • Master’s or PhD in Computer Science, AI / ML, Applied Math, or related fields • Experience in startups or high-growth environments with ownership mindset • Building full MLOps pipelines (MLFlow, Kubeflow, Airflow, SageMaker, Vertex AI) • LLM fine-tuning and parameter-efficient training (PEFT: LoRA, QLoRA, DoRA, Adapters, etc.) • Experience with LangChain, LangGraph, LlamaIndex, and multi-agent workflows • Building Retrieval-Augmented Generation (RAG) pipelines using vector DBs like Pinecone, Chroma, Qdrant, Weaviate, or FAISS • Practical experience in evaluating LLM applications using Ragas, DeepEval, Promptfoo, or custom frameworks • Knowledge of modern research in Transformer optimizations, self-supervised learning, agentic AI, and efficient training frameworks • Contributions to open-source ML / AI projects, publications, or active participation in research communities
Requirements
- AI
- ML
- Python
- NLP
- Computer Vision
- Deep Learning
Qualifications
- Bachelor’s degree
- Master’s degree
- PhD
Preferred Technologies
- AI
- ML
- Python
- NLP
- Computer Vision
- Deep Learning
About the company
DataWeave is a SaaS-based digital commerce analytics platform that empowers retailers with competitive intelligence and equips consumer brands with digital shelf analytics globally. Using proprietary AI technology, DataWeave analyzes over 500+ billion data points across 400,000+ brands, 4,000+ websites, and 20+ industry verticals. Our clients include Nordstrom, Overstock, The Home Depot, Mars, Bush Brothers, Mondelez, Pernod Ricard, and more.
Similar Jobs
Lead Data Scientist
Whatjobs
Senior / Lead Data Scientist
Tiger Analytics
Senior / Lead Data Scientist
Tiger Analytics