Wipro Limited (NYSE: WIT, BSE: 507685, NSE: WIPRO) is a leading technology services and consulting company focused on building innovative solutions that address clients’ most complex digital transformation needs. Leveraging our holistic portfolio of capabilities in consulting, design, engineering, and operations, we help clients realize their boldest ambitions and build future-ready, sustainable businesses. With over 230,000 employees and business partners across 65 countries, we deliver on the promise of helping our customers, colleagues, and communities thrive in an ever-changing world. For additional information, visit us at www.wipro.com.
Job Description
Job Title: AI Researcher (SFT, RLHF, RL Environments & Model Evaluation)
About the RoleWe are seeking an AI Researcher with strong hands-on experience in Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback (RLHF), RL environments (gyms), and model evaluation. The role focuses on training, aligning, and evaluating models—particularly for STEM, coding, robotics, reasoning, and real-world problem-solving capabilities.You will help build systems that not only perform well on benchmarks, but also reason effectively, generalize to real-world scenarios, and align with human intent.Key ResponsibilitiesDesign and implement SFT pipelines for training models on STEM subjects, coding tasks, robotics concepts, logical reasoning, and real-world problem-solvingDevelop and execute RLHF workflows, including preference data collection, reward modeling, and policy optimizationCreate and maintain RL environments / gyms for reasoning tasks, coding challenges, robotics simulations, and applied real-world scenariosTrain models to improve step-by-step reasoning, tool use, and structured problem solvingDesign and run model evaluation frameworks covering:STEM and mathematical reasoningCode correctness, efficiency, and robustnessRobotics task success and planningReal-world decision-making and generalizationPerform error analysis to identify reasoning failures, hallucinations, or misalignmentCollaborate with engineers, educators, and domain experts to curate high-quality training and evaluation datasetsTranslate research insights into scalable, production-ready training and evaluation systemsDocument experiments, results, and best practices with strong reproducibility standardsRequired QualificationsStrong background in machine learning, reinforcement learning, or AI researchHands-on experience with SFT and RLHF, especially for reasoning-intensive tasksExperience building or using RL gyms / environments, including task-driven or simulation-based setupsSolid understanding of model evaluation, including automated metrics and human-in-the-loop evaluationProficiency in Python and ML frameworks such as PyTorchAbility to reason deeply about model behavior, generalization, and alignmentExperience training or evaluating models on STEM, coding, or real-world problem domainsPreferred / Nice-to-HaveExperience with LLMs, multimodal models, or foundation modelsBackground in robotics, simulation environments, or embodied AIFamiliarity with program synthesis, code evaluation, or formal reasoningExperience with large-scale or distributed trainingInterest or experience in AI safety, alignment, or robustnessPublications, open-source contributions, or applied research experienceWhat We OfferOpportunity to work on cutting-edge AI reasoning and alignment challengesDirect impact on real-world AI capabilities in STEM, coding, and roboticsCollaborative, research-driven environmentCompetitive compensation and benefits
͏
DO:
- At least 15+ years of experience in selling IT Services in Tier-1 or Tier-2 competitive organizations.
- Strong knowledge of global delivery model (GDM) and methodologies. Should be familiar with cross selling various service lines for customers
- Ability to present and interact at all levels, and have consultative sales capability.
- Ability to work and collaborate across other teams in various service lines and anchor together for the account.
- Exposure to delivery, sales or pre-sales roles will be required
- Should have managed a multi-million USD account, across various geos.
- Strong Account Management - building and managing client relationships at the all levels.
- Carry targets on revenue, bookings and OM.
- Get involved in resolving any people management issue within Wipro teams
- Generating leads by interacting with the customers in various lines of business to expand our footprint.
- Presenting and publishing the proposals (proactive ones as well as responses to RFP/RFIs)
- Interacting with Procurement and Supplier relationship team from customer organization and maintain smoother flow of contracts, invoices and payments.
- Work closely with senior customer team (CIO, VPs and Directors) to suggest, advice, evaluate, and prime business growth
ÃÂ
͏
͏
͏
Expected annual pay for this role ranges from $200,000.00 to $280,000.00. Based on the position, the role is also eligible for Wipro’s standard benefits including a full range of medical and dental benefits options, disability insurance, paid time off (inclusive of sick leave), other paid and unpaid leave options
Reinvent your world. We are building a modern Wipro. We are an end-to-end digital transformation partner with the boldest ambitions. To realize them, we need people inspired by reinvention. Of yourself, your career, and your skills. We want to see the constant evolution of our business and our industry. It has always been in our DNA - as the world around us changes, so do we. Join a business powered by purpose and a place that empowers you to design your own reinvention.
Applications from people with disabilities are explicitly welcome.