Introduction
The Responsible Tech team's research is focused on the intersection of technology and society. They study and devise approaches to mitigate technology risks across research and development processes, and foster innovations that expand societal benefits of technology.
Your role and responsibilities
We are seeking a motivated intern with background in computer science, artificial intelligence, applied mathematics, computational linguistics, or a related field to leverage state-of-the-art reinforcement learning techniques for language model alignment. In this role, you will have a freedom to explore multiple research directions: developing sophisticated reward models that capture human values by effectively decomposing complex value alignment into learnable components; creating synthetic training data through policy-guided rejection sampling; and implementing RL for alignment. We're particularly interested in approaches that can leverage human feedback efficiently, scale to large language models, and provide verifiable alignment guarantees. Candidates currently enrolled in graduate programs are encouraged to apply.
Want more jobs like this?
Get jobs in Winchester, United Kingdom delivered to your inbox every week.
Required education
Bachelor's Degree
Required technical and professional expertise
- Strong experience in deep RL
- Familiarity with language models and alignment challenges
- Machine Learning
- Advanced experience with Python, PyTorch, TensorFlow
- Experience with reward modeling and synthetic data generation is highly valued
Preferred technical and professional experience
- Cloud based computation
- Hands on experience with fine tuning of large language models
- Experience working with Huggingface models and data
ABOUT BUSINESS UNIT
IBM Research is the organic growth engine of IBM and an innovation engine for our customers and partners. As part of this mission, IBM Research anticipates and examines 'What's Next in Computing' to ultimately create and integrate the technologies the world relies upon to solve big challenges and unlock new opportunities. We create and pioneer new markets for IBM, our partners and customers as exemplified in our ongoing quest to reach practical and large-scale quantum computing. Across IBM Research, we realize the power and potential to accelerate discovery with our partners and clients by combining the power of high performance computing, AI, and Quantum, all integrated through the hybrid cloud.
YOUR LIFE @ IBM
In a world where technology never stands still, we understand that, dedication to our clients success, innovation that matters, and trust and personal responsibility in all our relationships, lives in what we do as IBMers as we strive to be the catalyst that makes the world work better.
Being an IBMer means you'll be able to learn and develop yourself and your career, you'll be encouraged to be courageous and experiment everyday, all whilst having continuous trust and support in an environment where everyone can thrive whatever their personal or professional background.
Our IBMers are growth minded, always staying curious, open to feedback and learning new information and skills to constantly transform themselves and our company. They are trusted to provide on-going feedback to help other IBMers grow, as well as collaborate with colleagues keeping in mind a team focused approach to include different perspectives to drive exceptional outcomes for our customers. The courage our IBMers have to make critical decisions everyday is essential to IBM becoming the catalyst for progress, always embracing challenges with resources they have to hand, a can-do attitude and always striving for an outcome focused approach within everything that they do.
Are you ready to be an IBMer?
ABOUT IBM
IBM's greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world.
Restlessly reinventing since 1911, we are not only one of the largest corporate organizations in the world, we're also one of the biggest technology and consulting employers, with many of the Fortune 50 companies relying on the IBM Cloud to run their business.
At IBM, we pride ourselves on being an early adopter of artificial intelligence, quantum computing and blockchain. Now it's time for you to join us on our journey to being a responsible technology innovator and a force for good in the world.
OTHER RELEVANT JOB DETAILS
For additional information about location requirements, please discuss with the recruiter following submission of your application.