Introduction
Want to be a part of preparing and governing data for IBM's Granite models? We are a group of scientists, engineers and designers working on the state-of-the-art Data and Model Factory that produces all of IBM's Granite models. Our work enables and accelerates the entire data pipeline, from data clearance and acquisition to engineering. These data are used in pre-training, fine-tuning, instruction-tuning, or RAG solutions powered by IBM Granite. We thrive in opensource innovation, responsible use of data and AI, collaboration across disciplines, including backend engineering, data science, distributed computing, natural language processing, among others.
Your Role and Responsibilities
This is for a 2025 summer internship with the following start dates: May - August or June - September for quarter system schools.
Want more jobs like this?
Get Data and Analytics jobs delivered to your inbox every week.
During your internship, you can expect to work on challenging research problems and produce cutting edge solutions in a diverse and nurturing research environment. You'll learn and practice how to define problems, build prototypes, test hypotheses, and deploy results. In the past, interns have also authored papers, filed patents, and contributed to opensource projects.
Required Technical and Professional Expertise
- Applicants should be enrolled in a Master's course and have a science, technology, engineering, or mathematical discipline background.
- Your areas of experience should include foundation models, machine learning, AI, natural language processing, data engineering, cloud computing, database management, and other computer science and engineering topics.
Preferred Technical and Professional Expertise
- Applicants should be enrolled in a Doctoral program and having a science, technology, engineering, or mathematical discipline background.