Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Foundation Models for Data Research Intern: 2025

AT IBM
IBM

Foundation Models for Data Research Intern: 2025

Albany, NY

Introduction
IBM Research Scientists are charting the future of Artificial Intelligence, creating breakthroughs in quantum computing, discovering how blockchain will reshape the enterprise, and much more. Join a team that is dedicated to applying science to some of today's most complex challenges, whether it's discovering a new way for doctors to help patients, teaming with environmentalists to clean up our waterways or enabling retailers to personalize customer service.

Your Role and Responsibilities

This is for a 2025 summer internship with the following start dates: May - August or June - September for quarter system schools.

We are broadly interested in further improving the capabilities of foundation models (FMs) for a range of data management tasks such as data discovery, metadata enrichment, data access and retrieval with querying, and automated data-driven insights.

Want more jobs like this?

Get jobs delivered to your inbox every week.

Select a location
By signing up, you agree to our Terms of Service & Privacy Policy.

Topics of interest include research on interactive orchestration of data workflows such as natural language to data insights spanning multiple tools and functions, knowledge-driven data discovery and querying with graphs and mutli-modal FMs, step-by-step planning and reasoning for complex data workflows , and low-computational cost inference techniques for FMs to efficiently automate or assist users with data tasks.
We are looking for interns with skills and tasks of interest include:

  • [LLM for code generation] Research for effective use of foundational models for code generation pipelines specific to data tasks such as SQL for data retrieval
  • [Agents and Reasoning] Research for developing novel autonomous agentic systems to compete with Text-to-SQL on public leaderboards like BIRD and Spider 2.0
  • [Knowledge Graphs, Multi-Modal FMs] Research for novel ways to combine foundational models, knowledge graphs, and multi-modal data for improving tasks such as data discovery and automated text-to-sql
  • [FM Inference] Research for improving foundation models inference in terms of both answer generation and computational cost.

Required Technical and Professional Expertise

  • Applicants should be PhD & MS students pursuing graduate studies.
  • Pursuing graduate studies in computer science and related fields.
  • Having at least one Research publication at a top conference in AI.
  • Familiarity and working expertise with large language models.

Preferred Technical and Professional Expertise

  • Familiarity with knowledge graphs, RAG, agentic frameworks.
  • Familiarity with reinforcement learning, knowledge distillation and prompt optimization.
  • Familiarity with SQL.

Client-provided location(s): Albany, NY, USA; San Jose, CA, USA; Cambridge, MA, USA; Yorktown Heights, NY 10598, USA
Job ID: IBM-21087648
Employment Type: Intern

Company Videos

Hear directly from employees about what it is like to work at IBM.