Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Research Scientist Intern, Gen AI Language for Llama Data (PhD)

AT Meta
Meta

Research Scientist Intern, Gen AI Language for Llama Data (PhD)

Menlo Park, CA

Meta was built to help people connect and share, and over the last decade our tools have played a critical part in changing how people around the world communicate with one another. With over a billion people using the service and more than fifty offices around the globe, a career at Meta offers countless ways to make an impact in a fast growing organization.Meta is seeking Research Interns to join the Llama pre-training data team to advance the state of the art of our Generative AI efforts . We are committed to advancing the field of artificial intelligence by making fundamental advances in technologies to help interact with and understand our world. We are seeking individuals passionate in areas such as deep learning, computer vision, audio and speech processing, natural language processing, machine learning, reinforcement learning, computational statistics, and applied mathematics. Our interns have an opportunity to make core algorithmic advances and apply their ideas at an unprecedented scale.Our internships are twelve (12) to twenty-four (24) weeks long and we have various start dates throughout the year.

Want more jobs like this?

Get jobs in Menlo Park, CA delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.


Research Scientist Intern, Gen AI Language for Llama Data (PhD) Responsibilities:
  • Develop novel state-of-the-art methods and algorithms to automatically curate pre-training data for training Large Language Models
  • Help analyze and improve safety and robustness of existing methods and algorithms for model-based data curation
  • Perform research to advance the science and technology of model-based data curation
  • Collaborate with researchers from Llama Data and Llama Pre-training teams and cross-functional partners including communicating research plans, progress, and results
  • Disseminate research results
  • Publish research results and contribute to research that can be applied to Meta product development
Minimum Qualifications:
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Computer Vision, Audio Processing, Artificial Intelligence, Generative AI, or relevant technical field
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • Research experience in machine learning, deep learning, computer vision and/or natural language processing
  • Experience with Python, C++, C, Java or other related languages
  • Experience with deep learning frameworks such as Pytorch or Tensorflow
Preferred Qualifications:
  • Intent to return to the degree program after the completion of the internship/co-op
  • Proven track record of achieving significant results as demonstrated by grants, fellowships, patents, as well as first-authored publications at leading workshops or conferences such as NeurIPS, ICLR, AAAI, RecSys, KDD, IJCAI, CVPR, ECCV, ACL, NAACL, EACL, ICASSP, or similar
  • Experience working and communicating cross functionally in a team environment
  • Publications or experience in machine learning, AI, computer vision, optimization, computer science, statistics, applied mathematics, or data science
  • Experience solving analytical problems using quantitative approaches
  • Experience setting up ML experiments and analyzing their results
  • Experience manipulating and analyzing complex, large scale, high-dimensionality data from varying sources
  • Experience in utilizing theoretical and empirical research to solve problems
  • Demonstrated software engineer experience via an internship, work experience, coding competitions, or widely used contributions in open source repositories (e.g. GitHub)
  • Experience with data curation for pre-training data necessary to train Large Language Models
  • Experience with prompting and evaluating Large Language Models
  • Experience with PySpark for processing large amounts of text data
About Meta:

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics.

Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.

Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.

$7,800/month to $11,293/month + benefits

Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.

Client-provided location(s): Menlo Park, CA, USA
Job ID: a1KDp00000E2KNqMAN
Employment Type: Intern

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Health Reimbursement Account
    • Dental Insurance
    • Vision Insurance
    • Life Insurance
    • Short-Term Disability
    • Long-Term Disability
    • FSA
    • FSA With Employer Contribution
    • HSA
    • HSA With Employer Contribution
    • Fitness Subsidies
    • On-Site Gym
    • Mental Health Benefits
  • Parental Benefits

    • Birth Parent or Maternity Leave
    • Non-Birth Parent or Paternity Leave
    • Fertility Benefits
    • Adoption Assistance Program
    • Family Support Resources
  • Work Flexibility

    • Flexible Work Hours
    • Remote Work Opportunities
    • Hybrid Work Opportunities
  • Office Life and Perks

    • Commuter Benefits Program
    • Casual Dress
    • Happy Hours
    • Snacks
    • Some Meals Provided
    • Company Outings
    • On-Site Cafeteria
    • Holiday Events
  • Vacation and Time Off

    • Paid Vacation
    • Unlimited Paid Time Off
    • Paid Holidays
    • Personal/Sick Days
    • Sabbatical
    • Leave of Absence
  • Financial and Retirement

    • 401(K)
    • 401(K) With Company Matching
    • Pension
    • Company Equity
    • Performance Bonus
    • Relocation Assistance
    • Financial Counseling
  • Professional Development

    • Learning and Development Stipend
    • Promote From Within
    • Mentor Program
    • Shadowing Opportunities
    • Access to Online Courses
    • Lunch and Learns
    • Internship Program
  • Diversity and Inclusion

    • Diversity, Equity, and Inclusion Program
    • Employee Resource Groups (ERG)
    • Founder led

Company Videos

Hear directly from employees about what it is like to work at Meta.