Meta is seeking a Research Scientist to join its Fundamental AI Research (FAIR) team, a research organization focused on making significant advances in AI. We publish groundbreaking papers and release frameworks and libraries that are widely used in the open-source community. The team is working on the industrial leading research on building foundation models for audio understanding and audio generation. We are also closely working with vision research teams on pushing the frontier of multimodality (audio, video, language) research. Individuals in this role will work with an interdisciplinary team of scientists, engineers, and cross-functional partners with a broad range of experiences, perspectives, approaches, and backgrounds, and access cutting-edge technology, resources, and research facilities.
Want more jobs like this?
Get Science and Engineering jobs in Menlo Park, CA delivered to your inbox every week.
Research Scientist, Speech & Audio - FAIR (PhD) Responsibilities:
- Develop algorithms based on state-of-the-art machine learning and neural network methodologies.
- Work with and create large datasets.
- Conduct research to advance the science and technology of intelligent machines.
- Conduct research towards long-term ambitious research goals while identifying intermediate milestones.
- Conduct research that enables learning the semantics of data across multiple modalities (speech, audio, images, video, text, and other modalities).
- Open source high quality code and reproducible results for the community.
- Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta.
- Currently has or is in the process of obtaining a PhD in the field of Speech, Audio, Language, Machine Learning, a related field, or equivalent practical experience. Degree must be completed prior to joining Meta.
- Research and/or hands-on experience in one or more of the following areas: audio (speech, sound, or music) generation, text-to-speech (TTS) synthesis, text-to-music generation, text-to-sound generation, speech recognition, speech / audio representation learning, vision perception, image / video generation, video-to-audio generation, audio-visual learning, audio language models, lip sync, lip movement generation / correction, lip reading, etc.
- Experience with Python and PyTorch.
- Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment.
- Proven track record of achieving significant results as demonstrated by grants, fellowships, patents, as well as publications at leading workshops, journals or conferences such as ICML, NeuRIPS, ICLR, ICASSP, Interspeech, ACL, EMNLP, CVPR, and other similar venues.
- Demonstrated software engineer experience via an internship, work experience, coding competitions, or used contributions in open source repositories (e.g. GitHub)
- Experience solving complex problems and comparing alternative solutions, tradeoffs, and different perspectives to determine a path forward
- Experienced in large-scale data processing
- Experience communicating research findings to public audiences of peers.
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics.
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.
Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.
$117,000/year to $173,000/year + bonus + equity + benefits
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.