Responsibilities
Trust and Safety RD team is fast growing and responsible for building machine learning models and systems to identify and defend internet abuse and fraud on our platform. Our mission is to protect billions of users and publishers across the globe every day. We embrace state-of-the-art machine learning technologies and scale them to detect and improve the tremendous amount of data generated on the platform. With the continuous efforts from our team, TikTok is able to provide the best user experience and bring joy to everyone in the world.
We are looking for researchers in the Large Language Model (LLM) domain who are going to conduct research on single modality and multi modality LLM pretraining and applications including in-context learning (ICL), supervised fine-tuning (SFT) and reinforcement learning based alignment. We are looking forward to applying LLM in trust and safety business scenarios so that we can protect our users and creators with the best moderation quality and cost efficiency. There are no doubt a lot of unsolved problems in the LLM domain which could have a huge impact on industry and academia. In the Trust & Safety team, we have real applications, resources and patience for technology incubation.
Want more jobs like this?
Get Data and Analytics jobs in San Jose, CA delivered to your inbox every week.
Your main responsibilities will be
- Lead the incubation of next-generation, high-capacity LLM solutions for the Trust & Safety business
- Identify research problems and dive deep for innovative solutions
- Work closely with cross-functional teams to plan and implement projects harnessing LLMs for diverse purposes and vertical domains
- Extend the insights and impact from industry to academia
Qualifications
Minimum Qualifications
- Ph.D in Computer Science, Data Science, Artificial Intelligence, or a related field
- Strong understanding of cutting-edge LLM research (e.g., long context, multi modality, alignment research, agent ecosystem, etc.) and possess practical expertise in effectively implementing these advanced systems as a plus
- Proficiency in programming languages such as Python, Rust, or C++ and a track record of working with deep learning frameworks (e.g., pytorch, deepspeed, megatron, vllm, etc.).
- Strong understanding of distributed computing framework & performance tuning and verification for training/finetuning/inference; Being familiar with PEFT, RL, MoE, CoT or Langchain is a plus.
Preferred Qualification
- Excellent problem-solving skills and a creative mindset to address complex AI challenges. Demonstrated ability to drive research projects from idea to implementation, producing tangible outcomes.
- Published research papers or contributions to the LLM community would be a significant plus.
- Experience with inference tuning and Inference acceleration. Have a deep understanding of GPU and/or other AI accelerators, experience with large scale AI networks, pytorch 2.0 and similar technologies.
- Experience with evaluation of AI systems, LLM application & agent development is desirable.