Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Technical AI Ethicist / AI Red Teamer

AT Salesforce
Salesforce

Technical AI Ethicist / AI Red Teamer

Seattle, WA

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.

Job Category
Data

Job Details

About Salesforce

We're Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too - driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing well and doing good - you've come to the right place.

Want more jobs like this?

Get Data and Analytics jobs delivered to your inbox every week.

Select a location
By signing up, you agree to our Terms of Service & Privacy Policy.


Senior/Lead Technical AI Ethicist / [AI Red Teamer]

Salesforce's Office of Ethical and Humane Use is seeking an experienced responsible AI data scientist with an adversarial mindset and experience conducting ethical red teaming to contribute to our ethical red teaming practice. In this role, you will help us gain a deep understanding of how our models and products may be leveraged by malign actors or through unanticipated use to cause harm. In addition to adversarial testing, you will analyze current safety trends, and develop solutions to detect and mitigate risk, while working cross-functionally with security, engineering, data science, and AI Research teams. You will bring technical depth to the assessment of AI products, models, and applications, in order to identify the best technical mitigations to identified risks.

The ideal candidate will have technical experience in generative as well as predictive artificial intelligence.

Responsibilities:

  • Adversarial Testing

    • Provide technical leadership in designing, prototyping, and implementing comprehensive adversarial testing strategies, including both automated and manual adversarial testing approaches.
    • Mentor and guide stakeholder teams on adversarial testing best practices, helping them develop the skills to conduct their own testing effectively.
    • Collaborate with cross-functional teams to integrate OEHU adversarial testing frameworks into the AI development lifecycle.
  • Safety and Robustness

    • Contribute to the development of detection models, safety guardrails, and other proactive measures to prevent and mitigate risks posed by bad actors.
    • Research and implement state-of-the-art techniques for enhancing AI safety and robustness, drawing from both open-source and internal tools.
    • Collaborate with Salesforce's AI Research team on novel approaches to model safety.
  • Technical Research and Implementation

    • Write clean, efficient, and well-documented code (primarily in Python) to support research efforts and facilitate the evaluation of AI systems.
    • Develop and maintain a repository of reusable code modules and libraries to streamline adversarial testing processes.
  • Testing Execution and Collaboration

    • Participate in scoping, documenting, and executing tests with partner teams, including the implementation of mitigations identified during testing.
    • Test for technical vulnerabilities, model vulnerabilities, and harm/abuse including but not limited to bias, toxicity, and inaccuracy.
    • Participate in labeling test data in partnership with OEHU and partner teams
  • Reporting, Documentation, and Continuous Learning

    • Write reports covering the goals and outcomes of testing operations, including significant observations and recommendations.
    • Continuously monitor and analyze emerging threats and vulnerabilities to inform the development of adaptive safety measures.
    • Continue to grow expertise in model safety by keeping up with research in sociotechnical systems, privacy, interpretability/explainability, robustness, alignment, and responsible AI

Qualifications:

  • Several years of experience in responsible/ethical AI practice
  • 5-7 years of relevant experience in Data Science, Software Engineering, AI ethics, AI research, or similar roles
  • Experience designing and conducting ethical adversarial testing to identify harms such as bias/fairness, toxicity, mis and disinformation, data leakage and privacy violations, and hallucinations.
  • Experience creating heuristic-based detection logic and rules for identifying anomalous or suspicious activity in systems and networks (e.g. log analysis, user behavior analytics.)
  • Experience using SQL and relational databases. Ability to use Python, R, or other scripting languages to perform data analysis at scale.
  • Experience with problem-solving and troubleshooting complex issues with an emphasis on root-cause analysis.
  • Experience in analyzing complex, large-scale data sets and presenting findings to technical and non-technical audiences.
  • Proven organizational and execution skills within a fast-paced, multi-stakeholder environment.
  • Experience working in a technical environment with a broad, cross-functional team to drive results, define requirements, coordinate resources from other groups (design, legal, etc.), and deliver key milestones
  • Excellent written and oral communication skills, as well as interpersonal skills, including the ability to articulate technical concepts to both technical and non-technical audiences.
  • Works well under pressure, and is comfortable working in a fast-paced, ever-changing environment.

Accommodations

If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.

Posting Statement

At Salesforce we believe that the business of business is to improve the state of our world. Each of us has a responsibility to drive Equality in our communities and workplaces. We are committed to creating a workforce that reflects society through inclusive programs and initiatives such as equal pay, employee resource groups, inclusive benefits, and more. Learn more about Equality at www.equality.com and explore our company benefits at www.salesforcebenefits.com.

Salesforce is an Equal Employment Opportunity and Affirmative Action Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. Salesforce does not accept unsolicited headhunter and agency resumes. Salesforce will not pay any third-party agency or company that does not have a signed agreement with Salesforce.

Salesforce welcomes all.

Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Salesforce will consider for employment qualified applicants with arrest and conviction records.

For New York-based roles, the base salary hiring range for this position is $165,600 to $227,700.

For Washington-based roles, the base salary hiring range for this position is $151,800 to $208,800.

For California-based roles, the base salary hiring range for this position is $165,600 to $227,700.

Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, benefits. More details about our company benefits can be found at the following link: https://www.salesforcebenefits.com.

Client-provided location(s): Seattle, WA, USA; San Francisco, CA, USA; New York, NY, USA; Palo Alto, CA, USA; Bellevue, WA, USA; Hillsboro, OR, USA
Job ID: Salesforce-JR260806
Employment Type: Full Time

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Health Reimbursement Account
    • Dental Insurance
    • Vision Insurance
    • Life Insurance
    • Short-Term Disability
    • Long-Term Disability
    • FSA
    • FSA With Employer Contribution
    • HSA
    • HSA With Employer Contribution
    • Fitness Subsidies
    • On-Site Gym
    • Mental Health Benefits
  • Parental Benefits

    • Adoption Leave
    • Return-to-Work Program
    • Birth Parent or Maternity Leave
    • Non-Birth Parent or Paternity Leave
    • Fertility Benefits
    • Adoption Assistance Program
    • Family Support Resources
  • Work Flexibility

    • Flexible Work Hours
    • Remote Work Opportunities
    • Hybrid Work Opportunities
  • Office Life and Perks

    • Casual Dress
    • Happy Hours
    • Snacks
    • Some Meals Provided
    • Company Outings
  • Vacation and Time Off

    • Paid Vacation
    • Unlimited Paid Time Off
    • Paid Holidays
    • Personal/Sick Days
    • Leave of Absence
    • Sabbatical
    • Volunteer Time Off
  • Financial and Retirement

    • 401(K)
    • 401(K) With Company Matching
    • Company Equity
    • Stock Purchase Program
    • Performance Bonus
    • Relocation Assistance
    • Financial Counseling
  • Professional Development

    • Tuition Reimbursement
    • Learning and Development Stipend
    • Promote From Within
    • Mentor Program
    • Shadowing Opportunities
    • Access to Online Courses
    • Lunch and Learns
    • Internship Program
    • Leadership Training Program
    • Professional Coaching
    • Work Visa Sponsorship
  • Diversity and Inclusion

    • Employee Resource Groups (ERG)
    • Unconscious Bias Training
    • Diversity, Equity, and Inclusion Program

Company Videos

Hear directly from employees about what it is like to work at Salesforce.