Sr. Data Scientist, Responsible AI

3 hours, 32 minutes ago
Full-time
Senior
Data Science and Analytics
Pinterest

Pinterest

Pinterest is the world's first visual discovery engine, offering a vast dataset of ideas with over 200 billion recipes, home hacks, and style inspiration. With a mission to inspire everyone to create a life they love, Pinterest empowers its employees t...

Internet Software & Services
5K-10K
Founded 2010

Description

  • Design and develop automated adversarial testing methodologies for Pinterest's generative AI products, including single-turn, multi-turn, and multimodal attack strategies.
  • Build and calibrate hybrid evaluation pipelines using LLM-based judges, classifiers, and rule-based systems to detect safety violations, policy breaches, bias, and representational harms.
  • Develop and operationalize harm taxonomies aligned with industry standards and Pinterest's Responsible AI and Trust & Safety threat models.
  • Create adaptive refinement loops that use attack outcomes to discover deeper and previously unknown vulnerabilities.
  • Apply scientific rigor and statistical methods to AI safety evaluation, including benchmark dataset construction, calibration, and success-metric definition.
  • Partner cross-functionally with ML engineers, Trust & Safety specialists, policy teams, product managers, and legal partners to support safe product launches.
  • Communicate key findings proactively and help translate safety insights into product, policy, and engineering actions.
  • Mentor junior data scientists and up-level cross-functional partners on adversarial evaluation and responsible AI practices.

Requirements

  • 5+ years of experience analyzing data in a fast-paced, data-driven environment with applied scientific methods on web-scale data.
  • Hands-on experience in one or more of the following areas: AI safety, adversarial machine learning, red teaming, responsible AI, or trust & safety.
  • Deep familiarity with large language models and generative AI systems, including failure modes such as prompt injection, jailbreaks, bias, and safety violations.
  • Experience designing and calibrating AI evaluation frameworks, including LLM-as-judge, classifier-based evaluation, and benchmark dataset construction.
  • Strong quantitative programming skills in Python and data manipulation skills in SQL and Spark.
  • Experience with ML pipelines and large-scale experimentation.
  • Familiarity with AI safety taxonomies and frameworks such as OWASP LLM Top 10 or MITRE ATLAS, preferred.
  • Ability to work independently, drive ambiguous projects end-to-end, and operate with high ownership.
  • Excellent written and verbal communication skills for technical and non-technical audiences.
  • Ability to collaborate across Responsible AI, Trust & Safety, Product, Engineering, Policy, and Legal teams.

Benefits

  • Base salary range of $139,764 to $287,749 USD for US-based applicants.
  • Eligible for equity.
  • Benefits information is available through Pinterest's careers site.
  • Flexible work culture with PinFlex, emphasizing the flexibility to do your best work.
  • Relocation assistance is not available for this position.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Binance Accelerator Program - Data Scientist (LLM & Trading)

Binance 5K-10K Capital Markets

Binance’s Accelerator Program is seeking an early-career data scientist in Hong Kong/Taipei to help develop a Web3 AI-powered financial risk control system and related AI workflow projects.

Blockchain Go Python System Design
2 hours, 27 minutes ago

Senior Data Scientist

Multi Media 51-250 Internet Software & Services

Multi Media LLC is seeking a Senior Data Scientist to join its AI and Machine Learning team, where the work focuses on analyzing complex product and business challenges for Chaturbate’s large-scale live streaming platform.

Machine Learning Python SQL Statistics
2 hours, 36 minutes ago

AI Engineer | Data Scientist - (WFH) #34955

This role leads the architecture and delivery of agentic AI systems at a company building production AI solutions for real-world operational use cases.

AWS Docker IoT LLM Machine Learning MLOps Python Reinforcement Learning
4 hours, 25 minutes ago

Data Scientist II, Marketplace

Thumbtack 1K-5K Construction & Engineering

Thumbtack is seeking a Data Scientist to support its Fulfillment pillar by using data to improve marketplace matching, pricing, and demand fulfillment for pros and homeowners.

SQL Statistics
4 hours, 55 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers