Prompt Engineer (LLM Systems, Evals & Safety)

9 hours, 1 minute ago
Mid Level
Software Development
Webook

Webook

WEbook.com is a writing community platform empowering writers, shaping future literary stars, and transforming the publishing world.

Media
1-10
Founded 2007

Description

  • Author, refactor, and chain prompts for a range of LLM tasks.
  • Design and maintain system, tool, and policy instructions.
  • Create offline and online evaluation harnesses, including rubrics, golden sets, and metrics.
  • Build and manage prompt libraries with versioning, A/B testing, and telemetry.
  • Reduce hallucinations through verification, constrained decoding, and tool use.
  • Implement safety checks for jailbreaks, prompt injection, content policy, and PII handling.
  • Partner with engineers to integrate prompts into production features.
  • Own evaluation and continuous improvement for LLM feature quality, safety, and cost effectiveness.

Requirements

  • Demonstrated prompt design experience across multiple task types and models.
  • Experience building evaluation datasets and automated scoring for accuracy, faithfulness, utility, cost, and latency.
  • Familiarity with retrieval-augmented generation concepts and tool/function calling.
  • Strong scripting skills in Python or TypeScript for data preparation, evaluation, and analysis.
  • Ability to translate business goals into measurable prompt specifications.
  • Experience with LangChain, LLM orchestration, vector stores, and rerankers is a plus.
  • Knowledge of safety tooling and red-teaming techniques is a plus.
  • Experience with experiment platforms such as feature flags and A/B tests, as well as analytics, is a plus.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Prompt Engineer – Conversational AI

Outsourced Staff 11-50 Professional Services

Voice AI Solutions is hiring a remote Prompt Engineer to design and refine conversational AI prompts for voice and chat agents that support business operations across multiple industries.

LLM NLP
3 hours, 1 minute ago

Shape the Future of AI — Hebrew Speakers in Israel

Welocalize 1K-5K Professional Services

Welo Data is building a freelance talent pool for future AI language projects in Tel Aviv and remote settings, where Hebrew speakers in Israel will help shape language data, annotation, evaluation, and prompt creation work.

3 hours, 46 minutes ago

Senior Prompt Engineer - Data Science & Quality Analysis (India)

ItsaCheckmate 51-250 Hotels, Restaurants & Leisure

Checkmate is hiring a Prompt Engineer to help develop and optimize production voice AI systems used by major restaurant and retail brands across the U.S.

LLM Machine Learning Python SQL
9 hours, 31 minutes ago

Prompt Engineer

Firework 251-1K Internet Software & Services

Firework is hiring a remote AI Prompt Engineer in Mexico to optimize prompts and evaluation workflows for large language models and multi-modal generative AI products.

Generative AI LLM Python
12 hours, 46 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers