Unstructured

Unstructured

Unstructured transforms natural language data for AI by providing open source tools for easy preprocessing and connection to large language models.

IT Services
1-10
Founded 2022
$25M raised

Description

  • Design and implement production-grade RAG pipelines and agentic workflows in Python.
  • Build systems for unstructured and multimodal data, including PDFs, scanned documents, images, and full motion video.
  • Own the lifecycle of AI solutions from initial research through AWS deployment.
  • Evaluate and prototype new models, including LLMs, embedding models, and object detection models.
  • Run experiments to validate approaches for SBIR and government deliverables.
  • Document architectures and contribute to technical reports for contract work.
  • Participate in pre-sales calls to help architect solutions for complex client needs.
  • Translate high-level government requirements into technical roadmaps.
  • Deploy, fine-tune, and optimize open-source models in restricted or air-gapped environments.
  • Work on scalable, performant systems where latency, cost, and accuracy are treated as first-class concerns.

Requirements

  • Proven experience deploying production RAG pipelines on real-world, messy datasets.
  • Deep expertise in agentic system design, including tool use and multi-agent orchestration.
  • Strong Python engineering skills with clean, scalable, maintainable code.
  • Experience operating in AWS and GovCloud environments.
  • Ability to work in restricted or air-gapped environments.
  • Experience deploying, fine-tuning, and optimizing open-source models when commercial APIs are unavailable.
  • Comfort translating high-level government requirements into technical roadmaps.
  • Familiarity with LangChain, LangGraph, CrewAI, or similar orchestration frameworks.
  • Experience fine-tuning NLP or object detection models is preferred.
  • Familiarity with LLM evaluation frameworks such as hallucination detection and drift monitoring is preferred.
  • Knowledge of government security standards and work across classification environments and on-prem environments is preferred.
  • Existing Secret or TS clearance, or clearance eligibility, is a significant plus.
  • Expert-level Python and SQL skills.
  • Experience with vector databases, graph databases, Elasticsearch, BM25, sentence transformers, and related RAG components.
  • Experience with AWS services such as SageMaker, Bedrock, S3, Lambda, Docker, and FastAPI.

Benefits

  • Competitive compensation package.
  • Stock options.
  • Remote full-time work arrangement.
  • Opportunity to work on cutting-edge machine learning projects.
  • Collaborative and innovative work environment.
  • Focus on learning and growth.
  • Opportunity to shape the company’s direction.
  • Impactful work on unstructured data processing for public sector clients.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Lead AI Engineer

Dreamix 51-250 Internet Software & Services

Dreamix is seeking a Lead AI Engineer to guide the development, deployment, and optimization of advanced AI and GenAI solutions for enterprise clients, working with cross-functional teams to deliver business value and better user experiences.

AWS BERT Docker Generative AI Kubernetes Machine Learning MLOps NumPy Pandas Python PyTorch Scikit-learn spaCy TensorFlow
11 minutes ago

Director, AI & Data Science

LRN 251-1K Professional Services

LRN is seeking a Director of AI & Data Science to lead applied AI strategy and delivery for its ethics and compliance SaaS platform, translating business and customer needs into production-ready ML and LLM solutions.

CI/CD Git LLM Machine Learning MLOps Node.js Python PyTorch Scikit-learn System Design
11 minutes ago

Senior Generative AI (Gen AI) Engineer

Innovation Team 51-250 Internet Software & Services

InnovationTeam is seeking a Senior Generative AI Engineer to design, develop, and deploy production-ready AI systems for enterprise and government use cases.

AWS Azure Deep Learning Docker GCP Generative AI Java Kubernetes LLM Machine Learning Microservices MLOps NLP OpenSearch Python PyTorch REST API Ruby TensorFlow
11 minutes ago

Senior Google AI Engineer

Credence Independent 1K-5K Internet Software & Services

Credence is hiring a Senior Google AI Engineer to design, build, and operationalize secure production AI/ML systems on Google Cloud for Department of Defense programs.

CI/CD GCP Generative AI Go LLM Looker MLOps Python SAP Secrets Management TypeScript Vertex AI
11 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers