Unstructured

Unstructured

Unstructured transforms natural language data for AI by providing open source tools for easy preprocessing and connection to large language models.

IT Services
1-10
Founded 2022
$25M raised

Description

  • Design and implement production-grade RAG pipelines and agentic workflows in Python.
  • Build systems for unstructured and multimodal data, including PDFs, scanned documents, images, and full motion video.
  • Own the lifecycle of AI solutions from initial research through AWS deployment.
  • Evaluate and prototype new models, including LLMs, embedding models, and object detection models.
  • Run experiments to validate approaches for SBIR and government deliverables.
  • Document architectures and contribute to technical reports for contract work.
  • Participate in pre-sales calls to help architect solutions for complex client needs.
  • Translate high-level government requirements into technical roadmaps.
  • Deploy, fine-tune, and optimize open-source models in restricted or air-gapped environments.
  • Work on scalable, performant systems where latency, cost, and accuracy are treated as first-class concerns.

Requirements

  • Proven experience deploying production RAG pipelines on real-world, messy datasets.
  • Deep expertise in agentic system design, including tool use and multi-agent orchestration.
  • Strong Python engineering skills with clean, scalable, maintainable code.
  • Experience operating in AWS and GovCloud environments.
  • Ability to work in restricted or air-gapped environments.
  • Experience deploying, fine-tuning, and optimizing open-source models when commercial APIs are unavailable.
  • Comfort translating high-level government requirements into technical roadmaps.
  • Familiarity with LangChain, LangGraph, CrewAI, or similar orchestration frameworks.
  • Experience fine-tuning NLP or object detection models is preferred.
  • Familiarity with LLM evaluation frameworks such as hallucination detection and drift monitoring is preferred.
  • Knowledge of government security standards and work across classification environments and on-prem environments is preferred.
  • Existing Secret or TS clearance, or clearance eligibility, is a significant plus.
  • Expert-level Python and SQL skills.
  • Experience with vector databases, graph databases, Elasticsearch, BM25, sentence transformers, and related RAG components.
  • Experience with AWS services such as SageMaker, Bedrock, S3, Lambda, Docker, and FastAPI.

Benefits

  • Competitive compensation package.
  • Stock options.
  • Remote full-time work arrangement.
  • Opportunity to work on cutting-edge machine learning projects.
  • Collaborative and innovative work environment.
  • Focus on learning and growth.
  • Opportunity to shape the company’s direction.
  • Impactful work on unstructured data processing for public sector clients.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Staff Software Engineer - Back End Ai (Gurugram based)

Agoda 10K-50K Consumer Services

Agoda is hiring an experienced Software Engineer to design and build mission-critical backend APIs and distributed systems that support millions of daily search requests on its global travel platform.

ActiveMQ Agile Apache Spark C# Cassandra Git Hadoop Java Kafka MongoDB Play Framework Puppet RabbitMQ Scala Scrum SQL TeamCity
1 hour, 3 minutes ago

Applied AI Engineer

Future 251-1K Hotels, Restaurants & Leisure

Future is hiring an Applied AI Engineer to build and ship production AI features for its digital personal training platform, improving the product experience and business outcomes.

AWS AWS CDK Datadog LLM OpenTelemetry Python Terraform
1 hour, 16 minutes ago

Intermediate Software Engineer - Artificial Intelligence (AI)

Tucows 251-1K Diversified Telecommunication Services

Tucows Domains is hiring a remote Intermediate Software Engineer specializing in Artificial Intelligence to help build AI-powered systems for domain services and related tools.

Go Hugging Face LLM Machine Learning Python REST API TensorFlow
2 hours, 34 minutes ago

AI Full Stack Engineer - KS001

An AI engineer at an Amazon brand management company will build and scale production AI infrastructure and workflows across communication, sales intelligence, content quality, lead qualification, and executive assistant functions.

Linux LLM Node.js OAuth PostgreSQL React REST API SSH TypeScript
3 hours, 18 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers