Fireworks AI

Fireworks AI

Fireworks AI is a company that provides a generative AI platform as a service. They use state-of-the-art LLMs and image models to deliver fast and efficient AI solutions. Partnering with leading AI researchers, Fireworks offers models for various use c...

Internet Software & Services
1-10
Founded 2022

Description

  • Architect and build scalable, resilient backend infrastructure for distributed training, inference, and data processing pipelines.
  • Design and implement core backend services with a focus on efficiency and low latency.
  • Lead technical design discussions, mentor engineers, and establish best practices for large-scale machine learning systems.
  • Drive infrastructure optimization for compute cost, storage lifecycle management, and network performance.
  • Collaborate with machine learning, DevOps, and product teams to turn research and product requirements into infrastructure solutions.
  • Evaluate and integrate cloud-native and open-source tools such as Kubernetes, Ray, Kubeflow, and MLFlow.
  • Own systems end to end from design through deployment, with an emphasis on reliability, fault tolerance, and operational excellence.

Requirements

  • Bachelor's degree or equivalent in Computer Science or a related field plus 4 years of experience in software engineering or a related role.
  • 4 years of experience designing, building, and optimizing large-scale backend infrastructure and distributed data systems in cloud environments.
  • Experience with distributed data systems such as PostgreSQL, MySQL, DynamoDB, Apache Spark, Apache Flink, or Apache Kafka.
  • Experience with cloud environments such as AWS, GCP, Azure, or equivalent, including cloud-native platforms and core infrastructure components.
  • 4 years of experience with major server-side programming languages and frameworks such as Python, C++, Go, or TypeScript.
  • 4 years of experience writing technical design documentation and leading cross-functional projects.
  • 3 years of experience developing and maintaining data processing and API systems, including gRPC or Thrift.
  • 3 years of experience conducting A/B testing and scientific experimentation using tools such as Statsig, Meta Deltoid, or Optimizely.
  • 3 years of experience conducting coding interviews and providing structured feedback to engineering candidates.
  • 2 years of experience with cloud-native tools and infrastructure such as Docker and Kubernetes.
  • 2 years of experience defining and implementing data-driven metrics to support company or team goals.

Benefits

  • Base salary range of $175,000 to $220,000 USD.
  • Meaningful equity in a fast-growing startup.
  • Competitive salary.
  • Comprehensive benefits package.
  • Opportunity to work with cutting-edge AI infrastructure and large-scale model serving systems.
  • High ownership and direct impact in a fast-growing, low-bureaucracy environment.
  • Opportunity to collaborate with world-class engineers and AI researchers.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Infrastructure Engineer

Accenture 100K+ Professional Services

Accenture Federal Services is hiring an Infrastructure Engineer to manage Linux and Windows server environments and support cloud infrastructure and monitoring for US federal government missions.

Linux PowerShell Python Shell Scripting Splunk Windows Server
7 hours, 11 minutes ago

Data Science AI/ML Lead | Fulltime | Bangalore/Remote

TWO95 International 51-250 Internet Software & Services

Data Science AI/ML Lead at a Bangalore/remote company to lead enterprise-scale AI/ML solution architecture, development, and deployment across automation, data pipelines, and model-serving systems.

AWS AWS SES CI/CD Docker LLM MLOps Python PyTorch REST API Salesforce SAP Scikit-learn TensorFlow
7 hours, 26 minutes ago

Directeur.rice, Sécurité & Infrastructure

Workleap 251-1K Internet Software & Services

Workleap is hiring a Director, Security & Infrastructure to lead the platform, security operations, and governance functions that support Workleap and ShareGate as the company evolves its product delivery model.

Azure GCP Kubernetes
7 hours, 26 minutes ago

SWE Infrastructure Specialist (Java) – Freelance AI Trainer Project

Meridial Marketplace, by Invisible 501-1000 information technology & services

Freelance contract role at an AI training company focused on evaluating how advanced models reason about Java-based infrastructure, cloud, and distributed systems in enterprise environments.

Java Microservices
7 hours, 26 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers