AI Inference Engineer QVAC

1 day, 7 hours ago
Full-time
Senior
Software Development
ITRex

ITRex

ITRex is a global technology consulting and software development company that transforms businesses with cutting-edge solutions in AI, IoT, Data Science, Cloud, and RPA, serving medium-sized companies and Fortune 500 giants across diverse industries.

Internet Software & Services
251-1K
Founded 2009

Description

  • Work on deploying machine learning models to edge devices using llama.cpp and ggml.
  • Optimize the inference runtime so models load faster, run leaner, and perform well across different hardware.
  • Collaborate closely with researchers to help code, train, and transition models from research into production.
  • Integrate AI features into existing products to add the latest machine learning capabilities.
  • Ensure the inference layer is stable and ready for integration with the rest of the stack.

Requirements

  • Strong programming skills in C++.
  • Experience with JavaScript is a bonus.
  • Strong experience with llama.cpp and ggml inference engines.
  • Good understanding of deep learning concepts and model architectures.
  • Experience with transformers, LLMs, and diffusion models.
  • Demonstrated ability to rapidly learn new technologies and techniques.
  • A degree in Computer Science, AI, Machine Learning, or a related field.
  • A solid track record in AI R&D.

Benefits

  • Remote flexibility with the ability to work where and how you work best.
  • Competitive salary plus benefits including medical and learning support.
  • Ownership opportunities to take initiative and solve meaningful problems.
  • AI-enhanced workflows designed to complement your abilities.
  • English classes and professional development support.
  • Clear career progression opportunities.
  • Responsive, supportive teammates and a collaborative culture.
  • Regular meetups and tech talks that build human connections.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior AI Engineer

ClassWallet 11-50 Diversified Financial Services

ClassWallet is seeking a Senior AI Engineer to lead the design and delivery of AI agents, automations, and internal tools that improve workflows across the company and selectively support customer-facing products.

HubSpot JIRA LLM Machine Learning Python Salesforce
6 hours, 49 minutes ago

Middle AI engineer (AI Agents)

Symphony Solutions 251-1K Internet Software & Services

Python Developer role at an AI team focused on building intelligent agents and integrating them into cloud-based platforms that support automation, analytics, and LLM-driven product features.

AWS Azure CI/CD Docker FastAPI Flask GCP Git Grafana Helm Kafka Machine Learning MongoDB OpenTelemetry PostgreSQL Prometheus Python RabbitMQ Redis SQL Terraform
6 hours, 49 minutes ago

Senior Modeling and Simulation Engineer, Space

Anduril Industries 1K-5K Aerospace & Defense

Anduril Industries is hiring a Senior Modeling and Simulation Engineer to support its Space team in developing analysis, models, and simulations that inform U.S. Department of Defense space mission decisions.

GitHub GitLab Machine Learning MATLAB Python Reinforcement Learning SAP
7 hours, 4 minutes ago

Senior AI Engineer - Americas

ChainGPT 11-50 Internet Software & Services

ChainGPT is hiring a Senior AI Engineer to help build Brain by AIVM, a remote-first platform for verifiable, governed enterprise AI at the intersection of blockchain, AI, and agentic systems.

System Design
7 hours, 4 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers