Space Inch

Space Inch

Space Inch is a digital innovation agency specializing in web, mobile, and augmented and virtual reality applications. They are known for creating a variety of software, with a focus on iPhone, iPad, and Mac apps. Space Inch has found success as mobile...

Internet Software & Services
11-50
Founded 2011

Description

  • Own the end-to-end delivery of production-ready LLM services, from design through rollout and iteration.
  • Build and operate core AI systems that convert complex data into fast, reliable, grounded recommendations at scale.
  • Develop backend services and APIs using Python, FastAPI, TypeScript, REST/GraphQL, and streaming patterns such as SSE or WebSocket.
  • Design and improve RAG and retrieval workflows, including embedding stores, chunking, re-ranking, hybrid search, prompts, and guardrails.
  • Implement observability and quality practices such as structured logging, tracing, metrics, telemetry, and A/B experimentation.
  • Build robust data ingestion and pipeline workflows for CSV/Sheets data, validation, PII handling, backfills, and scheduled jobs.
  • Apply agentic and tool-based patterns safely when appropriate, including constrained execution and planning.
  • Collaborate with Mobile, Backend, and Ops/DevLLM teammates on cross-functional delivery and support.
  • Make pragmatic technical decisions and maintain code quality, monitoring, and service reliability.
  • Contribute across multiple new projects as part of a growing AI team.

Requirements

  • 4-6+ years of software engineering experience in product environments.
  • At least 2+ years of hands-on experience shipping LLM/GenAI solutions to production.
  • Strong production experience with Python and FastAPI, including async patterns, dependency injection, and testing.
  • Comfort working with TypeScript/Node-based APIs and integrating across backend services.
  • Experience with REST/GraphQL APIs and at least one streaming pattern such as SSE or WebSocket.
  • Experience with LLM and RAG systems, including embedding stores such as pgvector or OpenSearch, chunking, re-ranking, hybrid search, prompt tooling, and guardrails.
  • Experience with observability and evaluation tooling, including structured logging, tracing, metrics, and offline/online A/B testing.
  • Experience with data pipelines, including CSV/Sheets ingestion, schema validation, PII handling, backfills, and scheduled jobs.
  • Familiarity with MCP and agent frameworks, tool design, constrained execution, and safe planning.
  • Ability to work effectively in CET timezone.
  • Nice to have: LLM serving optimization experience with vLLM or TensorRT-LLM, including quantization or LoRA.
  • Nice to have: experience with retrieval evaluation frameworks, cross-encoder rerankers, and response grading.
  • Nice to have: experience with cost controls, token budgeting, and prompt compression.
  • Nice to have: experience with Docker, Kubernetes, CI/CD, model gateways such as LiteLLM or vLLM, caching, object storage, and Kafka or equivalent message buses.
  • Nice to have: experience with tenant-aware access controls, secrets management, audit logs, and privacy or safety red-teaming basics.

Benefits

  • Monthly gross salary of 5,100-6,800 EUR for full-time B2B collaboration.
  • Remote-first working model with the option to work hybrid from Zagreb if nearby.
  • 23 days of PTO.
  • Sports membership or wellness subsidy.
  • Annual health checkup budget.
  • Education budget for learning and professional development.
  • Access to an executive coach as part of the company’s growth support.
  • Opportunity to work on long-term, end-to-end projects with a growing international team.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Senior Software Engineer II AI-Native, Messaging

Life360 251-1K Family Services

Life360 is hiring a senior backend engineer to own Kafka-based streaming infrastructure and production systems for real-time family safety messaging in a remote-first, AI-native environment.

AWS CI/CD Datadog DynamoDB Flink GitHub Actions Go Gradle Grafana gRPC Java Kafka Kubernetes Maven PagerDuty Prometheus Spring Boot Terraform
9 minutes ago

Forward Deployed AI Product Engineer

phData 251-1K IT Services

phData is hiring a Forward Deployed AI Product Engineer (Vector) to work directly inside strategic client accounts and build AI-native applications and agents that deliver measurable business impact on top of customer data and AI platforms.

AWS Azure CI/CD Databricks dbt GCP JavaScript Microservices PostgreSQL Python React Snowflake SQL TypeScript
9 minutes ago

Senior AI Engineer

Klaviyo 1K-5K IT Services

Klaviyo is hiring a Senior AI Engineer to design and build backend systems and experiences that power AI products and agent solutions at scale for its customer base.

Apache Spark AWS Celery CI/CD Django FastAPI Generative AI Hadoop Kafka Kubernetes Machine Learning Microservices Python RabbitMQ Redis Reinforcement Learning REST API SQLAlchemy
25 minutes ago

Agent UI - Full Stack - Senior Software Engineer II

Sumo Logic 251-1K Internet Software & Services

Sumo Logic is hiring a Senior Full Stack Software Engineer II to design and build agent-driven web experiences and the backend systems that power AI-enabled observability and security capabilities on its cloud-native platform.

AWS CI/CD Java Microservices Python React REST API SIEM TypeScript
55 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers