Jalasoft

Jalasoft

Jalasoft is a world-class technology company with over 20 years of experience in nearshore software development and staff augmentation. They offer Xian Suite, a comprehensive systems management solution, and provide software solutions for small startup...

Internet Software & Services
1K-5K
Founded 2001

Description

  • Architect and operate production-scale data lakes and storage tiers on AWS.
  • Build and maintain real-time ingestion, observability, and log analytics pipelines.
  • Own vector search and embedding layers that support RAG systems and autonomous agents.
  • Design and manage event streaming and data ingestion from production systems.
  • Develop and optimize OpenSearch and Elasticsearch workloads for lexical and vector search.
  • Build and maintain AI/LLM data pipelines using Amazon Bedrock and foundational models.
  • Implement software engineering components and API ingestion services for data workflows.
  • Support ETL/ELT pipelines for structured and unstructured data from internal and external sources.
  • Collaborate on data enrichment, classification, semantic parsing, and indexing workflows.

Requirements

  • 7+ years of experience in Data Engineering, Distributed Systems, or Data Architecture.
  • 4+ years architecting production-scale data lakes, storage tiers, and event streaming solutions.
  • 2+ years building RAG systems, managing embeddings, and orchestrating foundational models.
  • Strong proficiency in AWS data lake architecture and storage.
  • Strong proficiency in real-time observability and log analytics.
  • Strong proficiency in Elasticsearch and OpenSearch optimization, vectorization, and embeddings.
  • Strong proficiency in Amazon Bedrock and generative AI pipelines.
  • Strong proficiency in software engineering and API ingestion.
  • Production-level proficiency in one or more of the following: C# (.NET Core), Java, Python, or Node.js.
  • Experience with AWS S3 partitioning strategies, lifecycle policies, and columnar formats such as Parquet and Iceberg is preferred.
  • Experience with AWS Glue Data Catalog and Lake Formation for multi-tenant, fine-grained access control is preferred.
  • Experience optimizing queries over petabyte-scale datasets using Amazon Athena and Redshift Spectrum is preferred.
  • Experience configuring distributed oTel collectors for log, trace, and metrics routing into S3 is preferred.
  • Experience with high-volume streaming of system logs, Datadog captures, and raw server events into S3 is preferred.
  • Experience with real-time CDC from PostgreSQL using Debezium or AWS DMS is preferred.
  • Experience with Amazon OpenSearch clusters using simultaneous lexical and high-dimensional vector search is preferred.
  • Experience with OpenSearch index lifecycle management, sharding strategies, and dynamic mappings at scale is preferred.
  • Experience with Amazon Bedrock foundational model APIs such as Claude and Titan for enrichment, classification, and semantic parsing is preferred.
  • Experience with Knowledge Bases for Amazon Bedrock for automatic chunking, metadata extraction, and vector index syncs from S3 is preferred.
  • Experience building ETL/ELT pipelines that ingest unstructured event data from SaaS APIs such as Pendo, Hotjar, and Google Analytics is preferred.
  • Experience with MCP server development to expose data lake context and utilities to AI agents is preferred.

Benefits

  • Remote work.
  • 13 floating holidays.
  • 15 vacation days per year.
  • Equal opportunity employer commitment with non-discriminatory hiring practices.
  • Good working environment.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

AI Solutions Engineering - Software Engineer

Chainguard 51-250 Internet Software & Services

Chainguard is hiring an AI Solutions Engineer to identify high-impact internal AI opportunities and build validated prototypes that improve how the company operates.

GitHub Go Linear Python
2 hours, 37 minutes ago

Senior Data Engineer I

Samsara 1K-5K IT Services

Samsara is hiring a Senior Data Engineer in Canada to design and maintain data pipelines for its connected operations platform, transforming IoT and product data into analytics-ready datasets that support analysis, model training, and dashboards.

Apache Airflow Apache Spark AWS Azure Dagster Databricks GCP Git GitHub Prefect Python REST API SQL
2 hours, 52 minutes ago

Staff AI Engineer

Workato 251-1K IT Services

Workato is hiring a Staff AI Engineer to build the core AI platform behind enterprise automation, agentic workflows, and large-scale retrieval services.

Argo CD CI/CD Elasticsearch FastAPI GitHub Actions Kubernetes Microservices PostgreSQL Python
3 hours, 7 minutes ago

Senior AI Engineer

Robots & Pencils 51-250 IT Services

Robots & Pencils is hiring a Senior AI Engineer to design and deliver production AI/ML systems for enterprise clients across a range of industries.

AWS Docker Generative AI Kubernetes Machine Learning Python
3 hours, 7 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers