Binance

Binance

Binance operates as a leading blockchain ecosystem and digital asset exchange, integrating digital technology with financial services to facilitate the trading and management of cryptocurrencies.

Capital Markets
5K-10K
Founded 2017
$10M raised

Description

  • Design, develop, and optimize data processing and retrieval pipelines for enterprise generative tasks and model training applications.
  • Build and improve embedding, reranking, context engineering, and query rewriting models.
  • Research and evaluate advanced AI-native retrieval methods such as low-latency retrieval, multimodal retrieval, hierarchical retrieval, and GraphRAG.
  • Collaborate with infrastructure and application teams to integrate RAG pipelines into production systems.
  • Develop and optimize indexing, vector search, retrieval scoring, and reranking pipelines.
  • Support LLM training and RAG systems across pretraining, SFT, and reinforcement learning stages.
  • Apply NLP, computer vision, and multimodal methods to analyze user-generated content.
  • Design and implement robust evaluation methodologies for retrieval and generation systems.
  • Teach models to interact with external tools, APIs, and code interpreters.
  • Build agents and multi-agent systems to address complex real-world problems.

Requirements

  • Master’s degree in Information Retrieval, NLP, Machine Learning, Computer Vision, Multimodal Learning, or a related field.
  • Proficiency with PyTorch and strong coding ability in Python or C++.
  • Strong theoretical foundation in information retrieval, NLP, and deep learning.
  • Hands-on experience with RAG, vector databases, multimodal or graph retrieval, or large-scale AI systems.
  • Strong engineering ability to translate research into scalable, production-level systems.
  • Ability to own projects end-to-end from design through implementation to deployment.
  • Strong communication skills, intellectual curiosity, and a passion for lifelong learning.
  • Experience with embeddings, reranking, and query understanding is preferred.
  • Publications in top-tier venues such as NeurIPS, ICML, ACL, CVPR, SIGIR, KDD, or WWW are a plus.
  • Awards in ACM/ICPC or similar competitions are preferred.

Benefits

  • Competitive salary and company benefits.
  • Work-from-home arrangement, depending on the business team and role requirements.
  • Opportunity to work with world-class talent in a global, user-centric organization.
  • Autonomy to tackle unique, fast-paced projects in an innovative environment.
  • Career growth opportunities and continuous learning.
  • Equal opportunity employer committed to workforce diversity.

Interested in this position?

Apply directly on the company website

Apply Now

Similar Roles

Data Scientist (Contract)

Epoch AI 1-10 Professional Services

Epoch AI is hiring a part-time remote Data Scientist (Contract) to gather and analyze technical information on AI models, data centers, and companies for its research and policy-focused knowledge products.

Machine Learning
3 minutes ago

Senior Staff Biostatistician

N-Power Medicine 11-50 Biotechnology

N-Power Medicine is hiring a Senior Staff Biostatistician to lead statistical design and analysis for oncology clinical development projects using real-world data, external control arms, and innovative trial methods.

Python R Statistics
3 minutes ago

AI-First Data Scientist

CSC Generation 251-1K Internet Software & Services

CSC Generation is hiring a remote AI-First Data Scientist to build and deploy production machine learning systems that directly support decision-making across its retail brands and shared services platform.

AWS CI/CD Feature Engineering Generative AI Git Jupyter Machine Learning MLOps Python R Reinforcement Learning SageMaker SQL
18 minutes ago

Senior Data Scientist (Remote, Global)

Teramind is hiring a Senior Data Scientist to help build a new data science function that turns behavioral signals into predictive analytics, behavior models, and ML-driven decision-making for workforce intelligence and insider risk management.

Apache Spark GCP Python PyTorch Scikit-learn SQL Statistics TensorFlow XGBoost
18 minutes ago

You're on a roll! Sign up now to keep applying.

Sign Up

Already have an account? Log in

Used by 14,729+ remote workers