Remote (Global)

Reddit is hiring a Senior Machine Learning Engineer, Dev Platform Data and Discovery

About the Role

Design and implement machine learning solutions that improve data discovery, indexing, and developer experience within the platform infrastructure.

Responsibilities

  • Develop scalable ML models to enhance data retrieval and search relevance
  • Collaborate with data engineers to build robust data pipelines
  • Optimize model performance and latency for real-time inference
  • Integrate ML capabilities into developer-facing tools and APIs
  • Work closely with platform teams to identify high-impact use cases
  • Improve metadata tagging and classification systems
  • Evaluate model accuracy using statistical methods and A/B testing
  • Maintain and iterate on existing ML-powered features
  • Contribute to architectural decisions for ML infrastructure
  • Ensure models are explainable, fair, and monitorable
  • Drive automation in data labeling and feature engineering
  • Support deployment, monitoring, and alerting for ML services
  • Partner with product teams to align technical roadmap with business goals
  • Document designs, experiments, and system behavior
  • Mentor engineers on best practices in machine learning and data systems
  • Stay current with advancements in ML frameworks and techniques
  • Troubleshoot production issues related to data quality or model drift
  • Implement privacy-preserving methods where applicable
  • Scale systems to handle growing data volumes and query loads
  • Apply natural language processing to improve code and data understanding
  • Use embeddings and semantic search to enhance discovery
  • Design evaluation frameworks for ranking and recommendation systems
  • Ensure compliance with data governance policies
  • Balance innovation with system reliability and maintainability
  • Contribute to cross-team initiatives on data standardization

Nice to Have

  • Master’s or PhD in Computer Science or related technical field
  • Experience with developer tools or internal platforms
  • Prior work on code search or software repository analysis
  • Contributions to open-source ML projects
  • Publications or presentations in ML or data systems conferences
  • Experience with graph-based ML models
  • Background in building data cataloging solutions
  • Familiarity with privacy-aware machine learning techniques

Compensation

Competitive salary and equity package

Work Arrangement

Hybrid or remote options available

Team

Part of the Developer Platform team focused on data systems and discovery infrastructure

Why This Role Matters

This position plays a critical role in shaping how developers interact with data across the platform. By improving discovery and automation, the work directly impacts developer productivity and system intelligence.

What You’ll Build

You’ll design and deploy machine learning models that power intelligent search, automate metadata generation, and surface relevant data assets to internal teams. Your systems will handle diverse data types and scale with platform growth.

Available for qualified candidates

Required Skills
Machine LearningPythonPytorchTensorFlowDistributed SystemsData PipelinesA/B TestingMLOpsStatistical AnalysisModel Deployment
About company
Reddit
Reddit is a community of communities built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. With 100,000+ active communities and approximately 121 million daily active unique visitors, Reddit is one of the internet's largest sources of information.
All jobs at Reddit Visit website
Job Details
Category data
Posted 7 months ago