Design, test, and ship clustering and attribution heuristics, and measure them with real precision/coverage metrics rather than vibes.
Own your data end to end — pull, clean, join, and model large on-chain datasets without depending on a separate team for every query.
Build and maintain the pipelines that take a heuristic from notebook to production, including backfills, incremental runs, and validation.
Investigate edge cases (mixers, bridges, exchange hot wallets, consolidation patterns) and translate findings into repeatable logic.
Partner with investigations and product to define what "correct" looks like and benchmark against ground truth.
Prototype quickly, then harden what works.

4+ years building data science or data engineering systems that actually shipped (not just notebooks)
Strong Python and SQL; comfortable with large datasets and the gotchas of joins, dedup, and skew at scale
Solid grasp of clustering, graph/network analysis, or entity resolution — and a habit of validating results, not just producing them
Ability to reason about precision vs. coverage trade-offs and defend your metrics
Self-directed: you can scope an ambiguous problem, get the data yourself, and drive it to a result

Remote (Worldwide) — New York, Singapore, Bangalore, London

Merkle Science is hiring a Data Scientist — Blockchain Intelligence