Socure is seeking a Data Engineer-II to join the Identity Graph team. In this role, you will design, develop, and optimize core identity verification and fraud detection data services, directly influencing the performance, scalability, and reliability of our foundational platform. Our mission is to verify 100% of good identities in real time and eliminate identity fraud from the internet.
What You'll Do
- Design and build scalable, secure data pipelines for both batch and real-time processing.
- Support data systems that power machine learning, model inference, and analytics.
- Write clean, production-ready code in Java, Scala, or Python.
- Work with tools like Apache Spark, Kafka, Flink, Airflow, AWS EMR, and other AWS-native services.
- Use graph data modeling and graph databases such as Neo4j or Amazon Neptune.
- Optimize data architecture for performance, cost, and ease of maintenance.
- Work closely with Data Science, Product, and Security teams to turn business needs into data solutions.
- Participate in system design, code reviews, and sharing best practices with peers.
What We're Looking For
- Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related technical field.
- 3-5 years of experience building and supporting complex data systems and applications in cloud environments.
- Strong proficiency in Java, Scala or Python.
- Deep knowledge of distributed data processing frameworks (e.g., Spark, Kafka, Flink).
- Hands-on experience with cloud services (AWS) and containerized environments (Docker, Kubernetes).
- Understanding of software design patterns, data structures, and DevOps/CI-CD best practices.
- Experience in working with Airflow or other data pipeline orchestration services.
- Familiarity with building ML data pipelines (e.g., with Databricks, SageMaker, or similar platforms).
- Experience in developing and utilizing scalable, high-performance APIs.
Nice to Have
- Experience with graph databases and graph algorithms.
Technical Stack
- Languages: Java, Scala, Python
- Frameworks: Apache Spark, Kafka, Flink, Airflow
- Cloud & Infrastructure: AWS EMR, AWS, Docker, Kubernetes
- Databases: Neo4j, Amazon Neptune
- ML Platforms: Databricks, SageMaker
Team & Environment
You will join the Identity Graph team, collaborating closely with Data Science, Product, and Security teams to deliver core data services.
Socure is an equal opportunity employer and values diversity of all kinds at our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.


