SquarePeg.ai is looking for an AI/NLP Engineer to serve as the technical backbone for our core matching algorithms. As a key force on our data and machine learning team, you will enhance AI systems that intelligently connect candidates with jobs and own the end-to-end machine learning lifecycle for a fast-growing HR Tech startup.
What You'll Do
- Build and maintain taxonomies for candidate and job attributes; bootstrap gold datasets and evaluation pipelines.
- Extract and normalize entities from resumes and job descriptions; craft and optimize prompts and fine-tuned models.
- Develop and refine retrieval, ranking, and scoring using embedding-based methods and LLMs.
- Refine proprietary scoring algorithms that evaluate candidate-job compatibility.
- Conduct deep-dive analyses to identify patterns in successful hires and optimize the recommendation engine.
- Implement innovative NLP solutions that understand context, intent, and nuance in hiring language.
- Design and implement robust data pipelines that can handle massive volumes of resume and job posting data.
- Build sophisticated entity resolution systems to normalize and deduplicate candidate profiles across multiple data sources.
- Create scalable data architectures that power real-time matching at scale.
- Collaborate directly with the product team to translate business requirements into technical solutions.
- Own the end-to-end ML lifecycle from experimentation to production deployment.
- Continuously iterate on algorithms based on customer feedback and performance metrics.
What We're Looking For
- Deep understanding of machine learning algorithms, particularly in recommendation systems or ranking problems.
- Experience with prompt engineering, prompt chaining, and LLM fine-tuning.
- Knowledge of vector databases and semantic search technologies.
- Familiarity with A/B testing and experimental design.
- 3+ years of hands-on experience with Python, SQL, and modern ML frameworks (PyTorch, TensorFlow, scikit-learn).
- Proven track record in NLP and working with large language models (OpenAI, Anthropic, open-source LLMs).
- Experience with data engineering tools and cloud platforms (AWS, GCP).
- Strong background in entity resolution, data matching, or similar deduplication challenges.
- Experience building and maintaining ontologies.
- Experience building datasets and evaluation pipelines.
- Ability to choose different methods based on tradeoffs of cost, latency, and accuracy.
- Opinionated, data driven, and intellectually curious.
- Ability to thrive in an environment where you experiment and move quickly.
- Strong sense of ownership; can work autonomously.
Technical Stack
- Languages: Python, SQL
- ML Frameworks: PyTorch, TensorFlow, scikit-learn
- LLMs: OpenAI, Anthropic
- Cloud: AWS, GCP
- Data: Vector databases
Team & Environment
You'll be joining a founding team of 12 and will work directly with the founding team on a product solving a real market problem. This is a fast-growing HR Tech startup where you will have a direct impact on the product and company success in a cutting-edge tech environment.
Benefits & Compensation
- Competitive salary
- Equity
- Comprehensive benefits
Work Mode
This is a remote position.


