GT is hiring a Data Engineer to develop, maintain, and scale the pipelines that power BizMap’s unified data platform. You will be responsible for ingesting, transforming, and enriching firmographic data from multiple third-party and proprietary sources.
What You'll Do
- Design and maintain robust, scalable ETL/ELT pipelines to ingest and process third-party and first-party datasets.
- Apply transformation, normalization, and enrichment rules to ensure data consistency and usability.
- Work with product managers, data architects, and content experts to align data structure with business needs.
- Support the implementation of data matching and entity resolution processes using AI/ML tools and proprietary frameworks.
- Build alerts, logs, and metrics to ensure data flows remain healthy and issues are identified and resolved quickly.
- Contribute to documentation, code quality standards, and internal best practices to ensure maintainability.
What We're Looking For
- 4–8 years in data engineering, with experience building production-grade data pipelines.
- Proficient in SQL, Python, and experience with Spark, Airflow, Snowflake, and Azure Data Lake or similar technologies.
- Familiarity with Azure (preferred) or other major cloud platforms.
- Understanding of data modeling, version control, CI/CD, and data governance principles.
- Proactive, detail-oriented, and eager to take ownership of projects and continuously improve systems.
- Comfortable working in a cross-functional environment and open to learning from and supporting teammates.
Technical Stack
- SQL
- Python
- Spark
- Airflow
- Snowflake
- Azure Data Lake
Team & Environment
Work alongside data architects, product managers, and analysts.
Benefits & Compensation
- Join a fast-growing, high-impact team.
- Contribute to an ambitious effort to create the highest quality, most comprehensive business directory in the world.
- Be part of a startup-style group within the company that’s redefining how they deliver consulting through productization and data innovation.
- Work with cutting-edge data tools, including AI/ML enrichment, semantic matching, and modern cloud-based infrastructure.






