Responsibilities
- Link Balancing Mechanism units from Elexon to associated power plants, substations, and fuel categories using API outputs, public databases, and manual verification
- Associate substations with ETYS regions and grid supply points
- Create and update master reference datasets that unify identifiers from multiple sources such as Elexon, National Grid ESO, and TEC register
- Record data mappings, underlying assumptions, and current limitations in clear documentation for downstream teams
- Align outdated data structures with modern formats, including reconciling historical operational records stored under varying schemas or time resolutions
- Maintain alignment across Elexon message types by understanding market data architecture and addressing inconsistencies between BOALF, BOD, and DISBSAD
- Identify and resolve conflicts between source systems to establish trusted reference values
- Process time-series energy data by identifying anomalies like price surges or meter inaccuracies, filling missing intervals, and removing timestamp duplicates
- Construct modular Python scripts for automated data cleaning applicable across multiple datasets
- Analyze root causes of data quality problems such as settlement revisions, delayed filings, or schema updates rather than only correcting surface errors
- Develop and support Python-based data extractors for energy market APIs
- Design dbt models to convert raw inputs into structured, analysis-ready tables
- Manage pipeline execution using GitHub Actions for workflow automation
- Define PostgreSQL database schemas that accurately model energy domain relationships
Compensation
Not specified
Work Arrangement
Not specified
Team
Not specified
Not specified