Bangalore, India (Hybrid) Hybrid Full-time

ABBYY is hiring a Senior Machine Learning Engineer, Synthetic Data & Document Understanding

Responsibilities

  • Create and manage data processing workflows that analyze real-world documents to guide the creation of high-quality synthetic data
  • Develop systems that generate documents in various formats, layouts, and subject areas
  • Build assessment methods to verify synthetic data reflects accurate statistical patterns and variety
  • Investigate and deploy generative modeling approaches tailored for document AI training
  • Detect and resolve data quality problems to ensure synthetic outputs effectively train downstream models
  • Work with modeling teams to assess how synthetic data influences model accuracy and performance
  • Lead the synthetic data pipeline from design through validation, ensuring technical and quality standards
  • Make key architectural choices that balance data quality, diversity, scale, and cost efficiency
  • Establish and track data quality indicators and maintain real-time generation monitoring dashboards
  • Coordinate with annotation teams to align synthetic data with downstream processing needs
  • Support strategic planning in collaboration with senior technical leadership
  • Develop high-throughput systems capable of producing millions of synthetic training samples
  • Implement filtering, post-processing, and validation steps to eliminate poor-quality synthetic outputs
  • Design cost-effective generation workflows that optimize computational resources, output quality, and speed
  • Build monitoring tools to identify changes in data distribution or declining quality over time
  • Partner with platform engineering teams on compute resource management, storage, and job scheduling
About company
ABBYY
Love how you work, what you work on and whom you work with. Join ABBYY and be part of a team that celebrates your unique work style. With flexible work options, a supportive team, and rewards that reflect your value, you can focus on what matters most – driving your growth while fuelling ours. Our commitment to respect, transparency, and simplicity means you can trust us to always choose to do the right thing. Over 10,000 customers trust ABBYY, including many Fortune 500 ones, with names such as DHL, Johnson & Johnson, FDA, DMV, PWC, KeyBank, Spotify, and H&R BLOCK in our client portfolio. As a trusted partner for intelligent automation, we solve highly complex problems for our enterprise customers and put their information to work to transform the way they do business. With a focus on customer-centric thinking, we're not just another vendor – we're a transformative force in the industry. Join us and be part of a team that's changing the world, one solution at a time. For more information about life at ABBYY visit abbyy.com/careers
All jobs at ABBYY Visit website
Job Details
Department Document AI Data team
Category data
Posted 6 days ago