Remote (Country) Full-time

Granica is hiring a Software Engineer – Foundational Data Systems for AI - India, remote

About the Role

Granica is looking for a Software Engineer – Foundational Data Systems for AI to build the metadata substrate, adaptive engines, and intelligent data layouts that power efficient, structured AI at exabyte scale. You will translate research algorithms into production-grade implementations, working at the intersection of fundamental research and enterprise impact.

What You'll Do

  • Design and implement the global metadata substrate supporting time-travel, schema evolution, and atomic consistency across massive tabular datasets.
  • Build adaptive engines that reorganize data autonomously, learning from access patterns and workloads to maintain efficiency with minimal manual tuning.
  • Develop and refine bit-level encodings, compression, and layout strategies to extract maximum signal per byte read.
  • Contribute to distributed compute systems that scale predictively and adapt to dynamic load.
  • Translate new algorithms in compression and representation from research into production-grade implementations.
  • Design and optimize data paths to minimize time between question and insight, enabling faster learning for both models and humans.

What We're Looking For

  • Foundational understanding of distributed systems: partitioning, replication, and fault tolerance.
  • Experience or curiosity with columnar formats such as Parquet or ORC and low-level data encoding.
  • Familiarity with metadata-driven architectures or data query planning.
  • Exposure to or hands-on use of Spark, Flink, or similar distributed engines on cloud storage.
  • Proficiency in Java, Rust, Go, or C++ and commitment to clean, reliable code.
  • Curiosity about how compression, entropy, and representation shape system efficiency and learning.
  • A builder’s mindset—eager to learn, improve, and deliver features end-to-end with growing autonomy.

Nice to Have

  • Familiarity with Iceberg, Delta Lake, or Hudi.
  • Contributions to open-source projects or research in compression, indexing, or distributed systems.
  • Interest in how data representation influences AI training dynamics and reasoning efficiency.

Technical Stack

  • Parquet, ORC, Spark, Flink, Java, Rust, Go, C++, Iceberg, Delta Lake, Hudi

Team & Environment

You will partner closely with the Granica Research group led by Prof. Andrea Montanari (Stanford). The environment emphasizes deep technical work, minimal bureaucracy, and a shared mission.

Benefits & Compensation

  • Flexible time off
  • Comprehensive health coverage for you and your family
  • Support for research, publication, and deep technical exploration
  • Competitive salary + meaningful equity

Work Mode

This is a remote position based in India.

Granica is an equal opportunity employer.

Required Skills
ParquetORCSparkFlinkJavaRustGoC++IcebergDelta LakeData SystemsDistributed SystemsPerformance Optimization
Visa expiring soon?

Extend or switch without leaving Thailand

Running out of time on your current visa? SVBL identifies your best option — extension, category switch, or long-term visa — and handles the entire process.

Visa extensions & category switches
LTR & DTV visa applications
90-day reporting managed
Overstay prevention
Check your options
Prevent overstay issues
About company
Granica

Granica is an AI research and systems company building infrastructure for intelligence that is structured, efficient, and deeply integrated with data, operating at exabyte scale and processing petabytes of data daily for prominent enterprises.

Visit website
Job Details
Category data
Posted 5 months ago