Cartesia is looking for a Researcher: Model Architecture to lead foundational research in neural network design. You will join an in-person team to advance the state-of-the-art in alternative architectures, creating systems for diverse deployment environments.
What You'll Do
- Conduct groundbreaking research in neural network architecture design to advance the state-of-the-art.
- Design novel architectures that improve model quality, inference efficiency, and adaptability across diverse deployment environments.
- Explore and develop capabilities such as statefulness, long-range memory, and innovative conditioning mechanisms.
- Investigate how architectural decisions impact model trade-offs, including scalability, robustness, latency, and energy efficiency.
- Develop new frameworks and tools to evaluate architectural innovations, benchmarking performance across research and production settings.
- Collaborate with cross-functional teams to translate architectural research into scalable and impactful systems for real-world applications.
What We're Looking For
- Deep expertise in architecture design, with experience in researching or deploying advanced architectures.
- Strong understanding of how architectures interact with system constraints, including deployment in cloud environments or on-device.
- Proficiency in designing architectures that balance quality, efficiency, and adaptability across different use cases and modalities.
- Familiarity with generative modeling paradigms and designing capabilities such as statefulness and conditioning in deep learning models.
- A proven research track record in top-tier ML/AI venues or demonstrable contributions to state-of-the-art architectures.
- Exceptional analytical and problem-solving skills, with a focus on experimentation and iterative refinement.
- Strong programming skills in deep learning frameworks such as PyTorch or TensorFlow, and experience with profiling tools.
Nice to Have
- Prior research or publications in state space models, efficient Transformers or other alternative architectures.
- Research or practical experience in designing architectures for multi-modal systems.
- Early-stage startup experience or a track record of rapid innovation in R&D environments.
Technical Stack
- PyTorch
- TensorFlow
Team & Environment
You will collaborate with cross-functional teams in a culture where we ship fast and support each other.
Benefits & Compensation
- Lunch, dinner and snacks at the office.
- Fully covered medical, dental, and vision insurance for employees.
- Pension Plan.
- Relocation and immigration support.
Work Mode
This is an onsite position located in London, UK. We are an in-person team.
Cartesia is an equal opportunity employer.




