Innodata Inc is looking for a Software Engineering Associate who is curious and hands-on, eager to build real-world GenAI systems from the ground up. In this role, you will work across engineering, research, and product teams to turn advanced AI into production-ready solutions.
What You'll Do
- Prototype LLM and retrieval pipelines with integrated safety and filtering.
- Operate knowledge graph and vector databases like Pinecone and Weaviate, and manage embeddings.
- Build FastAPI services for search, recommendation systems, and memory.
- Design resilient systems with caching, retries, and observability.
- Run data pipelines for large-scale indexing and embedding generation.
- Capture personalization signals from user interactions like search, chat, and purchase.
- Optimize for low-latency APIs and high-throughput data pipelines.
- Collaborate with research and product teams on evaluation and UX.
What We're Looking For
- Strong Python skills with experience in FastAPI, async/await, Redis, and PostgreSQL.
- 1–3 years of hands-on experience with LLM prompting, RAG, embeddings, and vector search.
- Comfort with APIs, distributed systems, caching, and observability.
- Familiarity with GCP, AWS, or Azure cloud services, Docker, Git, and CI/CD.
- Clear communicator who is self-driven and a team player.
- Excellent communication, presentation, and collaboration skills.
- Strong time management and organizational abilities.
- Self-starter attitude with a customer-centric mindset.
- Ability to work effectively in a multicultural and fast-paced environment.
- Bachelor’s degree or higher in Computer Science, Artificial Intelligence, Machine Learning, Linguistics, Localization or a related field.
Nice to Have
- Experience with TypeScript/Node.js (NestJS), React/Next.js, or frameworks like FAST API and Streamlit.
- Exposure to recommender systems.
- Skills in evaluating embedding models.
Technical Stack
- Languages & Frameworks: Python, FastAPI, TypeScript, Node.js, NestJS, React, Next.js, FAST API, Streamlit
- Data & Infrastructure: Redis, PostgreSQL, LLMs, RAG, Embeddings, Vector Search, Pinecone, Weaviate
- Tools & Platforms: GCP/AWS/Azure, Docker, Git, CI/CD
Team & Environment
You will work across engineering, research, and product teams to deliver integrated solutions.
Work Mode
This role is open to candidates located in the US or Canada.
We are an equal opportunity employer committed to fostering an inclusive, respectful, and diverse workplace.




