About the Role
Role details below.
Responsibilities
- Architect the system and mentor the team while spending significant time hands-on in the codebase (Python/PyTorch)
- Own the core chat loop: optimize context windows, memory/RAG retrieval, and inference latency
- Drive strategy for SFT (Supervised Fine-Tuning) and RLHF/DPO (Preference Optimization)
- Decide when to prompt, when to fine-tune, and when to architect a new RAG pipeline
- Manage the 'Data Engine': oversee sourcing, labeling, and cleaning of diverse datasets
- Improve model steerability and multicultural performance
- Design and train custom classifiers to detect and filter non-consensual or illegal content
- Create nuanced, context-aware moderation systems beyond binary 'safe/unsafe' flags
Benefits
- Contract flexibility: B2B preferred, but open to other arrangements with long-term commitment
- Fully remote work: choose where you do your best work
- 4 weeks (20 working days) of paid time off per year
- Yearly in-person meetup for team connection and celebration
- Monthly allowance of 100 USD for health insurance expenses
- Unlimited 1:1 sessions with psychologists and lifestyle experts through OpenUp (available for up to three family members)
- Co-working space budget: up to twice per month (35 EUR / 40 USD per visit)
- Learning budget for courses, books, conferences, events, or certifications
- Company laptop provided
- Monitor budget up to 250 USD for workspace setup
- Premium access to AI tools: ChatGPT, Cursor, Hugging Face, Claude Code, and any other needed tools
Work Arrangement
Remote (Country)
Additional Information
- Operates in the uncensored/NSFW space with unique challenges in alignment, moderation, and steerability
- Platform processes 80 million tokens per day and growing
- Culture of high autonomy, low bureaucracy, and direct line to the CTO
- Fully remote elite team from Tier 1 tech companies
- External referral program offering up to 2,500 USD bonus for successful hires