Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
We are a stealth-mode startup building new infrastructure for the AI industry. Our mission is to make advanced language models deployable, customizable, and secure across diverse environments.
Our platform leverages an existing SaaS codebase for authentication, billing, and user management, and we are extending it with AI-specific features including runtime orchestration, dashboards, and secure communication layers.
Role
We are seeking a Backend Engineer (Node.js/NestJS), based in Ukraine, to extend our platform using our existing codebase. You’ll build the proxy backend that interacts with our custom inference runtime and extend dashboards.
This role requires strong backend engineering skills, an ability to integrate existing systems, and comfort working closely with C++ engineers who are building low-level runtime features.
Responsibilities
Proxy Backend for Inference Runtime
- Accepts inference requests from the frontend.
- Schedules and serializes prompts.
- Manages QKV cache load/unload (API hooks from the C++ runtime).
- Provides APIs to manage LoRA adapters.
- Integrate with authentication, RBAC, and logging already provided by the existing stack.
- Expose metrics and logs for monitoring inference usage and performance.
- Extend existing Dashboard: Dataset upload, training job view, model management, inference usage, request history, and adapter selection.
- Reuse auth, billing, and user management code (Auth0, Stripe).
- Add necessary backend endpoints to support new UI flows.
- Develop using NestJS as the main backend framework.
- Work with PostgreSQL, Redis, and HashiCorp Vault for persistence, caching, and secrets.
- Use Socket.IO for real-time updates (job status, inference progress).
- Ensure secure integration with Stripe (billing) and Auth0 (identity). Collaborate with DevOps on deployment pipelines (Proxmox, Docker, CI/CD).
- Strong experience with Node.js and NestJS framework.
- Proficiency in PostgreSQL and Redis for persistence and caching.
- Hands-on experience with Socket.IO or other WebSocket libraries.
- Experience with secure configuration and secrets management (HashiCorp Vault preferred).
- Comfortable working with microservices and integrating with existing codebases.
- Strong debugging and systems thinking able to reason about scheduling, state management, and concurrency.
- Experience integrating with AI runtimes (gRPC/REST backends for inference).
- Familiarity with C++ service APIs (FFI, REST, or gRPC bindings).
- Experience with authentication/authorization frameworks (Auth0, JWT, RBAC).
- Familiarity with Stripe API or similar billing systems.
- Contributions to backend open-source projects.
- Extend a proven SaaS foundation into a new AI runtime platform.
- Work directly with a C++ systems team building custom inference features.
- Build real products (dashboards + runtime APIs) used by vendors and customers.
- Competitive compensation, equity potential.
Key Skills
Ranked by relevanceReady to apply?
Join Baasi and take your career to the next level!
Application takes less than 5 minutes