Patsnap logo

Senior AI Engineer

Patsnap
2 days ago
Full-time
On-site
Singapore, Singapore

Role Overview

This role will be the technical foundation builder for the company’s AI transformation. You will design and build the company-wide knowledge infrastructure and context layer that powers future AI applications. This is a highly hands-on role requiring strong backend engineering capability, LLM application experience, product sense, and the ability to operate independently in a fast-moving, ambiguous environment.

Key Responsibilities

  • Design and build the company-wide AI knowledge infrastructure, including company wiki, internal knowledge base, retrieval layer, and context management system.
  • Develop scalable LLM application architecture, including RAG pipelines, vector database integration, prompt workflows, API services, monitoring, and deployment.
  • Own the end-to-end technical delivery of internal AI tools, from backend architecture and basic frontend integration to deployment, testing, and monitoring.
  • Work closely with business, brand, PR, IR, and leadership stakeholders to translate ambiguous business needs into practical AI systems and technical roadmaps.
  • Optimize system performance, including token efficiency, latency, caching strategy, retrieval quality, data architecture, and model inference flow.
  • Evaluate and integrate AI coding tools, LLM frameworks, vector databases, and third-party APIs to improve development efficiency and product quality.
  • Mentor junior engineers or interns when needed, and help establish technical standards, documentation practices, and reusable engineering workflows.

Requirements

  • 4–7 years of backend engineering experience, with at least 2 years of hands-on LLM application development experience.
  • Strong backend development skills in Python; experience with Node.js or Go is a plus.
  • Solid computer science fundamentals, including algorithms, system design, database design, API architecture, distributed systems, caching, and performance optimization.
  • Production-level LLM application experience, not limited to demos or prototypes. Experience should include prompt engineering at scale, model selection, inference pipeline design, or RAG architecture.
  • Hands-on experience with RAG and vector databases such as Pinecone, Weaviate, Chroma, or similar tools.
  • Experience owning full engineering delivery, including backend services, basic frontend integration, API deployment, monitoring, and troubleshooting.
  • Heavy user of AI coding tools such as Cursor, Claude Code, GitHub Copilot, or similar tools.
  • Mandarin fluency is required; English working proficiency is required.
  • Able to work independently under ambiguous instructions and make sound technical decisions without waiting for detailed specifications.