Position Overview
Mid-Level AI Engineer with strong hands-on experience in end-to-end AI system development, specializing in on-premises and private cloud AI deployments. This role focuses on building production-grade AI solutions using primarily local models, optimizing GPU-based training and fine-tuning pipelines, and developing scalable RAG, agentic, multimodal, and speech-enabled systems.
Key Responsibilities
- Analyze requirements and contribute to technical AI solution design.
- Strong programming skills in Python (mandatory).
- Experience in backend development (APIs, microservices, model serving).
- Hands-on experience with local LLM deployment and optimization.
- Experience in RAG system design and vector databases.
- Practical experience in model fine-tuning (LoRA / PEFT).
- Solid understanding of: Transformer architecture, quantization techniques, and embeddings with retrieval.
- Experience working in Linux environments.
- Participate in peer code reviews and team planning activities.
- Provide task estimations and deliver within approved timelines.
- Apply secure coding practices and participate in bug resolution with quality metrics in mind.
Qualifications
- Bachelor's degree in computer science, Artificial Intelligence, Data Science, or related discipline.
- 2-5 years of experience in AI Engineering.
- Proficiency in C#, .NET Core and Python.
- Experience with Structured and Unstructured databases.
- Experience with Angular or other frontend technologies is a strong plus.
- Working knowledge of relational databases (i.e., SQL Server) and non-relational databases (i.e., Qdrant)
- Familiarity with Git, RESTful services, and Agile development methodologies