Senior Data Scientist

Presight Capital

Abu Dhabi, United Arab Emirates

5-7 Years

Save

Posted 9 hours ago
Be among the first 10 applicants

Early Applicant

Job Description

Senior Data Scientist – Large Language Models | Abu Dhabi, UAE

Position Overview

We are seeking a Senior Data Scientist with strong expertise in large language models (LLMs), agentic workflows, and applied machine learning. This role involves leading the design, training, and deployment of next-generation AI solutions that power intelligent agents, natural language interfaces, and knowledge-driven applications across domains. You will collaborate with cross-functional teams to deliver scalable, high-performance AI systems that unlock value from unstructured and structured data.

Key Responsibilities

Design, build, and optimize LLM-based solutions for tasks such as text understanding, summarization, semantic search, dialogue systems, and reasoning.
Develop and orchestrate agentic workflows, integrating LLMs with tools, APIs, and data pipelines for autonomous task execution.
Fine-tune and align LLMs on domain-specific datasets, leveraging techniques such as RLHF, prompt engineering, retrieval-augmented generation (RAG), and knowledge grounding.
Architect multi-modal AI pipelines combining LLMs with computer vision, knowledge graphs, and structured data.
Ensure scalability, efficiency, and reliability of deployed models on cloud and edge environments.
Stay updated with the latest research in GenAI, agents, and applied AI, translating findings into innovative product features.
Collaborate with software engineers, data engineers, and product managers to integrate AI models into production systems.
Document methodologies, publish internal best practices, and present findings to stakeholders.

Requirements

Qualifications

• Master's or Ph.D. in Computer Science, Data Science, Machine Learning, Applied Mathematics, or related field.

• Advanced research and practical experience with large-scale machine learning models.

• 5+ years of hands-on experience in AI, machine learning, or intelligent agent system development.

Experience

• Proven experience in developing and deploying LLM-powered solutions.

• Strong background in natural language processing (NLP), applied machine learning, and GenAI projects.

• Experience with orchestration frameworks (LangChain, LlamaIndex, Haystack, or equivalent).

Skills

Proficiency in Python and ML frameworks (TensorFlow or PyTorch, Hugging Face).
Hands-on with LLM fine-tuning, evaluation, and optimization.
Knowledge of RAG pipelines, embeddings, and vector databases (e.g., FAISS, Pinecone, Milvus).
Familiarity with agentic architectures and workflow automation frameworks.
Familiarity with backend frameworks (e.g., FastAPI), containerization (Docker), and databases (PostgreSQL, Redis).
Strong analytical, problem-solving, and research skills.
Excellent communication skills to explain technical concepts to diverse audiences.
Ability to thrive in fast-paced, collaborative environments.

Preferred

Experience with multi-agent systems, autonomous reasoning, and planning frameworks.
Experience with vector databases (FAISS, Pinecone, Milvus) and orchestration frameworks (LangChain, LlamaIndex, Haystack).
Knowledge of Model Context Protocol (MCP) and reasoning models is a plus.
Familiarity with responsible AI, bias mitigation, and data privacy in LLM systems.
Knowledge of real-time applications (e.g., chatbots, copilots, AI agents). Knowledge of AI coding tools (Cursor, Claude code, OpenAI Codex, github copilot