About ThinkTech:
ThinkTech is an Egyptian IT company specializing in providing professional services within the fintech and banking industry. With a strong client base across the region, we are committed to fostering growth and innovation.
About the Role
We are seeking an Artificial Intelligence Engineer to join our AI team and contribute to cutting-edge multi-modal models for real-world applications. In this role, you will work on designing, developing, and optimizing deep learning models for image and video analysis. You will collaborate closely with machine learning researchers, software engineers, and domain experts to bring AI-powered solutions to production in agriculture domain for smart farms.
Responsibilities:
- Develop, implement, and optimize state-of-the-art computer vision models for object detection, segmentation, tracking, and scene understanding.
- Develop and train state-of-the-art multi-modal networks, LLMs, VLMs, and ViTs.
- Train and fine-tune deep learning models using frameworks like PyTorch and TensorFlow.
- Conduct research and stay up-to-date with the latest advancements in computer vision and deep learning (e.g., Transformer-based architectures, self-supervised learning, diffusion models, generative models, foundation models, VLMs and CoT models).
- Improve the efficiency and scalability of models for deployment on edge devices, cloud platforms, or embedded systems.
- Work with large-scale datasets, preprocess images and videos, and apply data curation, data augmentation and synthetic data generation techniques.
- Collaborate with software engineers to integrate vision models into production pipelines and real-time applications.
- Optimize inference speed and memory usage using techniques like model quantization, pruning, and distillation.
- Debug, profile, and analyze performance bottlenecks in ML pipelines, design and develop MLOPs frameworks
- Participate in code reviews and contribute to best practices in ML model development and deployment.
Required Qualifications:
- 4+ years of experience in deep learning and computer vision, with a strong track record of designing and deploying state-of-the-art models.
- Bachelor's or Master's degree in computer science, electrical engineering, AI, or a related field.
- Strong proficiency in Python and experience with deep learning frameworks such as PyTorch or TensorFlow.
- Solid understanding of computer vision fundamentals (image processing, feature extraction).
- Experience with CNNs, Transformers (ViTs), diffusion models, CoT or other modern architectures/techniques.
- Experience with knowledge distillation, self-supervised learning, domain adaptation and other recent techniques.
- Hands-on experience in training and fine-tuning large-scale deep learning models on GPU-based environments.
- Experience with large databases and datalakes for data management, versioning and control.
- Proficiency in software development best practices, version control (Git), CI/CD, and working in a Linux environment.
- Hands-on experience deploying models to edge devices, embedded systems, or cloud-based ML services.
- Hands-on experience with AWS, including model training, deployment, and scaling on ECS, EC2, S3, Lambda, DataLake and Athena.
- Familiarity with containers like Docker and Kubernetes, and knowledge of C++ and CUDA for low-level optimization of vision models.