Data Analyst I

Nexus Consulting

United Arab Emirates

Fresher

Save

Posted 7 hours ago
Be among the first 10 applicants

Early Applicant

Job Description

Role: Data Analyst I
Location: 100% Remote (Global)
Compensation: USD 7085 per hour

About the Role

We are seeking a detail-oriented Data Analyst I to support large-scale data curation and evaluation initiatives for advanced generative AI systems. This role focuses on improving model quality across key dimensions such as visual fidelity, prompt adherence, identity preservation, naturalness, and text generation within images.

You will work closely with engineers and research teams to manage data labeling workflows, maintain high-volume data pipelines, audit annotations, and analyze model outputs to identify quality gaps.

This is an onsite role requiring hands-on collaboration with technical teams.

Key Responsibilities

Data Curation & Labeling Operations

Manage end-to-end data labeling workflows
Enqueue datasets for labeling and maintain labeling interfaces
Extract structured labels for modeling teams
Manually annotate training data when required
Audit and correct human-labeled data

Data Engineering & Pipelines

Maintain and optimize large-scale data processing pipelines (billions of images)
Support data sourcing and content understanding using ML models
Leverage LLMs to clean, annotate, and evaluate data
Assist in building efficient ETL workflows

Data Governance

Maintain dataset portfolio with proper access controls
Ensure compliance with data retention and privacy standards
Support governance and documentation practices

Analysis & Model Evaluation

Identify model quality gaps using structured evaluation protocols
Collaborate with engineers to summarize findings and recommend improvements
Mine and prepare new datasets for iterative model training
Scale validated evaluation frameworks across product teams

Required Qualifications

Associate's degree or equivalent training in Computer Science, Engineering, Physics, Bioinformatics, or other STEM field
Basic knowledge of Python and SQL
Foundational understanding of computer vision and generative AI models
Experience with data ETL workflows or pipelines
Familiarity using LLMs for data labeling or evaluation tasks
Strong attention to detail and analytical thinking

Preferred Qualifications

Prior industry experience in software development, QA, or research
Exposure to human-computer interaction or ML evaluation work
Experience working in large-scale technology environments
Strong written and verbal communication skills

Work Environment

Onsite collaboration with engineering teams in Menlo Park, CA
Fast-paced, research-driven environment
High-impact role supporting next-generation AI systems

Equal Opportunity Statement

We are an equal opportunity employer and consider all qualified applicants without regard to legally protected characteristics. Qualified applicants with arrest and conviction records will be considered in accordance with applicable laws. Reasonable accommodations are available upon request.

APPLY NOW!