Data First Jobs

BayOne Solutions

Agentic AI Data Engineer (Scientific Data)

Contract · In Office · USA

Posted Jun 16, 2026

Work Options
Cloud Stack
Industry
Positions
Job Type
Position Group
  • Job Title: Agentic AI Data Engineer (Scientific Data)
  • Location: Remote
  • Duration: 12+ Months - contract to hire
  • Pay Rate: $70-80/hr on W2/C2C
  • Key Responsibilities
  • Build an agentic data ingestion pipeline.
  • Triage and prioritize incoming requests to ingest specific datasets.
  • Clean and organize the data. Build the first pass cleaning and organization steps into the agentic flow.
  • Validate cross-modal linkage. Add automated checks that catch when ingested data does not connect correctly and flag low quality or mismatched records.
  • Version every dataset. Retain and make prior versions addressable.
  • Preserve raw data and provenance. Make agent workflows log validation and transformation steps so lineage is traceable.
  • Make agents usable across teams. Move beyond bespoke steps towards agents that teams can reliably use as a shared, deployed service.
  • Collaborate with AI, software engineering, and computational biology groups to co-define data standards and conventions.
  • Qualifications & Requirements
  • Core (Required)
  • Agentic AI engineering: Demonstrated experience building multi-agent workflows or LLM workflows using tools/frameworks such as LangGraph or LlamaIndex, including tool/function calling and asynchronous task execution.
  • Python data engineering: Strong Python for data manipulation, working with APIs and databases, and handling heterogeneous data formats.
  • Data versioning and provenance: Familiarity with dataset versioning approaches (e.g. DVC, lakeFS, or equivalent).
  • Working knowledge of scientific data structures: Comfortable or willingness to learn common omics data formats like AnnData, H5AD, TileDB.
  • Basic understanding of omics: No deep bioinformatics expertise required; just a basic understanding of different modalities (e.g. what is RNA-seq vs scRNA-seq vs WES; genomics vs transcriptomics vs proteomics vs metabolomics).
  • Unit testing: Comfortable writing unit and functional tests to ensure data processing workflows are reliable and reproducible.
  • Education: Degree in a technical field or equivalent practical experience.
  • Nice to have
  • Experience deploying agent workflows as a shared service (e.g., FastAPI or MCP endpoints).
  • Exposure to cloud (AWS, GCP) and containerization (Docker).
  • Familiarity with workflow managers such as Nextflow or Snakemake.

BayOne is an Equal Opportunity Employer and does not discriminate against any employee or applicant for employment because of race, color, sex, age, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any federal, state, or local protected class. This job posting represents the general duties and requirements necessary to perform this position and is not an exhaustive statement of all responsibilities, duties, and skills required. Management reserves the right to revise or alter this job description.

Mention you found this on Data First Jobs — it helps us bring you more roles like this.

Agentic AI Data Engineer (Scientific Data)

BayOne Solutions

Like this role? Get carefully selected jobs like it, twice a week, straight to your inbox.

Free, no spam. Unsubscribe anytime.