Maincode Logo

Maincode

Data Scientist

Reposted 21 Days Ago
Be an Early Applicant
In-Office
Melbourne, Victoria
Entry level
In-Office
Melbourne, Victoria
Entry level
The Data Scientist will handle large-scale data to support AI model training, analyzing datasets and ensuring data quality and performance. Collaboration with engineers and researchers is key, along with designing data workflows, pipelines, and tools for dataset evaluation.
The summary above was generated by AI
Overview

Maincode is building Australian-made AI models from the ground up. We train foundation models from scratch, design new reasoning architectures, and deploy them on our own state-of-the-art GPU clusters. Our data and infrastructure are entirely homegrown, from curation to large-scale training, to ensure independence, transparency, and excellence in model performance.

We’re looking for a Data Scientist who thrives at the intersection of data engineering, machine learning, and creative experimentation. You’ll help shape the datasets and data systems that power the next generation of models.

This role bridges deep technical execution (pipelines, validation, distributed data processing) with the curiosity and innovation needed to push data science into new territory. You’ll work closely with researchers and engineers to make data the backbone of Australia’s AI capability.


What you’ll do
  • Engineer and innovate with data: Design scalable data workflows that handle massive, heterogeneous datasets (text, code, multimodal, structured).

  • Prototype novel data science approaches: Apply advanced techniques for dataset synthesis, filtering, augmentation, and generation to improve downstream model reasoning.

  • Build production-grade pipelines: Automate ingestion, cleaning, transformation, and validation of large-scale data for model training.

  • Develop intelligent metrics: Develop tools and metrics for assessing dataset quality, diversity, and performance impact.

  • Collaborate across disciplines: Work with AI researchers to shape training corpora aligned with emerging model architectures and objectives.

  • Continuously refine systems: Improve how data flows through the entire training stack, from curation to evaluation.

  • Champion data quality and ethics: Help define standards for responsible, high-integrity data use in AI.


Who you are
  • Strong foundation in Python, data processing frameworks (Pandas, PySpark, Dask, or Ray), and large-scale data systems.

  • Skilled in data analysis, feature engineering, and statistical reasoning.

  • Experienced working with multi-terabyte or distributed datasets in production environments.

  • Familiar with or curious about deep learning, data-centric AI, and model training pipelines.

  • Eager to experiment: combining scientific rigour with creative problem-solving.

  • Motivated to help shape Australia’s independent AI capability through world-class data infrastructure.


Why Maincode

Maincode is a small, highly technical team operating at the frontier of AI research and infrastructure. We build, train, and deploy foundation models from scratch - not fine-tune existing ones - and the data you work on will directly shape model behavior at scale.

You’ll join a team that:

  • Treats data as a core differentiator of AI progress.

  • Values experimentation and scientific precision in equal measure.

  • Builds clean, transparent, and scalable systems from first principles.

  • Aims to make Australia a leader in independent AI innovation.

Top Skills

Dask
Pandas
Pyspark
Python
Ray
HQ

Maincode Melbourne, Victoria, AUS Office

Melbourne, VIC, Australia, 3000

Similar Jobs

Yesterday
In-Office
Melbourne, Victoria, AUS
Junior
Junior
Fintech • Consulting
As a Data Scientist, you will analyze data, develop analytical solutions, and collaborate on predictive models and risk assessments based on diverse data sources. Responsibilities include data preparation, visualization of results, and mentorship for junior data scientists.
Top Skills: AlgorithmsAnalyticsData Structures
16 Days Ago
In-Office
Melbourne, Victoria, AUS
Senior level
Senior level
Agency
Lead, mentor, and manage interdisciplinary teams in developing AI and data science products. Ensure quality delivery and adherence to budget and timelines.
Top Skills: Cloud AiMl FrameworksPython
6 Days Ago
Easy Apply
In-Office
Melbourne, Victoria, AUS
Easy Apply
Expert/Leader
Expert/Leader
Logistics • On-Demand
Lead the Analytics team in Australia, driving data-driven decisions across various functions and collaborating with cross-functional teams to solve complex problems.
Top Skills: PythonSQL

What you need to know about the Melbourne Tech Scene

Home to 650 biotech companies, 10 major research institutes and nine universities, Melbourne is among one of the top cities for biotech. In fact, some of the greatest medical advancements were conceptualized and developed here, including Symex Lab's "lab-on-a-chip" solution that monitors hormones to predict ovulation for conception, and Denteric's vaccine for periodontal gum disease. Yet, the thousands of people working in the city's healthtech sector are just getting started, to say nothing of the tech advancements across all other sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account