Cargill

Senior Data Engineer

Posted 18 Hours Ago

Be an Early Applicant

In-Office

Bengaluru, Bengaluru Urban, Karnataka

Senior level

In-Office

Bengaluru, Bengaluru Urban, Karnataka

Senior level

Designs, builds, and operates scalable AWS-based batch and streaming data pipelines and platforms (Lakehouse and warehouse). Owns architecture, performance, security, and cost optimization; implements transformations (SQL/dbt, PySpark), Kafka ingestion, Airflow orchestration, and CI/CD/Terraform. Partners with product and platform teams, mentors engineers, and drives data modeling, schema evolution, and operational excellence.

The summary above was generated by AI

Job Purpose and Impact

The Senior Data Engineer designs, builds, and operates scalable, reliable data products and platforms that power analytics, reporting, and downstream applications. This role owns end‑to‑end delivery of batch and streaming data pipelines on a modern AWS‑based cloud data platform, applying strong engineering patterns to ensure performance, security, observability, and cost efficiency.
With minimal supervision, the role partners closely with product, analytics, and platform teams to translate business requirements into robust technical solutions across a Lakehouse (Iceberg) and approved warehousing platforms (e.g., Snowflake). The Senior Data Engineer also mentors other engineers, drives code quality, and raises the engineering bar across the organization.

Key Accountabilities

Data & Analytical Solutions
- Designs and delivers scalable data products using standard cloud and data engineering architectures.
- Owns technical decisions (batch vs. streaming, Lakehouse vs. warehouse) and ensures solutions meet reliability, security, governance, latency, and cost requirements.
- Reviews designs and contributes reusable components, templates, and standards.
Data Pipelines
- Builds and operates end‑to‑end batch and streaming pipelines.
- Implements transformations using SQL/dbt and PySpark as needed.
- Integrates real‑time or event‑driven ingestion using Kafka.
- Orchestrates workflows with Airflow; establishes SLAs/SLOs and CI/CD‑based deployments.
Data Systems & Architecture
- Optimizes data architectures for performance, scalability, and cost.
- Applies best practices for Iceberg table design, incremental processing, and query optimization across Hive, Impala, Snowflake, and RDBMS.
- Diagnoses systemic issues and drives remediation with platform teams.
Data Infrastructure (AWS)
- Leads technical readiness across dev/test/prod environments.
- Works hands‑on with AWS services including S3, Glue, Lambda, IAM, and SageMaker.
- Partners with governance and platform teams on access control, tagging, and operational support.
Data Modeling & Formats
- Leads modeling across RAW, CURATED, and SERVING layers.
- Applies dimensional or normalized models for correctness, performance, and usability.
- Implements efficient formats (Parquet + Iceberg) with clear schema evolution strategies.
DevOps & CI/CD
- Designs and improves Git‑based CI/CD pipelines and infrastructure‑as‑code using Terraform.
- Ensures quality gates, auditability, and compliance with governance requirements.
Stakeholder & Engineering Leadership
- Partners with product, analytics, and platform teams to align on requirements, data contracts, and SLAs.
- Communicates complex technical topics clearly and leads technical discussions.
- Coaches engineers and raises engineering standards through reviews and documentation.
AI‑First & Product Mindset
- Uses GenAI‑assisted development responsibly to accelerate delivery.
- Builds products, not just pipelines, focusing on usability, adoption, reliability, and lifecycle ownership.
- Designs systems end‑to‑end and continuously optimizes cost‑performance trade‑offs using metrics.

Qualifications

8+ years of total experience with 6+ years of Data Engineering experience.
Strong expertise in AWS‑based data engineering and scalable cloud architectures
Proven experience building end‑to‑end batch and streaming pipelines, including Kafka
Advanced proficiency in SQL, Hive, Impala, and PostgreSQL / RDBMS
Strong programming skills in Python and PySpark
Hands‑on experience with AWS Glue, Lambda, S3, IAM, and SageMaker
Experience with Snowflake and modern data warehousing
Expertise in CI/CD, Terraform, and DevOps practices
Proficiency in Airflow for workflow orchestration
Experience with Power BI for data visualization and reporting
Strong foundation in data modeling, performance optimization, and large‑scale data systems

Similar Jobs at Cargill

Cargill

Senior Data Engineer

2 Hours Ago

In-Office

Senior level

Food • Greentech • Logistics • Sharing Economy • Transportation • Agriculture • Industrial

Design, build, and maintain scalable batch and streaming data platforms and pipelines (Snowflake, Kafka/Pulsar). Develop production-grade Python ETL/ELT, data models, automated deployments, and ensure security, governance, performance, reliability, and GenAI enablement. Partner with analytics and business stakeholders to deliver robust data products.

Top Skills: .NetApache KafkaApache PulsarData LakeData WarehouseEltOpenaiOraclePostgresPower BIPythonSnowflakeSQLSQL ServerTableau

Cargill

Senior Data Engineer

2 Hours Ago

In-Office

Senior level

Food • Greentech • Logistics • Sharing Economy • Transportation • Agriculture • Industrial

Top Skills: .NetApache KafkaApache PulsarData LakeData WarehouseEltOpenaiOraclePostgresPower BIPythonSnowflakeSQLSQL ServerTableau

Cargill

Software Engineer

2 Hours Ago

In-Office

Mid level

Food • Greentech • Logistics • Sharing Economy • Transportation • Agriculture • Industrial

The Software Engineer (Backend Developer) designs and develops software applications, collaborates with teams, writes code, tests, and provides technical support.

Top Skills: AuthenticationAuthorizationAWSCi/CdDatadogDockerGitGradleJavaJunitOktaPostmanRest ApiSpring BootSQL

What you need to know about the Melbourne Tech Scene

Home to 650 biotech companies, 10 major research institutes and nine universities, Melbourne is among one of the top cities for biotech. In fact, some of the greatest medical advancements were conceptualized and developed here, including Symex Lab's "lab-on-a-chip" solution that monitors hormones to predict ovulation for conception, and Denteric's vaccine for periodontal gum disease. Yet, the thousands of people working in the city's healthtech sector are just getting started, to say nothing of the tech advancements across all other sectors.