Database & Infrastructure Engineer (Full Stack)

Data EngineerData EngineerFull TimeRemote

Location

United States

Posted

4 days ago

Salary

$150K - $250K / year

Postgre SQLPythonType ScriptS3DenoIndexingPartitioningPerformance TuningETLCi/cd

Job Description

ABOUT KLED

Kled is building the largest opt-in human data network in the world.

We are not a labeling firm. We are not a task marketplace.

We are a consumer application where people upload their real photos, videos, and documents and get paid continuously.

We then filter, standardize, and license that data to frontier AI labs and enterprises that need fresh, rights-aware training data.

Since launching our mobile app in 2026, we have:

• Reached #1 on the App Store (Finance) with 0 paid marketing
• Scaled to 200,000+ active data contributors
• Processed 1.5–3M uploads per day
• Raised $5M+ from investors behind SpaceX, Airbnb, Coinbase, xAI, OpenAI, Anthropic, Spotify, Lyft, Uber, and more

Our mission is to let anyone download the app and earn a real living wage from uploading their data.

ABOUT THE ROLE

Database & Infrastructure Engineer
(Full-Stack Systems)

We process millions of files per day and store hundreds of millions of media records.

Your job is to make our data layer world-class.

You will:

• Optimize and scale our PostgreSQL (Supabase) infrastructure
• Design indexing, partitioning, and query strategies for large-scale media datasets
• Improve performance across ingestion, enrichment, and retrieval pipelines
• Build internal tools for querying and auditing large datasets
• Create customer-ready dataset sample packs
• Design and automate dataset exports and delivery pipelines (S3, secure transfers, custom formats)
• Work across backend, ML, and product teams to support new features

This is not just DBA work.

You’ll help design the systems that move and package the data powering frontier AI labs.

WE’RE LOOKING FOR

• Strong PostgreSQL expertise (indexing, partitioning, performance tuning)
• Experience working with large datasets (100M+ records preferred)
• Deep understanding of storage systems (S3 or similar object storage)
• Strong backend experience (TypeScript, Python, or similar)
• Comfort building internal tooling and automation scripts
• Ability to move between database, backend, and infrastructure work

Bonus:

• Experience with data pipelines (ETL, transformation layers)
• Experience with vector databases (pgvector, FAISS, Pinecone)
• Experience delivering structured datasets to enterprise customers
• DevOps experience (CI/CD, infra automation)
• Experience working with media-heavy systems

CURRENT STACK

Backend
• PostgreSQL (Supabase) — 188M+ media files
• S3 storage
• Deno / TypeScript edge functions
• Python ML pipelines

Frontend
• SwiftUI (migrating to Flutter)

COMPENSATION

• Base salary: $150,000 - $250,000
• $150,000 – $350,000 equity
• Benefits
• Relocation support
• SF HQ (SOMA) or remote

We move fast and work hard (9–9 culture).

If you're excited to build the world’s largest consumer app, let’s talk!

GROWTH OPPORTUNITY

You’ll join a team operating at the frontier of applied AI data infrastructure. We move fast and work 7 days a week.

In this role, you’ll have the opportunity to:

• Own core systems that power one of the largest human data networks in the world
• Design infrastructure that directly influences what data trains next-generation AI models
• Build at real scale - millions of uploads per day, adversarial environments, global contributors
• Ship alongside a team that has built marketplaces, AI systems, and products used by millions

If you’re excited to move fast, build systems that matter, and help define how human data powers frontier AI, let’s talk.

Related Categories

Related Job Pages

More Data Engineer Jobs

Staff Data Engineer

tvScientific

Connected TV Advertising + Attribution Platform

Data Engineer4 days ago
Full TimeRemoteTeam 51-200Since 2020H1B No Sponsor

The Staff Data Engineer will lead the design, implementation, and evolution of identity services, including building scalable identity resolution platforms and data pipelines for ingestion and matching. This role also involves owning data governance foundations such as lineage, quality checks, schema enforcement, and operationalizing privacy requirements.

ScalaApache SparkAWSData lakesData governancePII handlingAPIData lineageData qualitySchema enforcementIdentity resolutionBatch pipelinesStreaming pipelinesMetadata managementAccess controls
United States
$155K - $320K / year

Staff Data Engineer

tvScientific

Connected TV Advertising + Attribution Platform

Data Engineer4 days ago
Full TimeRemoteTeam 51-200Since 2020H1B No Sponsor

The Staff Data Engineer will be responsible for designing and implementing robust data infrastructure primarily in AWS using Spark with Scala, and evolving core data pipelines to efficiently scale for massive growth. This role involves storing data in optimal engines and formats while collaborating cross-functionally to design data solutions, including knowledge graphs exposed via Batch Processing and APIs.

SparkScalaAWSSQLData LakesCloud WarehousesAPI DevelopmentData InfrastructureBatch ProcessingData StorageData QualityData Pipelines
United States
$155K - $320K / year

Sr. Data Engineer

tvScientific

Connected TV Advertising + Attribution Platform

Data Engineer4 days ago
Full TimeRemoteTeam 51-200Since 2020H1B No Sponsor

The Senior Data Engineer will implement robust data infrastructure primarily in AWS utilizing Spark with Scala, focusing on evolving core data pipelines to handle massive growth efficiently. This role involves designing fault-tolerant batch and streaming pipelines and collaborating with cross-functional teams to define and implement the strategic vision for data engineering.

SparkScalaAWSSQLData Lakes
United States
$123K - $254K / year

Senior Staff Data Engineer

SonderMind

We know therapy works. That's why we’re redesigning mental healthcare to make it easier to find and access therapy.

Data Engineer4 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

This role involves designing, building, and evolving data systems that support analytics, experimentation, machine learning, and clinical outcomes, operating with high ownership and autonomy within a regulated healthcare environment. Responsibilities include owning data architecture decisions, establishing data quality standards, and leading technical initiatives to ensure high-quality, trusted data delivery.

SQLPythonData PipelineData ArchitectureData TransformationData StorageScalabilityReliabilityData QualityObservabilityMonitoringAlertingIncident ResponseData ModelingData ContractCloud Data WarehouseSchedulingETLData GovernanceData PrivacySecurity Compliance
United States
$180K - $200K / year