Senior AI Research Engineer - Data & Infrastructure
Location
United States + 144 moreAll locations: United States, Canada, Brazil, Colombia, Argentina, Chile, Venezuela, Bolivarian Republic Of, Bolivia, Plurinational State Of, Ecuador, French Guiana, Guyana, Paraguay, Peru, Suriname, Uruguay, Mexico, Costa Rica, El Salvador, Guatemala, Honduras, Nicaragua, Panama, Dominican Republic, Puerto Rico, Bahamas, Guadeloupe, Haiti, Jamaica, Martinique, Montserrat, United Kingdom, Germany, France, Estonia, Portugal, Hungary, Poland, Ukraine, Romania, Bulgaria, Czech Republic, Slovakia, Belarus, Moldova, Republic Of, Sweden, Greece, Belgium, Italy, Ireland, Switzerland, Netherlands, Finland, Malta, Denmark, Lithuania, Croatia, Spain, Austria, Bosnia And Herzegovina, Iceland, Luxembourg, Macedonia, The Former Yugoslav Republic Of, Montenegro, Norway, Serbia, Slovenia, Albania, Cyprus, Latvia, Monaco, South Africa, Egypt, Algeria, Angola, Benin, Botswana, Burkina Faso, Burundi, Cameroon, Cape Verde, Central African Republic, Chad, Congo, Côte D'ivoire, Congo, The Democratic Republic Of The, Equatorial Guinea, Eritrea, Ethiopia, Gabon, Gambia, Ghana, Guinea, Guinea-bissau, Kenya, Lesotho, Liberia, Libyan Arab Jamahiriya, Madagascar, Malawi, Mali, Mauritania, Mauritius, Mayotte, Morocco, Mozambique, Namibia, Niger, Nigeria, Réunion, Rwanda, Senegal, Seychelles, Sierra Leone, Somalia, Sudan, Swaziland, Tanzania, United Republic Of, Togo, Tunisia, Uganda, Zambia, Zimbabwe, Georgia, Turkey, Israel, United Arab Emirates, Armenia, Azerbaijan, Bahrain, Iraq, Jordan, Kuwait, Lebanon, Oman, Qatar, Saudi Arabia, Palestinian Territory, Occupied, Yemen
Posted
6 days ago
Salary
Not specified
Job Description
Role Description
We're seeking experienced AI infrastructure Engineers to design and implement robust, scalable pipelines for massive data workloads. Join Tether’s applied research team, where you’ll contribute to high-impact projects that run across thousands of GPUs and drive cutting-edge video generation foundation development.
Responsibilities
- Build and scale high-throughput data infrastructure optimized for video and multimodal content processing across large GPU clusters (e.g., H100/H200).
- Design core preprocessing algorithms for video, audio, text, and image modalities, enabling efficient extraction, synchronization, and normalization of temporal data.
- Build automated acquisition pipelines for sourcing large-scale video datasets, handling diverse formats, frame rates, annotations, and embedded audio.
- Architect robust systems for scalable evaluation and annotation, including prompt-based scoring, perceptual metrics, caption generation, and retrieval-based diagnostics.
- Collaborate with model researchers to co-design video model architectures (e.g. DiTs, VAEs, spatio-temporal transformers) and training schedules across pretraining and fine-tuning stages.
- Optimize distributed data loading and pipeline throughput for training at scale, ensuring robustness across model variants and modality combinations.
- Manage infrastructure to support experiment tracking, model versioning, and cross-team deployment workflows, integrating with production and research platforms.
- Support backend engineering across research, product, and creative teams to ensure seamless integration of data and model workflows from prototyping to inference.
Qualifications
- Proficient in Python with strong programming skills across backend, infrastructure, and data tooling domains.
- Strong software engineering experience, including 2+ years working with petabyte-scale data pipelines and systems across thousands of GPUs.
- Proven ability to architect and maintain large-scale distributed systems for data processing and delivery.
- Deep expertise in orchestration frameworks such as Kubernetes and SLURM with hands-on experience deploying and managing high-throughput workloads.
Preferred Qualifications
- Practical experience on building pipelines and infrastructure with visual and multimodal datasets, including image/video pipelines.
- Experience in building video foundation infrastructure pipelines and workflows with collaboration of LLM and/or video foundation research and engineering teams is a strong advantage.
Important information for candidates
- Apply only through our official channels.
- We do not use third-party platforms or agencies for recruitment unless clearly stated. All open roles are listed on our official careers page: https://tether.recruitee.com/
- Verify the recruiter’s identity. All our recruiters have verified LinkedIn profiles.
- Be cautious of unusual communication methods. We do not conduct interviews over WhatsApp, Telegram, or SMS.
- Double-check email addresses. All communication from us will come from emails ending in @tether.to or @tether.io.
- We will never request payment or financial details. If someone asks for personal financial information or payment at any point during the hiring process, it is a scam.
Job Requirements
- Proficient in Python with strong programming skills across backend, infrastructure, and data tooling domains.
- Strong software engineering experience, including 2+ years working with petabyte-scale data pipelines and systems across thousands of GPUs.
- Proven ability to architect and maintain large-scale distributed systems for data processing and delivery.
- Deep expertise in orchestration frameworks such as Kubernetes and SLURM with hands-on experience deploying and managing high-throughput workloads.
- Preferred Qualifications
- Practical experience on building pipelines and infrastructure with visual and multimodal datasets, including image/video pipelines.
- Experience in building video foundation infrastructure pipelines and workflows with collaboration of LLM and/or video foundation research and engineering teams is a strong advantage.
- Important information for candidates
- Apply only through our official channels.
- We do not use third-party platforms or agencies for recruitment unless clearly stated. All open roles are listed on our official careers page: https://tether.recruitee.com/
- Verify the recruiter’s identity. All our recruiters have verified LinkedIn profiles.
- Be cautious of unusual communication methods. We do not conduct interviews over WhatsApp, Telegram, or SMS.
- Double-check email addresses. All communication from us will come from emails ending in @tether.to or @tether.io.
- We will never request payment or financial details. If someone asks for personal financial information or payment at any point during the hiring process, it is a scam.
Related Guides
Related Job Pages
More AI Research Scientist Jobs
German Language Specialist
Invisible AgencyEmployment type: Freelance / Contract Workplace type: Remote Seniority level: Mid‑Senior Level
Are you a German language expert eager to shape the future of AI? Large‑scale language models are evolving from clever chatbots into powerful engines of linguistic discovery. With high‑quality training data, tomorrow’s AI can democratize world‑class education, keep pace with cutt...
Research Director - Genetics
Private Health ManagementPrivate Health Management's team of clinical, research and care management experts provide unparalleled access to the best doctors, diagnostics and treatments to find the best of what’s possible in medicine. From routine and preventative care to serious or complex conditions, our integrated clinical approach and proprietary resources deliver more accurate diagnoses, optimal treatments and better outcomes that consistently improve, extend, and save lives. Our deep and trusted relationships allow us to gain access and expertly navigate the complex healthcare system on behalf of our clients. By maintaining independence from all providers and payors, we keep our members’ interests at the heart of every care decision. We serve as a committed healthcare champion working directly with individuals and families and with businesses that provide our services to their employees as a premium benefit
The Research Director will lead end-to-end research strategy for complex and rare disease cases, delivering evidence-based treatment pathways and identifying diagnostic gaps. Responsibilities include evaluating standard-of-care and investigational therapies globally and translating complex medical literature into actionable, patient-friendly reports.
The Senior Engineer, AI is responsible for building AI and Machine learning models and pipelines, with a focus on generative AI, LLMs and predictive modelling. The successful candidate will work closely with data scientists and product leads to implement AI models, build required...
Generative AI Specialist - Humanities (English and Russian)
Innodata IncInnodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cities around the world, we are an AI technology solutions provider-of-choice for 4 out of 5 of the world’s biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine. By combining advanced machine learning and artificial intelligence (ML/AI) technologies, a global workforce of subject matter experts, and a high-security infrastructure, we’re helping usher in the promise of AI. Our global workforce includes over 7,000 employees in the United States, Canada, United Kingdom, the Philippines, India, Sri Lanka, Israel and Germany. We’re poised for a period of explosive growth over the next few years.
At Innodata, we’re partnering with the world’s leading technology companies to build the future of generative AI and large language models (LLMs). We’re on the lookout for smart, savvy, and curious Generative AI Specialist to join our global contributor community as part of our S...