Data Scientist, Applied AI - Remote

Data ScientistData ScientistFull TimeRemoteTeam 79Since 2016Company Site

Location

United States + 4 moreAll locations: United States, Brazil, Mexico, Colombia, Argentina

Posted

28 days ago

Salary

Not specified

Bachelor Degree9 yrs expEnglishAirflowAWSAzureCi/cdDockerGCPJIRAKubeflowKubernetesMlflowNumpyPandasPlantumlPythonPy TorchSparkTfx

Job Description

Azumo is currently looking for a highly motivated Data Scientist / Machine Learning Engineer to develop and enhance our data and analytics infrastructure. The position is FULLY REMOTE , based in Latin America. Professional English proficiency (B2/C1) This position will provide you with the opportunity to collaborate with a dynamic team and talented data scientists in the field of big data analytics and applied AI . If you have a passion for designing and implementing advanced machine learning and deep learning models, particularly in the Generative AI space, this role is perfect for you. We are seeking a skilled professional with expertise in Python for production-level projects, proficiency in machine learning and deep learning techniques such as CNNs and Transformers , and hands-on experience working with PyTorch . We’re looking for a versatile Machine Learning Engineer / Data Scientist to join our big-data analytics team. In this hybrid role you’ll not only design and prototype novel ML/DL models , but also productionize them end-to-end, integrating your solutions into our data pipelines and services. You’ll work closely with data engineers, software developers and product owners to ensure high-quality, scalable, maintainable systems. Key Responsibilities Model Development & Productionization Design, train, and validate supervised and unsupervised models (e.g., anomaly detection, classification, forecasting). Architect and implement deep learning solutions (CNNs, Transformers) with PyTorch . Develop and fine-tune Large Language Models (LLMs) and build LLM-driven applications. Implement Retrieval-Augmented Generation (RAG) pipelines and integrate with vector databases. Build robust pipelines to deploy models at scale ( Docker , Kubernetes , CI/CD ). Data Engineering & MLOps Ingest, clean and transform large datasets using libraries like pandas , NumPy , and Spark . Automate training and serving workflows with Airflow or similar orchestration tools. Monitor model performance in production; iterate on drift detection and retraining strategies. Implement LLMOps practices for automated testing, evaluation, and monitoring of LLMs. Software Development Best Practices Write production-grade Python code following SOLID principles, unit tests and code reviews. Collaborate in Agile (Scrum) ceremonies; track work in JIRA . Document architecture and workflows using PlantUML or comparable tools. Cross-Functional Collaboration Communicate analysis, design and results clearly in English. Partner with DevOps, data engineering and product teams to align on requirements and SLAs. About Azumo Based in San Francisco, California, Azumo is an innovative software development firm specializing in AI software development services . We help companies of all sizes build intelligent applications by combining expertise in data, cloud, and AI . Our talented AI developers are trusted to deliver Top AI Development services in Generative AI , intelligent automation, and custom machine learning solutions. At Azumo , we believe in professional and personal growth. As a recognized AI Development company , we support our engineers in mastering the latest technologies and delivering Top AI Development services worldwide. Our culture emphasizes collaboration, continuous learning, and solving complex problems with modern AI solutions. We believe in giving back to our community and will volunteer our time to philanthropy, open-source initiatives and sharing our knowledge. If you are qualified for the opportunity and looking for a challenge please apply online at Azumo/join-our-team or connect with us at people@azumo.co Requirements Minimum Qualifications Bachelor’s or Master’s in Computer Science, Data Science or related field. 5+ years of professional experience with Python in production environments. Solid background in machine learning & deep learning ( CNNs , Transformers , LLMs ). Hands-on experience with PyTorch or similar frameworks (training, custom modules, optimization). Proven track record deploying ML solutions . Expert in pandas , NumPy and scikit-learn . Familiarity with Agile/Scrum practices and tooling ( JIRA , Confluence ). Strong foundation in statistics and experimental design. Excellent written and spoken English. Preferred Qualifications Experience with cloud platforms ( AWS , GCP , or Azure ) and their AI-specific services like Amazon SageMaker , Google Vertex AI , or Azure Machine Learning . Familiarity with big-data ecosystems ( Spark , Hadoop ). Practice in CI/CD & container orchestration ( Jenkins/GitLab CI , Docker , Kubernetes ). Exposure to MLOps/LLMOps tools ( MLflow , Kubeflow , TFX ). Experience with Large Language Models , Generative AI , prompt engineering , and RAG pipelines . Hands-on experience with vector databases (e.g., Pinecone , FAISS ). Experience building AI Agents and using frameworks like Hugging Face Transformers , LangChain or LangGraph . Documentation skills using PlantUML or similar. Benefits Paid time off (PTO) U.S. Holidays Training Udemy free Premium access Mentored career development Profit Sharing $US Remuneration

Related Categories

Related Job Pages

More Data Scientist Jobs

Full TimeRemoteTeam 70Since 2022

The SSDI Case Manager will oversee a caseload of 250-300 disability cases, communicating with claimants and collaborating with SSA/DDS to resolve issues.

Salesforce
Florida

Data Scientist

Buyers Edge Platform

Buyers Edge Platform: the leading foodservice Digital Procurement Network, powered by data, software, and collaboration.

Data Scientist28 days ago
Full TimeRemoteTeam 501-1,000H1B No Sponsor

Data Scientist developing predictive intelligence models for foodservice technology company

PandasPythonScikit-LearnSQLTableau
United States

Data Operations Lead

Atlan

Atlan is building the shared context layer that enterprises need so AI can operate on trusted, governed context. The conversation has moved from data leaders asking: “Can we trust the data in our stack?” to businesses asking: “Can we trust AI inside the business?” We are the missing infrastructure for businesses becoming AI-forward - the connective tissue between their data stack, operational systems, and AI agents. Recognized as an industry-leading metadata, catalog, and data governance platform, we’ve been named a Leader by both Gartner and Forrester across enterprise data catalogs, metadata management, and governance. To learn more, visit www.atlan.com and follow us on LinkedIn .

Data Scientist30 days ago
Full TimeRemoteTeam 192Since 2018

The Data Operations Lead will drive data analytics, build reporting pipelines, unify metrics across teams, and foster data-driven decision-making using AI and analytics.

SigmaSnowflakeSQL
United States

TN Case Manager - 50% remote

Mindoula

We identify, engage, and serve populations with behavioral health, medical, and social challenges across the continuum.

Data Scientist31 days ago
Full TimeRemoteTeam 201-500Since 2015H1B No Sponsor

Provide outreach, enrollment, and individualized service plans for members; coordinate community resources, schedule and follow up on appointments, advocate between care teams and members, document outcomes, and report productivity and quality concerns.

Tennessee
$20 - $23 / hour