Data Entry Question and Answer

~~$150~~ $49

Buy Now

Remote Question and Answer

~~$150~~ $49

Buy Now

Virtual Assistant Question and Answer

~~$150~~ $49

Buy Now

Content Moderator Question and Answer

~~$150~~ $49

Buy Now

Customer Service Question and Answer

~~$150~~ $49

Buy Now

Video Game Tester Question and Answer

~~$150~~ $49

Buy Now

ML Engineer – Experimentation Platform

June 2, 2026

Other Jobs To Apply

No other job posts for this day.

Job Title: ML Engineer – Experimentation Platform Experience: 3 – 4 Years Location: Remote Notice Period: Immediate Joiners Only About the Role We are looking for a highly skilled ML Engineer to join our Test & Learn Platform team. In this role, you will build and scale experimentation and causal inference services that enable business teams to make data-driven decisions globally. You will work across statistical modeling, API development, cloud-native infrastructure, and large-scale data processing to deliver reliable and production-ready ML solutions. Key Responsibilities <ul> <li>Develop and maintain statistical and machine learning modules for: </li> <ul> <li>Difference-in-Differences (DID) </li> <li>Synthetic Control </li> <li>A/B Testing </li> <li>Multi-Treatment Effects </li> </ul> <li>Build and extend RESTful APIs using FastAPI and integrate them with web applications through SDK wrappers </li> <li>Design and optimize large-scale data pipelines using PySpark, Delta Lake, and Azure Data Lake </li> <li>Diagnose and resolve Out-of-Memory (OOM) issues in PySpark workloads by optimizing: </li> <ul> <li>Memory allocation </li> <li>Partitioning </li> <li>Broadcast joins </li> <li>Caching strategies </li> <li>Spark configurations </li> </ul> <li>Deploy and manage Databricks workloads including notebooks, job clusters, and Delta Lake tables </li> <li>Containerize and deploy services using Docker, Kubernetes, and CI/CD pipelines </li> <li>Ensure code quality, testing, and security using PyTest, SonarCloud, and Snyk </li> <li>Collaborate closely with Data Scientists and Product teams to convert research concepts into scalable production systems </li> <li> </li> </ul> Mandatory Skills <ul> <li>Strong experience in Python (3.9+) </li> <li>Hands-on expertise in: </li> <ul> <li>PySpark & Spark Internals </li> <li>Databricks </li> <li>FastAPI / API Development </li> <li>Azure Cloud Platform </li> <li>Kubernetes & Docker </li> <li>PyTest </li> </ul> <li>Strong understanding of: </li> <ul> <li>DID </li> <li>Synthetic Control </li> <li>A/B Testing </li> <li>Hypothesis Testing </li> <li>Panel Data Methods </li> </ul> <li>Expertise in statistical and ML libraries: </li> <ul> <li>statsmodels </li> <li>scikit-learn </li> <li>SciPy </li> <li>Pandas </li> <li>NumPy </li> </ul> </ul> Technical Requirements PySpark & Spark Internals <ul> <li>Strong understanding of Spark memory model </li> <li>Executor tuning and shuffle optimization </li> <li>Diagnosing and resolving OOM errors </li> <li>Experience with: </li> <ul> <li>Broadcast thresholds </li> <li>Partition skew handling </li> <li>Spill-to-disk optimization </li> <li> </li> <li>GC tuning </li> </ul> </ul> Databricks <ul> <li>Hands-on experience with: </li> <ul> <li>Job orchestration </li> <li>Cluster configuration </li> <li>Notebook workflows </li> <li>Delta Lake optimization </li> <li>Z-ordering, compaction, and caching </li> </ul> </ul> Cloud & DevOps <ul> <li>Azure Storage, Azure ML, and Azure Data Lake </li> <li>Docker-based containerization </li> <li>Kubernetes orchestration for ML workloads </li> <li>CI/CD pipeline integration </li> </ul> Testing & Quality <ul> <li>Unit and integration testing using PyTest </li> <li>Familiarity with SonarCloud, Snyk, and GitHub Actions </li> </ul> Good-to-Have Skills <ul> <li>Experience with Celery and Redis for async task orchestration </li> <li>Familiarity with Polars, PyArrow, or SQLAlchemy </li> <li>Background in econometrics or experimental design </li> <li>Experience with Spark UI profiling and performance benchmarking </li> <li>Knowledge of advanced CI/CD tooling and automation practices </li> </ul> Preferred Candidate Profile <ul> <li>Strong analytical and problem-solving abilities </li> <li>Ability to work independently in a remote setup </li> <li>Excellent collaboration and communication skills </li> <li>Passion for building scalable ML and experimentation platforms </li> </ul> Tech Stack Languages & Libraries: Python, Pandas, NumPy, SciPy, statsmodels, scikit-learn Big Data: PySpark, Spark Internals, Delta Lake Cloud & Platforms: Azure, Databricks, Azure Data Lake APIs & Backend: FastAPI DevOps: Docker, Kubernetes, GitHub Actions Testing & Security: PyTest, SonarCloud, Snyk