Job Location : New York,NY, USA
Our client is actively seeking to hire a Data Scientist for a Greenfield RTB System, where the individual will build the system from scratch. The role involves optimizing estimation models to enhance the performance of bidding algorithms and maximize outcomes for customers. Responsibilities include planning, executing, and analyzing experiments to understand the dynamics of the real-time bidding marketplace, exploring data to drive business growth, and utilizing strong Python skills (including numpy, scipy, and scikit-learn) to analyze data and prototype models for high-volume distributed systems.
The candidate will build predictive and forecasting models using techniques such as ARIMA, ARIMAX, and SARIMAX, and will turn prototypes into production-grade models using Spark, managing job dependencies with Airflow. Monitoring and measuring model performance with tools like Prometheus and Grafana, and quickly building dashboards and alerts to ensure model effectiveness, are also key responsibilities. Additionally, the role involves working with batch and online learning systems, developing real-time learning and training systems capable of processing billions of data points daily.
Minimum requirements include a Bachelor's Degree or equivalent in Statistics, Mathematics, Data Science, or a related field, along with at least 3 years of experience in data science. The candidate should have 3 years of experience with Python, R, Java 8, Spark, Airflow, DataBricks, MySQL, Redshift, MongoDB, S3/Parquet, and cloud platforms such as AWS or GCP. Experience in machine learning modeling, A/B testing, building data pipelines, ETL processes, and natural language processing or information retrieval is also required.
#J-18808-Ljbffr