Dow Jones is a global leader in news and business information. For over 130 years, it has delivered unrivaled quality content across multiple formats, including print, digital, mobile, and live events.
About the Role:
We are seeking an experienced Lead Data Scientist to lead and enhance our core data science capabilities within the AI Engineering Team and Commercial Tech team. You will take ownership of designing, constructing, and maintaining machine learning pipelines tailored for multiple AI applications, with a focus on Natural Language Processing.
You will oversee the data science project development lifecycle, from data analysis and model selection to developing proofs of concept and refining pipelines.
You Will:
* Lead collaboration within the AI Engineering Team to maintain and optimize robust data pipelines supporting multiple ML models, with a focus on information retrieval and AdTech applications.
* Engineer scalable, high-performance machine learning solutions, using modern modeling techniques, rigorous testing methodologies, and early validation on targeted user cohorts.
* Guide strategic decision-making by drawing meaningful insights from large datasets, defining key metrics, and developing relevant solutions to address business challenges.
* Partner with cross-functional teams to translate business requirements into technical solutions, ensuring alignment with strategic objectives.
* Lead the integration and deployment of diverse ML models into production systems, ensuring interoperability, performance optimization, and real-world effectiveness.
* Mentor junior team members, promoting collaboration, knowledge sharing, and professional development, while contributing to strategic decision-making to enhance the AI capabilities of the team.
* Stay ahead of AI, ML, and NLP advancements, incorporating new trends to enhance processes and model performance.
* Identify and scope new data programs, asking thoughtful questions, diagnosing the primary business problems, and developing long-term, impactful solutions.
You Have:
* 7+ years of industry experience in data science or machine learning engineering, with experience leading projects and mentoring teams.
* Expert-level programming skills in Python or another high-level language commonly used in machine learning.
* Expertise in NLP and Machine Learning frameworks and libraries (e.g., PyTorch, HuggingFace, LangChain, spaCy, NLTK, scikit-learn).
* Experience with information retrieval techniques and structured data extraction from unstructured sources.
* Hands-on experience with LLM APIs for pre-processing, fine-tuning, and deploying models on cloud-based infrastructure (AWS, GCP).
* Proficiency in containerization and orchestration technologies such as Docker and Kubernetes for scalable ML solutions.
* Demonstrated ability to drive impact in AI-driven applications such as document analysis, classification, summarization, translation, personalization, and chatbots.
* Comprehensive benefits package including comprehensive healthcare plans, extra paid time off, remote work options, meal benefit, retirement plans, family care benefits, subscription discounts, and employee referral program.