Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster.
We are committed to making a positive impact on our customers, employees, and communities.
The Role:
Veeva Link supports the life sciences industry to connect with key people to improve research and care.
You will work on Veeva Link's next-gen Data Platform, improving our current environment with features, refactoring, and innovation.
As a data engineer, you will focus on our data pipelines and take responsibility for a major part of the Link data processing platform.
Your Responsibilities:
* Work on Veeva Link's next-gen Data Platform
* Improve our current environment with features, refactoring, and innovation
* Work with JVM-based languages or Python on Spark-based data pipelines
* Operate ML models in close cooperation with our data science team
* Experiment in your domain to improve precision, recall, or cost savings
Requirements:
* Expert skills in Java or Python
* Experience with Apache Spark or PySpark
* Experience writing software for the cloud (AWS or GCP)
* Speaking and writing in English enables you to take part in day-to-day conversations in the team and contribute to deep technical discussions
Nice to Have:
* Experience with operating machine learning models (e.g., MLFlow)
* Experience with Data Lakes, Lakehouses, and Warehouses (e.g., DeltaLake, Redshift)
* DevOps skills, including terraform and general CI/CD experience
* Previously worked in agile environments
* Experience with expert systems
Perks & Benefits:
* Comprehensive benefits package
* Annual allocations for continuous learning, development & charitable contributions
* Fitness reimbursement
* Veeva Work-Anywhere