Empleo
Mis anuncios
Mis alertas
Conectarse
Encontrar un trabajo Consejos empleo Fichas empresas
Buscar

Vector data engineer (madrid)

Madrid
Johnson & Johnson
Publicada el 1 noviembre
Descripción

This job is with Johnson & Johnson, an inclusive employer and a member of myGwork – the largest general platform for the LGBTQ+ business community. Please do not contact the recruiter directly.

At Johnson & Johnson, we believe health is everything. Our strength in healthcare innovation empowers us to build a world where complex diseases are prevented, treated, and cured, where treatments are smarter and less invasive, and solutions are personal. Through our expertise in Innovative Medicine and MedTech, we are uniquely positioned to innovate across the full spectrum of healthcare solutions today to deliver the breakthroughs of tomorrow, and profoundly impact health for humanity. Learn more at

Job Function: Data Analytics & Computational Sciences

Job Sub

Function: Data Science

Job Category: Scientific / Technology

All Job Posting Locations: Cornellà de Llobregat, Barcelona, Spain; Madrid, Spain

Job Description : Johnson and Johnson Innovative Medicine (J&J; IM), a pharmaceutical company of Johnson & Johnson, is recruiting for a Vector Data Engineer. This position has a primary location of Barcelona, Spain. The secondary location is Madrid. This is a hybrid role. Our expertise in Innovative Medicine is informed and inspired by patients, whose insights fuel our science-based advancements. Visionaries like you work in teams that save lives by developing the medicines of tomorrow. Join us in developing treatments, finding cures, and pioneering the path from lab to life while championing patients every step of the way. Learn more at

Position Summary : The Vector Data Engineer designs and implements the embedding and semantic-search infrastructure that connects discovery, translational, and clinical data into AI-ready knowledge representations. This role bridges multi-omics data engineering and machine-learning infrastructure, enabling scientists and agentic tools to discover biological insights through vector-based search and reasoning.

Key Responsibilities

- Develop scalable pipelines that convert multi-omics and clinical data (e.g., proteomics, transcriptomics, spatial omics, biomarkers) into vectorized embeddings for AI and semantic retrieval.
- Build and maintain vector databases and hybrid data stores using technologies such as TileDB, Weaviate, Snowflake, Cortex, and other vector/database systems.
- Collaborate with the Data Transformation Engineers to design standardized data formats suitable for embedding generation and cross-modality mapping.
- Integrate metadata, ontology terms, and provenance into vector representations to ensure traceability and governance compliance.
- Partner with the AI / ML team to deploy embeddings supporting agentic reasoning, semantic similarity, and cross-dataset query.
- Optimize indexing, retrieval, and inference performance across large-scale multi-omics data collections.
- Evaluate and incorporate emerging representation-learning and knowledge-graph techniques to improve data discoverability and model interoperability.

Qualifications

- MS / PhD in Computer Science, Computational Biology, Data Science, or related field.
- 3+ years of experience building or maintaining vector or semantic-retrieval infrastructure.
- Hands‑on experience with multi-omics or biomedical data integration (e.g., RNA‑seq, proteomics, clinical endpoints).
- Proficiency in Python and frameworks such as LangChain, Transformers, or sentence‑embedding models.
- Familiarity with TileDB, Snowflake, Weaviate, FAISS, or other vector/array database systems.
- Understanding of metadata modeling, ontologies (e.g., OBO, UMLS), and FAIR data practices.
- Strong ability to collaborate across solution architecture, data science, and AI / ML teams.

Strategic Impact

Multi-omics and clinical data assets transformed into interoperable, vectorized embeddings supporting scientific AI applications. AI can perform semantic queries and reasoning over governed datasets. Vector database infrastructure scales efficiently and complies with governance and lineage standards, accelerating insight generation across discovery, translational, and clinical domains.

#J-18808-Ljbffr

Enviar
Crear una alerta
Alerta activada
Guardada
Guardar
Oferta cercana
Associate director, investigational product preparation strategy
Johnson & Johnson
105.000 € al año
Oferta cercana
Senior business controller
Johnson & Johnson
Controlador financiero
80.000 € al año
Oferta cercana
Senior business controller
Johnson & Johnson
Controlador financiero
Ofertas cercanas
Empleo Johnson & Johnson
Empleo Johnson & Johnson en Madrid
Empleo Madrid
Empleo Provincia de Madrid
Empleo Comunidad de Madrid
Inicio > Empleo > Vector Data Engineer (Madrid)

Jobijoba

  • Dosieres empleo
  • Opiniones Empresas

Encuentra empleo

  • Ofertas de empleo por profesiones
  • Búsqueda de empleo por sector
  • Empleos por empresas
  • Empleos para localidad

Contacto/ Colaboraciones

  • Contacto
  • Publiquen sus ofertas en Jobijoba

Menciones legales - Condiciones legales y términos de Uso - Política de Privacidad - Gestionar mis cookies - Accesibilidad: No conforme

© 2025 Jobijoba - Todos los Derechos Reservados

Enviar
Crear una alerta
Alerta activada
Guardada
Guardar