Empleo
Mis anuncios
Mis alertas
Conectarse
Encontrar un trabajo Consejos empleo Fichas empresas
Buscar

Data engineer for language technologies (re1) (barcelona)

Barcelona
Barcelona Supercomputing Center (Bsc)
Publicada el 16 mayo
Descripción

**Job Reference**:

- 606_25_LS_LT_RE1

**Position**:

- Data Engineer for Language Technologies (RE1)

**Closing Date**:

- Saturday, 18 October, 2025

**Reference**: 606_25_LS_LT_RE1

**Job title**: Data Engineer for Language Technologies (RE1)

**About BSC**
- The Barcelona Supercomputing Center - Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain. It houses MareNostrum, one of the most powerful supercomputers in Europe, was a founding and hosting member of the former European HPC infrastructure PRACE (Partnership for Advanced Computing in Europe), and is now hosting entity for EuroHPC JU, the Joint Undertaking that leads large-scale investments and HPC provision in Europe. The mission of BSC is to research, develop and manage information technologies in order to facilitate scientific progress. BSC combines HPC service provision and R&D; into both computer and computational science (life, earth and engineering sciences) under one roof, and currently has over 1000 staff from 60 countries.

Look at the BSC experience:
BSC-CNS YouTube Channel
Let's stay connected with BSC Folks!

We promote Equity, Diversity and Inclusion, fostering an environment where each and every one of us is appreciated for who we are, regardless of our differences.

**Context And Mission**
- The Language Technologies Laboratory at BSC has consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning for under-resourced languages and domains. It has been entrusted by the Spanish and the Catalan governments with the mission to develop fundamental open
- source resources and technologies for Spanish and Catalan. In connection with this, the LT Lab is currently in charge of two flagship projects at the national and regional level: the ALIA project, funded by the Spanish Secretariat of Digitalisation and Artificial Intelligence, and the AINA project, aimed at developing AI resources for Catalan, funded by the Catalan Digitalisation Department. In addition, the Lab participates in various EU funded international projects.
The researcher will implement innovative techniques for language modeling and evaluation in the HPC environment.
- Este contrato se encuentra financiado por el proyecto “Despliegue de la familia de Modelos ALIA en castellano y lenguas cooficiales”, con referencia externa 2024EtL00019, promovido por la Secretaría de Estado de Digitalización e Inteligencia Artificial (SEDIA), cuyos fondos provienen del Ministerio para la Transformación Digital y de la Función Pública, financiado por la Unión Europea-NextGenerationEU».

**Key Duties**
- Work, in collaboration with the group members, on the design and development of the solutions needed to achieve the goals of the group’s research projects.
- Interact with relevant stakeholders of the group’s research projects to understand their problems and the available data to formulate valuable solutions.
- Ensure the long-term acquisition, management and accessibility of language data through the design and implementation of scalable storage solutions and structured data systems, and processing tools.
- Collaborate with the members of the group in the generation and evaluation of language models using Deep Learning techniques (Transformers, Recurrent Neural Networks, and other neural network architectures).

**Requirements**:

- Education
- Degree in Applied Linguistics, Computer Science or related disciplines with a very strong linguistic background.
- Essential Knowledge and Professional Experience
- Native speaker of Spanish.
- Good knowledge of Python.
- Good knowledge of Linux.
- Knowledge of Deep Learning.
- Experience in Machine Learning techniques applied to NLP.
- Experience/ knowledge in corpus annotation and generation of linguistic resources.
- Understanding of data administration and management functions (transfer, storage, analysis, distribution, exploration, etc.)
- Additional Knowledge and Professional Experience
- Theoretical broad knowledge of AI techniques.
- Knowledge of HPC workload managers such as Slurm.
- Knowledge of Continuous Integration/Delivery/Deployment, including tools such as (or similar to) GitLab CI, Github, Docker and/or Ansible.
- Experience in machine learning and data mining including knowledge of PyTorch, Tensorflow, OpenCV, Pandas, Scikit-learn and/or Numpy.
- Basic Knowledge of GPU-based computing.
- Fluency in spoken and written English.
- Experience in web/data scraping.
- Expertise in building and maintaining data-curation pipelines.
- Competences
- Capacity to explore new research lines.
- Ability to work independently and collaboratively within multidisciplinary teams.
- Proactive, detail-oriented mindset, capable of problem-solving in complex data contexts.
- Good communication and presentation skills.
- Commitment to deadlines and quality research output.

**Conditions**
- The position will be located at B

Enviar
Crear una alerta
Alerta activada
Guardada
Guardar
Ofertas cercanas
Empleo Barcelona
Empleo Provincia de Barcelona
Empleo Cataluña
Inicio > Empleo > Data Engineer for Language Technologies (Re1) (Barcelona)

Jobijoba

  • Dosieres empleo
  • Opiniones Empresas

Encuentra empleo

  • Ofertas de empleo por profesiones
  • Búsqueda de empleo por sector
  • Empleos por empresas
  • Empleos para localidad

Contacto/ Colaboraciones

  • Contacto
  • Publiquen sus ofertas en Jobijoba

Menciones legales - Condiciones legales y términos de Uso - Política de Privacidad - Gestionar mis cookies - Accesibilidad: No conforme

© 2026 Jobijoba - Todos los Derechos Reservados

Enviar
Crear una alerta
Alerta activada
Guardada
Guardar