Research Engineer for Language Technologies (RE1)
Join to apply for the Research Engineer for Language Technologies (RE1) role at Barcelona Supercomputing Center.
Position
Research Engineer for Language Technologies (RE1)
Closing Date
Tuesday, 20 May, 2025
About BSC
The Barcelona Supercomputing Center - Centro Nacional de Supercomputación (BSC-CNS) is the leading supercomputing center in Spain. It houses MareNostrum, one of the most powerful supercomputers in Europe, and is involved in European HPC initiatives. BSC's mission is to research, develop, and manage information technologies to facilitate scientific progress, combining HPC services and R&D across various scientific disciplines.
Research Context
The Language Technologies Unit at BSC specializes in NLP areas such as language model building, biomedical text mining, machine translation, and unsupervised learning for under-resourced languages. It manages national and regional projects like ALIA and AINA, and participates in EU-funded international projects.
Role Description
The ideal candidate will have experience in Deep Learning and NLP, working in a high-performance computing environment to develop language modeling techniques and evaluation methods.
Key Duties
1. Design and develop solutions for research projects in collaboration with team members.
2. Generate and evaluate language models using Deep Learning techniques such as Transformers and RNNs.
3. Focus on research and deployment of post-training strategies, including instruction processing, algorithm design, and evaluation of instructed LLMs.
Requirements
Education
* Degree in Computer Science, Mathematics, or related fields.
* Master's Degree in Deep Learning, NLP, or related areas.
Essential Knowledge and Experience
* Proficiency in Python and Linux.
* Experience with Deep Learning techniques.
Additional Knowledge and Experience
* Published research in NLP, especially on Large Language Models.
* Knowledge of mathematics and statistics for Machine Learning.
* Understanding of AI techniques.
* Experience with HPC workload managers like Slurm.
* Familiarity with CI/CD tools such as GitLab CI, Github, Docker, or Ansible.
* Basic knowledge of C++, Javascript/node.js, C#, Matlab, or Java.
* Experience with PyTorch, TensorFlow, Pandas, Scikit-learn, or Numpy.
* Knowledge of GPU computing and multi-GPU parallelization.
* Experience in fine-tuning Large Language Models and post-training techniques.
* Fluency in Spanish and English; knowledge of other Spanish co-official languages is a plus.
Competencies
* Research exploration skills.
* Good communication and presentation skills.
* Teamwork and pair programming abilities.
Conditions
* Location: BSC, Life Sciences Department.
* Full-time contract, flexible hours, training, benefits, and relocation support.
* Start date: 01/06/2025.
Application Process
Submit via BSC website: CV, motivation letter, references, and optional documents. The process includes curriculum review and interviews assessing technical and personal competencies. The panel is gender-balanced, and feedback is provided to all candidates.
We promote diversity and inclusion and are committed to equal opportunity employment.
#J-18808-Ljbffr