Context And Mission
The Language Technologies Laboratory at BSC has a consolidated experience in several NLP areas, such as massive language model building, biomedical text mining, machine translation and unsupervised learning for under-resourced languages and domains. It has been entrusted by the Spanish and the Catalan government to develop fundamental open-source resources and technologies for Spanish and Catalan. In connection with this, the LT Laboratory is currently in charge of two flagship projects at the national and regional level: the ALIA Plan, funded by the Spanish Secretariat of Digitalisation and Artificial Intelligence, and the AINA project, aimed at developing AI resources for Catalan, funded by the Catalan Digitalisation Department. In addition, the Unit participates in various EU-funded international projects.
The LT Laboratory at BSC is looking for a Full-stack Machine Learning Engineer/Developer, with experience in Language Technologies, specifically in Machine Learning and Large Language Model deployment and fine tuning, as well as front-end and back-end web development to contribute to its efforts at widespread adoption of modern AI technologies for Iberian languages.
The successful candidate will work in a highly sophisticated HPC environment, have access to state-of-the-art systems and computational infrastructures, and establish collaborations with experts in different areas at the local and international levels.
Key Duties
* Design and implement proof of concept NLP applications using LLMs and other AI resources.
* Prepare model training and fine-tuning in HPC clusters.
* Curate training and benchmarking datasets to measure application performance.
* Evaluate the quality of the models and datasets.
* Document and publish the models on open platforms.
* Manage projects in collaboration with external partners to showcase our technologies.
Requirements
* Education
o A degree in Computer Science, Telecommunications, Applied Linguistics or related disciplines.
* Essential Knowledge and Professional Experience
o Demonstrated experience of at least 3 years in deep learning or ML frameworks and in the relevant area(s).
o Demonstrated experience coding NLP applications in Python and other programming languages.
o Knowledge of version control tools and best practices.
o Native or good level of spoken and written English.
* Additional Knowledge and Professional Experience
o Demonstrated experience in developing open-source software and resources.
o Demonstrated experience in working in dynamic ML teams.
o Native or good level of spoken and written Catalan and/or Spanish.
o Familiarity with Transformers, Gensim, Gradio, Hugging Face environment, Pytorch, Langchain and other NLP, UI and ML libraries.
o Strong understanding of linguistic concepts.
* Competences
o Ability to work independently and in a team to complete tasks on schedule.
o Ability to work under set deadlines.
Conditions
* The position will be located at BSC within the Life Sciences Department
* We offer a full-time contract a good working environment, a highly stimulating environment with state-of-the-art infrastructure, flexible working hours, extensive training plan, restaurant tickets, private health insurance, support to the relocation procedures
* Duration: Open-ended contract due to technical and scientific activities linked to the project and budget duration
* Holidays: 23 paid vacation days plus 24th and 31st of December per our collective agreement
* Salary: we offer a competitive salary commensurate with the qualifications and experience of the candidate and according to the cost of living in Barcelona
* Starting date: asap