Image
Online since 2025-03-12

Position as Data Scientist (m/f/d) – Pilot Project Lignin Chemistry

Image
Deadline
31.03.2025
Image
Start of work
as soon as possible
Image
Working time model
Full time

About us

The Center for the Transformation of Chemistry (CTC) aims to reimagine the chemical industry as sustainable and resilient. Divided into 5 interconnected thematic areas; Automation & Standardization, Data-driven Chemistry, Recycling, Renewable Feedstocks, and Societal-Environmental-Economic Metrics (SEEM); the CTC has identified strong pillars to guide its scientific efforts towards the transformation of chemistry.

The CTC Science Team is currently planning multiple pilot projects and looking for ambitious people taking the lead.  

We are seeking a highly motivated data scientist to lead the development of a comprehensive database of lignin-derived compounds. As a Data Scientist (m/f/d) – Pilot Project Lignin Chemistry, you will leverage machine learning and large language models to automate data extraction, curation, and prediction of lignin-derived compounds. This role offers a unique opportunity to conduct pioneering research at the intersection of computational science, data management, and sustainable chemistry. The successful candidate will collaborate closely with an interdisciplinary team of chemists, data scientists, and engineers to accelerate lignin valorization and drive innovation in the biorefinery sector.

As part of CTC’s Science team, you’ll be located in Leuna in the greater Leipzig/Halle area.

Your tasks and qualifications

  • Harness large language models to extract and synthesize lignin-related information from scientific literature (e.g., reaction conditions, yields, and catalysts).
  • Develop and apply machine learning models for data curation, predictive analytics, and structure-property relationship modeling.
  • Design and implement an open-access database cataloging lignin-derived compounds, including chemical properties, reaction pathways, and potential applications.
  • Independently analyze scientific results, troubleshoot computational approaches, and design innovative workflows.
  • Interact iteratively with the CTC’s scientists to define the users’ requirements and develop tailored solutions according to the peculiar features of the generated data.
  • Present research findings at conferences and publish in scientific journals.

Your profile

  • Successfully completed degree (at least a Master’s, ideally PhD) in Chemistry, Chemical Engineering, Data Science, Computational Chemistry, or a related field.
  • Strong experience in programming languages such as Python.
  • Proven experience in machine learning (e.g., random forests, neural networks, and gradient boosting) and natural language processing.
  • Experience with cheminformatics tools (e.g., RDKit, PubChem) and database management systems (e.g., SQL, NoSQL).
  • Knowledge of computational tools for reaction network visualization (e.g., Cytoscape, Gephi).
  • Hands-on experience in biomass valorization is a plus.
  • Willingness to learn new tools and technologies as required.
  • Excellent communication, collaboration, and leadership skills to engage with internal and external stakeholders.

What we offer

  • The opportunity to perform your research at what will be the largest research center for chemical research in Europe.
  • Benefit from a large network: Meet and work with top experts from academia and industry.
  • Competitive compensation according to TVöD-Bund up to E13 depending on the candidate’s qualifications.
  • Partial compensation of Job Commute Ticket.
  • Comprehensive social benefits, including 30 days of vacation per year, annual special payment, and additional pension scheme (VBL).
  • Term-limited position until the 31st of December, 2025 allowing for fresh perspectives in terms of contract extension after 2025.
Back to overview E-mail application Online application