Research Data Steward / Data Architect | Software Engineer (m/w/d)
Eberhard Karls Universität Tübingen
Job Summary
This role involves building and managing a modern High-Performance Computing (HPC) architecture dedicated to large-scale scientific research and foundation models within the Machine Learning Science Cloud at the University of Tübingen. You will be responsible for the entire data lifecycle, from project planning and curation to building distributed data loading pipelines and ensuring long-term archiving. A primary focus is implementing a FAIR data strategy to support diverse research projects in fields like climate science and large language models. You will work in a highly collaborative, international environment, acting as a bridge between researchers and technical infrastructure. The position offers the unique opportunity to support cutting-edge AI experiments by designing scalable, high-performance storage strategies and data governance policies. It is an ideal role for a data professional who enjoys both technical implementation and strategic advisory within a vibrant academic ecosystem.
Required Skills
Education
Master's degree in Information Technology, Applied Computer Science, Computer Engineering, or a comparable degree.
Experience
- Professional experience in data engineering and designing data pipelines
- Experience working with High-Performance Computing (HPC) clusters
- Experience with Linux operating systems and advanced shell scripting
- Proven experience with research data management and the application of FAIR principles
- Technical experience with automation, configuration management, and containerization technologies
- Experience in data governance, metadata schema creation, and lifecycle management
Languages
Additional
- The position is a fixed-term contract until December 31, 2032. Candidates must be able to coordinate with legal and compliance departments regarding data protection requirements.