Author Image

Hi, I am Lukas

Lukas Schäfer

PhD Student at University of Edinburgh

I’ll be on the job market for post-doc and research scientist positions in summer/ fall 2024. Please reach out if you think I am a good fit / know of opportunities!

I am a Data Science and Artificial Intelligence PhD student from Germany working on multi-agent reinforcement learning at the University of Edinburgh, where I am supervised by Stefano Albrecht and Amos Storkey.

Stefano, Filippos and myself wrote a textbook on multi-agent reinforcement learning. The book will be published with MIT Press and is already available at www.marl-book.com!

Reinforcement learning is often sample inefficient, and prone to overfitting, leading to behaviour which often does not generalise across tasks. These challenges are further exacerbated in multi-agent systems in which multiple agents interact with each other in a shared environment. Guided by these challenges of efficiency and generalisation, my PhD research focuses on efficient and generalisable reinforcement learning in multi-agent systems.

Previously, I interned at at Microsoft Research Cambridge, where I worked with the Game Intelligence lab under the supervision of Sam Devlin, at Huawei Noah’s Ark Lab and Dematic. At MSR, I researched visual encoders for imitation learning in video games, including vision foundation models. At Huawei, I was part of the multi-agent team researching novel exploration techniques for multi-agent reinforcement learning under the supervision of David Mguni. At Dematic, I researched and developed AI solutions for large-scale warehouse automation.

Contact: l.schaefer [at] ed.ac.uk

Research
Reinforcement Learning
Deep Learning
Teaching
Python
SE

News

Dec 06, 2023

Nov 01, 2023

📢 It is DONE! Our textbook “Multi-Agent Reinforcement Learning: Foundations and Modern Approaches” is now with MIT Press and available on the official webpage www.marl-book.com! The print release is scheduled for late 2024.

Oct 30, 2023

📃 Our work Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning will be presented at the Workshop on Generalization in Planning at the Conference on Neural Information Processing Systems (NeurIPS) 2023!

May 29, 2023

📢 I`m very excited to announce that the first pre-print non-final PDF of our book “Multi-Agent Reinforcement Learning: Foundations and Modern Approaches” is now released and available on the official webpage www.marl-book.com!

Apr 14, 2023

📃 Our works, Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning and Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments, will be presented at the Adaptive and Learning Agents (ALA) Workshop at the International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS) 2023!

Apr 03, 2023

📢 I’m very excited to start a research internship with the Game Intelligence lab at Microsoft Research Cambridge!

Sep 14, 2022

📃 Our work, Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning, has been accepted at the Neural Information Processing Systems Conference (NeurIPS) 2022!

Aug 27, 2022

📢 Excited to announce that I was selected to attend the upcoming 9th Heidelberg Laureate Forum!

Publications

Stefano V. Albrecht, Filippos Christianos, Lukas Schäfer (equal authorship) (2024)
To be published by MIT Press (print scheduled for fall 2024)
Lukas Schäfer, Logan Jones, Anssi Kanervisto, Yuhan Cao, Tabish Rashid, Raluca Georgescu, Dave Bignell, Siddhartha Sen, Andrea Treviño Gavito, Sam Devlin (2023)
arXiv
Lukas Schäfer, Filippos Christianos, Amos Storkey, Stefano V. Albrecht (2023)
Workshop on Generalization in Planning in the Conference on Neural Information Processing Systems (NeurIPS), 2023
Lukas Schäfer, Oliver Slumbers, Stephen McAleer, Yali Du, Stefano V. Albrecht, David Mguni (2023)
Adaptive and Learning Agents (ALA) Workshop in the International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2023
Alain Andres, Lukas Schäfer, Esther Villar-Rodriguez, Stefano V.Albrecht, Javier Del Ser (2023)
Adaptive and Learning Agents (ALA) Workshop in the International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2023
Trevor McInroe, Lukas Schäfer, Stefano V. Albrecht (2023)
Transactions on Machine Learning Research (TMLR) Journal
Rujie Zhong, Duohan Zhang, Lukas Schäfer, Stefano V. Albrecht, Josiah P. Hanna (2022)
Conference on Neural Information Processing Systems (NeurIPS), 2022
Lukas Schäfer (2022)
Doctoral Consortium at the International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2022
Lukas Schäfer, Filippos Christianos, Josiah P. Hanna, Stefano V. Albrecht (2022)
International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2022
Ibrahim H. Ahmed, Cillian Brewitt, Ignacio Carlucho, Filippos Christianos, Mhairi Dunion, Elliot Fosong, Samuel Garcin, Shangmin Guo, Balint Gyevnar, Trevor McInroe, Georgios Papoudakis, Arrasy Rahman, Lukas Schäfer, Massimiliano Tamborski, Giuseppe Vecchio, Cheng Wang, Stefano V. Albrecht (2022)
AI Communications Special Issue on Multi-Agent Systems Research in the UK
Aleksandar Krnjaic, Raul D. Steleac, Jonathan D. Thomas, Georgios Papoudakis, Lukas Schäfer, Andrew Wing Keung To, Kuan-Ho Lao, Murat Cubuktepe, Matthew Haley, Peter Börsting, Stefano V. Albrecht (2023)
arXiv
Georgios Papoudakis, Filippos Christianos, Lukas Schäfer, Stefano V. Albrecht (2021)
Conference on Neural Information Processing Systems (NeurIPS), 2021 - Datasets and Benchmarks track
Lukas Schäfer, Filippos Christianos, Josiah P. Hanna, Stefano V. Albrecht (2021)
Unsupervised Reinforcement Learning (URL) Workshop in the International Conference on Machine Learning, 2021
Rujie Zhong, Josiah P. Hanna, Lukas Schäfer, Stefano V. Albrecht (2021)
Workshop on Offline Reinforcement Learning in the Conference on Neural Information Processing Systems, 2021
Georgios Papoudakis, Filippos Christianos, Lukas Schäfer, Stefano V. Albrecht (2021)
Workshop on Adaptive and Learning Agents in the International Conference on Autonomous Agents and Multiagent Systems, 2021
Trevor McInroe, Lukas Schäfer, Stefano V. Albrecht (2021)
arXiv
Filippos Christianos, Lukas Schäfer, Stefano V. Albrecht (2020)
Conference on Neural Information Processing Systems (NeurIPS), 2020

Experiences

1
Research Scientist Intern
Microsoft Research

Apr 2023 - Oct 2023, Cambridge

Responsibilities:
  • Researching visual encoders for imitation learning in video games under the supervision of Sam Devlin.

Young Research Attendee
Heidelberg Laureate Forum

Sep 2022 - Sep 2022, Heidelberg

The Heidelberg Laureate Forum brings together the most exceptional mathematicians and computer scientists of their generations. Each year, the recipients of the most prestigious awards in mathematics and computer science, the Abel Prize, ACM A.M. Turing Award, ACM Prize in Computing, Fields Medal, IMU Abacus Medal and Nevanlinna Prize, meet 200 selected young researchers from all over the world. Participants spend a week interacting and networking in a relaxed atmosphere designed to encourage scientific exchange.

2

3
Research Scientist Intern
Huawei Noah's Ark Lab

Jul 2022 - Dec 2022, London

The Noah’s Ark Lab is the AI research center for Huawei Technologies, working towards significant contributions to both the company and society by innovating in artificial intelligence, data mining and related fields.

Responsibilities:
  • Researched ensemble models for exploration in multi-agent reinforcement learning with the RL and multi-agent team under the supervision of David Mguni.
  • Submitted a publication as the result of the internship to a top-tier machine learning conference (under review). A preprint is available on arXiv (https://arxiv.org/abs/2302.03439).

Research Intern
Dematic

Nov 2020 - Mar 2021, Remote

Dematic is global player focused on design and implementation of automated system solutions for warehouses, distribution centres and production facilities.

Responsibilities:
  • Applying state-of-the-art AI technology to enable a prototype for automation of large-scale robotic warehouse logistics.
4

5
HYPED

Sep 2018 - Aug 2020, Edinburgh

HYPED is a team of students at the University of Edinburgh dedicated to developing the Hyperloop concept and inspiring future generations about engineering. HYPED has received awards from SpaceX, Virgin Hyperloop One and Institution of Civil Engineers.

Navigation Advisor

Sep 2019 - Aug 2020

  • Advising navigation team on the adaptation and implementation of improved sensor and filtering techniques
Navigation Engineer

Sep 2018 - Aug 2019

  • Developing navigation system of “The Flying Podsman” Hyperloop prototype using sensor filtering, processing and control techniques to estimate location, orientation and speed of the pod
  • Finalist for the SpaceX 2019 Hyperloop competition in California in Summer 2019

Education

University of Edinburgh
Ph.D in Data Science and Artificial Intelligence
Project: Sample Efficiency and Generalisation in Multi-Agent Reinforcement Learning
Supervisors: Stefano V. Albrecht (primary) and Amos Storkey (secondary)
Funding: Principal's Career Development Scholarship from the University of Edinburgh
Key Areas: Reinforcement Learning, Multi-Agent Systems, Generalisation, Exploration, Intrinsic Rewards
2019-Present
University of Edinburgh
M.Sc. in Informatics
CGPA: 77.28%
Funding: DAAD (German Academic Exchange Service) graduate scholarship & Stevenson Exchange Scholarship
Taken Courses
Course NameObtained Credit
Reinforcement Learning10 (82%)
Algorithmic Game Theory and its Applications10 (98%)
Machine Learning and Pattern Recognition20 (64%)
Probabilistic Modelling and Reasoning20 (75%)
Decision Making in Robots and Autonomous Agents10 (86%)
Robotics: Science and Systems20 (87%)
Natural Computing10 (84%)
Informatics Project Proposal10 (73%)
Informatics Research Review10 (72%)
Extracurricular Activities
  • Active position as navigation engineer for HYPED.
  • Participation in GEAS roleplaying society.
  • Participation in EUKC - Edinburgh University Kendo Club.
2018-2019
Saarland University
B.Sc. in Informatics
German scale: 1.2
Taken Courses
Course NameObtained Credit
Automated Planning9 (1.0)
Admissible Search Enhancements7 (1.0)
Information Retrieval and Data Mining9 (1.7)
Neural Networks: Implementation and Application6 (2.0)
Artificial Intelligence9 (1.7)
Software Engineering9 (1.3)
Modern Imperative Programming Languages5 (1.3)
Concurrent Programming6 (2.7)
Fundamentals of Data Structures and Algorithms6 (1.7)
Information Systems6 (1.3)
Introduction to Theoretical Computer Science9 (1.0)
System Architecture9 (1.0)
Mathematics for Computer Scientists I9 (1.0)
Mathematics for Computer Scientists II9 (2.3)
Mathematics for Computer Scientists III9 (1.7)
Programming I9 (1.0)
Programming II9 (1.0)
Japanese Foundations - Shokyu I6 (1.3)
Japanese Foundations - Shokyu II6 (2.0)
Japanese Applied Geography5 (1.0)
Japanese History II5 (1.0)
Extracurricular Activities
  • Japanese language and cultural studies as minor subject.
2015-2018
Warndtgymnasium
Higher Secondary School Certificate
German scale: 1.0
Awards:
  • School year's best student award
  • Computer Science award of Saarland University
  • Mathematics award of Saarland University
  • History award of Historic Society for the Saar-Region
2008-2015

Teaching Experience

Teaching Assistant

Oct 2019 - June 2022, School of Informatics, University of Edinburgh

Teaching assistant, demonstrator and marker for three iterations of the Reinforcement Learning lecture at the University of Edinburgh under Dr. Stefano V. Albrecht

  • Holding lectures on implementation of RL systems and Deep RL
  • Designing RL project covering wide range of topics including dynamic programming, single- and multi-agent RL as well as deep RL
  • Marking project and exam for reinforcement learning course
  • Advising students on various challenges regarding lecture material and content

Visiting Student Co-Supervision

Jun 2022 - Present, School of Informatics, University of Edinburgh

Co-supervised visiting PhD student project at the University of Edinburgh

  • Supervision through regular meetings discussing research project and ideating novel solutions
  • Assisted project towards a successful workshop publication at the ALA workshop at AAMAS 2023

M.Sc. Student Supervision

Feb 2021 - Aug 2021, School of Informatics, University of Edinburgh

Co-supervised final Masters students’ projects at the University of Edinburgh

  • Co-supervised two M.Sc. students through project proposal, refinement and execution towards final thesis
  • Assisted M.Sc. student from their thesis towards a successful workshop publication at NeurIPS 2021, and a successful main conference publication at NeurIPS 2022.

Lecturer and Coach

Sep 2017 - Oct 2017, Mathematics Preparation Course, Saarland University

Voluntary lecturer and coach for the mathematics preparation course preparing upcoming computer science undergraduate students for their studies

  • Assisted the organisation of the mathematics preparation course for upcoming computer science students aiming to introduce them to foundational mathematical concepts, the university and student life as a whole
  • Introduced ∼250 participants to the importance of mathematics for computer science, formal languages and predicate logic in daily lectures of the first week
  • Supervised two groups to provide feedback and further assistance in daily coaching-sessions
  • The course received the BESTE-award for special student commitment 2017 at Saarland University

Teaching Assistant

Oct 2016 - Mar 2017, Dependable Systems and Software Chair, Saarland University

Tutor for the Programming 1 lecture about functional programming at the Dependable Systems and Software Group chair of Saarland University under Prof. Dr. Holger Hermanns

  • Taught first-year students fundamental concepts of functional programming, basic complexity theory and inductive correctness proofs in weekly tutorials and office hours
  • Corrected weekly tests as well as mid- and endterm exams
  • Collectively created learning materials and discussed student progress as part of the whole teaching team

Reviewing

Conferences
  • 2023: NeurIPS, NeurIPS Datasets and Benchmark Track, ICML, AAMAS
  • 2022: NeurIPS, NeurIPS Datasets and Benchmark Track, ICML (top 10% outstanding reviewer award), AAMAS
  • 2021: NeurIPS
Workshops
Pre-registration experiment workshop at NeurIPS 2020

Skills

Recent Posts