Gideon Stein, M.Sc.

Address: | Computer Vision Group Department of Mathematics and Computer Science Friedrich Schiller University of Jena Ernst-Abbe-Platz 2 07743 Jena Germany |
Phone: | +49 (0) 3641 9 46425 |
E-mail: | gideon (dot) stein (at) uni-jena (dot) de |
Room: | 1221 |
Links: |
Curriculum Vitae
Since 2021 | PhD student, Friedrich Schiller University of Jena and iDiv Research topic: “Causal Reasoning and Deep Learning for Understanding Changes in the Soil-Plant-Climate Interactions” |
---|---|
2020 | Master Thesis: “Transformer based action sequence generation in reinforcement learning settings” at ITMO University |
2018-2020 | M.Sc. in Machine Learning & Data Analysis at the ITMO University, St. Petersburg, Russia with focus on Machine Learning |
2018 | Bachelor Thesis: “Reinforcement learning and the provision of public goods” at University Bayreuth |
2014-2018 | B.A. in Philosophy and Economics at University Bayreuth |
Research Interests
- Deep Learning
- Causality
- Interdisciplinarity
- Language Generation
- Reinforcement Learning
Publications
2023
Yuanyuan Huang, Gideon Stein, Olaf Kolle, Karl Kuebler, Ernst-Detlef Schulze, Hui Dong, David Eichenberg, Gerd Gleixner, Anke Hildebrandt, Markus Lange, Christiane Roscher, Holger Schielzeth, Bernhard Schmid, Alexandra Weigelt, Wolfgang W. Weisser, Maha Shadaydeh, Joachim Denzler, Anne Ebeling, Nico Eisenhauer:
Plant diversity stabilizes soil temperature.
bioRxiv. pp. 2023-03. 2023.
[bibtex] [pdf]
Plant diversity stabilizes soil temperature.
bioRxiv. pp. 2023-03. 2023.
[bibtex] [pdf]
2022
Clemence Dubois, Jannik Jänichen, Maha Shadaydeh, Gideon Stein, Alexandra Katz, Daniel Klöpper, Joachim Denzler, Christiane Schmullius, Katja Last:
KI4KI: Neues Projekt zur regelmässigen Überwachung von Stauanlagen aus dem All.
Messtechnische Überwachung von Stauanlagen ; XII.Mittweidaer Talsperrentag. Pages 15-19. 2022.
[bibtex] [web] [doi] [abstract]
KI4KI: Neues Projekt zur regelmässigen Überwachung von Stauanlagen aus dem All.
Messtechnische Überwachung von Stauanlagen ; XII.Mittweidaer Talsperrentag. Pages 15-19. 2022.
[bibtex] [web] [doi] [abstract]
Die Überwachung von Staubauwerken stellt Stauanlagenbetreiber vor viele Herausforderungen. Insbesondere aufgrund der Kosten und des Zeitaufwandes werden Staubauwerke oft nur ein- bis zweimal im Jahr durch trigonometrische Messungen überwacht. Seit einigen Jahrzehnten liefern jedoch Radarsatellitendaten nützliche Informationen zum Infrastrukturmonitoring. Satellitendaten der Copernicus Sentinel-1 Mission erlauben es, mittels der Technik der Persistent Scatterer Interferometrie (PSI), Deformationsmessungen von Staubauwerken im Millimeterbereich mit einem zeitlichen Abstand von 6 bis 12 Tagen durchzuführen. In einem Verbundprojekt zwischen der Friedrich-Schiller-Universität Jena und dem Ruhrverband soll ein Dienst entwickelt werden, der bisherige Überwachungsstrategien der Anlagen durch Nutzung der PSI Technik verbessert. Zudem sollen neuartige Geräte genutzt werden, die die Sichtbarkeit der Stauanlagen im Satellitenbild erhöhen sowie Methoden der künstlichen Intelligenz genutzt werden, um Deformationen im Falle von Extremwetterereignissen besser vorhersagen zu können.
Sven Festag, Gideon Stein, Tim Büchner, Maha Shadaydeh, Joachim Denzler, Cord Spreckelsen:
Outcome Prediction and Murmur Detection in Sets of Phonocardiograms by a Deep Learning-Based Ensemble Approach.
Computing in Cardiology (CinC). 2022.
[bibtex] [web] [abstract]
Outcome Prediction and Murmur Detection in Sets of Phonocardiograms by a Deep Learning-Based Ensemble Approach.
Computing in Cardiology (CinC). 2022.
[bibtex] [web] [abstract]
We, the team UKJ-FSU, propose a deep learning system for the prediction of congenital heart diseases. Our method is able to predict the clinical outcomes (normal, abnormal) of patients as well as to identify heart murmur (present, absent, unclear) based on phonocardiograms recorded at different auscultation locations. The system we propose is an ensemble of four temporal convolutional networks with identical topologies, each specialized in identifying murmurs and predicting patient outcome from a phonocardiogram taken at one specific auscultation location. Their intermediate outputs are augmented by the manually ascertained patient features such as age group, sex, height and weight. The outputs of the four networks are combined to form a single final decision as demanded by the rules of the George B. Moody PhysioNet Challenge 2022. On the first task of this challenge, the murmur detection, our model reached a weighted accuracy of 0.567 with respect to the unknown validation set. On the outcome prediction task (second task) the ensemble led to a mean outcome cost of 10679. By focusing on the clinical outcome prediction and tuning some of the hyper-parameters only for this task, our model reached a validation score of 9031.
2020
Arip Asadulaev, Igor Kuznetsov, Gideon Stein, Andrey Filchenkov:
Exploring and Exploiting Conditioning of Reinforcement Learning Agents.
IEEE Access. 8 : pp. 211951-211960. 2020.
[bibtex] [abstract]
Exploring and Exploiting Conditioning of Reinforcement Learning Agents.
IEEE Access. 8 : pp. 211951-211960. 2020.
[bibtex] [abstract]
The outcome of Jacobian singular values regularization was studied for supervised learning problems. In supervised learning settings for linear and nonlinear networks, Jacobian regularization allows for faster learning. It also was shown that Jacobian conditioning regularization can help to avoid the “mode-collapse” problem in Generative Adversarial Networks. In this paper, we try to answer the following question: Can information about policy network Jacobian conditioning help to shape a more stable and general policy of reinforcement learning agents? To answer this question, we conduct a study of Jacobian conditioning behavior during policy optimization. We analyze the behavior of the agent conditioning on different policies under the different sets of hyperparameters and study a correspondence between the conditioning and the ratio of achieved rewards. Based on these observations, we propose a conditioning regularization technique. We apply it to Trust Region Policy Optimization and Proximal Policy Optimization (PPO) algorithms and compare their performance on 8 continuous control tasks. Models with the proposed regularization outperformed other models on most of the tasks. Also, we showed that the regularization improves the agent's generalization by comparing the PPO performance on CoinRun environments. Also, we propose an algorithm that uses the condition number of the agent to form a robust policy, which we call Jacobian Policy Optimization (JPO). It directly estimates the condition number of an agent's Jacobian and changes the policy trend. We compare it with PPO on several continuous control tasks in PyBullet environments and the proposed algorithm provides a more stable and efficient reward growth on a range of agents.