TY - JOUR A1 - Rübbelke, Dirk A1 - Vögele, Stefan A1 - Grajewski, Matthias A1 - Zobel, Luzy T1 - Cross border adjustment mechanism: Initial data for the assessment of hydrogen-based steel production JF - Data in Brief N2 - Ambitious climate targets affect the competitiveness of industries in the international market. To prevent such industries from moving to other countries in the wake of increased climate protection efforts, cost adjustments may become necessary. Their design requires knowledge of country-specific production costs. Here, we present country-specific cost figures for different production routes of steel, paying particular attention to transportation costs. The data can be used in floor price models aiming to assess the competitiveness of different steel production routes in different countries (Rübbelke, 2022). KW - Energy-intensive industry KW - Steel industry KW - Competitiveness KW - Floor prices KW - Cross border adjustment mechanism Y1 - 2023 U6 - https://doi.org/10.1016/j.dib.2023.108907 SN - 2352-3409 VL - 47 IS - Article 108907 SP - 1 EP - 5 PB - Elsevier CY - Amsterdam ER - TY - CHAP A1 - Engelmann, Ulrich M. A1 - Baumann, Martin ED - Herbig, Nicola ED - Poppelreuter, Stefan T1 - Moderationsexpertise für QMBs – die Methoden T2 - Qualitätsmanagement im Gesundheitswesen N2 - Damit Sie als Moderator effektiv und professionell moderieren können, sollten Sie die entsprechenden Methoden kennen. Mit den richtigen Methoden können Sie Diskussionen leiten, Konflikte lösen, die Teilnehmer motivieren und dafür sorgen, dass die Ziele der Veranstaltung erreicht werden. Außerdem helfen sie Ihnen, eine positive Atmosphäre zu schaffen und das Interesse der Teilnehmer zu halten. In diesem zweiten Beitrag der mehrteiligen Serie lernen Sie die grundsätzlichen Methoden kennen, um erfolgreiche Teamsitzungen, Arbeitsgruppentreffen, Kick-offs und Meetings durchzuführen. Y1 - 2023 SN - 978-3-8249-0714-4 SP - Kapitel 10815 PB - TÜV-Verlag CY - Köln ET - 60. Update ER - TY - JOUR A1 - Bialonski, Stephan A1 - Grieger, Niklas T1 - Der KI-Chatbot ChatGPT: Eine Herausforderung für die Hochschulen JF - Die neue Hochschule N2 - Essays, Gedichte, Programmcode: ChatGPT generiert automatisch Texte auf bisher unerreicht hohem Niveau. Dieses und nachfolgende Systeme werden nicht nur die akademische Welt nachhaltig verändern. Y1 - 2023 U6 - https://doi.org/10.5281/zenodo.7533758 SN - 0340-448X VL - 2023 IS - 1 SP - 24 EP - 27 PB - HLB CY - Bonn ER - TY - JOUR A1 - Gaigall, Daniel ED - AitSahlia, Farid T1 - Allocating and forecasting changes in risk JF - Journal of risk N2 - We consider time-dependent portfolios and discuss the allocation of changes in the risk of a portfolio to changes in the portfolio’s components. For this purpose we adopt established allocation principles. We also use our approach to obtain forecasts for changes in the risk of the portfolio’s components. To put the approach into practice we present an implementation based on the output of a simulation. Allocation is illustrated with an example portfolio in the context of Solvency II. The quality of the forecasts is investigated with an empirical study. KW - portfolio risk KW - allocation KW - forecast KW - covariance principle KW - conditional expectation principle Y1 - 2023 U6 - https://doi.org/10.21314/JOR.2022.048 SN - 1755-2842 SN - 1465-1211 VL - 25 IS - 3 SP - 1 EP - 24 PB - Infopro Digital Risk CY - London ER - TY - JOUR A1 - Gaigall, Daniel T1 - On the applicability of several tests to models with not identically distributed random effects JF - Statistics : A Journal of Theoretical and Applied Statistics N2 - We consider Kolmogorov–Smirnov and Cramér–von-Mises type tests for testing central symmetry, exchangeability, and independence. In the standard case, the tests are intended for the application to independent and identically distributed data with unknown distribution. The tests are available for multivariate data and bootstrap procedures are suitable to obtain critical values. We discuss the applicability of the tests to random effects models, where the random effects are independent but not necessarily identically distributed and with possibly unknown distributions. Theoretical results show the adequacy of the tests in this situation. The quality of the tests in models with random effects is investigated by simulations. Empirical results obtained confirm the theoretical findings. A real data example illustrates the application. KW - central symmetry test KW - exchangeability test KW - independence test KW - random effects KW - not identically distributed Y1 - 2023 SN - 0323-3944 U6 - https://doi.org/10.1080/02331888.2023.2193748 SN - 1029-4910 VL - 57 PB - Taylor & Francis CY - London ER - TY - CHAP A1 - Stollenwerk, Dominik A1 - Franzke, Till A1 - Maurer, Florian A1 - Reinkensmeier, Sebastian A1 - Kim, Franken A1 - Tambornino, Philipp A1 - Haas, Florian A1 - Rieke, Christian A1 - Hermanuz, Andreas A1 - Borchert, Jörg A1 - Ritz, Thomas A1 - Sander, Volker ED - Proff, Heike T1 - Smarte Ladesäulen : Netz- und Marktdienliches öffentliches Laden T2 - Towards the New Normal in Mobility : Technische und betriebswirtschaftliche Aspekte N2 - Stand 01.01.2022 sind in Deutschland 618.460 elektrisch angetriebene KFZ zugelassen. Insgesamt sind derzeit 48.540.878 KFZ zugelassen, was einer Elektromobilitätsquote von ca. 1,2 % entspricht. Derzeit werden Elektromobile über Ladestationen oder Steckdosen mit dem Stromnetz verbunden und üblicherweise mit der vollen Ladekapazität des Anschlusses aufgeladen, bis das Batteriemanagementsystem des Fahrzeugs abhängig vom Ladezustand der Batterie die Ladeleistung reduziert. Y1 - 2023 SN - 978-3-658-39437-0 (Print) SN - 978-3-658-39438-7 (Online) U6 - https://doi.org/10.1007/978-3-658-39438-7_18 SP - 287 EP - 304 PB - Springer Gabler CY - Wiesbaden ER - TY - JOUR A1 - Gaigall, Daniel A1 - Gerstenberg, Julian T1 - Cramér-von-Mises tests for the distribution of the excess over a confidence level JF - Journal of Nonparametric Statistics N2 - The Cramér-von-Mises distance is applied to the distribution of the excess over a confidence level. Asymptotics of related statistics are investigated, and it is seen that the obtained limit distributions differ from the classical ones. For that reason, quantiles of the new limit distributions are given and new bootstrap techniques for approximation purposes are introduced and justified. The results motivate new one-sample goodness-of-fit tests for the distribution of the excess over a confidence level and a new confidence interval for the related fitting error. Simulation studies investigate size and power of the tests as well as coverage probabilities of the confidence interval in the finite sample case. A practice-oriented application of the Cramér-von-Mises tests is the determination of an appropriate confidence level for the fitting approach. The adoption of the idea to the well-known problem of threshold detection in the context of peaks over threshold modelling is sketched and illustrated by data examples. KW - Cramér-von-Mises test KW - conditional excess distribution KW - confidence interval KW - goodness-of-fit test Y1 - 2023 U6 - https://doi.org/10.1080/10485252.2023.2173958 SN - 1048-5252 (Print) SN - 1029-0311 (Online) PB - Taylor & Francis ER - TY - JOUR A1 - Liphardt, Anna-Maria A1 - Fernandez-Gonzalo, Rodrigo A1 - Albracht, Kirsten A1 - Rittweger, Jörn A1 - Vico, Laurence T1 - Musculoskeletal research in human space flight – unmet needs for the success of crewed deep space exploration JF - npj Microgravity N2 - Based on the European Space Agency (ESA) Science in Space Environment (SciSpacE) community White Paper “Human Physiology – Musculoskeletal system”, this perspective highlights unmet needs and suggests new avenues for future studies in musculoskeletal research to enable crewed exploration missions. The musculoskeletal system is essential for sustaining physical function and energy metabolism, and the maintenance of health during exploration missions, and consequently mission success, will be tightly linked to musculoskeletal function. Data collection from current space missions from pre-, during-, and post-flight periods would provide important information to understand and ultimately offset musculoskeletal alterations during long-term spaceflight. In addition, understanding the kinetics of the different components of the musculoskeletal system in parallel with a detailed description of the molecular mechanisms driving these alterations appears to be the best approach to address potential musculoskeletal problems that future exploratory-mission crew will face. These research efforts should be accompanied by technical advances in molecular and phenotypic monitoring tools to provide in-flight real-time feedback. Y1 - 2023 U6 - https://doi.org/10.1038/s41526-023-00258-3 SN - 2373-8065 VL - 9 IS - Article number: 9 SP - 1 EP - 9 PB - Springer Nature ER - TY - JOUR A1 - Ringers, Christa A1 - Bialonski, Stephan A1 - Ege, Mert A1 - Solovev, Anton A1 - Hansen, Jan Niklas A1 - Jeong, Inyoung A1 - Friedrich, Benjamin M. A1 - Jurisch-Yaksi, Nathalie T1 - Novel analytical tools reveal that local synchronization of cilia coincides with tissue-scale metachronal waves in zebrafish multiciliated epithelia JF - eLife N2 - Motile cilia are hair-like cell extensions that beat periodically to generate fluid flow along various epithelial tissues within the body. In dense multiciliated carpets, cilia were shown to exhibit a remarkable coordination of their beat in the form of traveling metachronal waves, a phenomenon which supposedly enhances fluid transport. Yet, how cilia coordinate their regular beat in multiciliated epithelia to move fluids remains insufficiently understood, particularly due to lack of rigorous quantification. We combine experiments, novel analysis tools, and theory to address this knowledge gap. To investigate collective dynamics of cilia, we studied zebrafish multiciliated epithelia in the nose and the brain. We focused mainly on the zebrafish nose, due to its conserved properties with other ciliated tissues and its superior accessibility for non-invasive imaging. We revealed that cilia are synchronized only locally and that the size of local synchronization domains increases with the viscosity of the surrounding medium. Even though synchronization is local only, we observed global patterns of traveling metachronal waves across the zebrafish multiciliated epithelium. Intriguingly, these global wave direction patterns are conserved across individual fish, but different for left and right noses, unveiling a chiral asymmetry of metachronal coordination. To understand the implications of synchronization for fluid pumping, we used a computational model of a regular array of cilia. We found that local metachronal synchronization prevents steric collisions, i.e., cilia colliding with each other, and improves fluid pumping in dense cilia carpets, but hardly affects the direction of fluid flow. In conclusion, we show that local synchronization together with tissue-scale cilia alignment coincide and generate metachronal wave patterns in multiciliated epithelia, which enhance their physiological function of fluid pumping. Y1 - 2023 U6 - https://doi.org/10.7554/eLife.77701 SN - 2050-084X VL - 12 PB - eLife Sciences Publications ER - TY - CHAP A1 - Engelmann, Ulrich M. A1 - Baumann, Martin ED - Thomann, Hermann ED - Träger, Thomas T1 - Moderationsexpertise für QMBs – die Methoden T2 - Qualitätsmanagement in Dienstleistungsunternehmen N2 - Damit Sie als Moderator effektiv und professionell moderieren können, sollten Sie die entsprechenden Methoden kennen. Mit den richtigen Methoden können Sie Diskussionen leiten, Konflikte lösen, die Teilnehmer motivieren und dafür sorgen, dass die Ziele der Veranstaltung erreicht werden. Außerdem helfen sie Ihnen, eine positive Atmosphäre zu schaffen und das Interesse der Teilnehmer zu halten. In diesem zweiten Beitrag der mehrteiligen Serie lernen Sie die grundsätzlichen Methoden kennen, um erfolgreiche Teamsitzungen, Arbeitsgruppentreffen, Kick-offs und Meetings durchzuführen. Y1 - 2023 SN - 978-3-8249-0473-0 SP - Kapitel 08631 PB - TÜV-Verlag CY - Köln ER - TY - CHAP A1 - Engelmann, Ulrich M. A1 - Baumann, Martin ED - Lindinger, Markus ED - Bartsch, Oliver T1 - Moderationsexpertise – die Methoden T2 - IT-Servicemanagement N2 - Damit Sie als Moderator effektiv und professionell moderieren können, sollten Sie die entsprechenden Methoden kennen. Mit den richtigen Methoden können Sie Diskussionen leiten, Konflikte lösen, die Teilnehmer motivieren und dafür sorgen, dass die Ziele der Veranstaltung erreicht werden. Außerdem helfen sie Ihnen, eine positive Atmosphäre zu schaffen und das Interesse der Teilnehmer zu halten. In diesem zweiten Beitrag der mehrteiligen Serie lernen Sie die grundsätzlichen Methoden kennen, um erfolgreiche Teamsitzungen, Arbeitsgruppentreffen, Kick-offs und Meetings durchzuführen. Y1 - 2023 SN - 978-3-8249-1154-7 SP - Kapitel 05531 PB - TÜV-Verlag CY - Köln ET - 54. Update ER - TY - CHAP A1 - Engelmann, Ulrich M. A1 - Baumann, Martin ED - Herbig, Nicola ED - Poppelreuter, Stefan T1 - Moderationsexpertise für QMBs – Onlinemoderation T2 - Qualitätsmanagement im Gesundheitswesen N2 - Damit Sie auch in den immer häufiger werdenden Onlineveranstaltungen als Moderator gut bestehen, sollten Sie wissen, was bei der Onlinemoderation im Besonderen zu beachten ist. In diesem dritten Teil der Beitragsserie erfahren Sie, warum online anders als offline ist. Die technischen Möglichkeiten werden vorgestellt und auch wie diese zu nutzen sind. Schließlich erhalten Sie Tipps, die Sie beim Sprechen online beachten sollten. Y1 - 2023 SN - 978-3-8249-0714-4 SP - Kapitel 10816 PB - TÜV-Verlag CY - Köln ER - TY - CHAP A1 - Engelmann, Ulrich M. ED - Lindinger, Markus ED - Bartsch, Oliver T1 - Moderationsexpertise – Onlinemoderation T2 - IT-Servicemanagement N2 - Damit Sie auch in den immer häufiger werdenden Onlineveranstaltungen als Moderator gut bestehen, sollten Sie wissen, was bei der Onlinemoderation im Besonderen zu beachten ist. In diesem dritten Teil der Beitragsserie erfahren Sie, warum online anders als offline ist. Die technischen Möglichkeiten werden vorgestellt und auch wie diese zu nutzen sind. Schließlich erhalten Sie Tipps, die Sie beim Sprechen online beachten sollten. Y1 - 2023 SN - 978-3-8249-1154-7 SP - Kapitel 05532 PB - TÜV-Verlag CY - Köln ER - TY - JOUR A1 - Bertz, Morten A1 - Molinnus, Denise A1 - Schöning, Michael Josef A1 - Homma, Takayuki T1 - Real-time monitoring of H₂O₂ sterilization on individual bacillus atrophaeus spores by optical sensing with trapping Raman spectroscopy JF - Chemosensors N2 - Hydrogen peroxide (H₂O₂), a strong oxidizer, is a commonly used sterilization agent employed during aseptic food processing and medical applications. To assess the sterilization efficiency with H₂O₂, bacterial spores are common microbial systems due to their remarkable robustness against a wide variety of decontamination strategies. Despite their widespread use, there is, however, only little information about the detailed time-resolved mechanism underlying the oxidative spore death by H₂O₂. In this work, we investigate chemical and morphological changes of individual Bacillus atrophaeus spores undergoing oxidative damage using optical sensing with trapping Raman microscopy in real-time. The time-resolved experiments reveal that spore death involves two distinct phases: (i) an initial phase dominated by the fast release of dipicolinic acid (DPA), a major spore biomarker, which indicates the rupture of the spore’s core; and (ii) the oxidation of the remaining spore material resulting in the subsequent fragmentation of the spores’ coat. Simultaneous observation of the spore morphology by optical microscopy corroborates these mechanisms. The dependence of the onset of DPA release and the time constant of spore fragmentation on H₂O₂ shows that the formation of reactive oxygen species from H₂O₂ is the rate-limiting factor of oxidative spore death. KW - DPA (dipicolinic acid) KW - sterilization KW - Bacillus atrophaeus spores KW - optical trapping KW - Raman spectroscopy KW - optical sensor setup Y1 - 2023 U6 - https://doi.org/10.3390/chemosensors11080445 SN - 2227-9040 N1 - This article belongs to the Special Issue "Biosensors and Chemical Sensors for Food and Healthcare Monitoring—Celebrating the 10th Anniversary" VL - 8 IS - 11 PB - MDPI CY - Basel ER - TY - JOUR A1 - Wendlandt, Tim A1 - Koch, Claudia A1 - Britz, Beate A1 - Liedek, Anke A1 - Schmidt, Nora A1 - Werner, Stefan A1 - Gleba, Yuri A1 - Vahidpour, Farnoosh A1 - Welden, Melanie A1 - Poghossian, Arshak A1 - Schöning, Michael Josef T1 - Facile Purification and Use of Tobamoviral Nanocarriers for Antibody-Mediated Display of a Two-Enzyme System JF - Viruses N2 - Immunosorbent turnip vein clearing virus (TVCV) particles displaying the IgG-binding domains D and E of Staphylococcus aureus protein A (PA) on every coat protein (CP) subunit (TVCVPA) were purified from plants via optimized and new protocols. The latter used polyethylene glycol (PEG) raw precipitates, from which virions were selectively re-solubilized in reverse PEG concentration gradients. This procedure improved the integrity of both TVCVPA and the wild-type subgroup 3 tobamovirus. TVCVPA could be loaded with more than 500 IgGs per virion, which mediated the immunocapture of fluorescent dyes, GFP, and active enzymes. Bi-enzyme ensembles of cooperating glucose oxidase and horseradish peroxidase were tethered together on the TVCVPA carriers via a single antibody type, with one enzyme conjugated chemically to its Fc region, and the other one bound as a target, yielding synthetic multi-enzyme complexes. In microtiter plates, the TVCVPA-displayed sugar-sensing system possessed a considerably increased reusability upon repeated testing, compared to the IgG-bound enzyme pair in the absence of the virus. A high coverage of the viral adapters was also achieved on Ta2O5 sensor chip surfaces coated with a polyelectrolyte interlayer, as a prerequisite for durable TVCVPA-assisted electrochemical biosensing via modularly IgG-assembled sensor enzymes. KW - biosensor KW - horseradish peroxidase (HRP) KW - glucose oxidase (GOx) KW - enzyme cascade KW - turnip vein clearing virus (TVCV) KW - tobacco mosaic virus (TMV) Y1 - 2023 U6 - https://doi.org/doi.org/10.3390/v15091951 SN - 1999-4915 N1 - This article belongs to the Special Issue "Tobamoviruses 2023" VL - 9 IS - 15 PB - MDPI CY - Basel ER - TY - INPR A1 - Bornheim, Tobias A1 - Niklas, Grieger A1 - Blaneck, Patrick Gustav A1 - Bialonski, Stephan T1 - Preprint: Speaker attribution in German parliamentary debates with QLoRA-adapted large language models T2 - Journal for Language Technology and Computational Linguistics N2 - The growing body of political texts opens up new opportunities for rich insights into political dynamics and ideologies but also increases the workload for manual analysis. Automated speaker attribution, which detects who said what to whom in a speech event and is closely related to semantic role labeling, is an important processing step for computational text analysis. We study the potential of the large language model family Llama 2 to automate speaker attribution in German parliamentary debates from 2017-2021. We fine-tune Llama 2 with QLoRA, an efficient training strategy, and observe our approach to achieve competitive performance in the GermEval 2023 Shared Task On Speaker Attribution in German News Articles and Parliamentary Debates. Our results shed light on the capabilities of large language models in automating speaker attribution, revealing a promising avenue for computational analysis of political discourse and the development of semantic role labeling systems. Y1 - 2023 U6 - https://doi.org/10.48550/arXiv.2309.09902 N1 - Veröffentlichte Version verfügbar unter: https://doi.org/10.21248/jlcl.37.2024.244 ER - TY - JOUR A1 - Morais, Paulo V. A1 - Suman, Pedro H. A1 - Schöning, Michael Josef A1 - Siqueira Junior, José R. A1 - Orlandi, Marcelo O. T1 - Layer-by-layer film based on Sn₃O₄ nanobelts as sensing units to detect heavy metals using a capacitive field-effect sensor platform JF - Chemosensors N2 - Lead and nickel, as heavy metals, are still used in industrial processes, and are classified as “environmental health hazards” due to their toxicity and polluting potential. The detection of heavy metals can prevent environmental pollution at toxic levels that are critical to human health. In this sense, the electrolyte–insulator–semiconductor (EIS) field-effect sensor is an attractive sensing platform concerning the fabrication of reusable and robust sensors to detect such substances. This study is aimed to fabricate a sensing unit on an EIS device based on Sn₃O₄ nanobelts embedded in a polyelectrolyte matrix of polyvinylpyrrolidone (PVP) and polyacrylic acid (PAA) using the layer-by-layer (LbL) technique. The EIS-Sn₃O₄ sensor exhibited enhanced electrochemical performance for detecting Pb²⁺ and Ni²⁺ ions, revealing a higher affinity for Pb²⁺ ions, with sensitivities of ca. 25.8 mV/decade and 2.4 mV/decade, respectively. Such results indicate that Sn₃O₄ nanobelts can contemplate a feasible proof-of-concept capacitive field-effect sensor for heavy metal detection, envisaging other future studies focusing on environmental monitoring. KW - Sn₃O₄ KW - nanobelts KW - field-effect sensor KW - LbL films KW - heavy metals Y1 - 2023 U6 - https://doi.org/10.3390/chemosensors11080436 SN - 2227-9040 N1 - This article belongs to the Special Issue The Application of Electrochemical Sensors or Biosensors Based on Nanomaterials VL - 11 IS - 8 PB - MDPI CY - Basel ER - TY - THES A1 - Gaigall, Daniel T1 - On selected problems in multivariate analysis N2 - Selected problems in the field of multivariate statistical analysis are treated. Thereby, one focus is on the paired sample case. Among other things, statistical testing problems of marginal homogeneity are under consideration. In detail, properties of Hotelling‘s T² test in a special parametric situation are obtained. Moreover, the nonparametric problem of marginal homogeneity is discussed on the basis of possibly incomplete data. In the bivariate data case, properties of the Hoeffding-Blum-Kiefer-Rosenblatt independence test statistic on the basis of partly not identically distributed data are investigated. Similar testing problems are treated within the scope of the application of a result for the empirical process of the concomitants for partly categorial data. Furthermore, testing changes in the modeled solvency capital requirement of an insurance company by means of a paired sample from an internal risk model is discussed. Beyond the paired sample case, a new asymptotic relative efficiency concept based on the expected volumes of multidimensional confidence regions is introduced. Besides, a new approach for the treatment of the multi-sample goodness-of-fit problem is presented. Finally, a consistent test for the treatment of the goodness-of-fit problem is developed for the background of huge or infinite dimensional data. KW - Paired sample KW - Marginal homogeneity KW - Incomplete data KW - Asymptotic relative efficiency KW - Volumes of confidence regions Y1 - 2023 U6 - https://doi.org/10.15488/14304 N1 - Gottfried Wilhelm Leibniz Universität Hannover ER - TY - CHAP A1 - Büsgen, André A1 - Klöser, Lars A1 - Kohl, Philipp A1 - Schmidts, Oliver A1 - Kraft, Bodo A1 - Zündorf, Albert ED - Cuzzocrea, Alfredo ED - Gusikhin, Oleg ED - Hammoudi, Slimane ED - Quix, Christoph T1 - From cracked accounts to fake IDs: user profiling on German telegram black market channels T2 - Data Management Technologies and Applications N2 - Messenger apps like WhatsApp and Telegram are frequently used for everyday communication, but they can also be utilized as a platform for illegal activity. Telegram allows public groups with up to 200.000 participants. Criminals use these public groups for trading illegal commodities and services, which becomes a concern for law enforcement agencies, who manually monitor suspicious activity in these chat rooms. This research demonstrates how natural language processing (NLP) can assist in analyzing these chat rooms, providing an explorative overview of the domain and facilitating purposeful analyses of user behavior. We provide a publicly available corpus of annotated text messages with entities and relations from four self-proclaimed black market chat rooms. Our pipeline approach aggregates the extracted product attributes from user messages to profiles and uses these with their sold products as features for clustering. The extracted structured information is the foundation for further data exploration, such as identifying the top vendors or fine-granular price analyses. Our evaluation shows that pretrained word vectors perform better for unsupervised clustering than state-of-the-art transformer models, while the latter is still superior for sequence labeling. KW - Clustering KW - Natural language processing KW - Information extraction KW - Profile extraction KW - Text mining Y1 - 2023 SN - 978-3-031-37889-8 (Print) SN - 978-3-031-37890-4 (Online) U6 - https://doi.org/10.1007/978-3-031-37890-4_9 N1 - 10th International Conference, DATA 2021, Virtual Event, July 6–8, 2021, and 11th International Conference, DATA 2022, Lisbon, Portugal, July 11-13, 2022 SP - 176 EP - 202 PB - Springer CY - Cham ER - TY - CHAP A1 - Kohl, Philipp A1 - Freyer, Nils A1 - Krämer, Yoka A1 - Werth, Henri A1 - Wolf, Steffen A1 - Kraft, Bodo A1 - Meinecke, Matthias A1 - Zündorf, Albert ED - Conte, Donatello ED - Fred, Ana ED - Gusikhin, Oleg ED - Sansone, Carlo T1 - ALE: a simulation-based active learning evaluation framework for the parameter-driven comparison of query strategies for NLP T2 - Deep Learning Theory and Applications N2 - Supervised machine learning and deep learning require a large amount of labeled data, which data scientists obtain in a manual, and time-consuming annotation process. To mitigate this challenge, Active Learning (AL) proposes promising data points to annotators they annotate next instead of a subsequent or random sample. This method is supposed to save annotation effort while maintaining model performance. However, practitioners face many AL strategies for different tasks and need an empirical basis to choose between them. Surveys categorize AL strategies into taxonomies without performance indications. Presentations of novel AL strategies compare the performance to a small subset of strategies. Our contribution addresses the empirical basis by introducing a reproducible active learning evaluation (ALE) framework for the comparative evaluation of AL strategies in NLP. The framework allows the implementation of AL strategies with low effort and a fair data-driven comparison through defining and tracking experiment parameters (e.g., initial dataset size, number of data points per query step, and the budget). ALE helps practitioners to make more informed decisions, and researchers can focus on developing new, effective AL strategies and deriving best practices for specific use cases. With best practices, practitioners can lower their annotation costs. We present a case study to illustrate how to use the framework. KW - Active learning KW - Query learning KW - Natural language processing KW - Deep learning KW - Reproducible research Y1 - 2023 SN - 978-3-031-39058-6 (Print) SN - 978-3-031-39059-3 (Online) U6 - https://doi.org/10.1007/978-3-031-39059-3_16 N1 - 4th International Conference, DeLTA 2023, Rome, Italy, July 13–14, 2023. SP - 235 EP - 253 PB - Springer CY - Cham ER -