TY - CHAP A1 - Freyer, Nils A1 - Thewes, Dustin A1 - Meinecke, Matthias ED - Gusikhin, Oleg ED - Hammoudi, Slimane ED - Cuzzocrea, Alfredo T1 - GUIDO: a hybrid approach to guideline discovery & ordering from natural language texts T2 - Proceedings of the 12th International Conference on Data Science, Technology and Applications DATA - Volume 1 N2 - Extracting workflow nets from textual descriptions can be used to simplify guidelines or formalize textual descriptions of formal processes like business processes and algorithms. The task of manually extracting processes, however, requires domain expertise and effort. While automatic process model extraction is desirable, annotating texts with formalized process models is expensive. Therefore, there are only a few machine-learning-based extraction approaches. Rule-based approaches, in turn, require domain specificity to work well and can rarely distinguish relevant and irrelevant information in textual descriptions. In this paper, we present GUIDO, a hybrid approach to the process model extraction task that first, classifies sentences regarding their relevance to the process model, using a BERT-based sentence classifier, and second, extracts a process model from the sentences classified as relevant, using dependency parsing. The presented approach achieves significantly better resul ts than a pure rule-based approach. GUIDO achieves an average behavioral similarity score of 0.93. Still, in comparison to purely machine-learning-based approaches, the annotation costs stay low. KW - Natural Language Processing KW - Text Mining KW - Process Model Extraction KW - Business Process Intelligence Y1 - 2023 SN - 978-989-758-664-4 U6 - https://doi.org/10.5220/0012084400003541 SN - 2184-285X N1 - 12th International Conference on Data Science, Technology and Applications, July 11-13, 2023, in Rome, Italy. SP - 335 EP - 342 ER - TY - CHAP A1 - Schulte, Jonas A1 - Schwager, Christian A1 - Noureldin, Kareem A1 - May, Martin A1 - Teixeira Boura, Cristiano José A1 - Herrmann, Ulf T1 - Gradient controlled startup procedure of a molten-salt power-to-heat energy storage plant based on dynamic process simulation T2 - SolarPACES: Solar Power & Chemical Energy Systems N2 - The integration of high temperature thermal energy storages into existing conventional power plants can help to reduce the CO2 emissions of those plants and lead to lower capital expenditures for building energy storage systems, due to the use of synergy effects [1]. One possibility to implement that, is a molten salt storage system with a powerful power-to-heat unit. This paper presents two possible control concepts for the startup of the charging system of such a facility. The procedures are implemented in a detailed dynamic process model. The performance and safety regarding the film temperatures at heat transmitting surfaces are investigated in the process simulations. To improve the accuracy in predicting the film temperatures, CFD simulations of the electrical heater are carried out and the results are merged with the dynamic model. The results show that both investigated control concepts are safe regarding the temperature limits. The gradient controlled startup performed better than the temperature-controlled startup. Nevertheless, there are several uncertainties that need to be investigated further. KW - Power plants KW - Energy storage KW - Associated liquids Y1 - 2023 SN - 978-0-7354-4623-6 U6 - https://doi.org/10.1063/5.0148741 SN - 1551-7616 (online) SN - 0094-243X (print) N1 - SolarPACES: SOLAR POWER & CHEMICAL ENERGY SYSTEMS: 27th International Conference on Concentrating Solar Power and Chemical Energy Systems, 27 September–1 October 2021, Online IS - 2815 / 1 PB - AIP conference proceedings / American Institute of Physics CY - Melville, NY ER - TY - CHAP A1 - Büsgen, André A1 - Klöser, Lars A1 - Kohl, Philipp A1 - Schmidts, Oliver A1 - Kraft, Bodo A1 - Zündorf, Albert ED - Cuzzocrea, Alfredo ED - Gusikhin, Oleg ED - Hammoudi, Slimane ED - Quix, Christoph T1 - From cracked accounts to fake IDs: user profiling on German telegram black market channels T2 - Data Management Technologies and Applications N2 - Messenger apps like WhatsApp and Telegram are frequently used for everyday communication, but they can also be utilized as a platform for illegal activity. Telegram allows public groups with up to 200.000 participants. Criminals use these public groups for trading illegal commodities and services, which becomes a concern for law enforcement agencies, who manually monitor suspicious activity in these chat rooms. This research demonstrates how natural language processing (NLP) can assist in analyzing these chat rooms, providing an explorative overview of the domain and facilitating purposeful analyses of user behavior. We provide a publicly available corpus of annotated text messages with entities and relations from four self-proclaimed black market chat rooms. Our pipeline approach aggregates the extracted product attributes from user messages to profiles and uses these with their sold products as features for clustering. The extracted structured information is the foundation for further data exploration, such as identifying the top vendors or fine-granular price analyses. Our evaluation shows that pretrained word vectors perform better for unsupervised clustering than state-of-the-art transformer models, while the latter is still superior for sequence labeling. KW - Clustering KW - Natural language processing KW - Information extraction KW - Profile extraction KW - Text mining Y1 - 2023 SN - 978-3-031-37889-8 (Print) SN - 978-3-031-37890-4 (Online) U6 - https://doi.org/10.1007/978-3-031-37890-4_9 N1 - 10th International Conference, DATA 2021, Virtual Event, July 6–8, 2021, and 11th International Conference, DATA 2022, Lisbon, Portugal, July 11-13, 2022 SP - 176 EP - 202 PB - Springer CY - Cham ER - TY - CHAP A1 - Klöser, Lars A1 - Büsgen, André A1 - Kohl, Philipp A1 - Kraft, Bodo A1 - Zündorf, Albert ED - Conte, Donatello ED - Fred, Ana ED - Gusikhin, Oleg ED - Sansone, Carlo T1 - Explaining relation classification models with semantic extents T2 - Deep Learning Theory and Applications N2 - In recent years, the development of large pretrained language models, such as BERT and GPT, significantly improved information extraction systems on various tasks, including relation classification. State-of-the-art systems are highly accurate on scientific benchmarks. A lack of explainability is currently a complicating factor in many real-world applications. Comprehensible systems are necessary to prevent biased, counterintuitive, or harmful decisions. We introduce semantic extents, a concept to analyze decision patterns for the relation classification task. Semantic extents are the most influential parts of texts concerning classification decisions. Our definition allows similar procedures to determine semantic extents for humans and models. We provide an annotation tool and a software framework to determine semantic extents for humans and models conveniently and reproducibly. Comparing both reveals that models tend to learn shortcut patterns from data. These patterns are hard to detect with current interpretability methods, such as input reductions. Our approach can help detect and eliminate spurious decision patterns during model development. Semantic extents can increase the reliability and security of natural language processing systems. Semantic extents are an essential step in enabling applications in critical areas like healthcare or finance. Moreover, our work opens new research directions for developing methods to explain deep learning models. KW - Relation classification KW - Natural language processing KW - Natural language understanding KW - Information extraction KW - Trustworthy artificial intelligence Y1 - 2023 SN - 978-3-031-39058-6 (Print) SN - 978-3-031-39059-3 (Online) U6 - https://doi.org/10.1007/978-3-031-39059-3_13 N1 - 4th International Conference, DeLTA 2023, Rome, Italy, July 13–14, 2023. SP - 189 EP - 208 PB - Springer CY - Cham ER - TY - CHAP A1 - Lahrs, Lennart A1 - Krisam, Pierre A1 - Herrmann, Ulf T1 - Envisioning a collaborative energy system planning platform for the energy transition at the district level T2 - ECOS 2023. The 36th International Conference on Efficiency, Cost, Optimization, Simulation and Environmental Impact of Energy Systems N2 - Residential and commercial buildings account for more than one-third of global energy-related greenhouse gas emissions. Integrated multi-energy systems at the district level are a promising way to reduce greenhouse gas emissions by exploiting economies of scale and synergies between energy sources. Planning district energy systems comes with many challenges in an ever-changing environment. Computational modelling established itself as the state-of-the-art method for district energy system planning. Unfortunately, it is still cumbersome to combine standalone models to generate insights that surpass their original purpose. Ideally, planning processes could be solved by using modular tools that easily incorporate the variety of competing and complementing computational models. Our contribution is a vision for a collaborative development and application platform for multi-energy system planning tools at the district level. We present challenges of district energy system planning identified in the literature and evaluate whether this platform can help to overcome these challenges. Further, we propose a toolkit that represents the core technical elements of the platform. Lastly, we discuss community management and its relevance for the success of projects with collaboration and knowledge sharing at their core. KW - Energy system planning KW - District energy planning platform KW - District data model KW - Renewable energy integration Y1 - 2023 U6 - https://doi.org/10.52202/069564-0284 N1 - ECOS 2023. The 36th International Conference on Efficiency, Cost, Optimization, Simulation and Environmental Impact of Energy Systems, 25-30 JUNE, 2023, Las Palmas de Gran Canaria, Spain SP - 3163 EP - 3170 PB - Procedings of ECOS 2023 ER - TY - CHAP A1 - Alhaskir, Mohamed A1 - Tschesche, Matteo A1 - Linke, Florian A1 - Schriewer, Elisabeth A1 - Weber, Yvonne A1 - Wolking, Stefan A1 - Röhrig, Rainer A1 - Koch, Henner A1 - Kutafina, Ekaterina ED - Röhrig, Rainer ED - Grabe, Niels ED - Haag, Martin ED - Hübner, Ursula ED - Sax, Ulrich ED - Schmidt, Carsten Oliver ED - Sedlmayr, Martin ED - Zapf, Antonia T1 - ECG matching: an approach to synchronize ECG datasets for data quality comparisons T2 - Proceedings of the 68th Annual Meeting of the German Association of Medical Informatics, Biometry, and Epidemiology e.V. (gmds) 2023 N2 - Clinical assessment of newly developed sensors is important for ensuring their validity. Comparing recordings of emerging electrocardiography (ECG) systems to a reference ECG system requires accurate synchronization of data from both devices. Current methods can be inefficient and prone to errors. To address this issue, three algorithms are presented to synchronize two ECG time series from different recording systems: Binned R-peak Correlation, R-R Interval Correlation, and Average R-peak Distance. These algorithms reduce ECG data to their cyclic features, mitigating inefficiencies and minimizing discrepancies between different recording systems. We evaluate the performance of these algorithms using high-quality data and then assess their robustness after manipulating the R-peaks. Our results show that R-R Interval Correlation was the most efficient, whereas the Average R-peak Distance and Binned R-peak Correlation were more robust against noisy data. KW - Electrocardiography KW - Wearable electronic device KW - Sensors comparison KW - Time-series synchronization Y1 - 2023 SN - 978-1-64368-428-4 (Print) SN - 978-1-64368-429-1 (Online) U6 - https://doi.org/10.3233/SHTI230718 N1 - Proceedings of the 68th Annual Meeting of the German Association of Medical Informatics, Biometry, and Epidemiology e.V. (gmds) 2023, 17. - 21.9.23, Heilbronn, Germany Part of the series: Studies in Health Technology and Informatics VL - 307 SP - 225 EP - 232 PB - IOS Press ER - TY - CHAP A1 - Neth, Jannik A1 - Schuba, Marko A1 - Brodkorb, Karsten A1 - Neugebauer, Georg A1 - Höner, Tim A1 - Hack, Sacha T1 - Digital forensics triage app for android T2 - ARES '23: Proceedings of the 18th International Conference on Availability, Reliability and Security N2 - Digital forensics of smartphones is of utmost importance in many criminal cases. As modern smartphones store chats, photos, videos etc. that can be relevant for investigations and as they can have storage capacities of hundreds of gigabytes, they are a primary target for forensic investigators. However, it is exactly this large amount of data that is causing problems: extracting and examining the data from multiple phones seized in the context of a case is taking more and more time. This bears the risk of wasting a lot of time with irrelevant phones while there is not enough time left to analyze a phone which is worth examination. Forensic triage can help in this case: Such a triage is a preselection step based on a subset of data and is performed before fully extracting all the data from the smartphone. Triage can accelerate subsequent investigations and is especially useful in cases where time is essential. The aim of this paper is to determine which and how much data from an Android smartphone can be made directly accessible to the forensic investigator – without tedious investigations. For this purpose, an app has been developed that can be used with extremely limited storage of data in the handset and which outputs the extracted data immediately to the forensic workstation in a human- and machine-readable format. KW - Android KW - Digital triage KW - Triage-app Y1 - 2023 SN - 9798400707728 U6 - https://doi.org/10.1145/3600160.3605017 N1 - ARES 2023: The 18th International Conference on Availability, Reliability and Security. August 29 - September 1, 2023. Benevento, Italy. PB - ACM ER - TY - CHAP A1 - Grund, Raphael M. A1 - Altherr, Lena ED - Reiff-Stephan, Jörg ED - Jäkel, Jens ED - Schwarz, André T1 - Development of an open source energy disaggregation tool for the home automation platform Home Assistant T2 - Tagungsband AALE 2023 : mit Automatisierung gegen den Klimawandel N2 - In order to reduce energy consumption of homes, it is important to make transparent which devices consume how much energy. However, power consumption is often only monitored aggregated at the house energy meter. Disaggregating this power consumption into the contributions of individual devices can be achieved using Machine Learning. Our work aims at making state of the art disaggregation algorithms accessibe for users of the open source home automation platform Home Assistant. KW - Home Automation Platform KW - Home Assistant KW - Open Source KW - Machine Learning KW - Energy Disaggregation Y1 - 2023 SN - 978-3-910103-01-6 U6 - https://doi.org/10.33968/2023.02 N1 - 19. AALE-Konferenz. Luxemburg, 08.03.-10.03.2023. BTS Connected Buildings & Cities Luxemburg (Tagungsband unter https://doi.org/10.33968/2023.01) SP - 11 EP - 20 PB - le-tex publishing services GmbH CY - Leipzig ER - TY - CHAP A1 - Tischbein, Franziska A1 - Kean, Kilian A1 - Vertgewall, Chris Martin A1 - Ulbig, Andreas A1 - Altherr, Lena T1 - Determination of the topology of low-voltage distribution grids using cluster methods T2 - 27th International Conference on Electricity Distribution (CIRED 2023) N2 - Due to the decarbonization of the energy sector, the electric distribution grids are undergoing a major transformation, which is expected to increase the load on the operating resources due to new electrical loads and distributed energy resources. Therefore, grid operators need to gradually move to active grid management in order to ensure safe and reliable grid operation. However, this requires knowledge of key grid variables, such as node voltages, which is why the mass integration of measurement technology (smart meters) is necessary. Another problem is the fact that a large part of the topology of the distribution grids is not sufficiently digitized and models are partly faulty, which means that active grid operation management today has to be carried out largely blindly. It is therefore part of current research to develop methods for determining unknown grid topologies based on measurement data. In this paper, different clustering algorithms are presented and their performance of topology detection of low voltage grids is compared. Furthermore, the influence of measurement uncertainties is investigated in the form of a sensitivity analysis. Y1 - 2023 SN - 978-1-83953-855-1 U6 - https://doi.org/10.1049/icp.2023.0478 N1 - 27th International Conference on Electricity Distribution (CIRED 2023), 12-15 June 2023, Rome, Italy. SP - 1 EP - 5 PB - IEEE ER - TY - CHAP A1 - Stark, Ralf A1 - Bartel, Sebastian A1 - Ditsche, Florian A1 - Esch, Thomas T1 - Design study of a 30kN LOX/LCH4 aerospike rocket engine for lunar lander application T2 - Aerospace Europe Conference 2023 - 10th EUCASS - 9th CEAS N2 - Based on lunar lander concept EL3, various LOX/CH4 aerospike engines were studied. A distinction was made between single and cluster configurations as well as ideal and non-ideal contour concepts. It could be shown that non-ideal aerospike engines promise a significant payload gain. Y1 - 2023 N1 - Lausanne, July 9-13, 2023 ER -