TY - CHAP A1 - Galdi, Chiara A1 - Hartung, Frank A1 - Dugelay, Jean-Luc T1 - Videos versus still images: Asymmetric sensor pattern noise comparison on mobile phones T2 - Electronic Imaging N2 - Nowadays, the most employed devices for recoding videos or capturing images are undoubtedly the smartphones. Our work investigates the application of source camera identification on mobile phones. We present a dataset entirely collected by mobile phones. The dataset contains both still images and videos collected by 67 different smartphones. Part of the images consists in photos of uniform backgrounds, especially collected for the computation of the RSPN. Identifying the source camera given a video is particularly challenging due to the strong video compression. The experiments reported in this paper, show the large variation in performance when testing an highly accurate technique on still images and videos. KW - Image Forensics KW - Mobile Phones KW - Image Database Y1 - 2017 U6 - https://doi.org/10.2352/ISSN.2470-1173.2017.7.MWSF-331 SN - 2470-1173 N1 - IS&T International Symposium on Electronic Imaging 2017 Media Watermarking, Security, and Forensics 2017 SP - 100 EP - 103 PB - Society for Imaging Science and Technology CY - Springfield, Virginia ER - TY - CHAP A1 - Büsgen, André A1 - Klöser, Lars A1 - Kohl, Philipp A1 - Schmidts, Oliver A1 - Kraft, Bodo A1 - Zündorf, Albert ED - Cuzzocrea, Alfredo ED - Gusikhin, Oleg ED - Hammoudi, Slimane ED - Quix, Christoph T1 - From cracked accounts to fake IDs: user profiling on German telegram black market channels T2 - Data Management Technologies and Applications N2 - Messenger apps like WhatsApp and Telegram are frequently used for everyday communication, but they can also be utilized as a platform for illegal activity. Telegram allows public groups with up to 200.000 participants. Criminals use these public groups for trading illegal commodities and services, which becomes a concern for law enforcement agencies, who manually monitor suspicious activity in these chat rooms. This research demonstrates how natural language processing (NLP) can assist in analyzing these chat rooms, providing an explorative overview of the domain and facilitating purposeful analyses of user behavior. We provide a publicly available corpus of annotated text messages with entities and relations from four self-proclaimed black market chat rooms. Our pipeline approach aggregates the extracted product attributes from user messages to profiles and uses these with their sold products as features for clustering. The extracted structured information is the foundation for further data exploration, such as identifying the top vendors or fine-granular price analyses. Our evaluation shows that pretrained word vectors perform better for unsupervised clustering than state-of-the-art transformer models, while the latter is still superior for sequence labeling. KW - Clustering KW - Natural language processing KW - Information extraction KW - Profile extraction KW - Text mining Y1 - 2023 SN - 978-3-031-37889-8 (Print) SN - 978-3-031-37890-4 (Online) U6 - https://doi.org/10.1007/978-3-031-37890-4_9 N1 - 10th International Conference, DATA 2021, Virtual Event, July 6–8, 2021, and 11th International Conference, DATA 2022, Lisbon, Portugal, July 11-13, 2022 SP - 176 EP - 202 PB - Springer CY - Cham ER - TY - CHAP A1 - Kohl, Philipp A1 - Freyer, Nils A1 - Krämer, Yoka A1 - Werth, Henri A1 - Wolf, Steffen A1 - Kraft, Bodo A1 - Meinecke, Matthias A1 - Zündorf, Albert ED - Conte, Donatello ED - Fred, Ana ED - Gusikhin, Oleg ED - Sansone, Carlo T1 - ALE: a simulation-based active learning evaluation framework for the parameter-driven comparison of query strategies for NLP T2 - Deep Learning Theory and Applications N2 - Supervised machine learning and deep learning require a large amount of labeled data, which data scientists obtain in a manual, and time-consuming annotation process. To mitigate this challenge, Active Learning (AL) proposes promising data points to annotators they annotate next instead of a subsequent or random sample. This method is supposed to save annotation effort while maintaining model performance. However, practitioners face many AL strategies for different tasks and need an empirical basis to choose between them. Surveys categorize AL strategies into taxonomies without performance indications. Presentations of novel AL strategies compare the performance to a small subset of strategies. Our contribution addresses the empirical basis by introducing a reproducible active learning evaluation (ALE) framework for the comparative evaluation of AL strategies in NLP. The framework allows the implementation of AL strategies with low effort and a fair data-driven comparison through defining and tracking experiment parameters (e.g., initial dataset size, number of data points per query step, and the budget). ALE helps practitioners to make more informed decisions, and researchers can focus on developing new, effective AL strategies and deriving best practices for specific use cases. With best practices, practitioners can lower their annotation costs. We present a case study to illustrate how to use the framework. KW - Active learning KW - Query learning KW - Natural language processing KW - Deep learning KW - Reproducible research Y1 - 2023 SN - 978-3-031-39058-6 (Print) SN - 978-3-031-39059-3 (Online) U6 - https://doi.org/10.1007/978-3-031-39059-3_16 N1 - 4th International Conference, DeLTA 2023, Rome, Italy, July 13–14, 2023. SP - 235 EP - 253 PB - Springer CY - Cham ER - TY - CHAP A1 - Chavez Bermudez, Victor Francisco A1 - Cruz Castanon, Victor Fernando A1 - Ruchay, Marco A1 - Wollert, Jörg ED - Leipzig, Hochschule für Technik, Wirtschaft und Kultur T1 - Rapid prototyping framework for automation applications based on IO-Link T2 - Tagungsband AALE 2022: Wissenstransfer im Spannungsfeld von Autonomisierung und Fachkräftemangel N2 - The development of protype applications with sensors and actuators in the automation industry requires tools that are independent of manufacturer, and are flexible enough to be modified or extended for any specific requirements. Currently, developing prototypes with industrial sensors and actuators is not straightforward. First of all, the exchange of information depends on the industrial protocol that these devices have. Second, a specific configuration and installation is done based on the hardware that is used, such as automation controllers or industrial gateways. This means that the development for a specific industrial protocol, highly depends on the hardware and the software that vendors provide. In this work we propose a rapid-prototyping framework based on Arduino to solve this problem. For this project we have focused to work with the IO-Link protocol. The framework consists of an Arduino shield that acts as the physical layer, and a software that implements the IO-Link Master protocol. The main advantage of such framework is that an application with industrial devices can be rapid-prototyped with ease as its vendor independent, open-source and can be ported easily to other Arduino compatible boards. In comparison, a typical approach requires proprietary hardware, is not easy to port to another system and is closed-source. KW - Rapid-prototyping KW - Arduino KW - IO-Link KW - Industrial Communication Y1 - 2022 SN - 978-3-910103-00-9 U6 - https://doi.org/10.33968/2022.28 N1 - 18. AALE-Konferenz. Pforzheim, 09.03.-11.03.2022 CY - Leipzig ER - TY - CHAP A1 - Ulmer, Jessica A1 - Mostafa, Youssef A1 - Wollert, Jörg T1 - Digital Twin Academy: From Zero to Hero through individual learning experiences T2 - Tagungsband AALE 2022: Wissenstransfer im Spannungsfeld von Autonomisierung und Fachkräftemangel N2 - Digital twins are seen as one of the key technologies of Industry 4.0. Although many research groups focus on digital twins and create meaningful outputs, the technology has not yet reached a broad application in the industry. The main reasons for this imbalance are the complexity of the topic, the lack of specialists, and the unawareness of the twin opportunities. The project "Digital Twin Academy" aims to overcome these barriers by focusing on three actions: Building a digital twin community for discussion and exchange, offering multi-stage training for various knowledge levels, and implementing realworld use cases for deeper insights and guidance. In this work, we focus on creating a flexible learning platform that allows the user to select a training path adjusted to personal knowledge and needs. Therefore, a mix of basic and advanced modules is created and expanded by individual feedback options. The usage of personas supports the selection of the appropriate modules. KW - Digital Twins KW - Knowledge Transfer KW - Training Y1 - 2022 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bsz:l189-qucosa2-776097 SN - 978-3-910103-00-9 N1 - 18. AALE-Konferenz. Pforzheim, 09.03.-11.03.2022 SP - 1 EP - 9 ER - TY - CHAP A1 - Freyer, Nils A1 - Thewes, Dustin A1 - Meinecke, Matthias ED - Gusikhin, Oleg ED - Hammoudi, Slimane ED - Cuzzocrea, Alfredo T1 - GUIDO: a hybrid approach to guideline discovery & ordering from natural language texts T2 - Proceedings of the 12th International Conference on Data Science, Technology and Applications DATA - Volume 1 N2 - Extracting workflow nets from textual descriptions can be used to simplify guidelines or formalize textual descriptions of formal processes like business processes and algorithms. The task of manually extracting processes, however, requires domain expertise and effort. While automatic process model extraction is desirable, annotating texts with formalized process models is expensive. Therefore, there are only a few machine-learning-based extraction approaches. Rule-based approaches, in turn, require domain specificity to work well and can rarely distinguish relevant and irrelevant information in textual descriptions. In this paper, we present GUIDO, a hybrid approach to the process model extraction task that first, classifies sentences regarding their relevance to the process model, using a BERT-based sentence classifier, and second, extracts a process model from the sentences classified as relevant, using dependency parsing. The presented approach achieves significantly better resul ts than a pure rule-based approach. GUIDO achieves an average behavioral similarity score of 0.93. Still, in comparison to purely machine-learning-based approaches, the annotation costs stay low. KW - Natural Language Processing KW - Text Mining KW - Process Model Extraction KW - Business Process Intelligence Y1 - 2023 SN - 978-989-758-664-4 U6 - https://doi.org/10.5220/0012084400003541 SN - 2184-285X N1 - 12th International Conference on Data Science, Technology and Applications, July 11-13, 2023, in Rome, Italy. SP - 335 EP - 342 ER - TY - CHAP A1 - Klöser, Lars A1 - Büsgen, André A1 - Kohl, Philipp A1 - Kraft, Bodo A1 - Zündorf, Albert ED - Conte, Donatello ED - Fred, Ana ED - Gusikhin, Oleg ED - Sansone, Carlo T1 - Explaining relation classification models with semantic extents T2 - Deep Learning Theory and Applications N2 - In recent years, the development of large pretrained language models, such as BERT and GPT, significantly improved information extraction systems on various tasks, including relation classification. State-of-the-art systems are highly accurate on scientific benchmarks. A lack of explainability is currently a complicating factor in many real-world applications. Comprehensible systems are necessary to prevent biased, counterintuitive, or harmful decisions. We introduce semantic extents, a concept to analyze decision patterns for the relation classification task. Semantic extents are the most influential parts of texts concerning classification decisions. Our definition allows similar procedures to determine semantic extents for humans and models. We provide an annotation tool and a software framework to determine semantic extents for humans and models conveniently and reproducibly. Comparing both reveals that models tend to learn shortcut patterns from data. These patterns are hard to detect with current interpretability methods, such as input reductions. Our approach can help detect and eliminate spurious decision patterns during model development. Semantic extents can increase the reliability and security of natural language processing systems. Semantic extents are an essential step in enabling applications in critical areas like healthcare or finance. Moreover, our work opens new research directions for developing methods to explain deep learning models. KW - Relation classification KW - Natural language processing KW - Natural language understanding KW - Information extraction KW - Trustworthy artificial intelligence Y1 - 2023 SN - 978-3-031-39058-6 (Print) SN - 978-3-031-39059-3 (Online) U6 - https://doi.org/10.1007/978-3-031-39059-3_13 N1 - 4th International Conference, DeLTA 2023, Rome, Italy, July 13–14, 2023. SP - 189 EP - 208 PB - Springer CY - Cham ER - TY - CHAP A1 - Ketelhut, Maike A1 - Göll, Fabian A1 - Braunstein, Bjoern A1 - Albracht, Kirsten A1 - Abel, Dirk T1 - Iterative learning control of an industrial robot for neuromuscular training T2 - 2019 IEEE Conference on Control Technology and Applications N2 - Effective training requires high muscle forces potentially leading to training-induced injuries. Thus, continuous monitoring and controlling of the loadings applied to the musculoskeletal system along the motion trajectory is required. In this paper, a norm-optimal iterative learning control algorithm for the robot-assisted training is developed. The algorithm aims at minimizing the external knee joint moment, which is commonly used to quantify the loading of the medial compartment. To estimate the external knee joint moment, a musculoskeletal lower extremity model is implemented in OpenSim and coupled with a model of an industrial robot and a force plate mounted at its end-effector. The algorithm is tested in simulation for patients with varus, normal and valgus alignment of the knee. The results show that the algorithm is able to minimize the external knee joint moment in all three cases and converges after less than seven iterations. KW - Knee KW - Training KW - Load modeling KW - Force KW - Iterative learning control Y1 - 2019 SN - 978-1-7281-2767-5 (ePub) SN - 978-1-7281-2766-8 (USB) SN - 978-1-7281-2768-2 (PoD) U6 - https://doi.org/10.1109/CCTA.2019.8920659 N1 - 2019 IEEE Conference on Control Technology and Applications (CCTA) Hong Kong, China, August 19-21, 2019 PB - IEEE CY - New York ER - TY - CHAP A1 - Maurer, Florian A1 - Miskiw, Kim K. A1 - Acosta, Rebeca Ramirez A1 - Harder, Nick A1 - Sander, Volker A1 - Lehnhoff, Sebastian ED - Jorgensen, Bo Norregaard ED - Pereira da Silva, Luiz Carlos ED - Ma, Zheng T1 - Market abstraction of energy markets and policies - application in an agent-based modeling toolbox T2 - EI.A 2023: Energy Informatics N2 - In light of emerging challenges in energy systems, markets are prone to changing dynamics and market design. Simulation models are commonly used to understand the changing dynamics of future electricity markets. However, existing market models were often created with specific use cases in mind, which limits their flexibility and usability. This can impose challenges for using a single model to compare different market designs. This paper introduces a new method of defining market designs for energy market simulations. The proposed concept makes it easy to incorporate different market designs into electricity market models by using relevant parameters derived from analyzing existing simulation tools, morphological categorization and ontologies. These parameters are then used to derive a market abstraction and integrate it into an agent-based simulation framework, allowing for a unified analysis of diverse market designs. Furthermore, we showcase the usability of integrating new types of long-term contracts and over-the-counter trading. To validate this approach, two case studies are demonstrated: a pay-as-clear market and a pay-as-bid long-term market. These examples demonstrate the capabilities of the proposed framework. KW - Energy market design KW - Agent-based simulation KW - Market modeling Y1 - 2023 SN - 978-3-031-48651-7 (Print) SN - 978-3-031-48652-4 (eBook) U6 - https://doi.org/10.1007/978-3-031-48652-4_10 N1 - Energy Informatics Academy Conference, 6-8 December 23, Campinas, Brazil. N1 - Part of the Lecture Notes in Computer Science book series (LNCS,volume 14468). SP - 139 EP - 157 PB - Springer CY - Cham ER - TY - CHAP A1 - Schulze-Buxloh, Lina A1 - Groß, Rolf Fritz T1 - Interdisciplinary Course Smart Building Engineering: A new approach of teaching freshmen in remote teamwork project under pandemic restrictions T2 - New Perspectives in Science Education - International Conference N2 - In the context of the Corona pandemic and its impact on teaching like digital lectures and exercises a new concept especially for freshmen in demanding courses of Smart Building Engineering became necessary. As there were hardly any face-to-face events at the university, the new teaching concept should enable a good start into engineering studies under pandemic conditions anyway and should also replace the written exam at the end. The students should become active themselves in small teams instead of listening passively to a lecture broadcast online with almost no personal contact. For this purpose, a role play was developed in which the freshmen had to work out a complete solution to the realistic problem of designing, construction planning and implementing a small guesthouse. Each student of the team had to take a certain role like architect, site manager, BIM-manager, electrician and the technitian for HVAC installations. Technical specifications must be complied with, as well as documentation, time planning and cost estimate. The final project folder had to contain technical documents like circuit diagrams for electrical components, circuit diagrams for water and heating, design calculations and components lists. On the other hand construction schedule, construction implementation plan, documentation of the construction progress and minutes of meetings between the various trades had to be submitted as well. In addition to the project folder, a model of the construction project must also be created either as a handmade model or as a digital 3D-model using Computer-aided design (CAD) software. The first steps in the field of Building information modelling (BIM) had also been taken by creating a digital model of the building showing the current planning status in real time as a digital twin. This project turned out to be an excellent training of important student competencies like teamwork, communication skills, and self -organisation and also increased motivation to work on complex technical questions. The aim of giving the student a first impression on the challenges and solutions in building projects with many different technical trades and their points of view was very well achieved and should be continued in the future. KW - Freshmen KW - roleplay KW - Smart Building Engineering KW - BIM KW - remote teamwork Y1 - 2021 N1 - New Perspectives in Science Education - 10th Edition, 18-19 March 2021, Fully Virtual Conference PB - Filodiritto CY - Bologna ER -