TY - CHAP A1 - Schmidts, Oliver A1 - Kraft, Bodo A1 - Winkens, Marvin A1 - Zündorf, Albert T1 - Catalog integration of heterogeneous and volatile product data T2 - DATA 2020: Data Management Technologies and Applications N2 - The integration of frequently changing, volatile product data from different manufacturers into a single catalog is a significant challenge for small and medium-sized e-commerce companies. They rely on timely integrating product data to present them aggregated in an online shop without knowing format specifications, concept understanding of manufacturers, and data quality. Furthermore, format, concepts, and data quality may change at any time. Consequently, integrating product catalogs into a single standardized catalog is often a laborious manual task. Current strategies to streamline or automate catalog integration use techniques based on machine learning, word vectorization, or semantic similarity. However, most approaches struggle with low-quality or real-world data. We propose Attribute Label Ranking (ALR) as a recommendation engine to simplify the integration process of previously unknown, proprietary tabular format into a standardized catalog for practitioners. We evaluate ALR by focusing on the impact of different neural network architectures, language features, and semantic similarity. Additionally, we consider metrics for industrial application and present the impact of ALR in production and its limitations. Y1 - 2021 SN - 978-3-030-83013-7 U6 - https://doi.org/10.1007/978-3-030-83014-4_7 N1 - International Conference on Data Management Technologies and Applications, DATA 2020, 7-9 July SP - 134 EP - 153 PB - Springer CY - Cham ER - TY - CHAP A1 - Goldmann, Jan-Peter A1 - Braunstein, Bjoern A1 - Heinrich, Kai A1 - Sanno, Maximilian A1 - Stäudle, Benjamin A1 - Ritzdorf, Wolfgang A1 - Brüggemann, Gert-Peter A1 - Albracht, Kirsten T1 - Joint work of the take-off leg during elite high jump T2 - Proceedings of the 33th International Conference on Biomechanics in Sports (ISBS) Y1 - 2015 ER - TY - CHAP A1 - Droszez, Anna A1 - Sanno, Maximilian A1 - Goldmann, Jan-Peter A1 - Albracht, Kirsten A1 - Brüggemann, Gerd-Peter A1 - Braunstein, Bjoern T1 - Differences between take-off behavior during vertical jumps and two artistic elements T2 - 34th International Conference of Biomechanics in Sport, Tsukuba, Japan, July 18-22, 2016 Y1 - 2016 SN - 1999-4168 SP - 577 EP - 580 ER - TY - CHAP A1 - Abel, Thomas A1 - Bonin, Dominik A1 - Albracht, Kirsten A1 - Zeller, Sebastian A1 - Brüggemann, Gert-Peter A1 - Burkett, Brendan A1 - Strüder, Heiko K. T1 - Kinematic profile of the elite handcyclist T2 - 28th International Conference on Biomechanics in Sports, Marquette, Michigan, USA, July 19 – 23, 2010 Y1 - 2017 SN - 1999-4168 SP - 140 EP - 141 ER - TY - CHAP A1 - Braunstein, Bjoern A1 - Goldmann, Jan-Peter A1 - Albracht, Kirsten A1 - Sanno, Maximilian A1 - Willwacher, Steffen A1 - Heinrich, Kai A1 - Herrmann, Volker A1 - Brüggemann, Gert-Peter T1 - Joint specific contribution of mechanical power and work during acceleration and top speed in elite sprinters T2 - 31 International Conference on Biomechanics in Sports, Taipei, Taiwan, July 07 - July 22, 2013 Y1 - 2013 SN - 1999-4168 ER - TY - CHAP A1 - Kolditz, Melanie A1 - Albracht, Kirsten A1 - Fasse, Alessandro A1 - Albin, Thivaharan A1 - Brüggemann, Gert-Peter A1 - Abel, Dirk T1 - Evaluation of an industrial robot as a leg press training device T2 - XV International Symposium on Computer Simulation in Biomechanics July 9th – 11th 2015, Edinburgh, UK Y1 - 2015 SP - 41 EP - 42 ER - TY - CHAP A1 - Kolditz, Melanie A1 - Albin, Thivaharan A1 - Fasse, Alessandro A1 - Brüggemann, Gert-Peter A1 - Abel, Dirk A1 - Albracht, Kirsten T1 - Simulative Analysis of Joint Loading During Leg Press Exercise for Control Applications T2 - IFAC-PapersOnLine Y1 - 2015 U6 - https://doi.org/10.1016/j.ifacol.2015.10.179 N1 - IFAC-PapersOnLine 48-20; Conference Paper Archive VL - 48 IS - 20 SP - 435 EP - 440 ER - TY - CHAP A1 - Behbahani, Mehdi A1 - Rible, Sebastian A1 - Moulinec, Charles A1 - Fournier, Yvan A1 - Nicolai, Mike A1 - Crosetto, Paolo T1 - Simulation of the FDA Centrifugal Blood Pump Using High Performance Computing T2 - World Academy of Science, Engineering and Technology International Journal of Mechanical and Mechatronics Engineering Y1 - 2015 VL - 9 IS - 5 ER - TY - CHAP A1 - Büsgen, André A1 - Klöser, Lars A1 - Kohl, Philipp A1 - Schmidts, Oliver A1 - Kraft, Bodo A1 - Zündorf, Albert ED - Cuzzocrea, Alfredo ED - Gusikhin, Oleg ED - Hammoudi, Slimane ED - Quix, Christoph T1 - From cracked accounts to fake IDs: user profiling on German telegram black market channels T2 - Data Management Technologies and Applications N2 - Messenger apps like WhatsApp and Telegram are frequently used for everyday communication, but they can also be utilized as a platform for illegal activity. Telegram allows public groups with up to 200.000 participants. Criminals use these public groups for trading illegal commodities and services, which becomes a concern for law enforcement agencies, who manually monitor suspicious activity in these chat rooms. This research demonstrates how natural language processing (NLP) can assist in analyzing these chat rooms, providing an explorative overview of the domain and facilitating purposeful analyses of user behavior. We provide a publicly available corpus of annotated text messages with entities and relations from four self-proclaimed black market chat rooms. Our pipeline approach aggregates the extracted product attributes from user messages to profiles and uses these with their sold products as features for clustering. The extracted structured information is the foundation for further data exploration, such as identifying the top vendors or fine-granular price analyses. Our evaluation shows that pretrained word vectors perform better for unsupervised clustering than state-of-the-art transformer models, while the latter is still superior for sequence labeling. KW - Clustering KW - Natural language processing KW - Information extraction KW - Profile extraction KW - Text mining Y1 - 2023 SN - 978-3-031-37889-8 (Print) SN - 978-3-031-37890-4 (Online) U6 - https://doi.org/10.1007/978-3-031-37890-4_9 N1 - 10th International Conference, DATA 2021, Virtual Event, July 6–8, 2021, and 11th International Conference, DATA 2022, Lisbon, Portugal, July 11-13, 2022 SP - 176 EP - 202 PB - Springer CY - Cham ER - TY - CHAP A1 - Kohl, Philipp A1 - Freyer, Nils A1 - Krämer, Yoka A1 - Werth, Henri A1 - Wolf, Steffen A1 - Kraft, Bodo A1 - Meinecke, Matthias A1 - Zündorf, Albert ED - Conte, Donatello ED - Fred, Ana ED - Gusikhin, Oleg ED - Sansone, Carlo T1 - ALE: a simulation-based active learning evaluation framework for the parameter-driven comparison of query strategies for NLP T2 - Deep Learning Theory and Applications N2 - Supervised machine learning and deep learning require a large amount of labeled data, which data scientists obtain in a manual, and time-consuming annotation process. To mitigate this challenge, Active Learning (AL) proposes promising data points to annotators they annotate next instead of a subsequent or random sample. This method is supposed to save annotation effort while maintaining model performance. However, practitioners face many AL strategies for different tasks and need an empirical basis to choose between them. Surveys categorize AL strategies into taxonomies without performance indications. Presentations of novel AL strategies compare the performance to a small subset of strategies. Our contribution addresses the empirical basis by introducing a reproducible active learning evaluation (ALE) framework for the comparative evaluation of AL strategies in NLP. The framework allows the implementation of AL strategies with low effort and a fair data-driven comparison through defining and tracking experiment parameters (e.g., initial dataset size, number of data points per query step, and the budget). ALE helps practitioners to make more informed decisions, and researchers can focus on developing new, effective AL strategies and deriving best practices for specific use cases. With best practices, practitioners can lower their annotation costs. We present a case study to illustrate how to use the framework. KW - Active learning KW - Query learning KW - Natural language processing KW - Deep learning KW - Reproducible research Y1 - 2023 SN - 978-3-031-39058-6 (Print) SN - 978-3-031-39059-3 (Online) U6 - https://doi.org/10.1007/978-3-031-39059-3_16 N1 - 4th International Conference, DeLTA 2023, Rome, Italy, July 13–14, 2023. SP - 235 EP - 253 PB - Springer CY - Cham ER -