OPUS 4 | Search

Schema Matching with Frequent Changes on Semi-Structured Input Files: A Machine Learning Approach on Biological Product Data (2019)

Oliver Schmidts ; Bodo Kraft ; Ines Siebigteroth ; Albert Zündorf

Continuously evaluated research projects in collaborative decoupled environments (2018)

Oliver Schmidts ; Bodo Kraft ; Marc Schreiber ; Albert Zündorf

Often, research results from collaboration projects are not transferred into productive environments even though approaches are proven to work in demonstration prototypes. These demonstration prototypes are usually too fragile and error-prone to be transferred easily into productive environments. A lot of additional work is required. Inspired by the idea of an incremental delivery process, we introduce an architecture pattern, which combines the approach of Metrics Driven Research Collaboration with microservices for the ease of integration. It enables keeping track of project goals over the course of the collaboration while every party may focus on their expert skills: researchers may focus on complex algorithms, practitioners may focus on their business goals. Through the simplified integration (intermediate) research results can be introduced into a productive environment which enables getting an early user feedback and allows for the early evaluation of different approaches. The practitioners’ business model benefits throughout the full project duration.

Multi-pedestrian tracking by moving Bluetooth-LE beacons and stationary receivers (2017)

Oliver Schmidts ; Maik Boltes ; Bodo Kraft ; Marc Schreiber

Software Stories Guide (2017)

Ulrich Nobisrath ; Albert Zündorf ; Tobias George ; Jubeh Ruben ; Bodo Kraft

Software Stories are a simple graphical notation for requirements analysis and design in agile software projects. Software Stories are based on example scenarios. Example scenarios facilitate the communication between lay people or domain experts and software experts.

Fujaba based Tool Development for eHome Systems / Nobisrath, Ulrich ; Salumaa, Priit ; Schultchen, Erhard ; Kraft, Bodo (2004)

Bodo Kraft ; Ulrich Nobisrath ; Priit Salumaa ; Erhard Schultchen

Algorithm and Tool for Ontology Integration Based on Graph Rewriting / Heer, Thomas ; Retkowitz, Daniel ; Kraft, Bodo (2008)

Bodo Kraft ; Thomas Heer ; Daniel Retkowitz

Incremental Ontology Integration / Heer, Thomas ; Retkowitz, Daniel ; Kraft, Bodo (2008)

Bodo Kraft ; Thomas Heer ; Daniel Retkowitz

STAMP 4 NLP – an agile framework for rapid quality-driven NLP applications development (2021)

Philipp Kohl ; Oliver Schmidts ; Lars Klöser ; Henri Werth ; Bodo Kraft ; Albert Zündorf

The progress in natural language processing (NLP) research over the last years, offers novel business opportunities for companies, as automated user interaction or improved data analysis. Building sophisticated NLP applications requires dealing with modern machine learning (ML) technologies, which impedes enterprises from establishing successful NLP projects. Our experience in applied NLP research projects shows that the continuous integration of research prototypes in production-like environments with quality assurance builds trust in the software and shows convenience and usefulness regarding the business goal. We introduce STAMP 4 NLP as an iterative and incremental process model for developing NLP applications. With STAMP 4 NLP, we merge software engineering principles with best practices from data science. Instantiating our process model allows efficiently creating prototypes by utilizing templates, conventions, and implementations, enabling developers and data scientists to focus on the business goals. Due to our iterative-incremental approach, businesses can deploy an enhanced version of the prototype to their software environment after every iteration, maximizing potential business value and trust early and avoiding the cost of successful yet never deployed experiments.

ALE: a simulation-based active learning evaluation framework for the parameter-driven comparison of query strategies for NLP (2023)

Philipp Kohl ; Nils Freyer ; Yoka Krämer ; Henri Werth ; Steffen Wolf ; Bodo Kraft ; Matthias Meinecke ; Albert Zündorf

Supervised machine learning and deep learning require a large amount of labeled data, which data scientists obtain in a manual, and time-consuming annotation process. To mitigate this challenge, Active Learning (AL) proposes promising data points to annotators they annotate next instead of a subsequent or random sample. This method is supposed to save annotation effort while maintaining model performance. However, practitioners face many AL strategies for different tasks and need an empirical basis to choose between them. Surveys categorize AL strategies into taxonomies without performance indications. Presentations of novel AL strategies compare the performance to a small subset of strategies. Our contribution addresses the empirical basis by introducing a reproducible active learning evaluation (ALE) framework for the comparative evaluation of AL strategies in NLP. The framework allows the implementation of AL strategies with low effort and a fair data-driven comparison through defining and tracking experiment parameters (e.g., initial dataset size, number of data points per query step, and the budget). ALE helps practitioners to make more informed decisions, and researchers can focus on developing new, effective AL strategies and deriving best practices for specific use cases. With best practices, practitioners can lower their annotation costs. We present a case study to illustrate how to use the framework.

Multi-attribute relation extraction (MARE): simplifying the application of relation extraction (2021)

Lars Klöser ; Philipp Kohl ; Bodo Kraft ; Albert Zündorf

Natural language understanding’s relation extraction makes innovative and encouraging novel business concepts possible and facilitates new digitilized decision-making processes. Current approaches allow the extraction of relations with a fixed number of entities as attributes. Extracting relations with an arbitrary amount of attributes requires complex systems and costly relation-trigger annotations to assist these systems. We introduce multi-attribute relation extraction (MARE) as an assumption-less problem formulation with two approaches, facilitating an explicit mapping from business use cases to the data annotations. Avoiding elaborated annotation constraints simplifies the application of relation extraction approaches. The evaluation compares our models to current state-of-the-art event extraction and binary relation extraction methods. Our approaches show improvement compared to these on the extraction of general multi-attribute relations.

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Keywords

Institute

23 search hits