TY  - CHAP
A1  - Schreiber, Marc
A1  - Barkschat, Kai
A1  - Kraft, Bodo
T1  - Using Continuous Integration to organize and monitor the annotation process of domain specific corpora
T2  - 5th International Conference on Information and Communication Systems (ICICS) : 1-3 April 2014, Irbid, Jordanien
Y1  - 2014
SN  - 978-1-4799-3022-7
U6  - http://dx.doi.org/10.1109/IACS.2014.6841958
SP  - 1
EP  - 6
ER  - 
TY  - CHAP
A1  - Kohl, Philipp
A1  - Schmidts, Oliver
A1  - Klöser, Lars
A1  - Werth, Henri
A1  - Kraft, Bodo
A1  - Zündorf, Albert
T1  - STAMP 4 NLP – an agile framework for rapid quality-driven NLP applications development
T2  - Quality of Information and Communications Technology. QUATIC 2021
N2  - The progress in natural language processing (NLP) research over the last years, offers novel business opportunities for companies, as automated user interaction or improved data analysis. Building sophisticated NLP applications requires dealing with modern machine learning (ML) technologies, which impedes enterprises from establishing successful NLP projects. Our experience in applied NLP research projects shows that the continuous integration of research prototypes in production-like environments with quality assurance builds trust in the software and shows convenience and usefulness regarding the business goal. We introduce STAMP 4 NLP as an iterative and incremental process model for developing NLP applications. With STAMP 4 NLP, we merge software engineering principles with best practices from data science. Instantiating our process model allows efficiently creating prototypes by utilizing templates, conventions, and implementations, enabling developers and data scientists to focus on the business goals. Due to our iterative-incremental approach, businesses can deploy an enhanced version of the prototype to their software environment after every iteration, maximizing potential business value and trust early and avoiding the cost of successful yet never deployed experiments.
KW  - Machine learning
KW  - Process model
KW  - Natural language processing
Y1  - 2021
SN  - 978-3-030-85346-4
SN  - 978-3-030-85347-1
U6  - http://dx.doi.org/10.1007/978-3-030-85347-1_12
N1  - International Conference on the Quality of Information and Communications Technology, QUATIC 2021, 8-11 September, Algarve, Portugal
SP  - 156
EP  - 166
PB  - Springer
CY  - Cham
ER  - 
TY  - CHAP
A1  - Schreiber, Marc
A1  - Hirtbach, Stefan
A1  - Kraft, Bodo
A1  - Steinmetzler, Andreas
ED  - Kowalewski, Stefan
T1  - Software in the city: visual guidance through large scale software projects
T2  - Software Engineering 2013 : Fachtagung des GI-Fachbereichs Softwaretechnik, 26. Februar-1. März 2013 in Aachen. (GI-Edition ; 213)
Y1  - 2013
SN  - 978-3-88579-607-7 ; 978-3-88579-609-1
SP  - 213
EP  - 224
PB  - Ges. für Informatik
CY  - Bonn
ER  - 
TY  - CHAP
A1  - Schmidts, Oliver
A1  - Kraft, Bodo
A1  - Siebigteroth, Ines
A1  - Zündorf, Albert
T1  - Schema Matching with Frequent Changes on Semi-Structured Input Files: A Machine Learning Approach on Biological Product Data
T2  - Proceedings of the 21st International Conference on Enterprise Information Systems - Volume 1: ICEIS
Y1  - 2019
SN  - 978-989-758-372-8
U6  - http://dx.doi.org/10.5220/0007723602080215
SP  - 208
EP  - 215
ER  - 
TY  - CHAP
A1  - Schreiber, Marc
A1  - Kraft, Bodo
A1  - Zündorf, Albert
T1  - NLP Lean Programming Framework: Developing NLP Applications More Effectively
T2  - Proceedings of NAACL-HLT 2018: Demonstrations, New Orleans, Louisiana, June 2 - 4, 2018
N2  - This paper presents NLP Lean Programming
framework (NLPf), a new framework
for creating custom natural language processing
(NLP) models and pipelines by utilizing
common software development build systems.
This approach allows developers to train and
integrate domain-specific NLP pipelines into
their applications seamlessly. Additionally,
NLPf provides an annotation tool which improves
the annotation process significantly by
providing a well-designed GUI and sophisticated
way of using input devices. Due to
NLPf’s properties developers and domain experts
are able to build domain-specific NLP
applications more efficiently. NLPf is Opensource
software and available at https://
gitlab.com/schrieveslaach/NLPf.
Y1  - 2018
U6  - http://dx.doi.org/10.18653/v1/N18-5001 
ER  - 
TY  - CHAP
A1  - Schmidts, Oliver
A1  - Boltes, Maik
A1  - Kraft, Bodo
A1  - Schreiber, Marc
T1  - Multi-pedestrian tracking by moving Bluetooth-LE beacons and stationary receivers
T2  - 2017 International Conference on Indoor Positioning and Indoor Navigation (IPIN), 18-21 September 2017, Sapporo, Japan
Y1  - 2017
N1  - International Conference on Indoor Positioning and Indoor Navigation <8, 2017, Sapporo, Japan>
SP  - 1
EP  - 4
ER  - 
TY  - CHAP
A1  - Klöser, Lars
A1  - Kohl, Philipp
A1  - Kraft, Bodo
A1  - Zündorf, Albert
T1  - Multi-attribute relation extraction (MARE): simplifying the application of relation extraction
T2  - Proceedings of the 2nd International Conference on Deep Learning Theory and Applications - DeLTA
N2  - Natural language understanding’s relation extraction makes innovative and encouraging novel business concepts possible and facilitates new digitilized decision-making processes. Current approaches allow the extraction of relations with a fixed number of entities as attributes. Extracting relations with an arbitrary amount of attributes requires complex systems and costly relation-trigger annotations to assist these systems. We introduce multi-attribute relation extraction (MARE) as an assumption-less problem formulation with two approaches, facilitating an explicit mapping from business use cases to the data annotations. Avoiding elaborated annotation constraints simplifies the application of relation extraction approaches. The evaluation compares our models to current state-of-the-art event extraction and binary relation extraction methods. Our approaches show improvement compared to these on the extraction of general multi-attribute relations.
Y1  - 2021
SN  - 978-989-758-526-5
U6  - http://dx.doi.org/10.5220/0010559201480156
N1  - Proceedings of the 2nd International Conference on Deep Learning Theory and Applications, DeLTA2021, July 7-9, 2021
SP  - 148
EP  - 156
ER  - 
TY  - CHAP
A1  - Schreiber, Marc
A1  - Kraft, Bodo
A1  - Zündorf, Albert
ED  - Bilof, Randall
T1  - Metrics driven research collaboration: focusing on common project goals continuously
T2  - Proceedings : 2017 IEEE/ACM 4th International Workshop on Software Engineering Research and Industrial Practice : SER&IP 2017 : 21 May 2017 Buenos Aires, Argentina
Y1  - 2017
SN  - 978-1-5386-2797-6
U6  - http://dx.doi.org/10.1109/SER-IP.2017..6
SP  - 41
EP  - 47
PB  - IEEE Press
CY  - Piscataway, NJ
ER  - 
TY  - CHAP
A1  - Büsgen, André
A1  - Klöser, Lars
A1  - Kohl, Philipp
A1  - Schmidts, Oliver
A1  - Kraft, Bodo
A1  - Zündorf, Albert
ED  - Cuzzocrea, Alfredo
ED  - Gusikhin, Oleg
ED  - Hammoudi, Slimane
ED  - Quix, Christoph
T1  - From cracked accounts to fake IDs: user profiling on German telegram black market channels
T2  - Data Management Technologies and Applications
N2  - Messenger apps like WhatsApp and Telegram are frequently used for everyday communication, but they can also be utilized as a platform for illegal activity. Telegram allows public groups with up to 200.000 participants. Criminals use these public groups for trading illegal commodities and services, which becomes a concern for law enforcement agencies, who manually monitor suspicious activity in these chat rooms. This research demonstrates how natural language processing (NLP) can assist in analyzing these chat rooms, providing an explorative overview of the domain and facilitating purposeful analyses of user behavior. We provide a publicly available corpus of annotated text messages with entities and relations from four self-proclaimed black market chat rooms. Our pipeline approach aggregates the extracted product attributes from user messages to profiles and uses these with their sold products as features for clustering. The extracted structured information is the foundation for further data exploration, such as identifying the top vendors or fine-granular price analyses. Our evaluation shows that pretrained word vectors perform better for unsupervised clustering than state-of-the-art transformer models, while the latter is still superior for sequence labeling.
KW  - Clustering
KW  - Natural language processing
KW  - Information extraction
KW  - Profile extraction
KW  - Text mining
Y1  - 2023
SN  - 978-3-031-37889-8 (Print)
SN  - 978-3-031-37890-4 (Online)
U6  - http://dx.doi.org/10.1007/978-3-031-37890-4_9
N1  - 10th International Conference, DATA 2021, Virtual Event, July 6–8, 2021, and 11th International Conference, DATA 2022, Lisbon, Portugal, July 11-13, 2022
SP  - 176
EP  - 202
PB  - Springer
CY  - Cham
ER  - 
TY  - CHAP
A1  - Büsgen, André
A1  - Klöser, Lars
A1  - Kohl, Philipp
A1  - Schmidts, Oliver
A1  - Kraft, Bodo
A1  - Zündorf, Albert
T1  - Exploratory analysis of chat-based black market profiles with natural language processing
T2  - Proceedings of the 11th International Conference on Data Science, Technology and Applications
N2  - Messenger apps like WhatsApp or Telegram are an integral part of daily communication. Besides the various positive effects, those services extend the operating range of criminals. Open trading groups with many thousand participants emerged on Telegram. Law enforcement agencies monitor suspicious users in such chat rooms. This research shows that text analysis, based on natural language processing, facilitates this through a meaningful domain overview and detailed investigations. We crawled a corpus from such self-proclaimed black markets and annotated five attribute types products, money, payment methods, user names, and locations. Based on each message a user sends, we extract and group these attributes to build profiles. Then, we build features to cluster the profiles. Pretrained word vectors yield better unsupervised clustering results than current
state-of-the-art transformer models. The result is a semantically meaningful high-level overview of the user landscape of black market chatrooms. Additionally, the extracted structured information serves as a foundation for further data exploration, for example, the most active users or preferred payment methods.
KW  - Clustering
KW  - Natural Language Processing
KW  - Information Extraction
KW  - Profile Extraction
KW  - Text Mining
Y1  - 2022
SN  - 978-989-758-583-8
U6  - http://dx.doi.org/10.5220/0011271400003269
SN  - 2184-285X
SP  - 83
EP  - 94
ER  -