OPUS 4 | Search

From cracked accounts to fake IDs: user profiling on German telegram black market channels (2023)

André Büsgen ; Lars Klöser ; Philipp Kohl ; Oliver Schmidts ; Bodo Kraft ; Albert Zündorf

Messenger apps like WhatsApp and Telegram are frequently used for everyday communication, but they can also be utilized as a platform for illegal activity. Telegram allows public groups with up to 200.000 participants. Criminals use these public groups for trading illegal commodities and services, which becomes a concern for law enforcement agencies, who manually monitor suspicious activity in these chat rooms. This research demonstrates how natural language processing (NLP) can assist in analyzing these chat rooms, providing an explorative overview of the domain and facilitating purposeful analyses of user behavior. We provide a publicly available corpus of annotated text messages with entities and relations from four self-proclaimed black market chat rooms. Our pipeline approach aggregates the extracted product attributes from user messages to profiles and uses these with their sold products as features for clustering. The extracted structured information is the foundation for further data exploration, such as identifying the top vendors or fine-granular price analyses. Our evaluation shows that pretrained word vectors perform better for unsupervised clustering than state-of-the-art transformer models, while the latter is still superior for sequence labeling.

Fujaba based Tool Development for eHome Systems / Nobisrath, Ulrich ; Salumaa, Priit ; Schultchen, Erhard ; Kraft, Bodo (2004)

Bodo Kraft ; Ulrich Nobisrath ; Priit Salumaa ; Erhard Schultchen

Incremental Ontology Integration / Heer, Thomas ; Retkowitz, Daniel ; Kraft, Bodo (2008)

Bodo Kraft ; Thomas Heer ; Daniel Retkowitz

Metrics driven research collaboration: focusing on common project goals continuously (2017)

Marc Schreiber ; Bodo Kraft ; Albert Zündorf

Multi-attribute relation extraction (MARE): simplifying the application of relation extraction (2021)

Lars Klöser ; Philipp Kohl ; Bodo Kraft ; Albert Zündorf

Natural language understanding’s relation extraction makes innovative and encouraging novel business concepts possible and facilitates new digitilized decision-making processes. Current approaches allow the extraction of relations with a fixed number of entities as attributes. Extracting relations with an arbitrary amount of attributes requires complex systems and costly relation-trigger annotations to assist these systems. We introduce multi-attribute relation extraction (MARE) as an assumption-less problem formulation with two approaches, facilitating an explicit mapping from business use cases to the data annotations. Avoiding elaborated annotation constraints simplifies the application of relation extraction approaches. The evaluation compares our models to current state-of-the-art event extraction and binary relation extraction methods. Our approaches show improvement compared to these on the extraction of general multi-attribute relations.

Multi-pedestrian tracking by moving Bluetooth-LE beacons and stationary receivers (2017)

Oliver Schmidts ; Maik Boltes ; Bodo Kraft ; Marc Schreiber

NLP Lean Programming Framework: Developing NLP Applications More Effectively (2018)

Marc Schreiber ; Bodo Kraft ; Albert Zündorf

This paper presents NLP Lean Programming framework (NLPf), a new framework for creating custom natural language processing (NLP) models and pipelines by utilizing common software development build systems. This approach allows developers to train and integrate domain-specific NLP pipelines into their applications seamlessly. Additionally, NLPf provides an annotation tool which improves the annotation process significantly by providing a well-designed GUI and sophisticated way of using input devices. Due to NLPf’s properties developers and domain experts are able to build domain-specific NLP applications more efficiently. NLPf is Opensource software and available at https:// gitlab.com/schrieveslaach/NLPf.

Quick Pad Tagger : An Efficient Graphical User Interface for Building Annotated Corpora with Multiple Annotation Layers (2015)

Marc Schreiber ; Kai Barkschat ; Bodo Kraft ; Albert Zündorf

Schema Matching with Frequent Changes on Semi-Structured Input Files: A Machine Learning Approach on Biological Product Data (2019)

Oliver Schmidts ; Bodo Kraft ; Ines Siebigteroth ; Albert Zündorf

Software in the city: visual guidance through large scale software projects (2013)

Marc Schreiber ; Stefan Hirtbach ; Bodo Kraft ; Andreas Steinmetzler

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Keywords

Institute

23 search hits