• Treffer 4 von 239
Zurück zur Trefferliste

From cracked accounts to fake IDs: user profiling on German telegram black market channels

  • Messenger apps like WhatsApp and Telegram are frequently used for everyday communication, but they can also be utilized as a platform for illegal activity. Telegram allows public groups with up to 200.000 participants. Criminals use these public groups for trading illegal commodities and services, which becomes a concern for law enforcement agencies, who manually monitor suspicious activity in these chat rooms. This research demonstrates how natural language processing (NLP) can assist in analyzing these chat rooms, providing an explorative overview of the domain and facilitating purposeful analyses of user behavior. We provide a publicly available corpus of annotated text messages with entities and relations from four self-proclaimed black market chat rooms. Our pipeline approach aggregates the extracted product attributes from user messages to profiles and uses these with their sold products as features for clustering. The extracted structured information is the foundation for further data exploration, such as identifying the top vendors or fine-granular price analyses. Our evaluation shows that pretrained word vectors perform better for unsupervised clustering than state-of-the-art transformer models, while the latter is still superior for sequence labeling.

Metadaten exportieren

Weitere Dienste

Teilen auf Twitter Suche bei Google Scholar
Metadaten
Verfasserangaben:André BüsgenORCiD, Lars KlöserORCiD, Philipp Kohl, Oliver Schmidts, Bodo Kraft, Albert Zündorf
DOI:https://doi.org/10.1007/978-3-031-37890-4_9
ISBN:978-3-031-37889-8 (Print)
ISBN:978-3-031-37890-4 (Online)
Titel des übergeordneten Werkes (Englisch):Data Management Technologies and Applications
Verlag:Springer
Verlagsort:Cham
Herausgeber:Alfredo Cuzzocrea, Oleg Gusikhin, Slimane Hammoudi, Christoph Quix
Dokumentart:Konferenzveröffentlichung
Sprache:Englisch
Erscheinungsjahr:2023
Datum der Publikation (Server):27.07.2023
Freies Schlagwort / Tag:Clustering; Information extraction; Natural language processing; Profile extraction; Text mining
Erste Seite:176
Letzte Seite:202
Bemerkung:
10th International Conference, DATA 2021, Virtual Event, July 6–8, 2021, and 11th International Conference, DATA 2022, Lisbon, Portugal, July 11-13, 2022
Link:https://doi.org/10.1007/978-3-031-37890-4_9
Zugriffsart:campus
Fachbereiche und Einrichtungen:FH Aachen / Fachbereich Medizintechnik und Technomathematik
collections:Verlag / Springer