Refine
Year of publication
Document Type
- Conference Proceeding (1146) (remove)
Language
- English (1146) (remove)
Keywords
- Biosensor (25)
- CAD (7)
- Finite-Elemente-Methode (7)
- civil engineering (7)
- Bauingenieurwesen (6)
- Blitzschutz (6)
- Enterprise Architecture (5)
- Clusterion (4)
- Energy storage (4)
- Gamification (4)
- Leadership (4)
- Limit analysis (4)
- Natural language processing (4)
- Power plants (4)
- Sonde (4)
- Telekommunikationsmarkt (4)
- hydrogen (4)
- solar sail (4)
- Air purification (3)
- Associated liquids (3)
Institute
- Fachbereich Elektrotechnik und Informationstechnik (230)
- Fachbereich Medizintechnik und Technomathematik (208)
- Fachbereich Luft- und Raumfahrttechnik (178)
- Fachbereich Energietechnik (177)
- IfB - Institut für Bioengineering (147)
- Solar-Institut Jülich (110)
- Fachbereich Maschinenbau und Mechatronik (107)
- Fachbereich Bauingenieurwesen (73)
- Fachbereich Wirtschaftswissenschaften (51)
- ECSM European Center for Sustainable Mobility (50)
- MASKOR Institut für Mobile Autonome Systeme und Kognitive Robotik (46)
- INB - Institut für Nano- und Biotechnologien (39)
- Fachbereich Chemie und Biotechnologie (23)
- Kommission für Forschung und Entwicklung (16)
- Nowum-Energy (11)
- Fachbereich Architektur (7)
- Fachbereich Gestaltung (4)
- Institut fuer Angewandte Polymerchemie (2)
- ZHQ - Bereich Hochschuldidaktik und Evaluation (2)
- Arbeitsstelle fuer Hochschuldidaktik und Studienberatung (1)
Messenger apps like WhatsApp and Telegram are frequently used for everyday communication, but they can also be utilized as a platform for illegal activity. Telegram allows public groups with up to 200.000 participants. Criminals use these public groups for trading illegal commodities and services, which becomes a concern for law enforcement agencies, who manually monitor suspicious activity in these chat rooms. This research demonstrates how natural language processing (NLP) can assist in analyzing these chat rooms, providing an explorative overview of the domain and facilitating purposeful analyses of user behavior. We provide a publicly available corpus of annotated text messages with entities and relations from four self-proclaimed black market chat rooms. Our pipeline approach aggregates the extracted product attributes from user messages to profiles and uses these with their sold products as features for clustering. The extracted structured information is the foundation for further data exploration, such as identifying the top vendors or fine-granular price analyses. Our evaluation shows that pretrained word vectors perform better for unsupervised clustering than state-of-the-art transformer models, while the latter is still superior for sequence labeling.