Conference Proceeding
Refine
Year of publication
Document Type
- Conference Proceeding (1016) (remove)
Language
- English (1016) (remove)
Has Fulltext
- no (1016) (remove)
Keywords
- Enterprise Architecture (5)
- Energy storage (4)
- Gamification (4)
- Natural language processing (4)
- Power plants (4)
- hydrogen (4)
- solar sail (4)
- Associated liquids (3)
- Concentrated solar power (3)
- Hybrid energy system (3)
- MASCOT (3)
- Out-of-plane load (3)
- earthquakes (3)
- Additive manufacturing (2)
- Adjacent buildings (2)
- Case Study (2)
- Clustering (2)
- Deep learning (2)
- Digital Twin (2)
- Diversity (2)
Institute
- Fachbereich Elektrotechnik und Informationstechnik (224)
- Fachbereich Luft- und Raumfahrttechnik (171)
- Fachbereich Energietechnik (158)
- Fachbereich Medizintechnik und Technomathematik (131)
- IfB - Institut für Bioengineering (109)
- Solar-Institut Jülich (108)
- Fachbereich Maschinenbau und Mechatronik (98)
- Fachbereich Bauingenieurwesen (70)
- ECSM European Center for Sustainable Mobility (50)
- Fachbereich Wirtschaftswissenschaften (42)
- MASKOR Institut für Mobile Autonome Systeme und Kognitive Robotik (42)
- INB - Institut für Nano- und Biotechnologien (33)
- Fachbereich Chemie und Biotechnologie (23)
- Kommission für Forschung und Entwicklung (16)
- Nowum-Energy (11)
- Fachbereich Architektur (7)
- Fachbereich Gestaltung (3)
- Institut fuer Angewandte Polymerchemie (2)
- ZHQ - Bereich Hochschuldidaktik und Evaluation (2)
- Arbeitsstelle fuer Hochschuldidaktik und Studienberatung (1)
Supervised machine learning and deep learning require a large amount of labeled data, which data scientists obtain in a manual, and time-consuming annotation process. To mitigate this challenge, Active Learning (AL) proposes promising data points to annotators they annotate next instead of a subsequent or random sample. This method is supposed to save annotation effort while maintaining model performance.
However, practitioners face many AL strategies for different tasks and need an empirical basis to choose between them. Surveys categorize AL strategies into taxonomies without performance indications. Presentations of novel AL strategies compare the performance to a small subset of strategies. Our contribution addresses the empirical basis by introducing a reproducible active learning evaluation (ALE) framework for the comparative evaluation of AL strategies in NLP.
The framework allows the implementation of AL strategies with low effort and a fair data-driven comparison through defining and tracking experiment parameters (e.g., initial dataset size, number of data points per query step, and the budget). ALE helps practitioners to make more informed decisions, and researchers can focus on developing new, effective AL strategies and deriving best practices for specific use cases. With best practices, practitioners can lower their annotation costs. We present a case study to illustrate how to use the framework.
The progress in natural language processing (NLP) research over the last years, offers novel business opportunities for companies, as automated user interaction or improved data analysis. Building sophisticated NLP applications requires dealing with modern machine learning (ML) technologies, which impedes enterprises from establishing successful NLP projects. Our experience in applied NLP research projects shows that the continuous integration of research prototypes in production-like environments with quality assurance builds trust in the software and shows convenience and usefulness regarding the business goal. We introduce STAMP 4 NLP as an iterative and incremental process model for developing NLP applications. With STAMP 4 NLP, we merge software engineering principles with best practices from data science. Instantiating our process model allows efficiently creating prototypes by utilizing templates, conventions, and implementations, enabling developers and data scientists to focus on the business goals. Due to our iterative-incremental approach, businesses can deploy an enhanced version of the prototype to their software environment after every iteration, maximizing potential business value and trust early and avoiding the cost of successful yet never deployed experiments.
Scientific questions
- How can a non-stationary heat offering in the commercial vehicle be used to reduce fuel consumption?
- Which potentials offer route and environmental information among with predicted speed and load trajectories to increase the efficiency of a ORC-System?
Methods
- Desktop bound holistic simulation model for a heavy duty truck incl. an ORC System
- Prediction of massflows, temperatures and mixture quality (AFR) of exhaust gas