MLKD logo   Machine Learning &
Knowledge Discovery Group
 
 

Research Topics

Ensemble methods

Development of algorithms for combining multiple predictive models in order to increase predictive performance compared to a single model. A specific topic we study in more detail, is ensemble pruning, the reduction of the size of an ensemble prior to combination in order to reduce complexity and increase predictive performance.

Learning from multi-label data

Development of algorithms that build accurate classification and ranking models from multi-label data and scale efficiently with respect to the number of examples and labels. More information is available here.

Text classification

We currently focus our research on textual data streams, such as e-mails and news feeds and the problem of concept drift which appears in these type of applications. In particular, we have developed an algorithm that considers the appearance of new predictive words over time. Due to its simplicity and effectiveness this algorithm is incorporated into an adaptive news reader named PersoNews. Furthermore, we have confronted the problem of recurring contexts in data streams. Finally, we have proposed a number of approaches for automated web service classification that consider textual and semantic features.

Reinforcement learning

We are concerned with the development of reinforcement learning methods for multiagent systems as well as the application of reinforcement learning in several domains like focused crawling and ensemble pruning. One particular topic of study within this framework is transfer-learning with the aim of speeding up the reinforcement learning process.

Knowledge discovery from biological data

Development of algorithms for building predictive models from biological data. Application of knowledge discovery approaches to biological data in order to gain new biological insights.
- Translation Initiation Site Prediction page.
- Polyadenylation Site Prediction page.

Distributed data mining

Development of distributed algorithms for knowledge discovery from physically distributed databases. Distributed construction of predictive models. Discovery of interesting knowledge with respect to similarities/differences of distributed databases. More information available here.

Learning for planning

Developing approaches for learning domain knowledge for selection among different planning methods based on data concerning the performance (speed and quality) of these methods. We have modelled the problem of planning method selection using various approaches and applied it for a) the automatic configuration of the planning parameters of planning systems and b) the selection among different planning systems.

Periodicity detection in temporal sequences

Developing algorithms for detecting multiple partial and approximate periodicities. Our approach is exploratory and follows a filter/refine paradigm. In the filter phase we introduce an autocorrelation-based algorithm that produces a set of candidate partial periodicities. The algorithm is extended to capture ap-proximate periodicities. In the refine phase we effectively prune invalid periodicities. We conducted a series of experiments with various real world data sets to test the performance and verify the quality of the results.

 

Valid XHTML 1.0 Transitional Valid CSS!