Loog Tax

Machine Learning for Biomedical Image Analysis

Wednesday, April 14, 08:30 - 12:15

Presented by

Presented by Marco Loog (Delft University of Technology, The Netherlands) and David Tax (Delft University of Technology, The Netherlands)

Description

Machine learning and pattern recognition techniques utilize exemplary data rather than models to solve certain decision tasks. They enable their users to exploit various sources of data (images, annotations, general patient data, etc.) even if prior knowledge on the nature of these data and their interrelationship is limited or weak. More specifically, supervised machine learning and pattern recognition methods aim to "learn" a potentially complex input / output relation of interest from given example input / output pairs.

Machine learning plays a role of increasing importance in many biomedical image analysis and processing challenges. This tutorial offers a brief, and hopefully nonstandard, introduction to the basics of machine learning together with an overview of the key components and main considerations that play a role in good machine learning practice. The focus in this is on classification problems. While the basics are the topic of the first block, the second takes these as a prerequisite and presents various extensions to this standard, and in a sense limited, view of machine learning. It provides an overview of several techniques that have received increased attention in the past years.

Various concepts are illustrated by means of research examples and biomedical applications.

Part I: Established and Standard Techniques

A condensed overview of the standard classification pipeline is given. Various classifiers are touched upon (think linear discriminants, support vector machines, boosting, classifier combinations, ensemble methods, nearest neighbor methods), and representation and evaluation issues are briefly considered.

The machine learning trinity
- Representation
- Generalization
- Evaluation
Bayes risk and density-based classifiers
- LDA
- QDA
Distance based classifiers
- kNN
- NMC
Similarity and dissimilarity approaches
- SVM
- kernels
- dissimilarities
- prototypes
- structural representations
Issues of complexity
- bias / variance
- learning curves
The next hot thing?

Part II: Emerging and Less Standard Approaches

Deviations from the standard setting (from Part I) are considered and more realistic and, potentially, more powerful methods are discussed. We make a case for research into these "emerging" and less standard directions. Topics are multiple instance learning, semi-supervised learning, active learning, and learning under non-standard cost.

One-class classification
Multiple instance learning
Semi-supervised learning
Active learning
Contextual classification
Optimization of other risks

A Rather Biased List of References

Arzhaeva, Tax, van Ginneken, “Dissimilarity-based classification in the absence of local ground truth: application to the diagnostic interpretation of chest radiographs”, Patt. Recog., 2009
Arzhaeva, Tax, van Ginneken, “Improving computer-aided diagnosis of interstitial disease in chest radiographs by combining one-class and two-class classifiers”, SPIE, 2006
Bishop, “Pattern Recognition and Machine Learning”, Springer, 2006
Chapelle, Schölkopf, Zien, “Semi-Supervised Learning”, MIT Press, 2006
Collier, “Priestess of Delphi”, 1891
Cortes, Vapnik, “Support vector networks”, Mach. Learn., 1995
Duda, Hart, Stork, “Pattern Classification”, John Willey & Sons, 2000
Freund, “Boosting a weak learning algorithm by majority”, Information and computation, 1995
van Ginneken, et al. “Comparing and combining algorithms for computer-aided detection of pulmonary nodules in computed tomography scans: the ANODE09 study”, Tech. Rep., 2010
van Ginneken, Stegmann, Loog, “Segmentation of anatomical structures in chest radiographs using supervised methods: a comparative study on a public database”, MedIA, 2006
van der Heijden, Duin, et al., “Classification, parameter estimation, and state estimation”, Wiley, 2004
Juszczak, Duin, “Selective sampling based on the variation in label assignments”, ICPR, 2004
Klein, Loog, et al., “Early diagnosis of dementia based on intersubject whole-brain dissimilarities”, ISBI, 2010
Li, “Markov Random Field Modeling in Image Analysis”, Springer, 2009
Loog, van Ginneken, “Supervised segmentation by iterated contextual pixel classification”, ICPR, 2002
Loog, van Ginneken, “Static posterior probability fusion for signal detection”, ICPR, 2004
Loog, van Ginneken, Duin, “Dimensionality reduction of image features using the canonical contextual correlation projection”, Patt. Recog., 2005
Loog, van Ginneken, “Segmentation of the posterior ribs in chest radiographs using iterated contextual pixel classification”, TMI, 2006
Loog, de Bruijne, “Discriminative shape alignment”, IPMI, 2009
Maron, Lozano-Pérez, “A framework for multiple-instance learning”, NIPS, 1998
Niemeijer, et al., “Comparative study of retinal vessel segmentation methods on a new publicly available database”, SPIE, 2004
Pekalska, Duin, “The Dissimilarity Representation for Pattern Recognition”, World Scientific, 2005
Raundahl, Loog, et al. “Automated effect-specific breast density measures”, TMI, 2008
Sánchez, Niemeijer, et al., “Active learning approach for detection of hard exudates, cotton wool spots and drusen in retinal images”, SPIE, 2009
Sánchez, Niemeijer, et al., “Improving hard exudate detection in retinal images through a combination of local and contextual information” , ISBI, 2010
Schölkopf, Smola, “Learning with kernels”, MIT Press, 2002
Settles “Active Learning Literature Survey”, Tech. Rep. 1648, 2009
Sindhwani et al., “Beyond the point cloud: from transductive to semi-supervised learning”, ICML, 2005
Smal, Loog, et al., “Quantitative comparison of spot detection methods in fluorescence microscopy”, TMI, 2010
Sørensen, Shaker, de Bruijne, “Quantitative analysis of pulmonary emphysema using local binary patterns”, TMI, 2010
Tax, Duin, “Support vector data description”, Mach. Learn., 2004
Tax, Duin, “Learning curves for the analysis of multiple instance classifiers”, S+SSPR, 2008
Tax, Loog, Duin, “Optimal mean-precision classifier”, MCS, 2009
Tax, Duin, Arzhaeva, “Linear model combining by optimizing the Area under the ROC curve”, ICPR, 2006

Speaker Biographies

David M.J. Tax received his Ph.D. in 2001 at the Delft University of Technology in the Pattern Recognition Group, under the supervision of R.P.W. Duin and based on his thesis 'One-Class Classification'. Currently, he is an assistent professor in the Pattern Recognition Laboratory at Delft University of Technology. His main research interests are the learning and development of detection algorithms, the representation of data, simple and elegant classifiers and the fair evaluation of such methods.

Marco Loog received a Ph.D. degree from the Image Sciences Institute, NL, for the development and improvement of statistical pattern recognition methods and their use in the processing and analysis of images. After this joyful event, he moved to Copenhagen, DK, were he acted as assistant and, eventually, associate professor. After several splendid years, Marco moved to Delft University of Technology where he now works as an assistant professor in the Pattern Recognition Laboratory. Currently, his ever-evolving research interests include multiscale image analysis, semi-supervised and multi-instance learning, saliency, the dissimilarity approach, and black math.