Novel Representative Sampling for Improved Active Learning accepted at MATHMOD 2022

Congratulations! Debangsha for the paper accepted @ MATHMOD 2022

Novel Representative Sampling for Improved Active Learning

Abstract Active learning solves machine learning problems where acquiring labels for the data is costly. It allows for the learner to select training samples by asking intelligent questions. Various sampling strategies exist for choosing the training set for pool-based active learning. However, the existing representative querying approaches for active learning do not attempt to capture the underlying data distribution, which we believe is an important part of representative sampling. To that end, we propose an adaptation of the sigma point sampling technique from unscented transformation (UT) for constructing a representative subset. UT has shown to be very effective in non-linear transformation modeling in object tracking and robotics. When combined with the Gaussian mixture model, sigma points can estimate the true statistics of an unknown distribution with very few samples. Sigma point sampling being parameterized gives better control over the sampling process. We use sigma points for representative subset construction and train the learner on them. We compare our results with other sampling techniques and improve test accuracy on the handwritten digit recognition dataset MNIST.

IDSL Lab News

Highlights - 09 February 2022
The Banff International Research Station will host the *Climate Change Scenarios and Financial Risk* Online workshop at the UBC Okanagan campus in Kelowna, BC, from May 1 to May 6, 2022


Our group is recruiting year-round for postdocs, MASc and PhD students, visiting students and undergraduate students.

All admitted students will receive a stipend.

Please see our opportunities page for more information.