Electronic Theses and Dissertations

A submodular optimization framework for never-ending learning : semi-supervised, online, and active learning.

Wael Emara, University of Louisville

Date on Master's Thesis/Doctoral Dissertation

12-2012

Document Type

Doctoral Dissertation

Degree Name

Ph. D.

Department

Computer Engineering and Computer Science

Committee Chair

Kantardzic, Mehmed

Author's Keywords

Machine learning; online learning; semi-supervised learning; Never-Ending Learning; submodular optimization; Support Vector Machines

Subject

Machine learning; Data mining

Abstract

The revolution in information technology and the explosion in the use of computing devices in people's everyday activities has forever changed the perspective of the data mining and machine learning fields. The enormous amounts of easily accessible, information rich data is pushing the data analysis community in general towards a shift of paradigm. In the new paradigm, data comes in the form a stream of billions of records received everyday. The dynamic nature of the data and its sheer size makes it impossible to use the traditional notion of offline learning where the whole data is accessible at any time point. Moreover, no amount of human resources is enough to get expert feedback on the data. In this work we have developed a unified optimization based learning framework that approaches many of the challenges mentioned earlier. Specifically, we developed a Never-Ending Learning framework which combines incremental/online, semi-supervised, and active learning under a unified optimization framework. The established framework is based on the class of submodular optimization methods. At the core of this work we provide a novel formulation of the Semi-Supervised Support Vector Machines (S3VM) in terms of submodular set functions. The new formulation overcomes the non-convexity issues of the S3VM and provides a state of the art solution that is orders of magnitude faster than the cutting edge algorithms in the literature. Next, we provide a stream summarization technique via exemplar selection. This technique makes it possible to keep a fixed size exemplar representation of a data stream that can be used by any label propagation based semi-supervised learning technique. The compact data steam representation allows a wide range of algorithms to be extended to incremental/online learning scenario. Under the same optimization framework, we provide an active learning algorithm that constitute the feedback between the learning machine and an oracle. Finally, the developed Never-Ending Learning framework is essentially transductive in nature. Therefore, our last contribution is an inductive incremental learning technique for incremental training of SVM using the properties of local kernels. We demonstrated through this work the importance and wide applicability of the proposed methodologies.

Recommended Citation

Emara, Wael, "A submodular optimization framework for never-ending learning : semi-supervised, online, and active learning." (2012). Electronic Theses and Dissertations. Paper 404.
https://doi.org/10.18297/etd/404

Download

COinS

ThinkIR: The University of Louisville's Institutional Repository

Electronic Theses and Dissertations

A submodular optimization framework for never-ending learning : semi-supervised, online, and active learning.

Date on Master's Thesis/Doctoral Dissertation

Document Type

Degree Name

Department

Committee Chair

Author's Keywords

Subject

Abstract

Recommended Citation

Search

Browse

Author Corner

Related Links

Contact:

ThinkIR: The University of Louisville's Institutional Repository

Electronic Theses and Dissertations

A submodular optimization framework for never-ending learning : semi-supervised, online, and active learning.

Author

Date on Master's Thesis/Doctoral Dissertation

Document Type

Degree Name

Department

Committee Chair

Author's Keywords

Subject

Abstract

Recommended Citation

Share

Search

Browse

Author Corner

Related Links

Contact: