Multiclass nonnegative matrix factorization for comprehensive feature pattern discovery

Par Conseil national de recherches du Canada

DOI	Trouver le DOI : https://doi.org/10.1109/TNNLS.2018.2849932
Auteur	Rechercher : Li, Yifeng¹; Rechercher : Pan, Youlian¹; Rechercher : Liu, Ziying¹
Affiliation	Conseil national de recherches du Canada. Technologies numériques
Format	Texte, Article
Sujet	big data; cancer; feature pattern discovery; multiclass nonnegative matrix factorization (MC-NMF); stability selection
Résumé	In this big data era, interpretable machine learning models are strongly demanded for the comprehensive analytics of large-scale multiclass data. Characterizing all features from such data is a key but challenging step to understand the complexity. However, existing feature selection methods do not meet this need. In this paper, to address this problem, we propose a Bayesian multiclass nonnegative matrix factorization (MC-NMF) model with structured sparsity that is able to discover ubiquitous and class-specific features. Variational update rules were derived for efficient decomposition. In order to relieve the need of model selection and stably describe feature patterns, we further propose MC-NMF with stability selection, an ensemble method that collectively detects feature patterns from many runs of MC-NMF using different hyperparameter values and training subsets. We assessed our models on both simulated count data and multitumor ribonucleic acid-seq data. The experiments revealed that our models were able to recover predefined feature patterns from the simulated data and identify biologically meaningful patterns from the pan-cancer data.
Date de publication	2018-07-16
Maison d’édition	IEEE
Dans	IEEE Transactions on Neural Networks and Learning Systems 30, nº 2 : 615–629.
Langue	anglais
Publications évaluées par des pairs	Oui
Exporter la notice	Exporter en format RIS
Signaler une correction	Signaler une correction (s'ouvre dans un nouvel onglet)
Identificateur de l’enregistrement	86a509b0-de54-4406-a3eb-a381708d6877
Enregistrement créé	2019-06-06
Enregistrement modifié	2020-03-16

Date de modification :: 2024-07-27