Parametric t-distributed stochastic exemplar-centered embeddin

Par Conseil national de recherches du Canada

Téléchargement	Voir la version finale : Parametric t-distributed stochastic exemplar-centered embeddin (PDF, 6.6 Mio)
Auteur	Rechercher : Min, Martin Renqiang; Rechercher : Guo, Hongyu¹; Rechercher : Shen, Dinghan
Affiliation	Conseil national de recherches du Canada. Technologies numériques
Format	Texte, Article
Description physique	version 2
Résumé	Parametric embedding methods such as parametric t-SNE (pt-SNE) have been widely adopted for data visualization and out-of-sample data embedding without further computationally expensive optimization or approximation. However, the performance of pt-SNE is highly sensitive to the hyper-parameter batch size due to conflicting optimization goals, and often produces dramatically different embeddings with different choices of user-defined perplexities. To effectively solve these issues, we present parametric t-distributed stochastic exemplar-centered embedding methods. Our strategy learns embedding parameters by comparing given data only with precomputed exemplars, resulting in a cost function with linear computational and memory complexity, which is further reduced by noise contrastive samples. Moreover, we propose a shallow embedding network with high-order feature interactions for data visualization, which is much easier to tune but produces comparable performance in contrast to a deep neural network employed by pt-SNE. We empirically demonstrate, using several benchmark datasets, that our proposed methods significantly outperform pt-SNE in terms of robustness, visual effects, and quantitative evaluations.
Date de publication	2017-11-01
Maison d’édition	Cornell University Library
Dans	Computer Science, arXiv:1710.05128.
Langue	anglais
Publications évaluées par des pairs	Non
Numéro NPARC	23002468
Exporter la notice	Exporter en format RIS
Signaler une correction	Signaler une correction (s'ouvre dans un nouvel onglet)
Identificateur de l’enregistrement	cd03cba4-a957-41fa-9c40-15d5466274ea
Enregistrement créé	2017-11-15
Enregistrement modifié	2020-05-30

Date de modification :: 2024-11-08