Title: Novel Tensor Factorization Algorithms and Applications

Date: Thursday, July 2nd (7/2), 2026

Time: 12:00-2:00pm ET

Location: Coda C1315 (Grant Park)

Zoom: https://gatech.zoom.us/j/95337401341?pwd=UA3HGBasOpWtv4JIanQFoBKtyovP9y.1

 

Benjamin Cobb

Computer Science Ph.D. Candidate

School of Computational Science and Engineering

Georgia Institute of Technology

https://www.ben-cobb.com/

 

Committee:

Dr. Richard W. Vuduc (co-advisor), CSE, Georgia Institute of Technology

Dr. Haesun Park (co-advisor), CSE, Georgia Institute of Technology

Dr. Edmond Chow, CSE, Georgia Institute of Technology

Dr. Grey M. Ballard, CS, Wake Forest University

Dr. Ramakrishnan Kannan, Discrete Algorithms, Oak Ridge National Laboratory

 

Abstract:

As our capacity to collect and generate data continues to outpace our capacity to store and analyze said data, low-rank tensor factorizations offer a solution to alleviate this issue for tensorized data. The process of computing low-rank tensor factorizations is computationally expensive and requires efficiently leveraging computational resources to enable feasibility. This thesis proposes several novel algorithms for low-rank tensor factorizations to enable interpretable analysis and compression of massive multiway datasets. To this end, we focus on three overarching themes: efficiently leveraging computational resources to compute large-scale tensor decompositions, efficiently enforcing nonnegativity constraints to improve interpretability, and incorporating additional information into the factorization to improve solution quality.

 

We start by proposing the Fused In-place Sequentially Truncated Higher Order Singular Value Decomposition (FIST-HOSVD) algorithm as the first in-place method for computing the dense Tucker decomposition, increasing the problem size that can be factorized by up to 3x. We demonstrate the effectiveness of the proposed FIST-HOSVD algorithm by decreasing the auxiliary memory consumption by over 135x when computing the dense Tucker decomposition on two combustion simulation compression applications. We then provide an in-depth study of several state-of-the-art methods for Nonnegative Matrix Factorization (NMF). As part of this, we provide a comprehensive survey of Nonnegative Least Squares (NNLS) solvers used to enforce the nonnegativity of the factors. In doing so, we propose a Fast Active-Set Thresholding NNLS (FAST-NNLS) solver which outperforms existing NNLS methods for broad classes of problems. We then introduce a GPU-accelerated Hierarchical NMF K-Means initialization method for large-scale protein clustering on commodity-grade hardware. We then propose the Low Rank Approximations with Constraints at Exascale (LORACX) framework for computing large-scale distributed NMF. We demonstrate that LORACX yields unprecedented performance and scalability by achieving 0.67 exaflops in double-precision on 8,192 nodes of the Frontier supercomputer when computing NMF of a 2.1 petabyte matrix. We then extend the NMF objective function to incorporate additional multiway data, culminating in a Joint Nonnegative Coupled Matrix-Tensor Factorization (Joint-NCMTF) framework. We demonstrate that the proposed Joint-NCMTF method yields improved clustering quality and additional dimensions of insight relative to traditional matrix-based methods.