Algorithm 915, SuiteSparseQR 论文

2011ACM Transactions on Mathematical Software引用 264
Parallel Computing and Optimization TechniquesVLSI and FPGA Design TechniquesEmbedded Systems Design Techniques

详细信息

发表期刊/会议
ACM Transactions on Mathematical Software
发表日期
2011-11-01
发表年份
2011

关键词

Parallel Computing and Optimization TechniquesVLSI and FPGA Design TechniquesEmbedded Systems Design Techniques

摘要

SuiteSparseQR is a sparse QR factorization package based on the multifrontal method. Within each frontal matrix, LAPACK and the multithreaded BLAS enable the method to obtain high performance on multicore architectures. Parallelism across different frontal matrices is handled with Intel's Threading Building Blocks library. The symbolic analysis and ordering phase pre-eliminates singletons by permuting the input matrix A into the form [ R 11 R 12 ; 0 A 22 ] where R 11 is upper triangular with diagonal entries above a given tolerance. Next, the fill-reducing ordering, column elimination tree, and frontal matrix structures are found without requiring the formation of the pattern of A TA . Approximate rank-detection is performed within each frontal matrix using Heath's method. While Heath's method is not always exact, it has the advantage of not requiring column pivoting and thus does not interfere with the fill-reducing ordering. For sufficiently large problems, the resulting sparse QR factorization obtains a substantial fraction of the theoretical peak performance of a multicore computer.

相关事件

暂无数据

相关文章

暂无数据