spacer
Brett W. Bader Sandia Home
Papers | Talks
 


Publications

Please note that the copyrights for many of these papers are held by their respective publisher.


Temporal analysis of semantic graphs using ASALSAN
Brett W. Bader, Richard A. Harshman, and Tamara G. Kolda
In ICDM 2007: Proceedings of the 7th IEEE International Conference on Data Mining, October 2007.

ASALSAN is a new algorithm for computing three-way DEDICOM, which is a linear algebra model for analyzing intrinsically asymmetric relationships, such as trade among nations or the exchange of emails among individuals, that incorporates a third mode of the data, such as time. ASALSAN is unique because it enables computing the three-way DEDICOM model on large, sparse data. A nonnegative version of ASALSAN is described as well. When we apply these techniques to adjacency arrays arising from directed graphs with edges labeled by time, we obtain a smaller graph on latent semantic dimensions and gain additional information about their changing relationships over time. We demonstrate these techniques on international trade data and the Enron email corpus to uncover latent components and their transient behavior. The mixture of roles assigned to individuals by ASALSAN showed strong correspondence with known job classifications and revealed the patterns of communication between these roles. Changes in the communication pattern over time, e.g., between top executives and the legal department, were also apparent in the solutions.

Links:


Temporal analysis of social networks using three-way DEDICOM
Brett W. Bader, Richard Harshman, and Tamara G. Kolda
Technical Report SAND2006-2161, Sandia National Laboratories, Albuquerque, NM and Livermore, CA, April 2006. (PDF, 0.4 MB)

DEDICOM is an algebraic model for analyzing intrinsically asymmetric relationships, such as the balance of trade among nations or the flow of information among organizations or individuals. It provides information on latent components in the data that can be regarded as "properties" or "aspects" of the objects, and it finds a few patterns that can be combined to describe many relationships among these components. When we apply this technique to adjacency matrices arising from directed graphs, we obtain a smaller graph that gives an idealized description of its patterns. Three-way DEDICOM is a higher-order extension of the model that has certain uniqueness properties. It allows for a third mode of the data, such as time, and permits the analysis of semantic graphs. We present an improved algorithm for computing three-way DEDICOM on sparse data and demonstrate it by applying it to the adjacency tensor of a semantic graph with time-labeled edges. Our application uses the Enron email corpus, from which we construct a semantic graph corresponding to email exchanges among Enron personnel over a series of 44 months. Meaningful patterns are recovered in which the representation of asymmetries adds insight into the social networks at Enron.

The TOPHITS model for higher-order web link analysis
Tamara Kolda and Brett Bader
In Workshop on Link Analysis, Counterterrorism and Security, 2006. (http://www.cs.rit.edu/~amt/linkanalysis06/accepted/21.pdf) (PDF, 0.4 MB)

As the size of the web increases, it becomes more and more important to analyze link structure while also considering context. Multilinear algebra provides a novel tool for incorporating anchor text and other information into the authority computation used by link analysis methods such as HITS. Our recently proposed TOPHITS method uses a higher-order analogue of the matrix singular value decomposition called the PARAFAC model to analyze a three-way representation of web data. We compute hubs and authorities together with the terms that are used in the anchor text of the links between them. Adding a third dimension to the data greatly extends the applicability of HITS because the TOPHITS analysis can be performed in advance and offline. Like HITS, the TOPHITS model reveals latent groupings of pages, but TOPHITS also includes latent term information. In this paper, we describe a faster mathematical algorithm for computing the TOPHITS model on sparse data, and Web data is used to compare HITS and TOPHITS. We also discuss how the TOPHITS model can be used in queries, such as computing context-sensitive authorities and hubs. We describe different query response methodologies and present experimental results.

Higher-order web link analysis using multilinear algebra
Tamara G. Kolda, Brett W. Bader, and Joseph P. Kenny
In ICDM 2005: Proceedings of the 5th IEEE International Conference on Data Mining, pages 242–249, November 2005. (doi:10.1109/ICDM.2005.77) (PDF, 0.3 MB)

Linear algebra is a powerful and proven tool in web search. Techniques, such as the PageRank algorithm of Brin and Page and the HITS algorithm of Kleinberg, score web pages based on the principal eigenvector (or singular vector) of a particular non-negative matrix that captures the hyperlink structure of the web graph. We propose and test a new methodology that uses multilinear algebra to elicit more information from a higher-order representation of the hyperlink graph. We start by labeling the edges in our graph with the anchor text of the hyperlinks so that the associated linear algebra representation is a sparse, three-way tensor. The first two dimensions of the tensor represent the web pages while the third dimension adds the anchor text. We then use the rank-1 factors of a multilinear PARAFAC tensor decomposition, which are akin to singular vectors of the SVD, to automatically identify topics in the collection along with the associated authoritative web pages.

Links:

  • Previous version released as Sandia Technical Report SAND2005-4548. (July 2005) - PDF.

On the performance of tensor methods for solving ill-conditioned problems
Brett W. Bader and Robert B. Schnabel
SIAM J. Scientific Computing, 29(6):2329-2351, 2007.

This paper investigates the performance of tensor methods for solving small- and large-scale systems of nonlinear equations where the Jacobian matrix at the root is ill-conditioned or singular. This condition occurs on many classes of problems, such as identifying or approaching turning points in path following problems. The singular case has been studied more than the highly ill-conditioned case, for both Newton and tensor methods. It is known that Newton-based methods do not work well with singular problems because they converge linearly to the solution and, in some cases, with poor accuracy. On the other hand, direct tensor methods have performed well on singular problems and have superlinear convergence on such problems under certain conditions. This behavior originates from the use of a special, restricted form of the second-order term included in the local tensor model that provides information lacking in a (nearly) singular Jacobian. With several implementations available for large-scale problems, tensor methods now are capable of solving larger problems. We compare the performance of tensor methods and Newton-based methods for both small- and large-scale problems over a range of conditionings, from well-conditioned to ill-conditioned to singular. Previous studies with tensor methods only concerned the ends of this spectrum. Our results show that tensor methods are increasingly superior to Newton-based methods as the problem grows more ill-conditioned.

Links:


Tensor-Krylov methods for solving large-scale systems of nonlinear equations
Brett W. Bader
SIAM J. Numerical Analysis, 43(3):1321-1347, 2005.

This paper develops and investigates iterative tensor methods for solving large-scale systems of nonlinear equations. Direct tensor methods for nonlinear equations have performed especially well on small, dense problems where the Jacobian matrix at the solution is singular or ill-conditioned, which may occur when approaching turning points, for example. This research extends direct tensor methods to large-scale problems by developing three tensor-Krylov methods that base each iteration upon a linear model augmented with a limited second-order term, which provides information lacking in a (nearly) singular Jacobian. The advantage of the new tensor-Krylov methods over existing large-scale tensor methods is their ability to solve the local tensor model to a specified accuracy, which produces a more accurate tensor step. The performance of these methods in comparison to Newton-GMRES and tensor-GMRES is explored on three Navier--Stokes fluid flow problems. The numerical results provide evidence that tensor-Krylov methods are generally more robust and more efficient than Newton-GMRES on some important and difficult problems. In addition, the results show that the new tensor-Krylov methods and tensor-GMRES each perform better in certain situations.

Links:

  • Earlier version available as Sandia Technical Report SAND2004-1837 (August 2004)

Algorithm 862: MATLAB Tensor Classes for Fast Algorithm Prototyping
Brett W. Bader and Tamara G. Kolda
ACM Transactions on Mathematical Software, 32(4):635-653, 2006.

Tensors (also known as mutidimensional arrays or N-way arrays) are used in a variety of applications ranging from chemometrics to psychometrics. We describe four MATLAB classes for tensor manipulations that can be used for fast algorithm prototyping. The tensor class extends the functionality of MATLAB's multidimensional arrays by supporting additional operations such as tensor multiplication. The tensor_as_matrix class supports the "matricization" of a tensor, i.e., the conversion of a tensor to a matrix (and vice versa), a commonly used operation in many algorithms. Two additional classes represent tensors stored in decomposed formats: cp_tensor and tucker_tensor. We describe all of these classes and then demonstrate their use by showing how to implement several tensor algorithms that have appeared in the literature.

Links:

  • Webpage: MATLAB Tensor Toolbox
  • Software: released under SAND2004-5189 and is available in TGZ or ZIP format.
  • Technical Report SAND2004-5187 (October 2004) pdf
  • A previous version was released as Tech. Rep. SAND2004-3487 (July 2004)
  • Revision: July 20, 2005 (Paper in PDF, 0.3 MB) (Code in TGZ, 29 kb)

An Optimization approach to the problem of protein structure prediction
Elizabeth Eskow, Brett Bader, Richard Byrd, Vincent Lamberti, Silvia Crivelli, Teresa Head-Gordon, Robert Schnabel
Mathematical Programming, 101(3):497-514, 2004.

We describe a large-scale, stochastic-perturbation global optimization algorithm used for determining the structure of proteins. The method incorporates secondary structure predictions (which describe the more basic elements of the protein structure) into the starting structures, and thereafter minimizes using a purely physics-based energy model. Results show this method to be particularly successful on protein targets where structural information from similar proteins is unavailable, i.e., the most difficult targets for most protein structure prediction methods. Our best result to date is on a protein target containing over 4000 atoms and 12,000 cartesian coordinates.

Links:


Curvilinear linesearch for tensor methods
Brett W. Bader and Robert B. Schnabel
SIAM J. of Scientific Computing (Copper Mountain special issue), 25(2):604-622, 2003.

This paper presents a curvilinear linesearch for use when solving nonlinear equations with tensor methods. Standard tensor methods use a combination of the tensor step and Newton step in a linesearch, but they are handled separately and in an ad hoc manner. Our curvilinear linesearch combines the two directions in a single parametric step, which guarantees a monotonic decrease on the tensor model and also asymptotically approaches the Newton direction as the step length shrinks to zero, thus guaranteeing descent on the nonlinear equations. Numerical experiments on a set of 35 small-scale problems, drawn primarily from the CUTE collection, show an 18--23\% improvement (in terms of function evaluations) over previous tensor linesearches when using quadratic backtracking and a 41--83\% improvement when halving $\lambda$ before each trial. Our results also suggest that the curvilinear linesearch is more robust than linesearch-based Newton's method or the standard tensor linesearch, producing fewer failures than either method. Experiments on two large-scale fluid flow problems complement the small-scale results and give preliminary indication of the applicability of the tensor method and the efficiency of the curvilinear linesearch. The theoretical properties, coupled with the better performance, make this a desirable improvement over previous tensor method linesearches.

Links:


Fluid mechanics, cell distribution, and environment in CellCube bioreactors
John G. Aunins, Brett Bader, Anthony Caola, Janet Griffiths, Maayan Katz, Peter Licari, Kripa Ram, Colette S. Ranucci, and Weichang Zhou
Biotechnology Progress, 19(1):2-8, 2003.

Cultivation of MRC-5 cells and attenuated hepatitis A virus (HAV) for the production of VAQTA, an inactivated HAV vaccine (1), is performed in the CellCube reactor, a laminar flow fixed-bed bioreactor with an unusual diamond-shaped, diverging-converging flow geometry. These disposable bioreactors have found some popularity for the production of cells and gene therapy vectors at intermediate scales of operation (2, 3). Early testing of the CellCube revealed that the fluid mechanical environment played a significant role in nonuniform cell distribution patterns generated during the cell growth phase. Specifically, the reactor geometry and manufacturing artifacts, in combination with certain inoculum practices and circulation flow rates, can create cell growth behavior that is not simply explained. Via experimentation and computational fluid dynamics simulations we can account for practically all of the observed cell growth behavior, which appears to be due to a complex mixture of flow distribution, particle deposition under gravity, fluid shear, and possibly nutritional microenvironment.

Links:

  • Published online © 2003 by American Chemical Society and American Institute of Chemical Engineers - Online access

A Physical approach to protein structure prediction
Silvia Crivelli, Elizabeth Eskow, Brett Bader, Vincent Lamberti, Richard Byrd, Robert Schnabel, and Teresa Head-Gordon
Biophysical J., 82:36-49, 2002.

We describe our global optimization method called Stochastic Perturbation with Soft Constraints (SPSC), which uses information from known proteins to predict secondary structure, but not in the tertiary structure predictions or in generating the terms of the physics-based energy function. Our approach is also characterized by the use of an all atom energy function that includes a novel hydrophobic solvation function derived from experiments that shows promising ability for energy discrimination against misfolded structures. We present the results obtained using our SPSC method and energy function for blind prediction in the 4th Critical Assessment of Techniques for Protein Structure Prediction competition, and show that our approach is more effective on targets for which less information from known proteins is available. In fact our SPSC method produced the best prediction for one of the most difficult targets of the competition, a new fold protein of 240 amino acids.

Links:

  • Published online © 2002 by the Biophysical Society - Online access

Lysophosphatidylcholine and arachidonic acid are required in the cytotoxic response of human natural killer cells to tumor target cells
Margaret M. Whalen, Rashmi N. Doshi, Brett W. Bader, and Arthur D. Bankhurst
Cellular Physiology and Biochemistry, 9(6):297-309, 1999.

Treatment of human natural killer (NK) cells with phospholipase A2 (PLA2) inhibitors, mepacrine and 4-bromophenacyl bromide (BPB), diminished their ability to lyse K562 target cells by as much as 100%. The ability of NK cells to bind to K562 cells was significantly affected by BPB above 2 µM, but not by mepacrine at any concentration tested. This indicates that BPB is having effects on NK cells unrelated to its inhibition of PLA2 activity at concentrations above 2 µM. The activation of phospholipase C in response to K562 cell binding (as measured by inositol phosphate turnover) was unaffected by inhibition of the PLA2 activity. The products of PLA2 catabolism are a fatty acid (often arachidonic acid) and a lysophospholipid. Inhibition of NK cytotoxicity by mepacrine or BPB is reversed significantly when lysophosphatidylcholine, but no other lysolipid, is added back to the NK cells before assaying for cytotoxicity. Arachidonic acid, but not linoleic acid, also significantly reverses inhibition of NK cytotoxicity. Finally, the 15-lipoxygenase product, 15S-hydroperoxyeicosatetraenoic acid (15S-HPETE), is also able to reverse mepacrine-induced inhibition of NK cytotoxicity. The 5-lipoxygenase product 5S-HPETE was not effective. These data indicate that PLA2 activation is a necessary signal in human NK cytotoxicity and that it is not involved in protein tyrosine kinase and subsequent phospholipase C activation; these latter two enzymes are also required in the cytotoxic response. Thus PLA2 activation is either a more distal signal, dependent on activation of some earlier signal, or an independent cosignal stimulated by tumor-target binding which generates lysophosphatidylcholine, arachidonic acid, and/or a lipoxygenase product(s).

Links:



Selected Talks

Analysis of Latent Relationships in Semantic Graphs using DEDICOM
Brett W. Bader

Workshop on Algorithms for Modern Massive Data Sets
Stanford University and Yahoo! Research, Palo Alto, CA, June 21-24, 2006.

This presentation introduces the DEDICOM (DEcomposition into DIrectional COMponents) family of models from the psychometrics literature for analyzing asymmetric relationships in data and applies it to new applications in data mining, in particular social network analysis. In this work we present an improved algorithm for computing the 3-way DEDICOM model with modifications that make it possible to handle large, sparse data sets. We demonstrate the capabilities of DEDICOM as a new tool for reporting latent relationships in large semantic graphs, which we represent by an adjacency tensor. For an application we consider the email communications of former Enron employees that were made public, and posted to the web, by the U.S. Federal Energy Regulatory Commission during its investigation of Enron. We represent the Enron email network as a directed graph with edges labeled by time. Using the three-way DEDICOM model on this data, we show unique latent relationships that exist between types of employees and study their communication patterns over time.

Joint work with Richard Harshman and Tamara G. Kolda.

Links:


A Tensor Method for Solving Large-Scale Systems of Nonlinear Equations
Brett W. Bader and Tamara G. Kolda

SIAM Annual Meeting 2004
Portland, OR, July 12-16, 2004.

We present a tensor method that is easy to implement in large-scale applications. Tensor methods are an alternative to Newton-based methods and are based on using a quadratic local model rather than a linear model. The advantage of this method is that it can use any existing linear solver. We will also discuss a special globalization technique and other modifications for added robustness. We present numerical results showing computational advantages over Newton-GMRES on some realistic problems.

Links:

  • See Copper Mountain slides online


A Comparison of Iterative Tensor Methods for Solving Large Systems of Nonlinear Equations
Brett W. Bader and Tamara G. Kolda

Copper Mountain Conference on Iterative Methods 2004
Copper Mountain, CO, March 28 - April 2, 2004.

We compare and contrast a variety of iterative tensor methods for solving large-scale systems of nonlinear equations. Tensor methods are an alternative to Newton-based methods and are based on using a limited quadratic local model rather than a linear model. These higher order models often provide information that is lacking in a (nearly) singular Jacobian, thus making the solver more efficient when solving difficult problems.

This talk has three areas of emphasis. First, we introduce several new and recent iterative tensor methods for solving large-scale problems, including tensor-Krylov methods and a new implementation that can use a stand-alone linear solver. The methods have a range of characteristics and other considerations for a practical implementation, which we discuss. Second, we apply a curvilinear linesearch globalization technique to the tensor methods that smoothly combines the Newton and tensor directions. Our results show that the curvilinear linesearch is more robust and efficient than other linesearch implementations. Finally, we explore the performance of these large-scale tensor methods in comparison to Newton-GMRES on several realistic problems, including some Navier-Stokes fluid flow problems. All methods are implemented in an object-oriented nonlinear software package called NOX that is being developed at Sandia National Laboratories. Our results show that the iterative tensor methods have computational advantages over Newton-GMRES, especially when the Jacobian at the root is ill-conditioned or singular.

Links:


Tensor-Krylov Methods for Solving Systems of Nonlinear Equations:
An Introduction and Comparison

Brett W. Bader

Solution Methods for Large-scale Nonlinear Problems 2003
Hilton Garden Inn, Livermore, CA, August 6-8, 2003.

This talk will introduce several iterative, Krylov-subspace based approaches for solving large-scale systems of nonlinear equations using tensor methods. Standard tensor methods have performed especially well on small, dense problems where the Jacobian matrix at the solution is singular or ill-conditioned, something that happens on many classes of large-scale problems, such as bifurcation tracking. This research extends tensor methods to large-scale problems by developing a class of tensor-Krylov methods that base each iteration on a linear model augmented with a limited second-order term and employ a line search globalization scheme that imitates a trust region method. Two advantages over existing methods are an ability to solve the local tensor model to a specified accuracy and a reliance on block methods, which can be more memory efficient than standard techniques. In addition, the tensor-Krylov methods still exhibit the super linear convergence behavior of direct tensor methods on singular and ill-conditioned problems. Numerical comparisons with Newton-GMRES on several benchmark fluid flow problems are presented and provide evidence that tensor-Krylov methods are more robust and efficient than Newton-GMRES on some important and difficult problems.

Links:



Back to top of page | Back to home page
Papers | Talks

© Sandia Corporation | Site Contact | Site Map | Privacy and Security