Motivation: The rapidly increasing amount and disparity of biological data requires interpretation at many levels of description. Human judgement and intuition are important because not all data can be automatically and comprehensively analyzed. Visualization of trees and substructures corresponding to certain features are often used to analyze phylogenies or taxonomies. Unfortunately, most existing tools do not cope with the size of current datasets, the required functionality, or both.
Results: We introduce a program for visualization of huge trees and also for the interactive exploration of their content. We have developed a range of new schemes which are tailored for biological problems. Users can get an overview, zoom in, filter out data and retrieve details from standard databases such as SWISS-PROT. Furthermore, it is possible to analyze the relationship between chosen leaf sets that are specified by common features on a second level of representation. On a PC (with ≈ 512 MB RAM), trees of up to several tens of thousands of leaves can be loaded and both rapidly and interactively explored. We demonstrate the use of this program for the analysis of the SYSTERS data set (which contains hierarchically clustered protein sequences) to which PFAM domains were added as features.
Availability: TreeWiz is written in JAVA and is free for academic use. The full application as well as a restricted applet version can be found on the web-page http://www.eml.org/english/staff/homes/ulla/rost.html.