NHX is based on the New Hampshire (NH) standard (also called "Newick tree format").
dendrogram dna region
Gene Prediction File Format (genePred) is a table format commonly used for gene prediction tracks in the Genome Browser. Variations of genePred include standard format, extended format and a format wh ...
The Newick Standard for representing trees in computer-readable form makes use of the correspondence between trees and nested parentheses, noticed in 1857 by the famous English mathematician Arthur Ca ...
The .nib format pre-dates the .2bit format and is less compact. It describes a DNA sequence by packing two bases into each byte.
Standard flowgram format (SFF) is a binary file format used to encode results of pyrosequencing from the 454 Life Sciences platform for high-throughput sequencing. SFF files can be viewed, edited and ...
The Gene transfer format (GTF) is a file format used to hold information about gene structure. It is a tab-delimited text format based on the general feature format (GFF), but contains some additional ...
ENA Sequence Flat File Format is a standardised plain text format for nucleotide sequences. This format was previously called the EMBL Sequence Flat File Format.
The PSI Extended Fasta Format (PEFF) is a unified format for protein and nucleotide sequence databases to be used by sequence search engines and other associated tools (spectra library search tools, s ...
The ENCODE peak information Format is used to provide called regions of signal enrichment based on pooled, normalized (interpreted) data.
FASTA format is a text-based format for representing either nucleotide sequences or peptide sequences, in which nucleotides or amino acids are represented using single-letter codes. The format also al ...
The wiggle (WIG) format is an older format for display of dense, continuous data such as GC percent, probability scores, and transcriptome data. The bigWig format is the recommended format for almost ...
LINCS Production Phase 2 Extended Metadata Standards, including this guideline on Nucleic Acid Reagents, were developed by the LINCS consortium with the goal of generating an integrated view across th ...
CLUSTAL-W Alignment Format is a simple text-based format, often with a *.aln file extension, used for the input and output of DNA or protein sequences into the Clustal suite of multiple alignment prog ...
The Data Use Ontology (DUO) describes data use requirements and limitations. DUO allows to semantically tag datasets with restriction about their usage, making them discoverable automatically based on ...
The bedGraph format allows display of continuous-valued data in track format. This display type is useful for probability scores and transcriptome data. This track type is similar to the wiggle (WIG) ...
This PIR Database File Structure and Format Specification describes the files comprising the PIR-International Protein Sequence Database and the format of each. The format has been enhanced significan ...
The Multiple Alignment Format stores DNA level multiple alignments in an easily readable format between entire genomes. Unlike previous formats this resource can cope with forward and reverse strand d ...
FASTQ is a text-based file format for sharing sequencing data combining both the sequence and an associated per base quality score.
This format is for displaying SNPs from personal genomes. It is the same as is used for the Genome Variants and Population Variants tracks.
BLAT is a multiple algorithms developed for the analysis and comparison of biological sequences such as DNA, RNA and proteins.
In early 2010 we updated the site to facilitate more rapid transfer of our data to the public database and focus our efforts on the core mission of providing expression pattern images to the research ...