read and write UCSC track and assembly hub files in node or the browser
Read about hub.txt, genomes.txt, and trackDb.txt files here: https://genome.ucsc.edu/goldenpath/help/hgTrackHubHelp.html
Files are essentially JavaScript Map
s. A hub.txt file is a map with th keys as the first word in each line and the value as the rest of the line, like this:
Map {
"hub" => "UCSCHub",
"shortLabel" => "UCSC Hub",
"longLabel" => "UCSC Genome Informatics Hub for human DNase and RNAseq data",
"genomesFile" => "genomes.txt",
"email" => "[email protected]",
"descriptionUrl" => "ucscHub.html",
}
genomes.txt and trackDb.txt files are two-deep Map
s where the keys are the
values of the first line of each section and the value is a Map
of the lines in that whole section, like this:
Map {
"hg18" => Map {
"genome" => "hg18",
"trackDb" => "hg18/trackDb.txt",
},
"hg19" => Map {
"genome" => "hg19",
"trackDb" => "hg19/trackDb.txt",
},
"newOrg1" => Map {
"genome" => "newOrg1",
"trackDb" => "newOrg1/trackDb.txt",
"twoBitPath" => "newOrg1/newOrg1.2bit",
"groups" => "newOrg1/groups.txt",
"description" => "Big Foot V4",
"organism" => "BigFoot",
"defaultPos" => "chr21:33031596-33033258",
"orderKey" => "4800",
"scientificName" => "Biggus Footus",
"htmlPath" => "newOrg1/description.html",
},
}
Map {
"dnaseSignal" => Map {
"track" => "dnaseSignal",
"bigDataUrl" => "dnaseSignal.bigWig",
"shortLabel" => "DNAse Signal",
"longLabel" => "Depth of alignments of DNAse reads",
"type" => "bigWig",
},
"dnaseReads" => Map {
"track" => "dnaseReads",
"bigDataUrl" => "dnaseReads.bam",
"shortLabel" => "DNAse Reads",
"longLabel" => "DNAse reads mapped with MAQ",
"type" => "bam",
},
}
Example usage:
const fs = require('fs')
const { HubFile, GenomesFile, TrackDbFile } = require('@gmod/ucsc-hub')
const hubFile = new HubFile(fs.readFileSync('hub.txt', 'utf8'))
console.log(hubFile.get('genomesFile'))
// ↳ genomes.txt
const genomesFile = new GenomesFile(fs.readFileSync('genomes.txt', 'utf8'))
console.log(genomesFile.get('hg19').get('trackDb'))
// ↳ hg19/trackDb.txt
const trackDbFile = new TrackDbFile(fs.readFileSync('hg19/trackDb.txt', 'utf8'))
console.log(trackDbFile.get('dnaseSignal').get('shortLabel'))
// ↳ DNAse Signal
Extends RaFile
Class representing a genomes.txt file.
-
genomesFile
(string | Array<string>) A genomes.txt file as a string (optional, default[]
) -
Throws Error Throws if the first line of the hub.txt file doesn't start with "genome <genome_name>" or if it has invalid entries
Extends RaStanza
Class representing a hub.txt file.
-
hubFile
(string | Array<string>) A hub.txt file as a string (optional, default[]
) -
Throws Error Throws if the first line of the hub.txt file doesn't start with "hub <hub_name>", if it has invalid entries, or is missing required entries
Extends Map
Class representing an ra file. Each file is composed of multiple stanzas, and
each stanza is separated by one or more blank lines. Each stanza is stored in
a Map with the key being the value of the first key-value pair in the stanza.
The usual Map methods can be used on the file. An additional method add()
is available to take a raw line of text and break it up into a key and value
and add them to the class. This should be favored over set()
when possible,
as it performs more validity checks than using set()
.
raFile
(string | Array<string>) An ra file, either as a single string or an array of strings with one stanza per entry. Supports both LF and CRLF line terminators. (optional, default[]
)options
object (optional, default{checkIndent:true}
)options.checkIndent
boolean [true] - Check if a the stanzas within the file are indented consistently and keep track of the indentation
-
nameKey
(undefined | string) The key of the first line of all the stanzas (undefined
if the stanza has no lines yet). -
Throws Error Throws if an empty stanza is added, if the key in the first key-value pair of each stanze isn't the same, or if two stanzas have the same value for the key-value pair in their first lines.
Add a single stanza to the file
stanza
string A single stanza
Returns RaFile The RaFile object
Use add()
if possible instead of this method. If using this, be aware
that no checks are made for comments, empty stanzas, duplicate keys, etc.
key
string The key of the RaFile stanzavalue
RaStanza The RaFile stanza used to replace the prior one
Delete a stanza
stanza
string The name of the stanza to delete (the value in its first key-value pair)
Returns boolean true if the deleted stanza existed, false if it did not
Clear all stanzas and comments
Returns string Returns the stanza as a string fit for writing to a ra
file. Original leading indent is preserved. It may not be the same as the
input stanza as lines that were joined with \
in the input will be output
as a single line and all comments will have the same indentations as the
rest of the stanza. Comments between joined lines will move before that
line.
Extends Map
Class representing an ra file stanza. Each stanza line is split into its key
and value and stored as a Map, so the usual Map methods can be used on the
stanza. An additional method add()
is available to take a raw line of text
and break it up into a key and value and add them to the class. This should
be favored over set()
when possible, as it performs more validity checks
than using set()
.
stanza
(string | Array<string>) An ra file stanza, either as a string or a array of strings with one line per entry. Supports both LF and CRLF line terminators. (optional, default[]
)options
object (optional, default{checkIndent:true}
)options.checkIndent
boolean [true] - Check if a stanza is indented consistently and keep track of the indentation
-
nameKey
(undefined | string) The key of the first line of the stanza (undefined
if the stanza has no lines yet). -
name
(undefined | string) The value of the first line of the stanza, by which it is identified in an ra file (undefined
if the stanza has no lines yet). -
indent
(undefined | string) The leading indent of the stanza, which is the same for every line (undefined
if the stanza has no lines yet,''
if there is no indent). -
Throws Error Throws if the stanza has blank lines, if the first line doesn't have both a key and a value, if a key in the stanza is duplicated, or if lines in the stanza have inconsistent indentation.
Add a single line to the stanza. If the exact line already exists, does nothing.
line
string A stanza line
Returns RaStanza The RaStanza object
Use add()
if possible instead of this method. If using this, be aware
that no checks are made for comments, indentation, duplicate keys, etc.
Returns RaStanza The RaStanza object
Delete a line
key
string The key of the line to delete
Returns boolean true if the deleted line existed, false if it did not
Clear all lines and comments
Returns string Returns the stanza as a string fit for writing to a ra
file. Original leading indent is preserved. It may not be the same as the
input stanza as lines that were joined with \
in the input will be output
as a single line and all comments will have the same indentations as the
rest of the stanza. Comments between joined lines will move before that
line.
Extends RaFile
Class representing a genomes.txt file.
-
trackDbFile
(string | Array<string>) A trackDb.txt file as a string (optional, default[]
) -
Throws Error Throws if "track" is not the first key in each track or if a track is missing required keys
Gets all track entries including those of parent tracks, with closer entries overriding more distant ones