Skip to content

Commit

Permalink
Deploying to gh-pages from @ 20dbb6a 🚀
Browse files Browse the repository at this point in the history
  • Loading branch information
LTLA committed Nov 15, 2024
1 parent e4188a4 commit f45fa4f
Show file tree
Hide file tree
Showing 4 changed files with 185 additions and 175 deletions.
7 changes: 4 additions & 3 deletions docs/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -89,14 +89,15 @@ <h1><a class="anchor" id="autotoc_md3"></a>
Expected files</h1>
<h2><a class="anchor" id="autotoc_md4"></a>
Gene mapping</h2>
<p>Each gene can be associated with one, zero or multiple names of different types (e.g., Ensembl identifiers, symbols). For each type of gene name, we expect a file named <code>&lt;TYPE&gt;.tsv.gz</code>. This is a Gzip-compressed tab-separated file where each line corresponds to a gene and the tab-separated fields are the names for that gene. Different genes may have different number of fields, and an empty line indicates that the equivalence class contains no names of that type.</p>
<p>Each gene can be associated with one, zero or multiple names of different types (e.g., Ensembl identifiers, symbols). For each type of gene name, we expect a "gene mapping" file named <code>&lt;TYPE&gt;.tsv.gz</code>. This is a Gzip-compressed tab-separated file where each line corresponds to a gene and the tab-separated fields are the names for that gene. Different genes may have different number of fields, and an empty line indicates that the equivalence class contains no names of that type.</p>
<p>Any type of gene name can be used, but most Gesel clients will expect one or more of the following files:</p>
<ul>
<li><code>ensembl.tsv.gz</code>, where the fields are Ensembl identifiers associated with that gene.</li>
<li><code>entrez.tsv.gz</code>, where the fields are Entrez identifiers associated with that gene.</li>
<li><code>symbol.tsv.gz</code>, where the fields are symbols associated with that gene.</li>
</ul>
<p>All files have the same number of lines as they represent different names of the same genes. Each gene's identity (i.e., the "gene index") is defined as the 0-based index of the corresponding line in each file.</p>
<p>At least one gene mapping file should be present for each species.</p>
<h2><a class="anchor" id="autotoc_md5"></a>
Collection details</h2>
<p><code>collections.tsv.gz</code> is a Gzip-compressed tab-separated file where each line corresponds to a gene set collection and contains the following fields:</p>
Expand Down Expand Up @@ -163,8 +164,8 @@ <h2><a class="anchor" id="autotoc_md10"></a>
<div class="line"><span class="keyword">auto</span> num_genes = <a class="code hl_function" href="namespacegesel.html#a23c9159c734be4f259cee012f5dc0007">gesel::validate_genes</a>(<span class="stringliteral">&quot;my/path/to/genes/9606_&quot;</span>);</div>
<div class="line"><a class="code hl_function" href="namespacegesel.html#abc7c78eb8c53bfe0fb7689cf3d13fa0f">gesel::validate_database</a>(<span class="stringliteral">&quot;my/path/to/db/9606_&quot;</span>, num_genes);</div>
<div class="ttc" id="agesel_8hpp_html"><div class="ttname"><a href="gesel_8hpp.html">gesel.hpp</a></div><div class="ttdoc">Validate Gesel database and gene files.</div></div>
<div class="ttc" id="anamespacegesel_html_a23c9159c734be4f259cee012f5dc0007"><div class="ttname"><a href="namespacegesel.html#a23c9159c734be4f259cee012f5dc0007">gesel::validate_genes</a></div><div class="ttdeci">uint64_t validate_genes(const std::string &amp;prefix, const std::vector&lt; std::string &gt; &amp;types)</div><div class="ttdef"><b>Definition</b> validate_genes.hpp:29</div></div>
<div class="ttc" id="anamespacegesel_html_abc7c78eb8c53bfe0fb7689cf3d13fa0f"><div class="ttname"><a href="namespacegesel.html#abc7c78eb8c53bfe0fb7689cf3d13fa0f">gesel::validate_database</a></div><div class="ttdeci">void validate_database(const std::string &amp;prefix, uint64_t num_genes)</div><div class="ttdef"><b>Definition</b> validate_database.hpp:80</div></div>
<div class="ttc" id="anamespacegesel_html_a23c9159c734be4f259cee012f5dc0007"><div class="ttname"><a href="namespacegesel.html#a23c9159c734be4f259cee012f5dc0007">gesel::validate_genes</a></div><div class="ttdeci">uint64_t validate_genes(const std::string &amp;prefix, const std::vector&lt; std::string &gt; &amp;types)</div><div class="ttdef"><b>Definition</b> validate_genes.hpp:30</div></div>
<div class="ttc" id="anamespacegesel_html_abc7c78eb8c53bfe0fb7689cf3d13fa0f"><div class="ttname"><a href="namespacegesel.html#abc7c78eb8c53bfe0fb7689cf3d13fa0f">gesel::validate_database</a></div><div class="ttdeci">void validate_database(const std::string &amp;prefix, uint64_t num_genes)</div><div class="ttdef"><b>Definition</b> validate_database.hpp:84</div></div>
</div><!-- fragment --><p>This takes the path prefix to a species-specific suite of Gesel files and validates their contents, throwing an error if any invalid formatting is detected. Note that the gene mapping files can be stored in a different directory from the other files.</p>
<p>Check out the <a href="https://gesel-inc.github.io/gesel-spec">reference documentation</a> for more information.</p>
<h2><a class="anchor" id="autotoc_md11"></a>
Expand Down
2 changes: 1 addition & 1 deletion docs/namespacegesel.html
Original file line number Diff line number Diff line change
Expand Up @@ -206,7 +206,7 @@ <h2 class="memtitle"><span class="permalink"><a href="#a23c9159c734be4f259cee012
<dl class="params"><dt>Parameters</dt><dd>
<table class="params">
<tr><td class="paramname">prefix</td><td>Prefix for the Gesel gene mapping files. This should be of the form <code>&lt;DIRECTORY&gt;/&lt;SPECIES&gt;_</code>, where <code>&lt;SPECIES&gt;</code> is an NCBI taxonomy ID. </td></tr>
<tr><td class="paramname">types</td><td>Vector of gene name types, e.g., <code>"ensembl"</code>, <code>"symbol"</code>.</td></tr>
<tr><td class="paramname">types</td><td>Vector of gene name types, e.g., <code>"ensembl"</code>, <code>"symbol"</code>. This should contain at least one value.</td></tr>
</table>
</dd>
</dl>
Expand Down
Loading

0 comments on commit f45fa4f

Please sign in to comment.