-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ensembl human release 110: assembly_exception table is empty #22
Comments
The error occurred for SELECT
gene.stable_id AS ensembl_gene_id,
alt_allele.alt_allele_group_id,
alt_allele_attrib.attrib = "IS_REPRESENTATIVE" AS alt_allele_is_representative,
assembly_exception.exc_seq_region_id IS NULL AS primary_assembly,
seq_region.name AS seq_region,
alt_allele_attrib.attrib AS alt_allele_attrib,
gene.created_date AS ensembl_created_date
FROM alt_allele
INNER JOIN gene
ON gene.gene_id = alt_allele.gene_id
INNER JOIN alt_allele_attrib
ON alt_allele.alt_allele_id = alt_allele_attrib.alt_allele_id
INNER JOIN seq_region
ON gene.seq_region_id = seq_region.seq_region_id
LEFT JOIN assembly_exception
ON seq_region.seq_region_id = assembly_exception.seq_region_id
-- keep exc_type in (PATCH_FIX, PATCH_NOVEL, HAP)
-- refs internal Related Sciences issue 606.
AND NOT assembly_exception.exc_type <=> "PAR"
-- all genes were current when query was written, ensure this is always the case
WHERE gene.is_current
AND alt_allele_group_id = 44458
ORDER BY alt_allele_group_id, alt_allele_is_representative DESC, primary_assembly DESC, ensembl_created_date, ensembl_gene_id
It looks like the problem could be that UPDATE: Here's the same query on ensembl 109:
The alt allele group contains the same genes, but |
Here are the results of our genes query filtered for symbol MBOAT7 on release 110 and 109. release 110
release 109
In both releases, we can see that
|
Looks like the SELECT *
FROM assembly_exception
LIMIT 10 The API docs describe the
The "List of species with populated data:" is only Danio rerio (Zebrafish). So is it intentional that this table is no longer in use for humans? Without this |
Aleena Mushtaq, Ensembl Outreach Officer and It seems that the MBOAT7 alt allele group should include more genes. We will add them in the next release, which might be Ensembl 112 as it may be too late for release 111. Regarding the questions about the assembly exceptions;
The As well as causing issues with the core API when dumping out files, the Y PAR-related changes meant that we no longer needed the PAR-based database entries for any API-projecting of X PAR data onto Y PARs. We see from the GitHub issue that you are trying to write SQL queries for our core db, so if you wanted to be able to pull out/exclude sequence regions relating to
It is not possible to link an alternate region with the corresponding primary assembly region in the release 110 database. This information will be present in the
In the Just to add here in case it helps at all, the |
Ensembl 110 was released on 2023-07-17 and includes a note:
ensembl_genes datasets --species=human --release=110
querieshomo_sapiens_core_110_38
and fails with:The two genes are ENSG00000273592 and ENSG00000276935, both which have symbol MBOAT7.
The text was updated successfully, but these errors were encountered: