Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SLC or OR gene superfamilies: exploration for MorphMap paper (ORFs, need to check CRISPR) #6

Closed
AnneCarpenter opened this issue Dec 15, 2023 · 14 comments
Assignees
Labels
crispr Uses crispr data internal Internal discussions (but publicly accessible) new story we will include orf Uses ORF data

Comments

@AnneCarpenter
Copy link
Contributor

AnneCarpenter commented Dec 15, 2023

@AnneCarpenter
Copy link
Contributor Author

AnneCarpenter commented Dec 18, 2023

Anna Greka suggested talking to Jen Pluznick at Hopkins, emailed her today. Also emailed the JUMP Cell Painting email list.

@AnneCarpenter
Copy link
Contributor Author

This really is a dream team we assembled in very quick time! I'm copying below some emailed insights from our OR experts below that indicate it IS worth pursuing this to see where it goes.

That being the case, I've taken a stab to assign tasks mainly to Niranj and Evotec, and they can report on results in github and in this email thread. Once those groundwork tasks are done, we will feel comfortable investing the SLC/OR experts' time to dig into the biological relationship.

At that time, I will ask this group for volunteers (from among yourselves and/or subordinates) who want to shift from consulting to really contributing... in other words, as a group, volunteers will conspire to take full ownership of digging in and producing a few paragraphs of a story and figures for the paper, using Niranj and Evotec as resources to answer queries but otherwise driving what analyses (and possibly wet lab experiments) make sense.

I hope you all have a lovely and real break from your work over the holidays. All the best!!

Anne

Screenshot 2023-12-20 at 1 56 50 PM Screenshot 2023-12-20 at 1 56 57 PM

On Wed, Dec 20, 2023 at 4:22 AM Bolek Zapiec <[email protected]> wrote:
Hi Anne,

Thank you for bringing this fascinating discovery to our attention. I can certainly echo Jen in saying that this is indeed of potential interest and even if the underlying reason for the phenotypic convergence of the OR and SLCs (along with some of the other ORFs) turns out to be due to the overexpression of membrane proteins resulting in ER-stress, it’s still worth looking into on two levels I can think of.

From the Olfaction point of view, heterologous expression of ORs has been a challenge in de-orphaning the vast majority of mammalian ORs, and has limited their utility in commercial applications in the Smell & Taste industry (e.g. Matt Roger’s group in Firmenich). Hiro Matsunami has been leading the charge for a while now in understanding the ways ORs can be made to work in cell lines, but generalizing the challenge to membrane proteins and developing a standardized assay (like Cell painting) for profiling this phenotype may offer a key to unlocking this box and e.g. with the help of combination screening understanding what components (chaperones, ORF UTRs, etc) allow ORs to be expressed & trafficked without backing up in the ER & Golgi.

From the Cell Painting perspective, if it turns out that this is a convergent phenotype, then it’s again a beautiful case example for how Cell Painting is able to detect convergent pathways and the inner-life of a cell. In this case, if it turns out that folding/trafficking of challenging membrane proteins is resulting in this convergent phenotype, it provides a training set for a robust classifier of the protein misfolding & ER-/ Golgi-stress response and can be readily tested by comparing the Cell Painting fingerprint with that of another protein known to trigger this response. I personally would be interested in having such a classifier for profiling the intracellular activity of our compounds.

It might not be the kind of surprising discovery that might get the cover of CNS, like genetic co-regulation of ORs and SLCs might have, but I wouldn’t dismiss it as an experimental artifact alone as this new perspective might shed new light on this fundamental process that is of value to both science & industry.

Since the data/findings were shared in confidence I certainly won’t share this further at this point, but I’d glad to participate in any further meetings/email discussions about identifying what’s the underlying cause behind these data.

Best,

Bolek

Dr. Bolek Zapiec

Associate Director

Bioassay Automation & Compound Logistics
Discovery Pharmacology
Discovery & Development Technologies

signature_4053395742

Merck Healthcare KGaA | Frankfurter Straße 250 | 64293 Darmstadt | Germany

From: Jennifer Pluznick <[email protected]>
Date: Tuesday, 19. December 2023 at 22:34
To: Anne Carpenter <[email protected]>, Bolek Zapiec <[email protected]>
Subject: RE: SLC/OR families

Hi Anne,
I can’t speak to the SLCs, of course, but some thoughts on the ORs (just copying you and my OR colleague Bolek so as not to spam everyone):

…it could be interesting (at least to me!) if some ORs do this and others don’t, especially if you can uncover how those two groups differ. If these are two novel groups of ORs, that could be quite interesting.

However, there may be a hint in the literature: as was said, when ORs are exogenously expressed without ‘help’ (without special N-terminal tags or co-expression of chaperones), they get stuck in the ER, and are thought to induce an unfolded protein response. Hiro Matsunami (Duke) has shown that a subset of ORs are better behaved (https://elifesciences.org/articles/21895); he refers to these as “overrepresented ORs” (oORs) because they are well-expressed even when key chaperones are knocked out. I wonder if the reason you got this signal from some ORs and not others is that the oORs in your study did not cause as much ER stress?

Supplementary table 2 from the paper above lists oORs (and the counterparts, uORs). In theory, you could use these lists to see if this tracks with which ORs gave you the signal vs not…..however: the Matsunami study was done in mice and the gene names are completely different….in some/many cases the orthologs aren’t known with certainty between mice and humans, as there are 1000 murine ORs and 400 human ORs, and they are all similar to one other. Fwiw, the 3 well-conserved ORs that I was looking for and didn’t see on your list (OR6B1, OR51E1, OR51E2 in humans, Olfr78, Olfr558 and Olfr449 in mice) are known to be oORs.

Best

Jen

From: Anne Carpenter <[email protected]>
Sent: Tuesday, December 19, 2023 4:06 PM
To: Alexander Ehrmann <[email protected]>; Andrey Zinovyev <[email protected]>; Arne Monsees <[email protected]>; Bolek Zapiec <[email protected]>; Niranj Chandrasekaran <[email protected]>; Fauman, Eric B <[email protected]>; Helena Batoulis <[email protected]>; Holger Hennig <[email protected]>; Jennifer Pluznick <[email protected]>; Rahul Valiya Veettil <[email protected]>; Tim James <[email protected]>; William Charles <[email protected]>
Subject: SLC/OR families

Thanks again for your precious time helping us figure this out! Below are the many notes/questions and ideas for analysis or experiments for next steps. We will prioritize and keep this group posted.

If I understand correctly and the ER stress hypothesis is true, then this is a not-so-exciting result in the sense that it's already known that overexpressing membrane proteins yields a (to some degree artifactual) impact on cells? Or might it still be the case that the fact that a particular subset do this and others don't tell us something interesting?

Anne

@AnneCarpenter
Copy link
Contributor Author

AnneCarpenter commented Jan 4, 2024

For ER stress, worth noting this paper: Lebedeva et al Cell Death and Disease (2021)12:770 ; https://doi.org/10.1038/s41419-021-04045-4

They made isogenic human cellular models (in hTERT RPE1 cells) of Congenital disorders of glycosylation (CDG) and deglycosylation (CDDG), which have ER stress as an underlying biological mechanism. The genes involved are PMM2 and DPAGT1 and NGLY1. Among 1049 drugs screened, they found some that reverse the morphological impact using Cell Painting (minus SYTO14 and WGA - they included only mitochondria, the actin cytoskeleton, endoplasmic reticulum, and nuclei). 58 compounds in the primary screen. They chose those that worked across all cell lines for deeper followup.

On the cell lines themselves before the drug screen, they did some validation studies demonstrating ER stress: see section starting "In order to establish the ER stress profiles of the cellular models" involving staining for particular markers.

"All mutant cell lines exhibited a flat, extended morphology (Fig. 3C) reminiscent of cellular senescence that was not seen in the isogenic parental line" and slower proliferation.

"A majority of the active compounds in our screen, #s 3, 4, 6, 9, 15, 25, are reported to affect microtubules"... "through modulation of JAK/STAT and growth factor signaling among others"

(this task is waiting for Evotec/Niranj for next steps)

@AnneCarpenter
Copy link
Contributor Author

I want to be sure the steps above are still on Evotec's radar (@auranic ?)

@auranic
Copy link
Collaborator

auranic commented Jan 26, 2024

@AnneCarpenter I believe we successfully completed all these steps. I apologize if the results were overlooked in the email flow. Let me summarize them here and compile the information into a PowerPoint presentation. This way, if necessary, we can include some slides in the Supplementary materials. Please allow me a couple of days for this.

@AnneCarpenter
Copy link
Contributor Author

Great, thank you!

@AnneCarpenter AnneCarpenter changed the title SLC or OR gene superfamilies: exploration for MorphMap paper SLC or OR gene superfamilies: exploration for MorphMap paper (ORFs, need to check CRISPR) Feb 1, 2024
@afermg afermg added orf Uses ORF data crispr Uses crispr data labels Feb 1, 2024
@niranjchandrasekaran
Copy link
Member

Notebook

Many SLC-OR connections are still present in the new ORF data. There aren't any OR genes in CRISPR. The top left cluster is the one with both SLC and OR connections. All the other clusters are SLC or OR exclusive.

ORF

orf_slc-or-connections

CRISPR

crispr_slc-or-connections

@AnneCarpenter
Copy link
Contributor Author

We will very likely include this story in the paper, so we want to be sure we define the genes included in the plots based on the latest data (vs selecting the genes from a prior version). Have we done this already? If so then I think the next step is to follow the notes from my comment above: #6 (comment)

@niranjchandrasekaran
Copy link
Member

Notebook

I checked the images on Alán's tool: broad.io/orf and I didn't find any artifacts. Here are all he SLC and OR genes in the dataset: https://lite.datasette.io/?metadata=https://raw.githubusercontent.com/broadinstitute/monorepo/main/libs/jump_rr/metadata/orf_matches.json&install=datasette-json-html&parquet=https://zenodo.org/api/records/12766577/files/orf.parquet/content#/data/content?_filter_column_1=Gene%2FCompound&_filter_op_1=startswith&_filter_value_1=SLC&_filter_column=Match&_filter_op=startswith&_filter_value=OR&_sort=rowid

To check with features contribute to the similarity between SLC and OR genes, I ran the same analysis I did for MYT1-RNF41 (#3 (comment)) to generate correlation between feature groups within each compartment and imaging channel. Since we are not comparing just a pair of features but one cluster to another, I report the median similarity across all pairs.

Here are the results:
SLC-OR_area_size_compartment

SLC-OR_feature_group_channel

These results suggest that there are strong similarities between the two clusters, not only in the ER channel. Maybe that can rule out the ER stress hypothesis? @AnneCarpenter

@AnneCarpenter
Copy link
Contributor Author

I don't think we expected the ER channel to dominate the features just because we have stressed the ER, if that is what you are asking. I think the other queries and followups are still worth pursuing.

@niranjchandrasekaran
Copy link
Member

Notebook

It looks like a bunch of OR genes are all on the same plate. But they are at least not from the same row or column. We may want to mention this in the SLC-OR section of the paper. But many other SLC and OR genes are not on the same plate and are still strongly correlated. So I am not overly concerned about this.

ORF-plate-layout-SLC-OR

@AnneCarpenter
Copy link
Contributor Author

@dselinger-plex
Copy link

Adding a test comment here.

@shntnu shntnu added the internal Internal discussions (but publicly accessible) label Oct 17, 2024
@niranjchandrasekaran
Copy link
Member

Included in the morphmap paper

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
crispr Uses crispr data internal Internal discussions (but publicly accessible) new story we will include orf Uses ORF data
Projects
None yet
Development

No branches or pull requests

6 participants