Question about "AMLmut_923cells.RData" #5

Zifeng-L · 2021-07-25T03:18:01Z

Hi, here.
Thanks for your great work on AML. I want to use these methods for my own data, so I downloaded the dem matrix from your GEO database and tried to repeat your analysis. I wonder to know that how can I get "AMLmut_923cells.RData" from GEO? I checked the data but could not found it. Thanks!

modalaigh · 2023-07-04T15:20:06Z

Hi @Zifeng-L, did you manage to find out which cells are in the "AMLmut_923cells.RData" file? I currently have the same issue that you did. I downloaded the 35 anno.txt files from GEO for the AML patients and saw that there was a column labelled "MutTranscripts". I initially thought that the cells with entries in this column would represent those cells belonging to "AMLmut_923cells.RData" but when I counted all the cells with entries, I ended up with 939 cells which is a bit more than what I was expecting.

petervangalen · 2023-08-05T12:58:18Z

If you want to use the mutation data, I suggest loading the .anno.txt files. The MutTranscripts and WtTranscripts columns contain the mutation calls and the number of supporting reads (separated by /)

yi6kim · 2024-09-21T23:41:53Z

I had the same observation, @modalaigh .
When I constructed the dataset myself using the 35 anno.txt files from GEO (as the .RData itself is not given), I ended up with 939 cells not 923 cells.

Yousuk-Song · 2024-10-18T06:18:26Z

@yi6kim @modalaigh

Hi, I'm working on the same process, and I'm struggling to build AMLmut_923cells.RData too. Could you tell me how did you ended up with 939?

I see this information on a*nnot.txt.gz
MutTranscripts WtTranscripts
normal malignant
normal malignant
normal malignant
malignant normal

and counting cell_ids in which MutTranscripts==malignant can't make results even close to 900s

I really appreciate for your reply

yi6kim · 2024-10-18T08:43:33Z

@Yousuk-Song Are you sure you added up all 35 annotation files?

When I iterate through the files:

AML.anno.filenames <- list.files("my_directory", full.names = TRUE, pattern = "AML.*\.anno\.txt$")[1:35]
mut_counts <- c()
for (i in 1:35){
mut_counts[i]= sum(table(read.delim(AML.anno.filenames[i], header = T, na.strings = "")$MutTranscripts))
}
print(mut_counts)

15 45 0 0 3 1 3 27 92 0 0 6 0 146 6 0 0 6 1 367 8 2 37 0 3 0 0 1 0 0 0 21 143 6 0

print(sum(mut_counts))

939

Yousuk-Song · 2024-10-21T03:25:07Z

Thank You so much! I followed your method, chose cells without "MutTranscripts == normal" and got the same results, 939.

I don't understand why are there no explanation or any mentions about the number '923' in the article.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about "AMLmut_923cells.RData" #5

Question about "AMLmut_923cells.RData" #5

Zifeng-L commented Jul 25, 2021

modalaigh commented Jul 4, 2023

petervangalen commented Aug 5, 2023

yi6kim commented Sep 21, 2024

Yousuk-Song commented Oct 18, 2024

yi6kim commented Oct 18, 2024 •

edited

Loading

Yousuk-Song commented Oct 21, 2024 •

edited

Loading

Question about "AMLmut_923cells.RData" #5

Question about "AMLmut_923cells.RData" #5

Comments

Zifeng-L commented Jul 25, 2021

modalaigh commented Jul 4, 2023

petervangalen commented Aug 5, 2023

yi6kim commented Sep 21, 2024

Yousuk-Song commented Oct 18, 2024

yi6kim commented Oct 18, 2024 • edited Loading

Yousuk-Song commented Oct 21, 2024 • edited Loading

yi6kim commented Oct 18, 2024 •

edited

Loading

Yousuk-Song commented Oct 21, 2024 •

edited

Loading