@James Fellows Yates has joined the channel
@Sterling Wright has joined the channel
@Ophélie Lebrasseur has joined the channel
@Nora Bergfeldt has joined the channel
@Christian Carøe has joined the channel
@Katerina Guschanski has joined the channel
@Lucy van Dorp can you post again the strobe (and any similar reporting things?)
Recent Lancet ID paper providing a set of guidelines for modern metagenomics following a community discussion: https://www.thelancet.com/journals/laninf/article/PIIS1473-3099(20)30199-7/fulltext
And an inter-lab ''metrology project' on antimicrobial resistance predictions from genotype data. Everyone sent the same fastqs, purposely designed to test standards, and chaos ensures: https://www.microbiologyresearch.org/content/journal/mgen/10.1099/mgen.0.000335 Something similar could work for taxonomic profiling
I also saw this recently: https://www.biorxiv.org/content/10.1101/2020.06.24.167353v1
As a parallel thing, I was just pondering about defining standards, given that Warinner 2017 sort of already does that (although imo is difficult to read).
Maybe as well as defining reporting standards, what about writing a paper for non-geneticists on how to review ancient metagenomics studies? Something like the PLoS comp. bio. 10 point checklist thing?
*Thread Reply:* I think that's a great idea James, I've been wanting to do something like that for a while now, as it's always hard for non aDNAers or non geneticists to get all the intense concepts/formulae/etc. because it requires so much background knowledge
*Thread Reply:* Well then give me a couple of months to actually finish my thesis and we can start ramping this up. Unless you wanna spearhead organising this one?
But aimed at people like archaeologists and palaeontolosts? It would be slimmer and more accessible (i.e. less detailed than the Warinner one, which is more for people who actually want to work on in the field)
This would be relevant for @Clio Der Sarkissian's discussion point about 'dissemination'
@Clio Der Sarkissian has joined the channel
Hi all, I'm using the authenticity criteria of Warinner et al. (2017) to analyse samples for the presence of TB. For the second one (percent identity distributions), does anyone know if there is an efficient way to create a table and/or graph showing the number of mapped reads according to sequence identity (something like c-e in the attached figure). Using BWA mapping and MALT, so maybe a script/tool that works on bam/sam files?
What if you use the edit distance for that? (i.e., you calculate how many changes are needed in each read to be the same as reference) and then you build a histogram with that info
and if you divide the number of variants in the read by the trimmed read length you can easily get the %
You can also get the percent identity from MALT, maybe in a more convoluted way. But essentially you will open your results in MEGAN and extract the info for all the reads in a specific node. This should contain the number of mismatches to the reference, which can be used to calculate the percentage identity (or edit distance for that matter?). I then plot the identities in a histogram in R. If I am not mistaken @James Fellows Yates had a script for calculating the identities and even doing the plotting. I should also have it and I could try to fish it out from my old files.
*Thread Reply:* or did I create the script?? I can't remember 😅
*Thread Reply:* From MEGAN? 🤔
*Thread Reply:* Must be ANCIENT
*Thread Reply:* If I did, I don't anymore. Sorry
@Nico Rascovan is right percent identity is basically edit distance. You'll probably find more tools to calculate this (MaltExtract can do this for you, if you still have your RMA6 files).
Cheers @Nico Rascovan @aidanva and @James Fellows Yates! Will try these! 😁
*Thread Reply:* Not meant to be self-promotion, but if you do go the RMA6 -> maltExtract route, and have lots of samples I made a small R shiny app that allows you to visualise the tabular/PDF results much faster: https://github.com/jfy133/MEx-IPA
*Thread Reply:* Nice, that sounds very useful, thanks! 😁