@James Fellows Yates has joined the channel
@Aleksandra Laura Pach has joined the channel
As you may have noticed I'm very excited about the dataset you're compiling @Aleksandra Laura Pach
I just remembered that I actually wanted to cite something like this as a justification for nf-core/taxprofiler. I've got a preprint very close for this, and was wondering how soon you think you would feel comfortable making the list available already?
I'm being selfish now and I don't need it but was just wondering
Yes itβs great! So happy to see people invested π
I could push as much as I can for it now, no problem
Even if it's just an initial table of the names of the tools and the URLs
Itβs very preliminary at the moment but here is the sheet link if anyone wants to take a look: https://docs.google.com/spreadsheets/d/1WfI3GSfd0Wetvmx5LEyosiqTInuvTNVlryF-Kks6MDE/edit
*Thread Reply:* Commentable atm. I was a bit wary with sharing an editable version to all but feel free to request edit access!!
Ah! Thatβs almost there πͺ:skintone3:
Iβm just unsure whether Iβm missing some
I can have ac look at some point next week
Great!! Iβll try to streamline it as much as I can atm
See if I see anything but it looks like you've got most of the ones I know π
Thatβs reassuring π₯Ή
Have you checked the profilers tested in the CAMI and LEEMI(?) challenges?
Yes I did! They should be in the list.
It was actually only recently that I realized that these existed - but they'll be included in the "these are the benchmarking "papers" out there" list.
Also just added a link to the spreadsheet so one doesn't have to scroll every time π
What goes into your 'classifcation procedures' tab btw?
With this tab I tried to create an overview of the classifiers/pipelines that different labs used in their publications - restricted to eukaryotic taxa. I made it to try to quantify what people were actually doing, on which databases, with which kind of sequencing data etc.
Ahaaaa I found what I had mentioned at iSBA: https://microbe.land/97-metagenomics-classifiers/
But that was from 2018 by that looks of it!
(although we should maybe check the zotero)
Uuh! Outdated or not, Iβll take a look π
If it's abandon ware it would be a good thing to resurrect
Okay, I've been though all my pending papers etc. and put them in the sheet. I'm going through Jonathan Jacobs' list of 97 publications now. I can't really figure out Zotero, but I'll try to reach out to Jonathan and see if he can be of some help.
Ok cool! If I find spare minutes I can look into Zotero too (I used it a very long time ago)
In a couple of weeks we can make have a short zoom call with any people interested to Sketch out what you want to get out of it (and what I/we can do to help?)
Do you want to make a when2meet or something and post in #general briefly outlining what you have/want to do? Or do you want to speak to your supervisor first?
I think Hannes is cool with this (I'm also only employed as a RA as of now so I don't know what the rules - or social rules for that sake - dictate). π¬ Anywho, I'll write up a post today :meow_party:
*Thread Reply:* Haha fair enough!
Hey @James Fellows Yates should we add all the interested parties into this channel or should I just try to organize everything in a thread of the post first?
*Thread Reply:* And maybe leave a comment after the one on #general that if anyone else wants to get involved they should join <#C05SQA7RA81|yet-another-taxonomic-profiler>
Btw feel free to rename the channel/project! π
*Thread Reply:* I was working on the fly
*Thread Reply:* No worries! This all is great! Thanks for all the help!! π:skintone4:
And maybe give people half a day more to react
Bring them in here, and say they can leave if they change their mind :)
@Mohamed Sarhan has joined the channel
@Meriam van Os has joined the channel
@OphΓ©lie Lebrasseur has joined the channel
Hey all! Super happy that you guys are interested! Iβll give everyone half a day (ish) more to react and then we can try to set up a little brainstorm session. π§ π₯
I am totally a beginner, so the help I can provide is very limitedπ
That's not an issue - SPAAM is all about helping the community learn and improve the quality of research for everyone!
Zotero group doesn't seem to exist anymore π
https://www.zotero.org/bioinformer
So much work just left unfinished π₯Ή But it's alright, I might dig a bit and see if there is anything helpful in there anyways. Thanks for checking!
I think we should book a timeslot to chat though! Please tick any that work for you (if none work then leave a comment)
@Anna-Lena Titze has joined the channel
@channel Thanks for the interest everyone! I can see we are 16 in the channel!! If I could get a couple more of you to vote for a day (or tell if none of them work) then it would be great.
The name of this channel strangely reminded me of this π https://github.com/alesssia/YAMP
*Thread Reply:* It is very likely that I had that in my mind actually, I litrelly 5 minutes ago opened that to add a comaprison to the taxprofiler preprint π
Hi guys, I've ticked the 28th and 29th but since they're my last days on my current contract, they're more tentative as things may pop up I don't yet know about
That's absolutely fine. I think this is the tie breaker though and we'll go for the 29 sep. (in a week). I'm also keen to get started to chat about this so I don't mind it to be earlier than later π
Which times would work? Would 11am (CEST) be fine with all? @Ciara Wanket @Louis L'HΓ΄te @James Fellows Yates @Anna-Lena Titze @Arve Lee @Mohamed Sarhan @OphΓ©lie Lebrasseur
(presumably we would report back meeting minutes for those who can't join?)
Also, just a quick update (@channel). I just finished concatenating all the different lists and notes and the current list consists of 145 profilers and pipelines at this moment.
I'm in the US, so 11am CEST would be 2am for me, but I don't mind getting caught up later! Perhaps the brainstorm session could be recorded if people are comfortable with that?
Ah yeah, that's a pickle. But yes, of course! I'll ask the participants and we can either do that or make sure to take really good notes.
I forced myself to go on the birdsite, just for you @Aleksandra Laura Pach π
Might get the last version of his list!
I feel dirty but all for the greater good!
And I do appreciate it! I swore I would never make an account there but made one to message Jonathan Jacobs. I guess public tweets work better at guilt tripping into answering hehe π
*Thread Reply:* Haha, sorry! I should have said earlier... But i had also asked if anyone knew what I was looking before, and I realised it would be mean not to post it if I found it ..
But at least you can delete the account nowπ
*Thread Reply:* Hahah, no worries. You are absolutely right, and we got an answer out of it π
And I might, some wise men say it's better to have one.. I'm still very hesitant π¬
*Thread Reply:* I would've said 6 months ago that yes it's better to have one, it's very good for networking. But now it's a pile of poo.
Mastodon is much better, the way it works out keeps you on one topic so it's less 'addictive' and currently a lot of scientists are there
*Thread Reply:* Alright alright!! Good intel! Thanks
@Jonas Niemann has joined the channel
@Aleksandra Laura Pach do you have a final decision for when, and how many people do we have? Gather or zoom ?
Yes! This Friday (the 29th)! We should be 7 and I think gather would be cool if all are up for it! I could need a bit help though as Iβve only tried it for the summer school π
At 11?
if it's 7 then gather should still work! Same link as before: https://app.gather.town/mapmaker/PlXjb0deog0B4JCq/spaam-community
No setup needed π
Yes! Great! Alright @channel. 11am CEST this Friday (29th) weβll have a little get together where Iβll present all the work Iβve done so far on this project. Weβll meet on Gather (above link). Iβm really looking forward to seeing yβall! Come if you have a bunch of ideas, questions or if youβre just curious and want to listen in π₯³
*Thread Reply:* I'll try to be there too! But it's a bit of a funny time for me as I'm in NZ, so don't wait for me if I'm not there haha.
@Maria Lopopolo has joined the channel
Hey yβall am I too late to join in?
Absolutely not! There is nothing such as being too late for joining! βοΈ
That was the perfect chance to extort cookies π
Aaargh! My bad. Iβm not good at being the bad cop. π₯Ή especially not to Maria.
*Thread Reply:* β€οΈ you really Maria
*Thread Reply:* (but I will of course accept πͺ s from both of you next time I come through Paris/Copenhagen to compensate my pain π )
*Thread Reply:* Are :doggo: hugs also allowed as compensation?
*Thread Reply:* I'll allow it
confused
https://genomic.social/@PhilippBayer/111140400617023978
Also in a separate comment the same guy asked in the context of nf-core/taxprofiler about custom database building... https://genomic.social/@PhilippBayer/111140387294382855
Once the idea of this project has been formalized maybe we could borrow the idea and record something similar?
*Thread Reply:* Abso-heckin-lutely! The DB creation is tightly linked to the overall performance and user experience of the profiler so we should definitely look into this!
*Thread Reply:* I think simply 'can you build custom databases' would be a big boon
@Aleksandra Laura Pach do you have a google doc or similar with an agenda?
(and I might be +5-10m late, midwife is finishing up but she's chatty)
I don't. I wanted the first session to be a bit open, especially since we're not going to be that many. But I do have a little presentation. I was thinking for it to go like this: β’ Quick presentation rounds β’ Me showing what I've done β’ Brainstorming i.a.
I'm in the little blue room on the bottom left!
https://app.gather.town/app/PlXjb0deog0B4JCq/spaam-community
Meeting notes if anyone wants to help: https://docs.google.com/document/d/1FfRCjMYO-HHbbqqKsHZipsNYjPvhzF9mIgEas8yJrUw/edit?usp=sharing
SPAAMI challenge
(dunno what it stands for but still)
CAMI = Critical Assessment of Metagenome Interpretation
Sxxx Pxxx Assessment of Ancient Metagenome Interpretation
Super Profound Society aPpraisal Societal Precocious Social Platform for
but could maybe inspire some better suggestions π
we could think of integrating Sedimentary and Paleo- ?!
SedaDNA (as this is the aDNA related term is my understanding) and Palaeogenomic
Nice! I think "Standardized Paleogenomic Assessment of Ancient Metagenome Interpretation" could work :catjam:
I was thinking about Standardized but was afraid that it was promising too much hahaha
We tried a few years ago to add a aDNA challenge dataset to LEMMI benchmark, but that never really took of.
https://gitlab.com/ezlab/lemmi/-/issues/5
Well.. if we can't get into their club we'll make our own! Maybe this (π) Issue will be even louder then.
Hey I was thinking about more stuff to add. So I think one important point to know itβs also which type of data are required to do the taxonomic classification. In my experience, different labs process the fastq at adapter removal step differently and this may impact the output of the taxonomic classifiers. Do you think itβs worth adding a column about how the fastq were trimmed? (e.g. collapsed, uncollapsed, singletons etc.).
*Thread Reply:* This is a good point! I think it is more of a pipeline tab than classifier one if I understand correctly? But I could definitely add that in in one way or another!
*Thread Reply:* Yes indeed I think it would fall under the pipeline category!
*Thread Reply:* I'm not sure I follow... Do you mean just whether the pipeline does the trimming/merging for you?
*Thread Reply:* That and how it was done too bc it may affect how the classification itβs done
*Thread Reply:* Yeah reasonable!
Does anyone know of standalone tools that simulate deamination on the ends of reads?
We may already have a route into a benchmarking classifiers on aDNA π
https://gitlab.com/ezlab/lemmi/-/issues/5#note_1591209730
besides ADRSM (which I wrote a while ago), I only know gargammel
But both don't have the deamination simulation as a standalone thing right?
(like give a FASTQ file and tweak the ends)
Would NGSNGS work for this? https://github.com/RAHenriksen/NGSNGS
I've been using it for some of my simulations and been quite happy.
Ah but it intakes a FASTA not FASTq so not sure if it would do exactly what's needed here
Mmm possibly. I've not lookined into the lemmi workflow yet exactly
Gargammel has deamSIM. It works perfectly fine with just a fq file and your patterns. (At least it did for me).
*Thread Reply:* You can run it as a separate command?
*Thread Reply:* Yes! I guess not for fastq but fastq but one can get around that. Here is in general how I did it for my thesis simulation: https://github.com/AleksandraLaura/thesis/blob/main/1.%20Data%20prep/1.1.%20Simulation/commands.txt|https://github.com/AleksandraLaura/thesis/blob/main/1.%20Data%20prep/1.1.%20Simulation/commands.txt
Looks like thanks to @Maxime Borry and you guys, we are getting aDNA benchmarking data in LEMMI!
https://gitlab.com/ezlab/lemmi/-/issues/5
*Thread Reply:* That's amazing! Thank you for taking on the correspondence and explaining things so well in there! π
*Thread Reply:* Again, something I can do single handed π€£
*Thread Reply:* But @Maxime Borry please chime in if I've said anything wrong/you have better ideas (or anyone else for that matter)
*Thread Reply:* Haha no, let's see how it takes to have it implemented by the LEMMI people π As you saw, my original issue is already almost 4 years old
How's it going classifier committee commander (i.e. @Aleksandra Laura Pach)
*Thread Reply:* A couple urgent project took over lately to be honest, but itβs not a standstill - just slower progress!! The sheet is almost ready and Iβm figuring out whether/how to move it over to GitHub!
Hey guysss! Hope it's not too late to join βΊοΈ
Welcome! (and absolutely not!!) It's been quiet here lately, but I am working on creating a good baseline for all of us to go out from so stay tuned! Soon! πββοΈ
Hi everyone! I have been testing some of the columns that we agreed on at the last meeting and I think the sheet is coming together now! I wanted to organize a little session where I go over all of the different features and how I imagined the filling out process to go as well as hope to hear any input you guys might have. I hope everyone here is still keen on being a part of this - as I'm super keen on getting started with filling out the sheet. So please vote in the poll above for the days that could work for you (Iβm thinking to set at least 1h aside for it but anyone is welcome to join at their own capacity). @channel
*Thread Reply:* would it be possible to meet after 15 CET? I'd really like to join, but am in California. Completely understand though if that doesn't work for most people/if I'm the only one in this situation
*Thread Reply:* It would be perfectly fine for me! Letβs see which day gets the most votes and see what everyone says. But please just vote regardless of time! Iβm fine with meeting at 10pm CET if it means that a lot of people can join π
@channel I will give it until tomorrow for everyone to select the dates! But so far it looks like the 4th is winning! π₯³
@channel Aaalllrighty!! The 4th of December it is! :partyparrot: Would everyone be okay with 15 CET? That way we can both include the EU people as well as our friends overseas π Let's just go for the same link (so gather town) again, I think it should be alright? I'll repost it again when we get closer to the day. I will go over what my specific ideas are for everything as well as the sheet so if you'd like to come a bit prepared I would suggest to think about two things prior to the day: 1) look at the sheet (pinned in the channel) and see if you have any immediate questions or thoughts. 2) think about how much and how you could be able to contribute as I would love to get an idea of how we can proceed with the bulk of the work and that also depends on you guys:))))) Looking forward to it! π€© Cheers!
*Thread Reply:* I will likely have to leave early for Kita pickup but otherwise π
Could you make a google calendar invite π: jfy133@gmail.com
Hi everyone! The day has finally come and I'm very exited to meetup at 15 CET and talk about what has been done so far and even more importantly talk about what needs to be done. We meet in Gather (the link below). π₯³ https://app.gather.town/app/PlXjb0deog0B4JCq/spaam-community
*Thread Reply:* Due to overlap ping appointments I might only be able to attend the first 30mish :(
*Thread Reply:* That's alright. You can overload us with input for the first 0.5h and then we can process it for the rest of the time π
*Thread Reply:* Will let you know if I can't join at all
@channel We're here all ready to chat! :meow_party:
We're beginning now but feel free to join whenever
@channel Hey guys! Thanks for a great meeting and all of the helpful input! Super happy to actually get going with this π Please fill out the poll with which day a week would work best for you to allocate for this once a month (doesn't mean that you have to be here for all of it, but this is the day we will in general work on it together in Gather). βοΈ
Was there a final plan of action/todo in the end?
Yes:
*Thread Reply:* I will be travelling back on the 8th maybe itβs worth making a date survey after the day survey π
@channel let's pick a day before we all get too busy :meow_party:
@channel Can I get just a few more people to answer the poll before making the final decision? πββοΈ (and thank you guys who already have answered)
*Thread Reply:* I can possibly make any but I have no idea where I will be on each of the days so I don't want to influence the result
@channel Alright guys! Last info message for this year. π We'll meet the 29th of January on Gather (same link). I'll be there at 9AM CET and for the rest of the day (~9PM CET). You are welcome to join in whichever capacity and however long you can manage.
The plan is to repeat this initiative every last Monday of the month in 2024. Whoever would like a google invite for this can follow this link: https://calendar.google.com/calendar/event?action=TEMPLATE&tmeid=NWFtN3IzMm1ma3ZyNDllMTkxZGVoMTkyb2xfMjAyNDAxMjlUMDgwMDAwWiBhbGVrc0BwYWxhZW9tZS5vcmc&tmsrc=aleks%40palaeome.org&scp=ALL|https://calendar.google.com/calendar/event?action=TEMPLATE&tmeid=NWFtN3IzMm1ma3ZyNDllM[β¦]hbGVrc0BwYWxhZW9tZS5vcmc&tmsrc=aleks%40palaeome.org&scp=ALL
Thank you again very much for all of the input so far! I'm very keen on getting to work on this! :meow_party:
deamination patterns has been added to the new upcoming LEMMI classifier benchmarking tool (V2)!!
https://gitlab.com/ezlab/lemmi/-/issues/5
Fragmentation isn't there yet, but would someone be interesting/willing to test it out to see how 'realistic' the simulatd ancient mode looks like?
*Thread Reply:* @Maxime Borry you might be interested in particular as the original author of the idea / that thread π )
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10782538/
Uhh! Added to the list: (https://docs.google.com/spreadsheets/d/1an35fZM-rUzYJg1XVepzkRVon6DnUuY3c4mYLSfpwGk/edit#gid=1463535337)
Also, small update: 151 now π
*Thread Reply:* Here is 152 for you @Aleksandra Laura Pach π https://joss.theoj.org/papers/10.21105/joss.04360
Hey @channel! π This is just a reminder that tomorrow Iβm going to be available on gather for the whole day to work on filling out our sweet tool overview! Join in your own capacity but Iβm super keen on getting to it with you! (If someone already knows that you will join and at which times can you please reply in this thread so I can plan my day accordingly). See ya!
*Thread Reply:* I'll be probably joining for about 30m some point during the morning
*Thread Reply:* I'll also join from around 30min or so at ~10am PST. No need to be there at that time though given the time difference.
Good morning! βοΈ Here is where I will camp for the day:
Oh btw, another one to add (featuring me!): https://www.biorxiv.org/content/10.1101/2023.10.20.563221v1
Hi Everyone! @channel Thank you all who joined today. Super nice to see it all come together!!! I will log off gather for now but I'm still available slack for the rest of the day so please text me if you will have a go at filling out the sheet within the next 5h and have any questions!!
The general procedure of "How To" have a go at helping out with this project can be found both on gather on the whiteboard which can be seen on the screenshot above, in the canvas attached to this channel, in the working sheet linked to this channel (the first tab) and here:
How_to:
Hi @channel! A bit late because I needed to land from my vacation, but as stated in the previous message - I'll host the next workshop tomorrow! I will be ready on Gather ~10 AM CET and will be ready on Slack throughout the whole day for any questions. All principles from the message above still apply! Hope to see you! β¨
@channel Hey everyone! This is a reminder that next Monday it's the last Monday of the month - meaning the march edition of the classifier-committee workshop! We've done great progress so far but there is still much to be done :spaam:
I'll be available for the whole day to answer any burning questions and will pop on gather (https://app.gather.town/app/PlXjb0deog0B4JCq/spaam-community) from ~10CET to describe some classifiers.
If you'd like to have this in your google calendar here is an invite for this and all of the future events: https://calendar.google.com/calendar/event?action=TEMPLATE&tmeid=NWFtN3IzMm1ma3ZyNDllMTkxZGVoMTkyb2xfMjAyNDAxMjlUMDgwMDAwWiBhbGVrc0BwYWxhZW9tZS5vcmc&tmsrc=aleks%40palaeome.org&scp=ALL|https://calendar.google.com/calendar/event?action=TEMPLATE&tmeid=NWFtN3IzMm1ma3ZyNDllM[β¦]hbGVrc0BwYWxhZW9tZS5vcmc&tmsrc=aleks%40palaeome.org&scp=ALL
I hope to see many of you there! Cheers π₯³
*Thread Reply:* I can join this time π
*Thread Reply:* For a coule of hours anyway π¬
I'll be there in 10 minutes or so, need to finish an #ancientmetagenomedir PR
Gonna join soon, connection is horrible so I am going to only open the drive! gonna pop up on gather if I manage
HeyHey Everyone!!! @channel I just wanted to remind you that we roll again next week on Monday the 29th! The table is coming together quite nicely π€©π I'll be on from 10.30 CET aaaand then I'll keep going for the rest of the day! Come join me on gather or just say hi and/or ask a question on slack! See ya!!! :salute:
*Thread Reply:* Can only confirm on the day, depends what Aida has scehduled that week
*Thread Reply:* Unfortunately scheduling conflicts with Aida mean I can't do it on Monday. But I'll do a little bit on Tuesday too make up for it
*Thread Reply:* Absolutely completely fine π
*Thread Reply:* Going to connect after lunch β€οΈ
Link to the event in case someone needs it: https://calendar.google.com/calendar/event?action=TEMPLATE&tmeid=NWFtN3IzMm1ma3ZyNDllMTkxZGVoMTkyb2xfMjAyNDA0MjlUMDcwMDAwWiBhbGVrc0BwYWxhZW9tZS5vcmc&tmsrc=aleks%40palaeome.org&scp=ALL|https://calendar.google.com/calendar/event?action=TEMPLATE&tmeid=NWFtN3IzMm1ma3ZyNDllM[β¦]hbGVrc0BwYWxhZW9tZS5vcmc&tmsrc=aleks%40palaeome.org&scp=ALL
Question fo everyone! I was working on ganon information and I discovered that if you go on the github it gives you the new ganon (ganon2) and also sends you to a new preprint https://www.biorxiv.org/content/10.1101/2023.12.07.570547v1.full https://pirovc.github.io/ganon/ I was thinking that for now I can report both and then we can decide if to not include ganon but only ganon2 Anyone has thoughts?
I would keep both, they have slightly different algorithms
But big enough that would maybe make a difference worth distinguishing between
Can someone remind me that if there is no code, no way to install, and the webserver doesn't exist anymore, do we fill out the rest of the info based on the still accessible paper?
I think it would be nice to fill out as much as possible with all the info there - even when the code is not accessible in any way.
I will go back and do that for the 2 I couldn't find stuff
@Maxime Borry we should chase the lemmi people again (just came to my mind)
Hey hey lovely people @channel!!
Today is the last Monday of May and I will be on duty again today to enlist another classifier tool. I will not pop on gather this time due to illness but I will be checking slack regularly so please chat away with any doubts or questions!
βοΈ
*Thread Reply:* Baby duty again today unfortunately :( will try to catch up a bit this week again (but getting a bit buried again)
Hope you feel better soon π€
Hey hey! @channel
Here is a little reminder that next Monday is the last Monday of the month. So if you have a little time to spare and would like a productive break from your normal busy to-do list then I'll be super available for working on our sheet! We've done great progress so far so let's keep up the great work!!! :partyparrot: (Pleeeease text me on Slack if I'm not on Gather and you want to chat or sit together - I have to do a couple of errands on the day but will be ready to collaborate at any time!!)
Here is the calendar invite if you missed it last time:
See ya!!! π€©
Good morning! I will be on gather 11-12 today!
Edit: 11:30 (Got waylaid :)
Sylph: https://github.com/bluenote-1577/sylph
Metabuli: https://github.com/steineggerlab/Metabuli
Lemur: https://github.com/treangenlab/lemur
RAT: https://github.com/MGXlab/CAT_pack
All in now :facewithcowboy_hat: Family is getting bigger and bigger π΅βπ«
Would you include? https://www.onecodex.com/platform/
saw that included ina benchmarking paper
Hey @channel!!!
Today was a productive and "counterproductive" run.
Besides filling out some more information about existing tools, we found some more!!!! But the bad news is.. we found some more π
This means that we still need a little bit of help πββοΈ I know that August will be super busy for many (especially SPAAM-ingly) so I decided to mix things up and have the last summer classifier committee session on Monday the 22nd of July!
Grab a drink, sit in a comfy couch with a snack and help us get a bit order in this chaos! πΉβοΈ Even a mere 15min would be more than helpful!!! Hope to see some of you online! (And as always, ANYONE here is welcome to work on this ANYTIME - don't hesitate to ask. The worst thing that can happen is that I will answer at my earliest convenience or send a bad meme back).
Cheers :salute:
@Maxime Borry did the Lemmi guy ever get back you with about this broken aDNA/gargammel implementation?
Not more than to you @James Fellows Yates π