Recursion’s Post

View organization page for Recursion, graphic

50,669 followers

Using microscopy data to create powerful foundation models of cellular biology. ◾ In a new paper, researchers from Recursion and Valence Labs reveal the next level of scaling microscopy images for use in biological research – offering a significant improvement over weakly supervised learning (WSL). They are presenting their findings in a keynote and poster presentation at the IEEE/CVF Conference on Computer Vision and Pattern Recognition June 18 & 19. ◾ While images have been a powerful means to explore chemical and genetic changes that happen in cells, and while we now have massive datasets at our disposal like RxRx3 – a publicly available database of 2.2 million images that represents less than 1% of Recursion’s total dataset – developing robust and feature extraction pipelines using open source software packages has been challenging. ◾ One approach has been to use weakly supervised learning (WSL) to train models that predict the perturbations used to treat cells in an image – but these models are limited with larger datasets, including forgetting known biological relationships. In part, this is because perturbations are noisy labels with clusters of similar effects – and many have no significant effect. ◾ In the preprint, researchers demonstrate a new framework for learning representations of high content screening datasets based on self-supervised learning. They hypothesized that if they had more compute, more data, and more parameters, they would get improved performance (a.k.a., the "scaling hypothesis"). ◾ Using the BioHive supercomputer (more compute), Recursion’s public and private datasets (more data) and vision transformer (ViT) masked autoencoders (more parameters) the team found that they could scale microscopy-based representation of cellular biology that could accurately infer known biological relationships without losing recall – achieving as much as a 11.5% relative improvement when recalling known biological relationships curated from public databases. ◾ What’s more, they developed a new channel-agnostic masked autoencoder architecture (CA-MAE) that allows for inputting images of different numbers and orders of channels. Learn more: https://lnkd.in/einj4_5N Oren Kraus Kian Kenyon-Dean Berton Earnshaw Saber Saberian Maryam Fallah Peter McLean Jess Leung Vasudev Sharma Ayla Khan Safiye C. Dominique Beaini Maciej Sypetkowski Maureen Makes Kristen Morse Ben Mabey #ai #ml #tech #techbio #wsl #vit #mae #data  #cvpr

  • No alternative text description for this image
Darren Nelson 🚀 (We're hiring)

Innovative Leadership in Life Sciences & Technology Talent Solutions: Founder & CEO of Recruits Lab & BioJobs Lab, Driving Organizational Success through Cutting-Edge Recruitment Strategies

3mo

Recursion, your advancements in using microscopy data to build foundational models of cellular biology are truly groundbreaking! The strides made in scaling microscopy images for biological research, especially with the introduction of your new framework based on self-supervised learning, represent a significant leap forward in the field. It's exciting to see how your team's innovative approaches, including the BioHive supercomputer and vision transformer masked autoencoders, are enhancing accuracy and preserving biological relationships in datasets. Looking forward to hearing more about your keynote and poster presentations at the IEEE/CVF Conference on Computer Vision and Pattern Recognition! #ai #ml #tech #techbio #wsl #vit #mae #data #cvpr Recruits Lab (We're hiring)

To view or add a comment, sign in

Explore topics