By Allison Proffitt
June 9, 2021 | At the DECODE: AI for Pharmaceuticals forumyesterday, Puneet Batra, director of machine learning at the Broad Institute, outlined the mission of the new Eric and Wendy Schmidt Center at the Broad: to position biology to drive the next era of computing.
The EWSC was launched at the end of March thanks to a $150 million endowment from Eric and Wendy Schmidt that was matched by The Broad Foundation. The Eric and Wendy Schmidt Center seeks to understand the programs of life and how theyre organized across three biological scales: cells, tissues, and organisms, Batra explained. We are doing thisand promoting thisby convening a community of computational scientists and biologists. The goal is not just to bring the tools of modern machine learning to bear on biological discoverythough thats a great goalits also to make biology a key driver of advances in computation itself, he added.
The group is international by design. Twenty-five percent of the combined $300 million endowment is committed to use outside of Boston, and the community of collaborating computational scientists, biologists, and clinicians is already fairly extensive. Beyond the Broad Institute, MIT and Harvard communities, collaborators include Mila (Quebec AI Institute), led by Yoshua Bengio; European Laboratory for Learning and Intelligent Systems, Tuebingen, led by Bernhard Schoelkopf; The Alan Turing Institute, directed by Sir Adrian Smith; Oxford Big Data Institute, directed by Cecilia Lindgren; clinicians and researchers at Mayo Clinic and Geisinger; biopharmaceutical companies, including Genentech (a member of the Roche Group), AstraZeneca, and Novartis; technology and research companies focused on scientific inquiry, including DeepMind, Google Research, and Microsoft; Mikhail Belkin (UC San Diego), David Blei (Columbia University), Marzyeh Ghassemi (University of Toronto), Jennifer Listgarten (UC Berkeley), and Mihaela van der Schaar (Cambridge University).
The Schmidt Center is co-directed by Caroline Uhler, Associate Professor of Electrical Engineering and Computer Science and the Institute for Data, Systems, and Society at MIT and an associate member of the Broad Institute; and Anthony Philippakis, Broads chief data officer.
Biology as Driver
Biology and machine learning can together untangle some of the fundamental questions about the programs of life, Batra said. We think there are a lot of neat problems in biology that need to be addressed by new developments in machine learning, he said. For instance: how do genes interact to form cell types? How do cell types give rise to tissues? And, at the organism level, how do genotypes map to phenotypes?
There are two great revolutions of the 21st Century: the explosion in data technologies (machine learning, cloud, etc.) as well as the blossoming of biological technologies (sequencing, single-cell genomes, medical imaging, etc.). These two revolutions are converging, Batra said, and together they will open a new door on biological research.
But Batra insisted that the goal is not simply to apply machine learning to biological questions. Machine learning, thus far, has been driven by image recognition and predictive accuracy. Driverless cars are a prime example. But for biological questions, he argues, our aim is to understand natural laws. Machine learning needs to move from predictive accuracy to causal modeling, from what? to why?.
Biologyand these unique biological questionscan serve as a key driver to advances in computing.
ML4H: Machine Learning for Health
From its beginning EWSC partnered with the Machine Learning for HealthML4Hproject at Broad. Batra co-leads this 25-person, cross-disciplinary team of computer scientists and clinician researchers . The three pillars of what ML4H does, Batra said, are looking at rich data and outcomes, applying deep learning, and driving clinical questions. We are not just interested in applying models to data, but were really interested in changing how patients are treated, he said. Understanding not just the machine learning side, but also does it mean to improve patient care is a critical driver for the problems we choose to work on.
ML4H work is driven by the deep phenotype hypothesis: the supposition that there are many more phenotypes with genetic bases waiting to be discovered in rich data sets. Once we have these phenotypes, itll accelerate clinical impact in a variety of ways, Batra believes. He expects these deep phenotypes to enable biological discovery, identifying precise genetic architecture of disease and its progression; create new biomarkers and improve trial selection; and improve patient screening, predicting who will get sick and to which therapeutics he or she will respond.
To support the search of these deep phenotypes, Batra and his group are using real-world data from a 500,000 primary care cohort with an average of seven years of follow up per patient. Its deep-learning scale, Batra said, comprising 60-billion-word tokens, 7 million ECGs, rich imaging echoes, and tens of thousands of longitudinal outcomes including stroke, heart attacks, heart failure, and more.
These are the kinds of datasets that one needs to be able to build these deep phenotypes and be able to validate their impact on outcomes, he adds.
Big Data, New Strategies
But even this large dataset still requires thoughtful, new machine learning strategies. Batra outlined some of the steps ML4H takes to maximize the machine learning impact.
First, events and data labels are precious, Batra said. We dont have them at unlimited scales, so we found that its really important to apply pre-trained models from either outside the healthcare industry or inside that weve built ourselves, he said, for example, BERT, DenseNet, and PCLR. When you use [pre-trained models], it really improves data efficiency to make sure your models are learning as fast as they can on the limited data you have.
Second, choice of data representation is crucial. We often take these rich datasets and reduce them to a smaller dimension. When that happens, we need to make sure that the representation is faithful: it captures the biology and doesnt lose it, but also doesnt enhance bias and protocol differences, he said.
Third, Batra has not found clinical relevance to be captured by the area under the curve and believes that carefully considering how the models or tools will be applied in the clinic and in trials is essential. That gives you a different way of evaluating whether its useful or not, which is why the collaboration with clinicians is so important.
Its these practical observations that will drive EWSCs vision along with the work ML4H has been doing.
Our vision is not just to combine these two fieldsto bring modern machine learning to bear on biology, which is happening in many places, Batra said. Its also to start to make the central questions of biology needs to address, this causal aspect, this mechanistic aspect, to make those key needs drivers of additional advances in computing.
Read more here:
- Computer science to count as core credit in high schools; What led to the change? - KSNT News - June 10th, 2021
- QS ranks MIT the world's No. 1 university for 2021-22 - MIT News - June 10th, 2021
- Cybersecurity threats arise when AI is taught to lie - Popular Science - June 10th, 2021
- Governor recognizes Aparna Bhooshanan for national commendation - The Madison Record - themadisonrecord.com - June 10th, 2021
- Talk to explore power of taxonomy and embedding in text mining | Penn State University - Penn State News - June 10th, 2021
- Quality and Quantity | The UCSB Current - The UCSB Current - June 10th, 2021
- Would-be AWS bomber pleads guilty, faces 5 to 20 years behind bars for plot to take out government servers - The Register - June 10th, 2021
- Matthew Katzman: 5 Fast Facts You Need to Know - Heavy.com - June 10th, 2021
- Caltech Undergrad Wins Dual Computer Science Awards - Caltech - June 8th, 2021
- ECSU Ranked One of Top Most Affordable Computer Science Programs in the Country - Elizabeth City State University - June 8th, 2021
- Trio of gifts, $75 million, accelerates transformation of computing and data science at Berkeley - UC Berkeley - June 8th, 2021
- Saluting the 2021 Vals & Sals - River Journal Staff - June 8th, 2021
- Oakland University 's IGVC fuels the inspiration behind autonomous robotics teams - The Oakland Press - June 8th, 2021
- DeVry University Celebrates Its 90th Anniversary With History Steeped in Technology and Innovation - Business Wire - June 8th, 2021
- Amazon Sidewalk And The Issue Of Data Privacy - ideastream - June 8th, 2021
- UMass, Digital Ready Offer Early College STEM Living Learning Lab for Boston Students - UMass News and Media Relations - June 8th, 2021
- 2021 MITx Prize winners build community on campus and across the globe - MIT News - June 8th, 2021
- Stanford University Survey: 12 Greek scientists among the top ones globally - CEOWORLD magazine - June 8th, 2021
- How students at EWU, SFCC are helping local governments with cybersecurity protection - The Spokesman-Review - June 8th, 2021
- Launch your career in computer science or IT with help from this low-priced course bundle | TheHill - The Hill - June 6th, 2021
- Course Hero Doubles Down on Computer Science Pre-internships to Help Increase Racial Diversity in Tech - PRNewswire - June 6th, 2021
- Needs Suggestion on Master Program That Combine Architecture and Computer Science - Archinect - June 6th, 2021
- The Item shares its readers good news: Neighbor to Neighbor - Worcester Telegram - June 6th, 2021
- IBM partners with U.K. on $300M quantum computing research initiative - VentureBeat - June 6th, 2021
- MARYLAND STATE DEPARTMENT OF EDUCATION RECOGNIZES 2021- 2022 TEACHERS OF THE YEAR - maryland.gov - June 6th, 2021
- Egypt's peer-reviewed publication output best in Africa - University World News - June 6th, 2021
- This massive computer science bundle costs only $20 for a limited time - Cult of Mac - June 2nd, 2021
- Quiet on set: A Providence school is replacing beloved drama teacher with computer science - The Providence Journal - June 2nd, 2021
- University of Central Florida Researchers Leading Healthcare and Engineering Breakthroughs Awarded $3M to Advance Work - PRNewswire - June 2nd, 2021
- Douglas Neckers: Easter Island and Toledo - HollandSentinel.com - June 2nd, 2021
- Visit - UMass News and Media Relations - June 2nd, 2021
- Smoothstack Reviews Why Conventional Hiring is Outdated for the IT Industry - OCNJ Daily - June 2nd, 2021
- Let's Reminisce: Persuading the human body to regenerate its limbs - Sherman Denison Herald Democrat - June 2nd, 2021
- Goodyear to fund $1.6 million scholarship program at University of Akron - Akron Beacon Journal - June 2nd, 2021
- Project Showcases Highlight Student Creativity and Talent - CSUN Today - June 2nd, 2021
- The potential of artificial intelligence to bring equity in health care - MIT News - June 2nd, 2021
- Study.Com ranks Rowan computer science and music production programs among the best in nation - Rowan Today - May 27th, 2021
- This huge computer science learning library is just $20 for a limited time - BleepingComputer - May 27th, 2021
- Thomas Wins 2021 College of Information and Computer Sciences Outstanding Teacher Award - UMass News and Media Relations - May 27th, 2021
- RIT professor to lead international task force on shaping future computer science curriculum | RIT - RIT University News Services - May 27th, 2021
- Binary analysis research gets NSF CAREER Award funding | Binghamton News - Binghamton University - May 27th, 2021
- WHMI 93.5 Local News : Hartland Student Recognized For "Breaking Traditions" - WHMI - May 27th, 2021
- SMASH and SEED: The Kapors' gift to diversify the next generation of science and tech leaders - UC Berkeley News - May 27th, 2021
- Georgia Southern partnering with SALT to give students professional experience, job opportunities | Newsroom - Georgia Southern University Newsroom - May 27th, 2021
- Professor's two NSF grants aim to better sort social media content, identify online trolls | Binghamton News - Binghamton University - May 27th, 2021
- Supply of computer science skills still isn't meeting demand in the 'swelling tech sector' - FE News - May 27th, 2021
- Tiny seedlings and 3 million worms: Two OHIO experiments headed to International Space Station on June 3 - Ohio University - May 27th, 2021
- New Ph.D. wants the next generation to pick up the quantum story - Harvard Gazette - May 27th, 2021
- For Danilek '21, a "Comp. Sci. Class that Changed Everything" - Hamilton College News - May 18th, 2021
- Helping students of all ages flourish in the era of artificial intelligence MIT Media Lab - MIT Media Lab - May 18th, 2021
- Engineers and economists prize efficiency, but nature favors resilience lessons from Texas, COVID-19 and the 737 Max - The Conversation US - May 18th, 2021
- Stressed Out By Your 87 Open Browser Tabs? New Science Offers a Fix - Inc. - May 18th, 2021
- Cracking the high school code - Arlington Catholic Herald - May 18th, 2021
- Meet the Class of 2021 from HCC's middle college - Herald-Mail Media - May 18th, 2021
- Northwestern Polytechnic University Professor Ahmed Banafa's Book "Secure and Smart Internet of Things (IoT): Using Blockchain and Artificial... - May 18th, 2021
- Connections are the superpower of the College of ACES - Agri News - May 18th, 2021
- What is Google Project Starline and how does it render in hyper-real 3D? - Pocket-lint - May 18th, 2021
- Brain-Computer Interface Translates Brain Signals Associated with Handwriting into Text | Computer Science, Neuroscience - Sci-News.com - May 18th, 2021
- Grand Valley State University: Where tomorrow's STEM experts are made - Study International News - May 18th, 2021
- New Partnership Allows AACC Students to Earn BA and MS in Computer Science Within 5-Years | Eye On Annapolis - Eye On Annapolis - May 14th, 2021
- Op-Ed | The 2020 ACM Turing Award is a step against diversity, equity and inclusion - The Stanford Daily - May 14th, 2021
- From insightful courses to traveling and teaching, reflecting on Penn's 'unending opportunities' | Penn Today - Penn Today - May 14th, 2021
- This virtual nurse can tell you a prayer and where to get a coronavirus vaccine - News@Northeastern - May 14th, 2021
- Cal Poly seniors to launch second weather balloon into space with Project OWL - Mustang News - May 14th, 2021
- Novel Tandon-designed microchip will allow data to be processed without being decrypted - Newswise - May 14th, 2021
- Acting Secretary Joins Students, Teachers Across the State for Innovative Learning, Workforce Readiness Festival - Pennsylvania Pressroom - May 14th, 2021
- Inside the Economics of Science Papers - Walter Bradley Center for Natural and Artificial Intelligence - May 14th, 2021
- How the Coronavirus Spreads: Biggest Threats Are in Air, Not on Surfaces - AARP - May 14th, 2021
- Class of 2024 declares majors - The Tech - May 14th, 2021
- RC3's Pam McClendon named 'Sweet 16' finalist for Alabama Teacher of Year - Hoover Sun - May 14th, 2021
- Engineering Ag Tech Solutions Just One Element of Annual Innovate to Grow Expo | Newsroom - UC Merced University News - May 14th, 2021
- Three individuals with Stanford affiliations named 2021 Knight-Hennessy Scholars - Stanford Today - Stanford University News - May 9th, 2021
- Governor Murphy Announces New Jersey Department of Education Grants to Create Computer Science Learning Hubs Throughout State - InsiderNJ - May 9th, 2021
- What does it take to become an astronaut? - Livescience.com - May 9th, 2021
- 2021 Milestone Years of Service Recognition - Syracuse University News - May 9th, 2021
- East Stroudsburg University honors 119 employees with ceremony - Pocono Record - May 9th, 2021
- IBM and MIT kickstarted the age of quantum computing in 1981 - Fast Company - May 9th, 2021
- Crypto startup Dfinity takes on Ethereum with launch of Internet Computer - Mashable - May 9th, 2021
- Tusky Valley students 'knock it out of the park' with engineering projects - New Philadelphia Times Reporter - May 9th, 2021
- Michale Fee appointed head of the Department of Brain and Cognitive Sciences - MIT News - May 9th, 2021