Fecal alchemy: Turning poop into genomics gold

When it comes to genotyping technology, poop genetics is stuck in the 1990s. While most geneticists are now awash in genome-scale data from thousands of individuals, those who depend on fecal and other non-invasively collected samples still rely on old-school, boutique panels of a dozen or so genetic markers.

But feces — along with fur, feathers, and urine — is critically important stuff for understanding the population genetics, ecology, evolution, behavior, and conservation of wild animals. Many are too elusive or endangered to allow collection of blood samples, and even for common species it is a logistical nightmare to immobilize and draw blood from large numbers of animals in the field. In the latest issue of GENETICS, Snyder-Mackler et al. describe tools that promise to advance studies of such samples into the genomic era.

Patrick Chiyo collecting noninvasive samples from elephants in Amboseli National Park. Photo courtesy Jenny Tung.

Noninvasively collected samples have the obvious advantage of easy access. “We have freezers and freezers full of baboon poop,” says study co-leader Jenny Tung (Duke University). Tung’s group works on behavior and genetics in a wild baboon population in Kenya. But though abundant, poop also presents serious challenges for standard genetic analysis. The DNA present in noninvasive samples is typically a fragmented mixture of host and contaminant sequence. For example, only around 1% of the DNA in a fecal sample comes from the animal that produced the poop. Most of the rest is microbial.

These limitations were first overcome in the 1980s and 1990s, and the ability to analyze DNA from noninvasive samples revolutionized the field. Using such samples not only allowed geneticists to understand the genetic diversity and viability of endangered animals, it allowed them to empirically test important theories about animal behavior and evolution.

“There are many examples. Noninvasive sampling of chimps, baboons, rhesus macaques and other primates revealed that animals really do bias their behavior towards relatives, even paternal relatives that are likely more difficult for an individual to identify as kin,” says Tung. “And in baboons, it also showed that males provide some paternal care to their offspring, which wasn’t expected for a polygamous primate.”

But the genotyping methods used in such studies have changed surprisingly little over the last twenty years. For the most part, researchers still use small groups of carefully validated markers, usually based on stretches of short tandem repeat sequences (microsatellites). This means the field has mostly missed out on the benefits of genomics that have become routine for medical researchers and those who work with laboratory organisms.

“Microsatellite approaches still work. But over the last 5 or 10 years it has become impossible to ignore the way genome-scale datasets allow you to answer entirely different questions,” says Tung.

For example, data on how a genome varies across a population can provide crucial evidence of the evolutionary and demographic forces that have shaped it. Genomic data can also trace in detail the mergers and separations of mixing populations.

Vet, a female yellow baboon, and her children in Amboseli National Park. Photo courtesy of Susan Alberts.

Vet, a female yellow baboon, and her children in Amboseli National Park. Photo courtesy Susan Alberts.

The good news for poop genomics is that short-read next-generation sequencing methods are well suited to the fragmented DNA found in noninvasive samples. These methods have been famously adapted for analyzing a sample type that also suffers from vanishingly small amounts of target sequence: ancient DNA. The bad news is that the expensive, intensive approaches that work well for a precious sample of Neanderthal bone are not practical for a geneticist facing a freezer full of poop.

About six years ago, Tung’s friend and colleague George (PJ) Perry published a major advance that allowed large-scale resequencing from noninvasive samples. It was based on a method known as sequence capture, which enriches for host sequence using synthetic RNA “baits” to capture the target DNA. Tung was excited by the possibilities of the methods, but realized it was still too expensive for most applications. This was partly because the baits had to be custom-designed and synthesized for the species of interest. The method also had the drawback of only capturing a tiny fraction of the genome, while consuming large amounts of sample.

“Even fecal samples are exhaustible,” says Tung. “We have a lot of irreplaceable samples from dead animals, for instance. If we’re going to use them up, we want to cover all our bases and gather data on a truly genome-wide scale.”

So Tung’s group and their collaborators worked to modify and scale up Perry’s protocol. They also constructed the baits in a considerably cheaper way, using in vitro transcription of RNA from baboon DNA templates, sidestepping the need for custom synthesis. The new protocol had more modest input DNA requirements and could enrich the target DNA by 40-fold.

But getting enough sequence per sample was just the beginning. Xiang Zhou (University of Michigan) led the group’s efforts to develop tools to analyze data from the new method. Zhou says one of the reasons microsatellites became so popular was the availability of standard and easy-to-use software for assigning paternity from the data. “If people are going to transition to a new method, we thought it would be incredibly important that we package our models into software that will make it as easy as possible,” says Zhou.

But to develop something comparable for low-coverage sequence, the team faced two major challenges: the data is simultaneously much richer (more sequence) and much lower quality (more uncertainty). To deal with the large quantity of data they needed much more computationally efficient algorithms. They also had to factor in the lower data quality, which makes it impossible to use the simpler approaches that work when the genotype at each site is known with certainty. Instead, they incorporated the error rate across all the sites in the genome, generating a sophisticated statistical model.

One of (several) freezers in the Tung lab containing boxes of fecal samples. Photo courtesy Jenny Tung.

Using the new capture method and the paternity assignment software (called WHODAD), the team were able to construct pedigrees from baboon fecal samples that almost perfectly matched those created using traditional analysis of high-quality DNA from blood. In short, despite the low coverage of the genome (typically less than 1x), and the resulting very high uncertainty of the genotype at any one site, the trends in the data were more than enough to reconstruct family relationships.

But what about cost? Lead author Noah Snyder-Mackler gave the project the pet name “fecal alchemy” because it aims to transform poop into a data goldmine. But not every researcher can afford gold — most labs must use the cheapest tool that will get the job done. Tung says they included a cost analysis in the paper because they are regularly asked about the price of making the switch.

“Right now it costs about twice as much to produce 1x coverage of the entire baboon genome as it does to type 14 microsatellites. But the amount of information you get is much greater! So if you’re thinking in terms of cost per genotype, our method is way more cost effective. But in terms of absolute amounts it’s more expensive. In the end the cost-benefit decision depends on what questions you’re trying to answer,” says Tung. “Of course we’d like to get it even cheaper and more efficient and more robust. We’re working on it!”

FUNDING

This work was partly funded by the National Science Foundation DEB through an EAGER grant, with co-funding from NSF Biological Anthropology.

CITATION

Noah Snyder-Mackler, William H. Majoros, Michael L. Yuan, Amanda O. Shaver, Jacob B. Gordon, Gisela H. Kopp, Stephen A. Schlebusch, Jeffrey D. Wall,Susan C. Alberts, Sayan Mukherjee, Xiang Zhou, Jenny Tung (2016). Efficient Genome-Wide Sequencing and Low-Coverage Pedigree Analysis from Noninvasively Collected Samples. Genetics, 203(2), 699-714.

http://www.genetics.org/content/203/2/699

DOI: 10.1534/genetics.116.187492

Behavior, Ecology, Evolution, Genetics Journal, Genomics, Population Genetics, Primates, Sequencing, Wildlife

Cristy Gelling is a science writer, lapsed yeast geneticist, and former Communications Director at the GSA.

View all posts by Cristy Gelling »

Thank you, GSA community!

Thank you for being a member of the Genetics Society of America! As GSA’s current president, I am writing to tell you about Society projects and initiatives that we hope you will find useful in advancing your science and your career. Scientific research is a collaborative and exciting endeavor. Scientific societies like GSA exist to…
Where are they now? Rosalind Franklin Young Investigator Award recipients share updates on their research

Rosalind Franklin Young Investigator Award applications are open–make sure you submit your application or nomination of a colleague by September 30, 2024.
University of Minnesota researchers map genome of the last living wild horse species

The study, published in G3: Genes|Genomes|Genetics, is part of larger conservation efforts to save Przewalski’s horse.
Congratulations to the Spring 2024 DeLill Nasser Awardees!

GSA is pleased to announce the recipients of the DeLill Nasser Award for Professional Development in Genetics for Spring 2024! Given twice a year to graduate students and postdoctoral researchers, DeLill Nasser Awards support attendance at meetings and laboratory courses. The award is named in honor of DeLill Nasser, a long-time GSA supporter and National Science Foundation…
Carolyn Damilola: an NFS Rising Scientist on a lifelong quest to learn more

Carolyn Damilola is an NFS Rising Scientist from Nigeria doing respiratory system research and paving the way for scientists from underrepresented communities through mentorship.
What does a good microgrant proposal look like?

Members of the Microgrant Review Committee share their tips for a successful proposal.
The first piece of the facial recognition puzzle

New research in GENETICS gives a first peek at the molecular pathway involved in recognizing faces.
New Senior Editor Amy MacQueen joins GENETICS

A new senior editor is joining GENETICS in the Genome Integrity and Transmission section. We’re excited to welcome Amy MacQueen to the editorial team.
Block party on the zebrafish sex chromosome

Research in G3 identifies a gene regulatory block of the zebrafish genome responsible for overseeing the maternal-to-zygotic-transition.
Unraveling the mysteries of duckweed: epigenetic insights from Spirodela polyrhiza

Research published in G3 offers insight into the impact of DNA methylation on clonal propagation in asexually reproducing plants.
A microbiologist’s quest to understand CRISPR in bacterial self-defense

2024 Genetics Society of America Medal recipient Luciano Marraffini determined how CRISPR-Cas systems destroy genetic targets with precision, paving the way for gene editing technology development.
Unlocking mysteries of trait and disease heritability in dogs

2024 Edward Novitski Prize recipient Elaine Ostrander, a pioneer of the domestic dog model, discovered numerous genes affecting dog size, morphology, behavior, and disease susceptibility—many of which have relevance in humans.
GSA and collaborators Personal Genetics Education & Dialogue and Reclaiming STEM Institute launch NSF-funded BIO-LEAPS project to support culture change in genetics

We are thrilled to announce that the Genetics Society of America (GSA) is collaborating with the Personal Genetics Education & Dialogue (PGED) based in the Department of Genetics at Harvard Medical School, and the Reclaiming STEM Institute (RSI) on a Leading Culture Change Through Professional Societies of Biology (BIO-LEAPS) grant from the U.S. National Science…
Daman Saluja: Navigating Science and Policy in India

In the Paths to Science Policy series, we talk to individuals who have a passion for science policy and are active in advocacy through their various roles and careers. The series aims to inform and guide early career scientists interested in science policy. This series is brought to you by the GSA Early Career Scientist…
A fly geneticist’s journey into discovering rules of organ development

2024 George W. Beadle Award recipient Deborah Andrew discovered new genes and pathways in Drosophila salivary gland organogenesis. Now, her work can help optimize cell secretion in therapeutic applications and fight malaria.
Małgorzata Gazda: How receiving the DeLill Nasser Award helped her land her dream job

Have you ever experienced an event that changes the course of your life, or in this case, your career? Małgorzata (Gosia) Gazda is Assistant Professor at the University of Montreal and in 2022, she received the DeLill Nasser Award for Professional Development in Genetics, which she used to attend and present at the 2022 Population,…
Hongyu Zhao joins GENETICS as new Senior Editor

A new senior editor is joining GENETICS in the Statistical Genetics and Genomics section. We’re excited to welcome Hongyu Zhao to the editorial team.
GSA Member Julio Molina Pineda Receives DeLill Nasser Award, Shines at TAGC 2024

“At any career stage, the GSA membership is an amazing investment for any genetics professional!” Julio Molina Pineda is a PhD Candidate in Cell and Molecular Biology and a Research Assistant at the University of Arkansas, and a Doctoral Academy Fellow at the Lewis Lab. In 2023, Julio was awarded the DeLill Nasser Award for…
In Memoriam: Ellsworth Herman Grell (1932–2023), a pioneer of Drosophila genome engineering and annotation

Ellsworth (Ed) Grell blessed the Drosophila community through three enduring legacies: as a pioneer of chromosome mechanics, as a primary organizer and synthesizer of genetic knowledge in Drosophila, and as a graceful mentor to those fortunate to have known him personally. Ed grew up in rural Nebraska, completed his undergraduate studies at Iowa State, and…
Congratulations to the #Fungal24 Poster Award winners!

We are pleased to announce the recipients of the GSA Poster Awards for posters presented at the 32nd Fungal Genetics Conference! Undergraduate and graduate student members of GSA were eligible for the awards, and a hard-working team of judges made the determinations. Congratulations to all! Felicia Ebot Ojong, The University of Georgia My research is focused…
Poster presentation tips for TAGC 2024

You’ve been selected to present a poster at The Allied Genetics Conference 2024 in March—you’ve celebrated, made plans to attend, now what? This is an exciting opportunity to showcase your research and engage with fellow members of the genetics community, so you want to make sure you’re prepared. We wanted to offer you some tips…
Maximize your TAGC 2024 experience

A guide to all that National Harbor & DC have to offer Are you joining us for The Allied Genetics Conference 2024 in March? Make the most of your #TAGC24 experience in National Harbor! We know the science will keep you busy, but you deserve to unwind and have some fun, so we’ve curated a…
Early Career Leadership Spotlight: Sarah Petrosky

We’re taking time to get to know the members of the GSA’s Early Career Scientist Committees. Join us to learn more about our early career scientist advocates. Sarah PetroskyMultimedia SubcommitteeUniversity of Pittsburgh Research Interest I am interested in understanding adaptation that has been happening recently in populations by dissecting the ways that genes underlying an adaptation…
TAGC 2024 Early Career Award Winners

GSA is pleased to announce the winners of the early career awards presented at The Allied Genetics Conference 2024. These awards are specific to particular TAGC communities and recognize early career scientists’ outstanding work on their respective research organisms. The awardees will present their talks in keynote sessions at TAGC 2024. Don’t miss the opportunity…
Preeminent geneticists recognized with revamped GSA Awards

In 2022, GSA’s Board of Directors launched an audit to review the five major awards conferred by the Society. Today, we are thrilled to announce the recipients of the reimagined GSA Awards, including the new Genetics Society of America Early Career Medal. The scientists honored this year are recognized by their peers for their outstanding…
Fly Board funds outreach programs to spread the word about Drosophila research

In 2020, the Fly Board voted to use part of its reserve fund to support efforts to increase trainee participation as well as equity and diversity in the Drosophila community. An awards committee decides how the money will be spent each year, and from 2020–2022, the committee posted a very broad call for applications from…
New members of the GSA Board of Directors: 2024–2026

We are pleased to announce the election of four new leaders to the GSA Board of Directors: 2024 Vice President/2025 President Brenda Andrews Professor, University of Toronto It’s an honor to continue my association with the Society by serving as Vice President of the Board of Directors. I have broad knowledge of the ongoing activities…
Congratulations to the 2026 Yeast Poster Award recipients!

Each year, GSA recognizes outstanding student research at the Yeast Genetics Meeting, honoring exceptional presentations by undergraduate and graduate student members. Award recipients are selected based on both the scientific merit of their research and the clarity of their presentation. Please join us in congratulating this year’s awardees. Nasima AkhterUniversity of Rochester I study the…
Tips for finding a scientific narrative

How many robotic talks and lectures have you experienced in your scientific journey? Probably more than you can count! As scientists, we typically prioritize accuracy when communicating our work but sometimes neglect to ensure the audience remains engaged. One way to hold your audience’s attention during presentations is to use a universal communication tool: storytelling. …
Early Career Leadership Spotlight: Bahaar Chawla

We’re taking time to get to know the members of the GSA Early Career Scientist Subcommittees. Join us to learn more about members of the Early Career Leadership Program.