Finding a happy medium boosts ChIP-seq data quality

Every lab wants to produce high-quality, reproducible data. But when that data is destined for use by the whole community as part of an international consortium, there is an even greater incentive to ensure the highest standards. A new paper in G3: Genes|Genomes|Genetics defines a critical step for the success of widely-used gene regulation experiments.

Cheryl Keller is an Associate Research Professor in Ross Hardison’s lab at Penn State University, which studies blood cell development and how cells make the transition from pluripotent stem cells to fully differentiated blood cells. They also provided data for ENCODE, a project to catalogue the functions of the non-coding parts of the genome. To identify protein binding sequences in the genome, the researchers ran assays called chromatin immunoprecipitation and sequencing (ChIP-seq). Keller noticed that the lab was getting inconsistent results from their ChIP-seq assays, despite using the same protocols and reagents. They needed to figure out why.

“This whole project really started out as a troubleshooting experiment for our own purposes as a production lab for ENCODE 3,” Keller says. “We wanted to be able to generate the high-quality data that we said we could generate, and to try to be more consistent.”

Researchers use ChIP-seq to identify DNA sequences where a particular protein binds. First, the protein is crosslinked to the DNA, and then the chromatin is broken into small segments for sequencing. Next, an antibody against the protein of interest is used to pull out the protein and whatever DNA fragments it’s bound to. The crosslinking is reversed, the protein is removed from the sample, and the DNA fragments are sequenced. Bound sequences show up as “peaks” in the genome browser, with the height of the peak depending on the number of copies sequenced.

Keller had noticed a lot of unexplained variation in the results of the ChIP-seq experiments even when using the same antibody, which is often the major determinant of ChIP-seq success. “If I did several ChIPs with the same antibody, why didn’t they all work?” she says. “I know the antibody works, I know the protein is bound in those particular cells. Why are some working and some not?”

The researchers suspected the problem might lie with their sonicator, the machine that breaks the chromatin into fragments using sound waves. The device was old, its probe studded with pockmarks, so they invested in a brand-new machine. Yet they still had some problems with inconsistent results. They also heard from other labs that others were having similar issues with poor reproducibility in their ChIP-seq data.

“We decided to try to figure out whether if we optimized sonication, it would lead to more reproducible results,” Keller explains.

To separate antibody problems from issues in other parts of the protocol, Keller’s lab tested the ChIP-seq assay using two well-studied antibodies to the transcriptional regulators CTCF and TAL1. “One of the reasons we chose CTCF and TAL1 for our troubleshooting experiments is because there have been a number of publications on these factors and we knew, generally speaking, where these proteins were going to be binding,” Keller explains. “We had some basis for assessing our experiments.”

By systematically sonicating cells for different amounts of time, the researchers demonstrated that longer sonication generated smaller average fragment sizes, as expected. Then they looked at how the average fragment size affected the sequencing results.

Because they knew which bound sequences should turn up, the researchers were able to categorize how well the assay performed in each case. They classified each data set as “pass,” “low pass,” or “fail,” depending on how well the set of peaks matched what was expected. If most expected peaks were present, that dataset would “pass,” if some peaks were missing or not as strong as they should be, that might get classified as “low pass,” and a dataset with few to no peaks would be a “fail.”

The researchers found the level of sonication and the average chromatin length had pronounced impacts on the quality of ChIP-seq signals. Too much sonication consistently reduced data quality, while the impact of undersonication differed between transcription factors. “While most of the CTCF experiments look pretty good, we had a bit of a different story with the TAL1,” Keller says. When average fragment sizes were in the mid-range, the datasets passed, but the quality declined with the largest or smallest fragment sizes.

Excess sonication may lead to the disruption of the very DNA-protein interactions the experiment is designed to identify, while fragments that are too long may allow for 3D configurations that hide the binding site. “It’s possible that when the chromatin fragments are larger, the epitope is sequestered and the antibody doesn’t have good access to it,” Keller suggests.

While in these experiments fragments in the 200-250 bp range work the best, Keller emphasizes that it is unlikely that there is a universal best fragment size. “The take home is that it should be determined empirically for each factor, for each cell line,” she says. “Ideally, you’d do a titration and determine what is the optimal shearing size for your factor of interest.”

This optimization step has benefits beyond overall confidence in the data. Keller’s team took advantage of their improved procedures to produce ChIP-seq datasets for rare cell types, which are challenging experiments where every sample is precious and there is no room for error. Their work suggests that careful control of chromatin shearing will improve the success of not only individual labs but the entire field.

G3 Journal

Caroline Seydel is an independent science writer based in Los Angeles, CA. She has a MS in genetics from Stanford University. Her writing has appeared in Nature Biotechnology, Genetic Engineering News, and Forbes.com.

View all posts by Caroline Seydel »

Thank you, GSA community!

Thank you for being a member of the Genetics Society of America! As GSA’s current president, I am writing to tell you about Society projects and initiatives that we hope you will find useful in advancing your science and your career. Scientific research is a collaborative and exciting endeavor. Scientific societies like GSA exist to…
Where are they now? Rosalind Franklin Young Investigator Award recipients share updates on their research

Rosalind Franklin Young Investigator Award applications are open–make sure you submit your application or nomination of a colleague by September 30, 2024.
University of Minnesota researchers map genome of the last living wild horse species

The study, published in G3: Genes|Genomes|Genetics, is part of larger conservation efforts to save Przewalski’s horse.
Congratulations to the Spring 2024 DeLill Nasser Awardees!

GSA is pleased to announce the recipients of the DeLill Nasser Award for Professional Development in Genetics for Spring 2024! Given twice a year to graduate students and postdoctoral researchers, DeLill Nasser Awards support attendance at meetings and laboratory courses. The award is named in honor of DeLill Nasser, a long-time GSA supporter and National Science Foundation…
Carolyn Damilola: an NFS Rising Scientist on a lifelong quest to learn more

Carolyn Damilola is an NFS Rising Scientist from Nigeria doing respiratory system research and paving the way for scientists from underrepresented communities through mentorship.
What does a good microgrant proposal look like?

Members of the Microgrant Review Committee share their tips for a successful proposal.
The first piece of the facial recognition puzzle

New research in GENETICS gives a first peek at the molecular pathway involved in recognizing faces.
New Senior Editor Amy MacQueen joins GENETICS

A new senior editor is joining GENETICS in the Genome Integrity and Transmission section. We’re excited to welcome Amy MacQueen to the editorial team.
Block party on the zebrafish sex chromosome

Research in G3 identifies a gene regulatory block of the zebrafish genome responsible for overseeing the maternal-to-zygotic-transition.
Unraveling the mysteries of duckweed: epigenetic insights from Spirodela polyrhiza

Research published in G3 offers insight into the impact of DNA methylation on clonal propagation in asexually reproducing plants.
A microbiologist’s quest to understand CRISPR in bacterial self-defense

2024 Genetics Society of America Medal recipient Luciano Marraffini determined how CRISPR-Cas systems destroy genetic targets with precision, paving the way for gene editing technology development.
Unlocking mysteries of trait and disease heritability in dogs

2024 Edward Novitski Prize recipient Elaine Ostrander, a pioneer of the domestic dog model, discovered numerous genes affecting dog size, morphology, behavior, and disease susceptibility—many of which have relevance in humans.
GSA and collaborators Personal Genetics Education & Dialogue and Reclaiming STEM Institute launch NSF-funded BIO-LEAPS project to support culture change in genetics

We are thrilled to announce that the Genetics Society of America (GSA) is collaborating with the Personal Genetics Education & Dialogue (PGED) based in the Department of Genetics at Harvard Medical School, and the Reclaiming STEM Institute (RSI) on a Leading Culture Change Through Professional Societies of Biology (BIO-LEAPS) grant from the U.S. National Science…
Daman Saluja: Navigating Science and Policy in India

In the Paths to Science Policy series, we talk to individuals who have a passion for science policy and are active in advocacy through their various roles and careers. The series aims to inform and guide early career scientists interested in science policy. This series is brought to you by the GSA Early Career Scientist…
A fly geneticist’s journey into discovering rules of organ development

2024 George W. Beadle Award recipient Deborah Andrew discovered new genes and pathways in Drosophila salivary gland organogenesis. Now, her work can help optimize cell secretion in therapeutic applications and fight malaria.
Małgorzata Gazda: How receiving the DeLill Nasser Award helped her land her dream job

Have you ever experienced an event that changes the course of your life, or in this case, your career? Małgorzata (Gosia) Gazda is Assistant Professor at the University of Montreal and in 2022, she received the DeLill Nasser Award for Professional Development in Genetics, which she used to attend and present at the 2022 Population,…
Hongyu Zhao joins GENETICS as new Senior Editor

A new senior editor is joining GENETICS in the Statistical Genetics and Genomics section. We’re excited to welcome Hongyu Zhao to the editorial team.
GSA Member Julio Molina Pineda Receives DeLill Nasser Award, Shines at TAGC 2024

“At any career stage, the GSA membership is an amazing investment for any genetics professional!” Julio Molina Pineda is a PhD Candidate in Cell and Molecular Biology and a Research Assistant at the University of Arkansas, and a Doctoral Academy Fellow at the Lewis Lab. In 2023, Julio was awarded the DeLill Nasser Award for…
In Memoriam: Ellsworth Herman Grell (1932–2023), a pioneer of Drosophila genome engineering and annotation

Ellsworth (Ed) Grell blessed the Drosophila community through three enduring legacies: as a pioneer of chromosome mechanics, as a primary organizer and synthesizer of genetic knowledge in Drosophila, and as a graceful mentor to those fortunate to have known him personally. Ed grew up in rural Nebraska, completed his undergraduate studies at Iowa State, and…
Congratulations to the #Fungal24 Poster Award winners!

We are pleased to announce the recipients of the GSA Poster Awards for posters presented at the 32nd Fungal Genetics Conference! Undergraduate and graduate student members of GSA were eligible for the awards, and a hard-working team of judges made the determinations. Congratulations to all! Felicia Ebot Ojong, The University of Georgia My research is focused…
Poster presentation tips for TAGC 2024

You’ve been selected to present a poster at The Allied Genetics Conference 2024 in March—you’ve celebrated, made plans to attend, now what? This is an exciting opportunity to showcase your research and engage with fellow members of the genetics community, so you want to make sure you’re prepared. We wanted to offer you some tips…
Maximize your TAGC 2024 experience

A guide to all that National Harbor & DC have to offer Are you joining us for The Allied Genetics Conference 2024 in March? Make the most of your #TAGC24 experience in National Harbor! We know the science will keep you busy, but you deserve to unwind and have some fun, so we’ve curated a…
Early Career Leadership Spotlight: Sarah Petrosky

We’re taking time to get to know the members of the GSA’s Early Career Scientist Committees. Join us to learn more about our early career scientist advocates. Sarah PetroskyMultimedia SubcommitteeUniversity of Pittsburgh Research Interest I am interested in understanding adaptation that has been happening recently in populations by dissecting the ways that genes underlying an adaptation…
TAGC 2024 Early Career Award Winners

GSA is pleased to announce the winners of the early career awards presented at The Allied Genetics Conference 2024. These awards are specific to particular TAGC communities and recognize early career scientists’ outstanding work on their respective research organisms. The awardees will present their talks in keynote sessions at TAGC 2024. Don’t miss the opportunity…
Preeminent geneticists recognized with revamped GSA Awards

In 2022, GSA’s Board of Directors launched an audit to review the five major awards conferred by the Society. Today, we are thrilled to announce the recipients of the reimagined GSA Awards, including the new Genetics Society of America Early Career Medal. The scientists honored this year are recognized by their peers for their outstanding…
Fly Board funds outreach programs to spread the word about Drosophila research

In 2020, the Fly Board voted to use part of its reserve fund to support efforts to increase trainee participation as well as equity and diversity in the Drosophila community. An awards committee decides how the money will be spent each year, and from 2020–2022, the committee posted a very broad call for applications from…
New members of the GSA Board of Directors: 2024–2026

We are pleased to announce the election of four new leaders to the GSA Board of Directors: 2024 Vice President/2025 President Brenda Andrews Professor, University of Toronto It’s an honor to continue my association with the Society by serving as Vice President of the Board of Directors. I have broad knowledge of the ongoing activities…
Why PEQG is the meeting population, evolutionary, and quantitative geneticists can’t miss

What makes the Population, Evolutionary, and Quantitative Genetics (PEQG) Conference so special? For many researchers, it’s the rare chance to gather with experts who work across an incredible range of model systems, approaches, and questions, all while sharing a deep common interest.
Why scientists’ voices matter in Congress: A conversation with Adriana Bankston on the importance of federal research advocacy

Adriana Bankston, a former AAAS-ASGCT Congressional Policy Fellow in the U.S. House of Representatives*, shares how she used her background as a scientist to shape policy during uncertain times. She explains why advocacy matters at every career stage, and how individual voices can make an impact in the U.S. Congress.
A new study highlights the need for considering spatial structure in detecting positive selection

Identifying the signatures of natural selection in a population is tricky. A new simulation-based model investigates how population structure affects our ability to accurately predict signatures of selective sweeps.