All News

Talk Nerdy Podcast Features Rita Colwell Discussing Sexism and Science

Tue Mar 30, 2021

Rita Colwell, a Distinguished University Professor in the University of Maryland Institute for Advanced Computer Studies, recently discussed her storied career as an infectious disease scientist on the Talk Nerdy podcast hosted by Cara Santa Maria.

Colwell—still active in research, including multiple international efforts to stem the spread of cholera—talked candidly about the barriers she faced throughout her career as a woman scientist.

Go here to listen to the entire 60-minute podcast.

El-Sayed to Lead New Genomic Sequencing Facility at UMD

Mar 24, 2021

Najib El-Sayed, a professor of cell biology and molecular genetics with a joint appointment in the University of Maryland Institute for Advanced Computer Studies, has been named director of a new campus facility that will enable researchers to access single-cell DNA and RNA sequencing technologies.

The Advanced Genomic Technologies Core (BBI-AGTC), part of the Brain and Behavior Institute (BBI), will advance high-throughput and advanced large-scale genomics research, and will greatly improve the university’s capabilities in neuroscience and other life sciences, says Elizabeth Quinlan, a professor of biology and director of the BBI.

The BBI-AGTC is set to open in mid-April 2021 and will house one of the latest sequencing platforms (Illumina NextSeq 1000), a single-cell controller (10X Genomics Chromium), a liquid handling robotic station (Eppendorf epMotion 5075tc), a real-time PCR instrument and a microfluidics-based platform for the sizing, quantification and quality control of DNA and RNA.

“Next generation sequencing was a big advance toward understanding genomes and RNA expressed in different tissues, and the BBI-AGTC takes that technology a step further by enabling transcriptomes of individual cells,” says Karen Carleton, a professor of biology: “More broadly, the facility could be important for answering a diverse set of biological questions including how cells change through development, infection, or even in response to different behavioral states such as during mating or parental care.”

El-Sayed’s own research uses genomic approaches to study the biology of parasitism. His team develops and applies molecular, computational and systems biology tools to better understand host-pathogen interactions and, ultimately, the mechanisms of infection and survival. His work in functional genomics, tracing how biological information flows from gene to protein expression via RNA transcription, looks to contribute to better diagnosis, prevention of and therapeutics for parasite- and bacteria-caused diseases in humans, animals and plants.

El-Sayed has published more than 90 research papers and serves on the editorial boards of BMC Genomics and PLoS One. He also serves on the National Institute of Health’s Genomics, Computational Biology and Technology study section and the scientific advisory board for the National Institute of Allergy and Infectious Diseases’ Genomic Centers for Infectious Diseases.

Former CBCB Graduate Student Receives Larry S. Davis Doctoral Dissertation Award

Dec 15, 2020

A former graduate student in the Center for Bioinformatics and Computational Biology (CBCB) has been recognized for the excellence of her academic work on developing practical and efficient solutions to index large collections of genomes.

Fatemeh Almodaresi, who graduated with a Ph.D. in computer science in Summer 2020, is one of two students to receive the Larry S. Davis Doctoral Dissertation Award this year.

The annual award recognizes outstanding doctoral dissertations in the Department of Computer Science that convey excellence in their technical depth, significance, potential impact and presentation quality.

The award is named for Larry Davis, a Distinguished University Professor of computer science who served as chair of the department from 1999–2012. Davis was also the founding director of the University of Maryland Institute for Advanced Computer Studies (UMIACS), providing leadership for the institute from 1985–1994.

Her UMD dissertation, “Algorithms and Data Structures Indexing, Querying, and Analyzing Large Collections of Sequencing Data in the Presence or Absence of a Reference,” covers the development of new data structures as well as innovative, practical and efficient solutions to indexing large collections of genomes.

“Fatemeh demonstrates both technical brilliance and the ability to come up with ideas that are theoretically interesting and of tremendous practical impact,” says Rob Patro, an associate professor of computer science who advised Almodaresi during her doctoral studies.

In recommending her for the Davis dissertation award, Patro noted Almodaresi’s design of a new compacted version of a De Bruijn graph, a data structure that is used in bioinformatics to assemble and analyze genomes. Although similar structures have previously been proposed, Almodaresi’s work greatly improved the practicality and efficiency in important ways, Patro explains.

In recommending her for the dissertation award, Patro noted Almodaresi’s design of a new compacted version of a De Bruijn graph, a data structure that is used in bioinformatics to assemble and analyze genomes. Although similar structures have previously been proposed, Almodaresi’s work greatly improved the practicality and efficiency in important ways, Patro explains.

Almodaresi also built a tool using her new data structure for the taxonomic assignment of metagenomic read data, a well-studied problem in the field of computational biology. The taxonomic classifier she designed is both more accurate and more efficient—requiring less memory while operating at a similar speed—than two popular tools that are widely-used to accomplish the same task, Patro says.

One major goal in the field is to build practical indexes on the hundreds of thousands of sequencing samples available in public data repositories. In pursuit of this idea, Almodaresi developed new methods and solutions to perform large-scale search of raw sequencing data using a system called Mantis. Her methods allow the index to be iteratively updated–a crucial feature as the amount of genomic data to be indexed continues to grow.

“I anticipate that much of the theory and methodology Fatemeh develops will have a far-reaching impact both within and beyond the field of genomics,” says Patro. “She is a truly expectational researcher—displaying an intelligence, commitment and real passion at a level that I find to be rare, even among top Ph.D. students.”

Story by Maria Herd

Dr. Rita Colwell Book "A Lab of One's Own - One Woman's Personal Journey Through Sexism in Science"

Wed Oct 07, 2020
A riveting memoir-manifesto from the first female director of the National Science Foundation about the entrenched sexism in science, the elaborate detours women have taken to bypass the problem, and how to fix the system. If you think sexism thrives only on Wall Street or in Hollywood, you haven’t visited a lab, a science department, a research foundation, or a biotech firm. Rita Colwell is one of the top scientists in America: the groundbreaking microbiologist who discovered how cholera survives between epidemics and the former head of the National Science Foundation. But when she first applied for a graduate fellowship in bacteriology, she was told, “We don’t waste fellowships on women.” A lack of support from some male superiors would lead her to change her area of study six times before completing her PhD. A Lab of One’s Own documents all Colwell has seen and heard over her six decades in science, from sexual harassment in the lab to obscure systems blocking women from leading professional organizations or publishing their work. Along the way, she encounters other women pushing back against the status quo, including a group at MIT who revolt when they discover their labs are a fraction of the size of their male colleagues’. Resistance gave female scientists special gifts: forced to change specialties so many times, they came to see things in a more interdisciplinary way, which turned out to be key to making new discoveries in the twentieth and twenty-first centuries. Colwell would also witness the advances that could be made when men and women worked together—often under her direction, such as when she headed a team that helped to uncover the source of the anthrax used in the 2001 letter attacks. A Lab of One’s Own shares the sheer joy a scientist feels when moving toward a breakthrough, and the thrill of uncovering a whole new generation of female pioneers. But it is also the science book for the #MeToo era, offering an astute diagnosis of how to fix the problem of sexism in science—and a celebration of the women pushing back.

Understanding How the Gut Microbiome Influences Human Health

Sep 17, 2020

Living inside of every person are trillions of microorganisms—bacteria, viruses, fungi and other life forms that are collectively known as the microbiome. While they are miniscule in size, this collection of microbes has a big impact on our health.

Brantley Hall, who joined the University of Maryland on July 1 as an assistant professor in the Department of Cell Biology and Molecular Genetics, is particularly interested in the microbiome of the human gut, the region of the gastrointestinal tract that digests food, absorbs nutrients and expels waste.

Based on years of effort by numerous researchers focused on the microbiome, Hall says that scientists have collectively accumulated an acceptable understanding about the species of microbes that inhabit the gastrointestinal tracts of people in the United States.

The next step, he says—and one that will be the focus of his new lab at UMD— is to genetically interrogate the functions that these gut microbes perform, and how those functions interact, both positively and negatively, with human health.

“This is an enormous undertaking,” Hall explains. “There are millions of gut microbial genes lacking functional annotation, and those with functional annotations are of questionable quality.”

Hall says his lab is working to fill that gap by improving functional annotation across gut microbiome species, combining computational and wet-lab techniques to identify genes responsible for health-relevant functions, and developing new strategies to measure the metabolic output of the gut microbiome.

“Our long-term goal is to rationally modulate the human gut microbiome to improve human health,” he says.

Hall, who also has an appointment in the University of Maryland Institute for Advanced Computer Studies (UMIACS), says that coming to UMD seemed like a “very good fit.”

“I was excited about the opportunity to work with researchers at Maryland,” he says. “There is an enormous value in being surrounded by like-minded computational biologists.”

Michael Cummings, a professor of biology and director of Center for Bioinformatics and Computational Biology (CBCB)—one of five major centers that are part of UMIACS—says that Hall is a “very welcome addition” to the Maryland campus.

“Brantley’s interesting and important gut microbiome research is complementary to other research in CBCB, particularly that of [UMIACS director] Mihai Pop,” Cummings says.

Cummings adds that while still early in his career, Hall has proven to be extremely productive as evidenced by his publication record, noting that his research is also impactful, as evidenced by the citations his publications have garnered.

Hall says he has already learned a great deal from his interactions with researchers at CBCB. He cites recent collaborations with Pop and Rob Patro, an associate professor of computer science, as examples.

“By co-advising a graduate student with Mihai, I have been able to draw on his extensive expertise in metagenomic assembly to improve our computational experiments,” Hall says. “And close interactions with genomic software developers such as Rob gives us a better understanding of the state-of-the-art techniques in tool development.”

He adds that UMIACS’ computational resources have already proven beneficial to his research. Specifically, he says his lab is analyzing thousands of metagenomes, which takes a great amount of space to store and a lot of CPU cores to analyze.

“When I was interviewing at UMD, several professors emphasized how great the computing facilities were, and I can now attest that this is true,” Hall says. “Setting up and maintaining these large computing clusters isn't always easy to pull off, and I am grateful for the UMIACS tech support on both the hardware and software.”

—Story by Melissa Brachfeld

Patro Works to Improve Search Functionality for Large Repositories of Sequencing Data

Sep 02, 2020

Genomic sequencing data can often shed light on a wide array of scientific problems—from treating patients with heart disease and cancer to understanding how certain pathogens can affect plants and animals.

Public repositories of genomic data are becoming more commonplace and are growing at an exponential rate. The National Center for Biotechnology Information (NCBI), for example, runs the Sequence Read Archive (SRA), a repository that holds raw scientific data for a vast number of scientific experiments conducted using high-throughput sequencing data.

While repositories like the SRA are a boon to researchers, the ability to quickly search these large public databases for a specific sequence of interest is limited.

To that end, Rob Patro (in photo), an associate professor of computer science, is leading an effort to develop fundamentally improved data structures and algorithms to enable large-scale sequence search across public repositories of genomic data.

Patro is collaborating on this work with Mike Ferdman and Michael Bender, both at Stony Brook University, and Rob Johnson, a senior staff researcher at VMWare Research group.

Patro says that making these types of databases easier to search can help researchers get to the bottom of unique problems.

“Imagine a scientist who discovers what they imagine to be a novel environmental pathogen that, perhaps, has a negative effect on plant or animal health,” he says. “It may be very useful to know what other previously-collected data might have contained this pathogen, but its presence was not reported because the scientists working on the previous studies were focused on a different question and did not take note of this new pathogen. Combing through the repository of available data might help us learn more about this pathogen.”

Patro says that the technical challenge in addressing such a problem is simply the scale of the data. The SRA, for example, currently contains about 16 petabytes of data.

Sequencing data is quite different from other domains where similar challenges have been tackled, he adds.

“DNA and RNA do not have a well-defined vocabulary of words or ‘tokens’ into which the data can be broken down,” he says. “So it’s very different from a typical internet search query.”

Patro’s research group has recently been working on data structures and algorithms that let them scale sequence indices to increasingly large sets of experiments, while enabling very fast (approaching interactive) queries.

A recent paper they authored introduced a fundamentally better scheme for storing and retrieving a key component of this information than their previous attempts. (Note: the lead author on the paper was Fatemeh Almodaresi, who just completed her doctoral degree at UMD with Patro as her adviser.)

The researchers discovered that by examining the patterns of occurrence of small pieces of DNA in the repository—and compressing similar, rather than just identical, patterns in a relative manner—they could reduce the size of this component of the data structure by over an order of magnitude, without slowing the query speed, and, in some cases, speeding it up.

“A main technical challenge here is how one can efficiently search for ‘similar’ patterns when each pattern consists of a point in a very high dimensional space,” Patro says. “While initial attempts to solve this problem using a technique known as locality-sensitive hashing seemed not to work well, we were able to exploit the inherent structure of the data, and take advantage of a widely-used structure in genomics—known as the De Bruijn graph—to develop an efficient search structure to find ‘similar’ patterns efficiently.”

Patro’s group continues to work on this problem, providing important improvements to the indexing methodology. Additionally, they have redesigned the largest part of the index, which was previously required to be in the Random Access Memory (RAM) of the computer to be intelligently partitioned. This allows them to load only small parts of the index at a time, vastly reducing the amount of memory required to both build and query the index.

Finally, they are developing a distributed variant of this index that can be partitioned across a network of machines, allowing the query procedure to be scaled up simply by adding more machines to the network and improving robustness.

—Story by Melissa Brachfeld

***

Note: Patro was recently promoted to associate professor. In addition to his tenure appointment, he has an appointment in the University of Maryland Institute for Advanced Computer Studies, where he is a member of the Center for Bioinformatics and Computational Biology.

El-Sayed Named Director of Honors College Integrated Life Sciences Program

Wed Jul 08, 2020

Najib El-Sayed, a professor of cell biology and molecular genetics, has been named director of the Integrated Life Sciences (ILS) program at the University of Maryland.

The program, part of the university’s Honors College, offers talented undergraduates a biologically-inspired living and learning experience that includes a significant research requirement.

Many ILS graduates go on to graduate school and careers in medicine (both human and veterinary), public health, dentistry, and academia.

El-Sayed, who has an appointment in the University of Maryland Institute for Advanced Computer Studies, is an expert on the biology of parasitism and host-pathogen interactions using genomic and bioinformatics approaches.

He is the author of more than 80 research articles and book chapters and was designated as a highly cited investigator in microbiology by Thomson Reuters.

Go here to see an overview of El-Sayed’s research interests.

Cummings Receives NSF Funding to Improve Analyses of COVID-19 Genomic Data

Jun 01, 2020

A University of Maryland computational biologist has received funding from the National Science Foundation (NSF) to develop new tools and strategies for analyzing genomic data from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the strain of coronavirus responsible for COVID-19.

Michael Cummings, a professor of biology with an appointment in the University of Maryland Institute for Advanced Computer Studies (UMIACS) is primary investigator of the $188K award. The funding comes from the NSF Rapid Response Research initiative, which aims to mobilize the scientific community in response to the current pandemic.

A major goal of the project, Cummings says, is to accelerate the analyses of SARS-CoV-2 genomic data needed by scientists and public health officials battling the virus. “Evolutionary analyses using genomic data are an essential component of the scientific response to the COVID-19 pandemic,” he says.

Inferring the evolutionary history, or phylogeny, of virus samples—coupled with information on the time and location of where the samples originated—allows scientists to estimate the divergence of viral lineages, Cummings says. These types of analyses can provide time estimates that predate sampling events, meaning that public health officials could ostensibly predict where and when certain strains of SARS-CoV-2 could pop up next, and whether a strain of the virus has mutated from its last outbreak.

But current methods of analyzing these samples takes time. Even with powerful advances in computing power over the last decade, the process can require up to 150 hours or more to complete.

Cummings, working closely with Daniel Ayers, an assistant research scientist in UMIACS, plans to develop new software and parallel computing algorithms that could help decrease this timeline considerably.

The Maryland researchers are also working to improve the phylogenetic analyses of a certain type, referred to as molecular phylodynamics, which includes not only evolutionary history but also information on viral genetic variation and viral population dynamics—again all in the context of geography and time.

Phylodynamic analyses are particularly rich in terms of inferences, albeit at a considerable computational cost, Cummings says.

Ultimately, the research by Cummings and Ayers will offer a more comprehensive view of the transmission patterns for SARS-CoV-2. This includes defining patterns of SARS-CoV-2 movement and migration; identifying the specific timeline of an outbreak origin; determining the rate of mutation and detection of significant mutations with the resultant health impacts; identifying the prevalence of the virus in populations at different geographical scales; reproductive number and impact on policy; and infection-to-case reporting rates.

To increase the speed at which samples are sequenced and also to improve accuracy and decrease cost, Cummings and Ayers plan to work with others to develop more efficient parallel computing strategies and software, taking advantage of NSF-supported high-end computational resources at the Pittsburgh Supercomputing Center, San Diego Supercomputing Center, and the Texas Advanced Computing Center.

They will also rely upon open-source software they developed—known as BEAGLE, for Broad-platform Evolutionary Analysis General Likelihood Evaluator—that has become an essential component in the software workflow of many scientists studying the evolutionary history of organisms, including the viruses that cause AIDS, influenza and Ebola.

“BEAGLE is used in a variety of studies that contribute to our understanding of evolution and biology, which can have a role in informing decision-making,” Ayres says. “Now there are dozens, if not hundreds, of scientists worldwide using BEAGLE to better understand SARS-CoV-2.”

Though Cummings has been active with research for more than a decade on other serious public health issues, the speed and strength by which the current COVID-19 pandemic has spread has given him pause for thought, and also bolstered his resolve to help find answers.

“Science is a privilege, and we are given multiple opportunities to give back—we write papers and we educate students,” he says. “But now there is an opportunity to use the ideas and tools we’ve been working on for quite a while to have a positive impact on the current crisis. It gives us the incentive to work as hard as we possibly can.”

—Story by Tom Ventsias

Five UMIACS Faculty Receive Academic Promotions

May 29, 2020

In recognition of their continued excellence in research, scholarship and service, five faculty members in the University of Maryland Institute for Advanced Computer Studies (UMIACS) have received academic promotions from their respective departments.

Marine Carpuat (computer science), Vanessa Frias-Martinez (iSchool), Michelle Mazurek (computer science), Rob Patro (computer science), and Dana Dachman-Soled (electrical and computer engineering) have each been promoted from assistant professor to associate professor with tenure.

The academic promotions for Carpuat, Mazurek, Patro and Dachman-Soled are effective on July 1; the promotion for Frias-Martinez takes place in August.

“Marine, Vanessa, Michelle, Rob and Dana are all known for their outstanding research and strong commitment toward educating and mentoring students. They are a tremendous asset to our institute, their home departments, and the university as a whole,” says Mihai Pop, professor of computer science and director of UMIACS.

Carpuat’s research in the Computational Linguistics and Information Processing (CLIP) Lab focuses on multilingual natural language processing and statistical machine translation. She designs computational models that use second languages as meaningful annotation to make language processing applications more accurate, robust and useful.

Frias-Martinez is also a member of the CLIP Lab and does the majority of her research in her own Urban Computing Lab. The focus of her work is on building mathematical models that explain human behavior. This includes examining human interaction with the built-in environment in cities, and how those interactions might change due to events like natural disasters, economic crises or forced migrations.

Patro’s research in the Center for Bioinformatics and Computational Biology (CBCB) focuses on the design of algorithms and data structures for processing, organizing, indexing and querying high-throughput genomics data. His interests also extend to programming languages, computer graphics, scientific visualization, parallel computation and machine learning.

Both Mazurek and Dachman-Soled are core faculty members in the Maryland Cybersecurity Center (MC2).

Mazurek’s research focuses on human-centered computer security, understanding and influencing security and privacy behaviors and preferences by collecting real data from real users. Her recent work includes making security easier for professionals such as system administrators and software developers, understanding how and why end-users learn and apply security behaviors, and investigating adoption of end-to-end encrypted messaging.

Dachman-Soled research involves cryptography, complexity theory and security. This includes security against physical attacks, post-quantum cryptography, secure multiparty computation, and black-box complexity. Her research also extends to property testing of Boolean functions and cryptographic hardness of learning.

UMIACS Faculty Adapting to Online-Only Research Activities During COVID-19 Crisis

Apr 28, 2020

The normal whirlwind of hands-on research activity inside the Brendan Iribe Center for Computer Science and Engineering has come to a screeching halt.

The 215,000-square-foot-facility—home to the Department of Computer Science and the University of Maryland Institute for Advanced Computer Studies (UMIACS)—is empty these days, the result of severe research restrictions that have barred access to the Maryland campus for all but a handful of essential personnel.

While computer science faculty are busy teaching online during the current COVID-19 health crisis, UMIACS researchers—80-plus faculty from eight departments and almost 200 graduate students—are also using online resources to advance research in robotics, computational biology, cybersecurity, and more.

Cornelia Fermüller (right in photo), an associate research scientist in UMIACS, says the biggest challenge for her team has been losing access to specialized hardware in their robotics lab.

Fermüller says her students continue working on their ongoing projects, but that many wish to be back in the lab, “getting their hands on the data and running experiments with the robots in real time.”

An example of her team’s current workaround model involves a robotic hand used to explore how tactile sensing works in active tasks like grasping, pushing and pulling. Without access to their lab that houses a robotic arm they use, the researchers are relegated to analyzing previously collected sensor data and developing models and algorithms in order to understand the robot’s hand movements.

“We’re looking at the sensor characteristics, and modeling the relationship between forces applied to the tactile sensor and its readings,” Fermüller says. “We’re also investigating how to robustly process the tactile data from the sensors in an event-based way.”

Tudor Dumitraș, an associate professor of electrical and computer engineering in the Maryland Cybersecurity Center, says the shift to an all-online environment has made parts of the research process more challenging for his team.

“While we can run the same experiments that we were running before, the disruption has been dramatic, owing to the challenges for teamwork raised by the online environment,” Dumitraș says. “I've significantly lowered my expectations for my group's research productivity this year, and I am very impressed that some of my students are able to stay motivated and can continue making progress.”

Moreover, his team often requires large-scale computational resources to run their analyses, which might become more difficult as a result of the restrictions.

“Normally, the UMIACS tech staff can meet our requirements quickly, but with the restricted access, if we request that new equipment be installed or current equipment upgraded, it may take them a bit longer,” Dumitraș says.

He called this a minor issue, however, saying his group is more concerned with the loss of regular in-person meetings—a “key part of the process” for their research projects.

“Each one of us can bring a new perspective to a problem, so it’s important for us to interact on a regular basis,” Dumitraș says. “This type of brainstorming is much harder to replicate online, although we’re trying.”

The research restrictions have also put a hold on a summer research internship program that Dumitraș has run for the past three years. The program brings up to a dozen top undergraduates from universities overseas, focusing on underrepresented groups like female researchers and students from universities in Eastern Europe.

“The process of preparing those [summer] research projects is a key part of training for my current graduate students, because they have to formulate a project and supervise the interns once they’re here,” Dumitraș says. “The internship program is also a good pipeline for getting top-notch students to apply here for graduate school, and it was a lot of fun.”

Mihai Pop, a professor of computer science and the director of UMIACS, says the loss of onsite access for his lab’s workgroup hasn’t deterred their passion for using computational biology to explore the genetic makeup of microbial communities.

“Most of our work essentially is just sitting on a computer doing analyses of genetic data,” says Pop.

Still, Pop echoes Dumitraș in saying that what has been most taxing is the loss of personal connections.

“The main thing that’s lost is the interaction and being able to just walk up to somebody’s desk and bounce ideas off of them,” Pop says. “Those kinds of informal, personal interactions do add a lot to the graduate student experience, and we’ll work hard to rebuild those activities once we’re back on campus.”

—Story by Colleen Curran

Pages

Subscribe to All News