Machine Learning revolutionizes PFAS identification.
PFAS persists in the environment and poses serious health risks, including liver and kidney damage, developmental problems, and an increased risk of cancer. While regulatory bans have targeted PFAS like PFOA and PFOS, industries have substituted them with new PFAS, many of which remain unidentified due to proprietary trade secrets and environmental transformations.
Traditional methods for identifying PFAS in environmental samples focus on targeted screening, which misses unknown compounds. To fix this, Wang et al. created APP-ID, a machine learning-enhanced molecular network. It uses high-resolution mass spectrometry (HRMS) data and automated structure prediction to improve PFAS identification. Their study, published in Science Advances, shows that APP-ID detects hundreds of previously unknown PFAS, indicating widespread global exposure to these harmful substances.
You can also read: PFAS Characterization and the Push for Information.
PFAS identification in environmental samples is challenging due to their diversity of chemical structures. While nontarget screening methods using HRMS have been employed to detect unknown PFAS, these approaches assume that new compounds resemble known homologs. However, PFAS structures are becoming increasingly complex and do not always follow expected patterns. Additionally, current identification platforms, such as CFM-ID and MetFrag, rely on databases that contain only a limited number of PFAS structures, making it difficult to detect novel compounds.
To overcome these limitations, researchers designed APP-ID. This automated identification system combines a PFAS-specific molecular network, PFAS_link, and a machine learning–based PFAS identification module, PFAS_ID. This platform improves traditional methods by reducing false positives and enabling the discovery of previously unreported PFAS.
APP-ID begins by analyzing HRMS data to construct a PFAS-specific molecular network called PFAS_link. Unlike general molecular networking tools, PFAS_link recognizes PFAS features while filtering out non-PFAS compounds. It employs a novel Flink algorithm, which improves the accuracy of spectral matching by:
Once PFAS_link isolates potential PFAS candidates, the PFAS_ID module uses machine learning to predict their structures. The model uses 194 known PFAS spectra. These are chemical fingerprints recorded from previously identified PFAS compounds. The system uses a machine learning approach called a support vector machine (SVM) algorithm to learn how specific molecular features correspond to known PFAS structures.
When analyzing new samples, PFAS_ID follows these steps:
When tested on an external dataset, APP-ID correctly identified the top match 66.7% of the time when the molecular formula was known and 58.3% of the time when the formula was unknown.
To demonstrate the effectiveness of APP-ID, researchers applied it to wastewater samples from a fluorochemical industrial park in China. Their findings were striking:
To determine if the newly discovered PFAS were already present worldwide, the researchers used MASST, a tool that scans public mass spectrometry databases. Their search revealed 126 PFAS signals in samples from 20 countries, including the U.S., Canada, China, India, and several European nations.
Shockingly, 81 of these PFAS were previously unreported, indicating widespread, undocumented exposure. These findings emphasize the urgent need for further investigation and regulatory action to assess the risks posed by these unknown compounds.
Worldwide distribution of PFAS features using MASST retrospective screening. Courtesy of Science Advances.
Wang et al.’s study marks a significant advancement in PFAS research, integrating machine learning with molecular networking. APP-ID identifies both known and previously unrecognized PFAS, highlighting more widespread global contamination. The researchers call for expanded PFAS databases, regulatory action, and further use of APP-ID in human biomonitoring studies.
For 3D Printing, platforms like the Protiq Marketplace enable seamless transitions from CAD design to…
Researchers have found a link between seasonal atmospheric photochemistry and the formation of certain per-…
As concerns over environmental and health risks grow, the aviation industry is taking crucial steps.…
New findings reveal microplastics in brain tissue, raising concerns about links to cognitive decline. Microplastic…
The world is bracing to adapt as the new administration reveals its hand. Changes in…
ANTEC® 2025 brings together scientists, engineers, and industry leaders to shape the future of plastics…