Summary of Study ST003911
This data is available at the NIH Common Fund's National Metabolomics Data Repository (NMDR) website, the Metabolomics Workbench, https://www.metabolomicsworkbench.org, where it has been assigned Project ID PR002363. The data can be accessed directly via it's Project DOI: 10.21228/M8653M This work is supported by NIH grant, U2C- DK119886.
See: https://www.metabolomicsworkbench.org/about/howtocite.php
This study contains a large results data set and is not available in the mwTab file. It is only available for download via FTP as data file(s) here.
| Study ID | ST003911 |
| Study Title | Molecular fingerprint inference reveals bioactive lipids and microbial metabolites in colitis. Study 4 |
| Study Type | Study 4: Profiling of bacterial cultures for: Eggerthella lenta, Fusobacterium nucleatum |
| Study Summary | Untargeted metabolomics provides a sensitive readout of small molecules in biofluids , though considerable effort is still required to assign compound annotations to yet unidentified features . Here, we developed MS1-Tools, a machine learning model trained on LC-MS1 profiles of verified metabolites, to infer molecular fingerprints for unknown features. In stool samples, MS1-Tools achieved test accuracy of >74% (correct within top 3 candidates), drastically reducing the number of reference standards to be tested. Predicted identities were validated with MS/MS and standards, revealing diverse markers and drivers of inflammation, including amino-acid conjugations with primary metabolites, and platelet-activating factor analogs that we show to bind the PAF receptor. MS1-Tools enabled integration with bacterial culturomics to uncover production of N-lactoyl-phenylalanine (Lac-Phe) – an appetite-regulating metabolite – by inflammation-associated Lactobacillales and Bifidobacteriales. Structural protein search and bacterial genetics enabled validation of multiple bacterial Lac-Phe-producing dipeptidases, demonstrating broad applicability of MS1-Tools to resolve inflammation-associated metabolites and their origins. |
| Institute | Broad Institute of MIT and Harvard |
| Last Name | Avila-Pacheco |
| First Name | Julian |
| Address | 415 Main Street |
| jravilap@broadinstitute.org | |
| Phone | (617) 714-1729 |
| Submit Date | 2025-05-08 |
| Raw Data Available | Yes |
| Raw Data File Type(s) | mzML, raw(Thermo) |
| Analysis Type Detail | LC-MS |
| Release Date | 2025-08-04 |
| Release Version | 1 |
Select appropriate tab below to view additional metadata details:
Project:
| Project ID: | PR002363 |
| Project DOI: | doi: 10.21228/M8653M |
| Project Title: | Molecular fingerprint inference reveals bioactive lipids and microbial metabolites in colitis |
| Project Summary: | Untargeted metabolomics provides a sensitive readout of small molecules in biofluids, but requires targeted approaches to resolve ~90% of features for which tandem mass spectra (MS/MS) are not collected. By training on a subset of verified metabolites and their profiles in LC-MS, we derive a probabilistic model to predict molecular fingerprints in human stool and blood samples. These predictions, which do not utilize MS/MS, were accurate for >44% (correct top ranked candidate) or >75% (correct within top 3) of test metabolites, drastically reducing the number of reference standards that would need to be to be tested. These predictions revealed markers and drivers of inflammation, including amino acid derivatives and lysophospholipids with herein demonstrated platelet-activating factor receptor (PAF-R) activity. Integration with bacterial culturomics facilitates tracking the source of inflammation-associated metabolites to their origins in the gut microbiome. |
| Institute: | Broad Institute of MIT and Harvard |
| Last Name: | Avila-Pacheco |
| First Name: | Julian |
| Address: | 415 Main Street |
| Email: | jravilap@broadinstitute.org |
| Phone: | +1 (617) 714-1729 |
Subject:
| Subject ID: | SU004046 |
| Subject Type: | Bacteria |
| Subject Species: | Eggerthella lenta; Fusobacterium nucleatum |
| Taxonomy ID: | 84112; 851 |
| Genotype Strain: | Eggerthella lenta, Fusobacterium nucleatum |
Factors:
Subject type: Bacteria; Subject species: Eggerthella lenta; Fusobacterium nucleatum (Factor headings shown in green)
| mb_sample_id | local_sample_id | Species | Media | Timepoint | Replicate |
|---|---|---|---|---|---|
| SA430328 | EL_BHI-R_24hr_1 | Eggerthella lenta | BHI-R | 24 | R1 |
| SA430329 | EL_BHI-R_24hr_2 | Eggerthella lenta | BHI-R | 24 | R2 |
| SA430330 | EL_BHI-R_24hr_3 | Eggerthella lenta | BHI-R | 24 | R3 |
| SA430331 | EL_BHI-R_48hr_1 | Eggerthella lenta | BHI-R | 48 | R1 |
| SA430332 | EL_BHI-R_48hr_2 | Eggerthella lenta | BHI-R | 48 | R2 |
| SA430333 | EL_BHI-R_48hr_3 | Eggerthella lenta | BHI-R | 48 | R3 |
| SA430334 | EL_BHI-R_6hr_1 | Eggerthella lenta | BHI-R | 6 | R1 |
| SA430335 | EL_BHI-R_6hr_2 | Eggerthella lenta | BHI-R | 6 | R2 |
| SA430336 | EL_BHI-R_6hr_3 | Eggerthella lenta | BHI-R | 6 | R3 |
| SA430325 | EL_BHI-R_0hr_1 | Eggerthella lenta | BHI-R | - | R1 |
| SA430326 | EL_BHI-R_0hr_2 | Eggerthella lenta | BHI-R | - | R2 |
| SA430327 | EL_BHI-R_0hr_3 | Eggerthella lenta | BHI-R | - | R3 |
| SA430340 | EL_CHG-R_24hr_1 | Eggerthella lenta | CHG-R | 24 | R1 |
| SA430341 | EL_CHG-R_24hr_2 | Eggerthella lenta | CHG-R | 24 | R2 |
| SA430342 | EL_CHG-R_24hr_3 | Eggerthella lenta | CHG-R | 24 | R3 |
| SA430343 | EL_CHG-R_48hr_1 | Eggerthella lenta | CHG-R | 48 | R1 |
| SA430344 | EL_CHG-R_48hr_2 | Eggerthella lenta | CHG-R | 48 | R2 |
| SA430345 | EL_CHG-R_48hr_3 | Eggerthella lenta | CHG-R | 48 | R3 |
| SA430346 | EL_CHG-R_6hr_1 | Eggerthella lenta | CHG-R | 6 | R1 |
| SA430347 | EL_CHG-R_6hr_2 | Eggerthella lenta | CHG-R | 6 | R2 |
| SA430348 | EL_CHG-R_6hr_3 | Eggerthella lenta | CHG-R | 6 | R3 |
| SA430337 | EL_CHG-R_0hr_1 | Eggerthella lenta | CHG-R | - | R1 |
| SA430338 | EL_CHG-R_0hr_2 | Eggerthella lenta | CHG-R | - | R2 |
| SA430339 | EL_CHG-R_0hr_3 | Eggerthella lenta | CHG-R | - | R3 |
| SA430352 | EL_Gifu-R_24hr_1 | Eggerthella lenta | Gifu-R | 24 | R1 |
| SA430353 | EL_Gifu-R_24hr_2 | Eggerthella lenta | Gifu-R | 24 | R2 |
| SA430354 | EL_Gifu-R_24hr_3 | Eggerthella lenta | Gifu-R | 24 | R3 |
| SA430355 | EL_Gifu-R_48hr_1 | Eggerthella lenta | Gifu-R | 48 | R1 |
| SA430356 | EL_Gifu-R_48hr_2 | Eggerthella lenta | Gifu-R | 48 | R2 |
| SA430357 | EL_Gifu-R_48hr_3 | Eggerthella lenta | Gifu-R | 48 | R3 |
| SA430358 | EL_Gifu-R_6hr_1 | Eggerthella lenta | Gifu-R | 6 | R1 |
| SA430359 | EL_Gifu-R_6hr_2 | Eggerthella lenta | Gifu-R | 6 | R2 |
| SA430360 | EL_Gifu-R_6hr_3 | Eggerthella lenta | Gifu-R | 6 | R3 |
| SA430349 | EL_Gifu-R_0hr_1 | Eggerthella lenta | Gifu-R | - | R1 |
| SA430350 | EL_Gifu-R_0hr_2 | Eggerthella lenta | Gifu-R | - | R2 |
| SA430351 | EL_Gifu-R_0hr_3 | Eggerthella lenta | Gifu-R | - | R3 |
| SA430364 | EL_Mega-R_24hr_1 | Eggerthella lenta | Mega-R | 24 | R1 |
| SA430365 | EL_Mega-R_24hr_2 | Eggerthella lenta | Mega-R | 24 | R2 |
| SA430366 | EL_Mega-R_24hr_3 | Eggerthella lenta | Mega-R | 24 | R3 |
| SA430367 | EL_Mega-R_48hr_1 | Eggerthella lenta | Mega-R | 48 | R1 |
| SA430368 | EL_Mega-R_48hr_2 | Eggerthella lenta | Mega-R | 48 | R2 |
| SA430369 | EL_Mega-R_48hr_3 | Eggerthella lenta | Mega-R | 48 | R3 |
| SA430370 | EL_Mega-R_6hr_1 | Eggerthella lenta | Mega-R | 6 | R1 |
| SA430371 | EL_Mega-R_6hr_2 | Eggerthella lenta | Mega-R | 6 | R2 |
| SA430372 | EL_Mega-R_6hr_3 | Eggerthella lenta | Mega-R | 6 | R3 |
| SA430361 | EL_Mega-R_0hr_1 | Eggerthella lenta | Mega-R | - | R1 |
| SA430362 | EL_Mega-R_0hr_2 | Eggerthella lenta | Mega-R | - | R2 |
| SA430363 | EL_Mega-R_0hr_3 | Eggerthella lenta | Mega-R | - | R3 |
| SA430376 | FB_CHG_24hr_1 | Fusobacterium nucleatum | CHG | 24 | R1 |
| SA430377 | FB_CHG_24hr_2 | Fusobacterium nucleatum | CHG | 24 | R2 |
| SA430378 | FB_CHG_24hr_3 | Fusobacterium nucleatum | CHG | 24 | R3 |
| SA430379 | FB_CHG_48hr_1 | Fusobacterium nucleatum | CHG | 48 | R1 |
| SA430380 | FB_CHG_48hr_2 | Fusobacterium nucleatum | CHG | 48 | R2 |
| SA430381 | FB_CHG_48hr_3 | Fusobacterium nucleatum | CHG | 48 | R3 |
| SA430382 | FB_CHG_6hr_1 | Fusobacterium nucleatum | CHG | 6 | R1 |
| SA430383 | FB_CHG_6hr_2 | Fusobacterium nucleatum | CHG | 6 | R2 |
| SA430384 | FB_CHG_6hr_3 | Fusobacterium nucleatum | CHG | 6 | R3 |
| SA430373 | FB_CHG_0hr_1 | Fusobacterium nucleatum | CHG | - | R1 |
| SA430374 | FB_CHG_0hr_2 | Fusobacterium nucleatum | CHG | - | R2 |
| SA430375 | FB_CHG_0hr_3 | Fusobacterium nucleatum | CHG | - | R3 |
| SA430388 | FB_Gifu_24hr_1 | Fusobacterium nucleatum | Gifu | 24 | R1 |
| SA430389 | FB_Gifu_24hr_2 | Fusobacterium nucleatum | Gifu | 24 | R2 |
| SA430390 | FB_Gifu_24hr_3 | Fusobacterium nucleatum | Gifu | 24 | R3 |
| SA430391 | FB_Gifu_48hr_1 | Fusobacterium nucleatum | Gifu | 48 | R1 |
| SA430392 | FB_Gifu_48hr_2 | Fusobacterium nucleatum | Gifu | 48 | R2 |
| SA430393 | FB_Gifu_48hr_3 | Fusobacterium nucleatum | Gifu | 48 | R3 |
| SA430394 | FB_Gifu_6hr_1 | Fusobacterium nucleatum | Gifu | 6 | R1 |
| SA430395 | FB_Gifu_6hr_2 | Fusobacterium nucleatum | Gifu | 6 | R2 |
| SA430396 | FB_Gifu_6hr_3 | Fusobacterium nucleatum | Gifu | 6 | R3 |
| SA430385 | FB_Gifu_0hr_1 | Fusobacterium nucleatum | Gifu | - | R1 |
| SA430386 | FB_Gifu_0hr_2 | Fusobacterium nucleatum | Gifu | - | R2 |
| SA430387 | FB_Gifu_0hr_3 | Fusobacterium nucleatum | Gifu | - | R3 |
| SA430400 | FB_Mega_24hr_1 | Fusobacterium nucleatum | Mega | 24 | R1 |
| SA430401 | FB_Mega_24hr_2 | Fusobacterium nucleatum | Mega | 24 | R2 |
| SA430402 | FB_Mega_24hr_3 | Fusobacterium nucleatum | Mega | 24 | R3 |
| SA430403 | FB_Mega_48hr_1 | Fusobacterium nucleatum | Mega | 48 | R1 |
| SA430404 | FB_Mega_48hr_2 | Fusobacterium nucleatum | Mega | 48 | R2 |
| SA430405 | FB_Mega_48hr_3 | Fusobacterium nucleatum | Mega | 48 | R3 |
| SA430406 | FB_Mega_6hr_1 | Fusobacterium nucleatum | Mega | 6 | R1 |
| SA430407 | FB_Mega_6hr_2 | Fusobacterium nucleatum | Mega | 6 | R2 |
| SA430408 | FB_Mega_6hr_3 | Fusobacterium nucleatum | Mega | 6 | R3 |
| SA430397 | FB_Mega_0hr_1 | Fusobacterium nucleatum | Mega | - | R1 |
| SA430398 | FB_Mega_0hr_2 | Fusobacterium nucleatum | Mega | - | R2 |
| SA430399 | FB_Mega_0hr_3 | Fusobacterium nucleatum | Mega | - | R3 |
| SA430412 | BHI-R_24hr_1 | Unspent media | BHI-R | 24 | R1 |
| SA430413 | BHI-R_24hr_2 | Unspent media | BHI-R | 24 | R2 |
| SA430414 | BHI-R_24hr_3 | Unspent media | BHI-R | 24 | R3 |
| SA430415 | BHI-R_48hr_1 | Unspent media | BHI-R | 48 | R1 |
| SA430416 | BHI-R_48hr_2 | Unspent media | BHI-R | 48 | R2 |
| SA430417 | BHI-R_48hr_3 | Unspent media | BHI-R | 48 | R3 |
| SA430418 | BHI-R_6hr_1 | Unspent media | BHI-R | 6 | R1 |
| SA430419 | BHI-R_6hr_2 | Unspent media | BHI-R | 6 | R2 |
| SA430420 | BHI-R_6hr_3 | Unspent media | BHI-R | 6 | R3 |
| SA430409 | BHI-R_0hr_1 | Unspent media | BHI-R | - | R1 |
| SA430410 | BHI-R_0hr_2 | Unspent media | BHI-R | - | R2 |
| SA430411 | BHI-R_0hr_3 | Unspent media | BHI-R | - | R3 |
| SA430424 | CHG-R_24hr_1 | Unspent media | CHG-R | 24 | R1 |
| SA430425 | CHG-R_24hr_2 | Unspent media | CHG-R | 24 | R2 |
| SA430426 | CHG-R_24hr_3 | Unspent media | CHG-R | 24 | R3 |
| SA430427 | CHG-R_48hr_1 | Unspent media | CHG-R | 48 | R1 |
Collection:
| Collection ID: | CO004039 |
| Collection Summary: | All strains were cultured anaerobically at 37°C in different media types according to the experiment being performed. The media compositions used in this study include BHI-R, CHG-R, Gifu-R, and Mega-R, each formulated with specific components to support microbial growth. BHI-R was prepared by dissolving 18.5 g of Brain Heart Infusion (BHI) and 5 g of L-arginine in 500 mL of Milli-Q water, followed by sterilization via autoclaving (10 lbs, 115°C, 15 minutes). CHG-R was based on CHG media, which contained 18.5 g of BHI, 0.5 g each of D-(+)-cellobiose, D-(+)-maltose monohydrate, and D-(-)-fructose, and 0.25 g of cysteine in 500 mL of Milli-Q water. After filter sterilization, additional supplements, including 5 mL each of Vitamin K + Hematin (1%), trace mineral supplement, and vitamin supplement, were added in an anaerobic chamber. Gifu-R was derived from commercially available Gifu Anaerobic Media (GAM) (Himedia, Cat # M1801), prepared by suspending 59 g of GAM in 1000 mL of purified/distilled water and autoclaving under the same conditions as BHI-R. Mega-R was formulated with a complex mixture of components, including 5 g of Trypticase Peptone (BBL), 2.5 g each of Yeast Extract (Bacto) and Meat Extract, and multiple carbohydrate sources such as 1 g of D-(+)-glucose and 0.5 g each of D-(+)-cellobiose, D-(+)-maltose monohydrate, and D-(-)-fructose, all dissolved in 500 mL of Milli-Q water. Additional components included 50 mL of 1 M potassium phosphate buffer (pH 7.2), 20 mL of TYG salts solution, 1 mL of 25% Tween 80, and various supplements such as SCFA, CaCl₂, FeSO₄, and resazurin. Like CHG-R, Mega-R was supplemented with 5 mL each of Vitamin K + Hematin (1%), trace mineral supplement, and vitamin supplement. For CHG-R, Gifu-R, and Mega-R, an additional 5 g of L-arginine was incorporated into 500 mL of each respective medium to create their corresponding arginine-enriched formulations. All cultures were grown in an anaerobic chamber (Coy Laboratory Products) with an atmosphere of 5% CO2, 5% H2, and 90% N2 at 37°C. The isolates were streaked onto CHG plates and incubated for 48 hours. Once colonies were visible, a single colony from each plate was picked and processed for 16S rRNA Sanger sequencing using primer sequences 27F: AGAGTTTGATCMTGGCTCAG and 1492R: GGTTACCTTGTTACGACTT. Once the identity of the bacterial strains was confirmed, a single colony from each strain was inoculated in triplicates in 6 mL of the respective media types (CHG, Gifu, Mega) and incubated overnight. The overnight cultures were inoculated into fresh media and normalized to the OD600 of 0.05. At each time point after inoculation (6h, 24h, 48 h), 200 μL of the bacterial culture were collected. |
| Sample Type: | Bacterial cells |
Treatment:
| Treatment ID: | TR004055 |
| Treatment Summary: | NA |
Sample Preparation:
| Sampleprep ID: | SP004052 |
| Sampleprep Summary: | LC–MS samples were prepared from bacterial cultures for each profiling method as follows: - HILIC-pos: Bacterial cultures (10 μL) were extracted with the addition of nine volumes of 74.9:24.9:0.2 v/v/v acetonitrile/methanol/formic acid containing stable isotope-labeled internal standards (valine-d8, Isotec; and phenylalanine-d8, Cambridge Isotope Laboratories). The samples were centrifuged (10 min, 9,000g, 4°C), and the supernatants (10 μL) injected directly onto column. - C8-pos: Bacterial cultures (10 μL) using 190 μL isopropanol containing 1-dodecanoyl-2-tridecanoyl-sn-glycero-3-phosphocholine as an internal standard (Avanti Polar Lipids; Alabaster, AL). After centrifugation (10 min, 9,000g, ambient temperature), supernatants (2 μL) were injected directly onto column. - C18-neg: Bacterial cultures (30 μL) were extracted using 90 μl methanol containing 15R-15-methyl Prostaglandin A2,15R-15-methyl Prostaglandin F2α, 15S-15-methyl Prostaglandin D2, 15S-15-methyl Prostaglandin E1, and 15S-15-methyl Prostaglandin E2 as internal standards (Cayman Chemical Co.) and centrifuged (10 min, 9,000g, 4°C). The supernatants (10 μL) were injected onto column. - HILIC-neg: Bacterial cultures (30 μL) were extracted with the addition of four volumes of 80% methanol containing inosine-15N4, thymine-d4 and glycocholate-d4 internal standards (Cambridge Isotope Laboratories). The samples were centrifuged (10 min, 9,000g, 4°C) and the supernatants 10 μL) were injected directly onto column. |
Chromatography:
| Chromatography ID: | CH004870 |
| Instrument Name: | Agilent 1290 Infinity II |
| Column Name: | Waters Atlantis HILIC (150 x 2mm, 3um) |
| Column Temperature: | 30°C |
| Flow Gradient: | Isocratically with 5% mobile phase A for 1 minute followed by a linear gradient to 40% mobile phase B over 10 minutes |
| Flow Rate: | 250 µL/min |
| Solvent A: | 100% Water; 10 mM Ammonium formate; 0.1% Formic acid |
| Solvent B: | 100% Acetonitrile; 0.1% Formic acid |
| Chromatography Type: | HILIC |
| Chromatography ID: | CH004871 |
| Instrument Name: | Agilent 1290 Infinity II |
| Column Name: | Waters Acquity BEH C8 (100 x 2.1mm, 1.7um) |
| Column Temperature: | 40°C |
| Flow Gradient: | The column was eluted at a flow rate of 450 µL/min isocratically for 1 minute at 80% mobile phase A, followed by a linear gradient to 80% mobile-phase B over 2 minutes, a linear gradient to 100% mobile phase B over 7 minutes, and then 3 minutes at 100% mobile-phase B. |
| Flow Rate: | 450 µL/min |
| Solvent A: | 95% Water/5% Methanol; 10 mM Ammonium acetate; 0.1% Acetic acid |
| Solvent B: | 100% Methanol; 0.1% Acetic acid |
| Chromatography Type: | Reversed phase |
| Chromatography ID: | CH004872 |
| Instrument Name: | Shimadzu Nexera X2 |
| Column Name: | Waters ACQUITY UPLC BEH C18 (150 x 1.7mm,2.1um) |
| Column Temperature: | 45°C |
| Flow Gradient: | The column was eluted isocratically at a flow rate of 450 µL/min with 20% mobile phase A for 3 minutes followed by a linear gradient to 100% mobile phase B over 12 minutes. |
| Flow Rate: | 450 µL/min |
| Solvent A: | 100% Water; 0.01% Formic acid |
| Solvent B: | 100% Acetonitrile; 0.01% Acetic acid |
| Chromatography Type: | Reversed phase |
Analysis:
| Analysis ID: | AN006421 |
| Analysis Type: | MS |
| Chromatography ID: | CH004870 |
| Num Factors: | 168 |
| Num Metabolites: | 338 |
| Has Mz: | 1 |
| Has Rt: | 1 |
| Rt Units: | Minutes |
| Results File: | ST003911_AN006421_Results.txt |
| Units: | Abudances |
| Analysis ID: | AN006422 |
| Analysis Type: | MS |
| Chromatography ID: | CH004871 |
| Num Factors: | 168 |
| Num Metabolites: | 72 |
| Has Mz: | 1 |
| Has Rt: | 1 |
| Rt Units: | Minutes |
| Results File: | ST003911_AN006422_Results.txt |
| Units: | Abudances |
| Analysis ID: | AN006423 |
| Analysis Type: | MS |
| Chromatography ID: | CH004872 |
| Num Factors: | 168 |
| Num Metabolites: | 83 |
| Has Mz: | 1 |
| Has Rt: | 1 |
| Rt Units: | Minutes |
| Results File: | ST003911_AN006423_Results.txt |
| Units: | Abudances |