GREENWALL MICROBIOME ANALYSIS

Explorable Report

Publication-style, stepwise summary of WGS, 16S, and ITS workflow outputs, statistical tests, metadata-wide associations, control diagnostics, and direct hypothesis-linked conclusions.

Manifest Samples57
Testable Metadata Columns23
Hypothesis Cards13
QC Exclusions Recommended0

Assays Included in This Report

This report explicitly checks the hypothesis-suite inputs for WGS, 16S, and ITS. Included means nonzero sample and feature counts were loaded from the completed workflow exports.

WGS Included

57 samples; 8007 features loaded into the hypothesis/reporting layer.

16S Included

45 samples; 16699 features loaded into the hypothesis/reporting layer.

ITS Included

45 samples; 4364 features loaded into the hypothesis/reporting layer.

Hypothesis Verdict Cards

Mixed

Are there differences in 16S / WGS / ITS outcomes

Top metadata-linked signals were 16S:Relevant Field Blank; 16S:Relevant Kit Blank; 16S:Sample Material. 4 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; metadata_covariate_links.tsv; metadata_signal_summary.tsv; positive_control_benchmark.tsv

Mixed

Are the collection of microorganisms found in the greenwall related to human health in any way?

10 beneficial-taxa keyword hits were detected across assays. 4 positive-control benchmark summaries were generated.

Outputs: beneficial_taxa_summary.tsv; core_taxa.tsv; hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Not Supported Yet

nan

4 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Mixed

Is the microbiome of the "greenwall room" different from the other airstreams?

Maximum source-overlap fraction observed was 0.797. 4 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv; source_overlap.tsv

Not Supported Yet

nan

4 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Mixed

Is there evidence of diversification of the "greenwall room" microbiome by greenwall microorganisms?

159 alpha-diversity metadata associations passed q < 0.1. 4 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; metadata_alpha_associations.tsv; positive_control_benchmark.tsv

Not Supported Yet

nan

4 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Mixed

Is there evidence of metabolic function within the greenwall microbiome related to chemicals identified in relevant airstreams?

Maximum source-overlap fraction observed was 0.797. 4 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv; source_overlap.tsv

Not Supported Yet

nan

4 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Not Supported Yet

nan

4 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Not Supported Yet

nan

4 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Not Supported Yet

nan

4 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Not Supported Yet

nan

4 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Logical Steps

  1. Step 1. Study Overview and QC
    Summarize assay coverage, sample inventory, control annotations, and high-level dataset structure.
    study_overview.tsv; metadata_column_catalog.tsv; hypotheses.imported.tsv
  2. Step 2. Alpha Diversity
    Compute richness, Shannon, Simpson, evenness, and test them against targeted contrasts and every usable metadata column.
    diversity_metrics.tsv; diversity_tests.tsv; metadata_alpha_associations.tsv
  3. Step 3. Beta Diversity and Community Structure
    Build Bray-Curtis dissimilarities, PCoA ordinations, and PERMANOVA summaries for assay-specific and metadata-driven effects.
    pcoa_coordinates.tsv; permanova.tsv; metadata_permanova.tsv; metadata_pcoa_associations.tsv
  4. Step 4. Taxonomic Composition and Beneficial Taxa
    Screen for beneficial taxa, summarize source overlap, and prepare interpretable taxonomic summaries.
    beneficial_taxa_summary.tsv; source_overlap.tsv
  5. Step 5. Differential Abundance and Function
    Test assay- and metadata-driven contrasts for taxonomic and WGS functional differences.
    differential_abundance.tsv; metadata_differential_abundance_summary.tsv; wgs_function_top_differences.tsv
  6. Step 6. Core Microbiome and Cross-Assay Concordance
    Define core taxa, assess overlap across assays, and quantify assay bias or concordance on shared samples.
    core_taxa.tsv; cross_assay_sample_metrics.tsv; cross_assay_correlations.tsv
  7. Step 7. Machine Learning and Network Structure
    Rank discriminatory features and summarize co-occurrence structure for hypothesis generation.
    random_forest_summary.tsv; random_forest_top_features.tsv; metadata_random_forest_summary.tsv; cooccurrence_network_edges.tsv
  8. Step 8. Hypothesis Answer Layer
    Map outputs back to the stated hypotheses and surface the strongest metadata-linked signals.
    hypothesis_to_output_map.tsv; metadata_signal_summary.tsv; hypothesis_summary.md

Study Overview and QC

section metric value
dataset manifest_rows 57
dataset manifest_columns 30
dataset hypothesis_testable_metadata_columns 23
assay_summary WGS_samples 57
assay_summary WGS_features 8007
assay_summary 16S_samples 45
assay_summary 16S_features 16699
assay_summary ITS_samples 45
assay_summary ITS_features 4364
Sample Material PDMS 16
Sample Material Filter 15
Sample Material Rhizosphere 12
Sample Material Leaf 6
Sample Material Zymo Mock Community 2
Airstream Greenwall 45
Airstream Hallway 3
Airstream Office 3
Airstream Windowsill 3
Extraction Kit Power Water 43
Extraction Kit PowerSoil Pro 14
Sample Type Experimental 45
Sample Type Negative Control 10
Sample Type Positive Control 2
controls paired_negative_control_annotated 56
controls paired_positive_control_annotated 22
alpha 16S_alpha_samples 45
alpha ITS_alpha_samples 45
alpha WGS_alpha_samples 57

Metadata Catalog

column type non_null_samples unique_values usable analysis_roles
Accumulation Time (Days) categorical 57 5 True alpha,beta,differential,classification
Airstream categorical 57 5 True alpha,beta,differential,classification
Concentration/Filtration Step? categorical 57 2 True alpha,beta,differential,classification
Days Deploy_to_Collect categorical 57 6 True alpha,beta,differential,classification
Extraction Kit categorical 57 2 True alpha,beta,differential,classification
Greenwall Location categorical 57 7 True alpha,beta,differential,classification
Greenwall Radial Distance categorical 57 16 True alpha,beta,differential,classification
Greenwall Surface Position categorical 57 10 True alpha,beta,differential,classification
Relevant Field Blank categorical 57 4 True alpha,beta,differential,classification
Relevant Kit Blank categorical 57 7 True alpha,beta,differential,classification
Sample Material categorical 57 6 True alpha,beta,differential,classification
Sample Type categorical 57 3 True alpha,beta,differential,classification
paired_negative_control categorical 57 10 True alpha,beta,differential,classification
paired_positive_control categorical 57 3 True alpha,beta,differential,classification
Date Collected date 53 10 True alpha,ordination
Date Deployed date 31 5 True alpha,ordination
Date Extracted date 57 8 True alpha,ordination
Days Collect_to_Extract numeric 53 7 True alpha,ordination
Qubit ng/uL numeric 41 39 True alpha,ordination
ng/50uL numeric 41 39 True alpha,ordination
16S assay_flag 57 1 False alpha,beta,differential,classification
ITS assay_flag 57 1 False alpha,beta,differential,classification
WGS assay_flag 57 1 False alpha,beta,differential,classification
its_r1 path 57 57 False NaN
its_r2 path 57 57 False NaN
s16_r1 path 57 57 False NaN
s16_r2 path 57 57 False NaN
wgs_r1 path 57 57 False NaN
wgs_r2 path 57 57 False NaN

QC Flags

sample assay metric value threshold flag reason exclude_recommended
PWKB5 WGS post_filter_counts 93942.0 1000.0 False Low post-filter Bracken count total False
OFB_PWFBB WGS post_filter_counts 674258.0 1000.0 False Low post-filter Bracken count total False
6L11_PWKB3 WGS post_filter_counts 199098.0 1000.0 False Low post-filter Bracken count total False
WiC_PWFBC WGS post_filter_counts 194200.0 1000.0 False Low post-filter Bracken count total False
PWKB2 WGS post_filter_counts 57930.0 1000.0 False Low post-filter Bracken count total False
3L18_PWKB2 WGS post_filter_counts 623022.0 1000.0 False Low post-filter Bracken count total False
R10T2_PSKB1 WGS post_filter_counts 2310438.0 1000.0 False Low post-filter Bracken count total False
PWFBA WGS post_filter_counts 815924.0 1000.0 False Low post-filter Bracken count total False
PWKB1 WGS post_filter_counts 124933.0 1000.0 False Low post-filter Bracken count total False
R18T2_PSKB1 WGS post_filter_counts 2975934.0 1000.0 False Low post-filter Bracken count total False
R19T2_PSKB1 WGS post_filter_counts 4478997.0 1000.0 False Low post-filter Bracken count total False
R11T2_PSKB1 WGS post_filter_counts 2648318.0 1000.0 False Low post-filter Bracken count total False
6M02_PWKB4 WGS post_filter_counts 605720.0 1000.0 False Low post-filter Bracken count total False
PWKB4 WGS post_filter_counts 1268280.0 1000.0 False Low post-filter Bracken count total False
HaC_PWFBC WGS post_filter_counts 283713.0 1000.0 False Low post-filter Bracken count total False
6M13_PWKB4 WGS post_filter_counts 526051.0 1000.0 False Low post-filter Bracken count total False
1L07_PWKB1 WGS post_filter_counts 4719919.0 1000.0 False Low post-filter Bracken count total False
R03T2_PSKB1 WGS post_filter_counts 1870730.0 1000.0 False Low post-filter Bracken count total False
R18T1_PSKB1 WGS post_filter_counts 3023520.0 1000.0 False Low post-filter Bracken count total False
3L16_PWKB2 WGS post_filter_counts 186513.0 1000.0 False Low post-filter Bracken count total False
HaA_PWFBA WGS post_filter_counts 641394.0 1000.0 False Low post-filter Bracken count total False
GrB_PWFBB WGS post_filter_counts 931239.0 1000.0 False Low post-filter Bracken count total False
OFC_PWFBC WGS post_filter_counts 932569.0 1000.0 False Low post-filter Bracken count total False
6M11_PWKB4 WGS post_filter_counts 596483.0 1000.0 False Low post-filter Bracken count total False
PSKB1 WGS post_filter_counts 191810.0 1000.0 False Low post-filter Bracken count total False
P375_PWKB5 WGS post_filter_counts 12287355.0 1000.0 False Low post-filter Bracken count total False
6E02_PWKB3 WGS post_filter_counts 1081639.0 1000.0 False Low post-filter Bracken count total False
6L02_PWKB3 WGS post_filter_counts 191843.0 1000.0 False Low post-filter Bracken count total False
L10T3_PWKB5 WGS post_filter_counts 2288618.0 1000.0 False Low post-filter Bracken count total False
HaB_PWFBB WGS post_filter_counts 1037183.0 1000.0 False Low post-filter Bracken count total False
P375_PSKB1 WGS post_filter_counts 11331652.0 1000.0 False Low post-filter Bracken count total False
R19T1_PSKB1 WGS post_filter_counts 7324409.0 1000.0 False Low post-filter Bracken count total False
R03T1_PSKB1 WGS post_filter_counts 5494533.0 1000.0 False Low post-filter Bracken count total False
OFA_PWFBA WGS post_filter_counts 1347806.0 1000.0 False Low post-filter Bracken count total False
WiA_PWFBA WGS post_filter_counts 626646.0 1000.0 False Low post-filter Bracken count total False
PWFBC WGS post_filter_counts 6603362.0 1000.0 False Low post-filter Bracken count total False
R02T1_PSKB1 WGS post_filter_counts 3968345.0 1000.0 False Low post-filter Bracken count total False
WiB_PWFBB WGS post_filter_counts 644971.0 1000.0 False Low post-filter Bracken count total False
L03T3_PWKB5 WGS post_filter_counts 546220.0 1000.0 False Low post-filter Bracken count total False
R10T1_PSKB1 WGS post_filter_counts 2393675.0 1000.0 False Low post-filter Bracken count total False
6E11_PWKB3 WGS post_filter_counts 2007928.0 1000.0 False Low post-filter Bracken count total False
L18T3_PWKB5 WGS post_filter_counts 1554800.0 1000.0 False Low post-filter Bracken count total False
R02T2_PSKB1 WGS post_filter_counts 2568735.0 1000.0 False Low post-filter Bracken count total False
FB_PWKB4 WGS post_filter_counts 1270337.0 1000.0 False Low post-filter Bracken count total False
L02T3_PWKB5 WGS post_filter_counts 679260.0 1000.0 False Low post-filter Bracken count total False
GrA_PWFBA WGS post_filter_counts 244698.0 1000.0 False Low post-filter Bracken count total False
L11T3_PWKB5 WGS post_filter_counts 9165150.0 1000.0 False Low post-filter Bracken count total False
PWFBB WGS post_filter_counts 1465112.0 1000.0 False Low post-filter Bracken count total False
L19T3_PWKB5 WGS post_filter_counts 7309206.0 1000.0 False Low post-filter Bracken count total False
R11T1_PSKB1 WGS post_filter_counts 4513526.0 1000.0 False Low post-filter Bracken count total False

QC Summary

assay sample flagged_metrics exclude_recommended
16S 1L07_PWKB1 0 False
16S 1L10_PWKB1 0 False
16S 1L14_PWKB1 0 False
16S 3L06_PWKB2 0 False
16S 3L16_PWKB2 0 False
16S 3L18_PWKB2 0 False
16S 6E02_PWKB3 0 False
16S 6E11_PWKB3 0 False
16S 6E13_PWKB3 0 False
16S 6L02_PWKB3 0 False
16S 6L11_PWKB3 0 False
16S 6L13_PWKB3 0 False
16S 6M02_PWKB4 0 False
16S 6M11_PWKB4 0 False
16S 6M13_PWKB4 0 False
16S GrA_PWFBA 0 False
16S GrB_PWFBB 0 False
16S GrC_PWFBC 0 False
16S HaA_PWFBA 0 False
16S HaB_PWFBB 0 False
16S HaC_PWFBC 0 False
16S L02T3_PWKB5 0 False
16S L03T3_PWKB5 0 False
16S L10T3_PWKB5 0 False
16S L11T3_PWKB5 0 False
16S L18T3_PWKB5 0 False
16S L19T3_PWKB5 0 False
16S OFA_PWFBA 0 False
16S OFB_PWFBB 0 False
16S OFC_PWFBC 0 False
16S R02T1_PSKB1 0 False
16S R02T2_PSKB1 0 False
16S R03T1_PSKB1 0 False
16S R03T2_PSKB1 0 False
16S R10T1_PSKB1 0 False
16S R10T2_PSKB1 0 False
16S R11T1_PSKB1 0 False
16S R11T2_PSKB1 0 False
16S R18T1_PSKB1 0 False
16S R18T2_PSKB1 0 False
16S R19T1_PSKB1 0 False
16S R19T2_PSKB1 0 False
16S WiA_PWFBA 0 False
16S WiB_PWFBB 0 False
16S WiC_PWFBC 0 False
ITS 1L07_PWKB1 0 False
ITS 1L10_PWKB1 0 False
ITS 1L14_PWKB1 0 False
ITS 3L06_PWKB2 0 False
ITS 3L16_PWKB2 0 False

Positive Control Evaluation

assay sample total_counts nonzero_features top1_fraction top10_fraction top_features
WGS_species P375_PSKB1 11331652.0 100 0.250709 0.986473 Salmonella enterica:2840944.0000;Escherichia coli:1985121.0000;Pseudomonas aeruginosa:1753812.0000;Bacillus spizizenii:1469103.0000;Staphylococcus aureus:893120.0000;Enterococcus faecalis:890384.0000;Listeria monocytogenes:642953.0000;Limosilactobacillus fermentum:519475.0000;Saccharomyces cerevisiae:155658.0000;Bacillus subtilis:27803.0000
WGS_species P375_PWKB5 12287355.0 92 0.214053 0.985745 Salmonella enterica:2630148.0000;Escherichia coli:1902867.0000;Pseudomonas aeruginosa:1694812.0000;Bacillus spizizenii:1596350.0000;Staphylococcus aureus:1155467.0000;Listeria monocytogenes:1127978.0000;Limosilactobacillus fermentum:962784.0000;Enterococcus faecalis:950700.0000;Saccharomyces cerevisiae:67942.0000;Cryptococcus deneoformans:23156.0000
WGS_genus P375_PSKB1 11332082.0 29 0.251054 0.997220 Salmonella:2844961.0000;Escherichia:1985447.0000;Pseudomonas:1753967.0000;Bacillus:1504857.0000;Enterococcus:947629.0000;Staphylococcus:904337.0000;Listeria:651336.0000;Limosilactobacillus:519471.0000;Saccharomyces:156456.0000;Cryptococcus:32120.0000
WGS_genus P375_PWKB5 12287685.0 27 0.214228 0.997386 Salmonella:2632366.0000;Escherichia:1903137.0000;Pseudomonas:1694944.0000;Bacillus:1621564.0000;Staphylococcus:1167120.0000;Listeria:1140525.0000;Enterococcus:1026341.0000;Limosilactobacillus:962765.0000;Saccharomyces:68232.0000;Cryptococcus:38576.0000
16S P375_PSKB1 372933.0 26 0.215245 0.981013 d46e2205f0c6ecf67b51f83d111c509c:80272;a4cbe987942964b584e95e4efab6a176:78697;4cbfff144d4e7a4e0f4619ed505be070:71504;65d43491988bfe557da4d86a5ba25dae:44756;9908fffab7ed4f3bec44cda2f5084d49:22720;8ae518dbb29595b3f79214be0b589066:22393;ff9d93d7b7e46787568f2d241caeaf3b:20256;3e86e9620a4aceb493a34d5c47723513:10710;581a55014641e1cd55cdc272c0365a28:10581;9d1c07b83f672aca7dbf708f8e673edf:3963
16S P375_PWKB5 281692.0 27 0.207603 0.974220 d46e2205f0c6ecf67b51f83d111c509c:58480;a4cbe987942964b584e95e4efab6a176:53703;4cbfff144d4e7a4e0f4619ed505be070:51266;65d43491988bfe557da4d86a5ba25dae:34847;8ae518dbb29595b3f79214be0b589066:27053;9908fffab7ed4f3bec44cda2f5084d49:16379;581a55014641e1cd55cdc272c0365a28:10844;ff9d93d7b7e46787568f2d241caeaf3b:10206;3e86e9620a4aceb493a34d5c47723513:7579;9d1c07b83f672aca7dbf708f8e673edf:4073
ITS P375_PSKB1 231536.0 19 0.487851 0.986529 83b9d7a396bac90c08985d9a037971e1:112955;6a63af9ff5d36c7c07105f9054d80ed0:76013;113370260a0b9261c9cd012617c71724:8579;4f8139ecd78cd345e921e9a643f66116:7848;cf5a3884d6ce92e04bd4a5bb34962224:7597;fd0a8e52be46a107e52201f0b3856286:5279;6acb96f4ed5ebf265fbb779dc8ca7dcc:3617;a75c6988af18e88f85aa47febafaf195:2231;f56b16f95633db297e7e32627b08d109:2152;62465fd685977f87f8d6aa687735011a:2146
ITS P375_PWKB5 220568.0 19 0.468903 0.999474 83b9d7a396bac90c08985d9a037971e1:103425;6a63af9ff5d36c7c07105f9054d80ed0:73710;4409ddf461681a90f5d248d82094cde9:12331;04a0469bded9dce9be2002757ce602b0:10829;1b2330ff5cad876f25ed6c92536b0cbc:9944;8b2a01a49208e5a967973d7db0b2ab33:5047;3d8cd906f9a0fde9ca4fae02c014d1fb:3296;031f1f32e631fe6d51d09aa5d84cc8b1:797;8f8450bb410a0468811cf226f99519e5:747;87d4c7f38651fa4aa23363b8911faf64:326

Positive Control Benchmark

assay n_positive_controls median_top10_fraction median_nonzero_features median_total_counts
16S 2 0.977616 26.5 327312.5
ITS 2 0.993002 19.0 226052.0
WGS_genus 2 0.997303 28.0 11809883.5
WGS_species 2 0.986109 96.0 11809503.5

Alpha Diversity

Alpha Metrics

sample assay richness shannon simpson evenness library_size
PWKB5 WGS 312 3.903406 0.938905 0.679680 1.0
OFB_PWFBB WGS 1836 5.640630 0.987375 0.750548 1.0
6L11_PWKB3 WGS 327 4.133770 0.955723 0.713955 1.0
WiC_PWFBC WGS 901 5.652212 0.985641 0.830779 1.0
PWKB2 WGS 227 3.903949 0.924407 0.719629 1.0
3L18_PWKB2 WGS 818 4.440670 0.962771 0.662108 1.0
R10T2_PSKB1 WGS 1354 4.538816 0.960294 0.629445 1.0
PWFBA WGS 1879 4.992962 0.955161 0.662329 1.0
PWKB1 WGS 482 4.101376 0.900069 0.663874 1.0
R18T2_PSKB1 WGS 2104 5.529873 0.987834 0.722708 1.0
R19T2_PSKB1 WGS 2520 5.073536 0.975074 0.647795 1.0
R11T2_PSKB1 WGS 1726 4.980057 0.967603 0.668145 1.0
6M02_PWKB4 WGS 853 3.393451 0.798272 0.502826 1.0
PWKB4 WGS 2325 5.055294 0.962450 0.652172 1.0
HaC_PWFBC WGS 1315 5.679795 0.988450 0.790882 1.0
6M13_PWKB4 WGS 821 4.268226 0.957093 0.636050 1.0
1L07_PWKB1 WGS 2070 4.020943 0.943881 0.526625 1.0
R03T2_PSKB1 WGS 1040 4.808660 0.966428 0.692195 1.0
R18T1_PSKB1 WGS 2305 5.439684 0.982916 0.702544 1.0
3L16_PWKB2 WGS 290 3.789243 0.941364 0.668311 1.0
HaA_PWFBA WGS 2031 6.076817 0.992295 0.797872 1.0
GrB_PWFBB WGS 2155 5.676154 0.988530 0.739511 1.0
OFC_PWFBC WGS 2310 5.589036 0.980712 0.721631 1.0
6M11_PWKB4 WGS 801 3.923268 0.931057 0.586801 1.0
PSKB1 WGS 617 5.043418 0.983310 0.784984 1.0
P375_PWKB5 WGS 92 2.143042 0.864852 0.473937 1.0
6E02_PWKB3 WGS 1599 4.944393 0.976388 0.670232 1.0
6L02_PWKB3 WGS 241 3.660373 0.950011 0.667367 1.0
L10T3_PWKB5 WGS 414 3.825167 0.947312 0.634791 1.0
HaB_PWFBB WGS 2216 5.292463 0.975576 0.687024 1.0
P375_PSKB1 WGS 100 2.086642 0.847782 0.453109 1.0
R19T1_PSKB1 WGS 2462 4.317675 0.948260 0.552929 1.0
R03T1_PSKB1 WGS 2256 4.764591 0.969953 0.617067 1.0
OFA_PWFBA WGS 3116 6.352722 0.993937 0.789717 1.0
WiA_PWFBA WGS 1648 6.073117 0.993446 0.819881 1.0
PWFBC WGS 3534 3.191219 0.727930 0.390593 1.0
R02T1_PSKB1 WGS 2616 5.674997 0.989452 0.721147 1.0
WiB_PWFBB WGS 1908 5.909986 0.988205 0.782385 1.0
L03T3_PWKB5 WGS 256 3.254449 0.872557 0.586897 1.0
R10T1_PSKB1 WGS 2084 5.501766 0.984128 0.719934 1.0
6E11_PWKB3 WGS 2957 5.731379 0.988741 0.717146 1.0
L18T3_PWKB5 WGS 758 3.679144 0.902815 0.554867 1.0
R02T2_PSKB1 WGS 1812 5.343380 0.982939 0.712243 1.0
FB_PWKB4 WGS 2463 4.899123 0.911719 0.627358 1.0
L02T3_PWKB5 WGS 349 3.368472 0.906909 0.575308 1.0
GrA_PWFBA WGS 658 4.390302 0.917501 0.676555 1.0
L11T3_PWKB5 WGS 599 1.440901 0.433087 0.225308 1.0
PWFBB WGS 2642 5.438785 0.969404 0.690263 1.0
L19T3_PWKB5 WGS 626 1.398967 0.431926 0.217253 1.0
R11T1_PSKB1 WGS 2773 5.418866 0.983172 0.683537 1.0

Metadata-Wide Alpha Associations

assay metadata_column metadata_type metric test n n_groups statistic pvalue effect_direction qvalue
ITS Date Deployed date shannon spearman 45 NaN 0.747596 3.624354e-09 positive 5.103700e-07
16S Days Collect_to_Extract numeric richness spearman 45 NaN 0.747275 3.711782e-09 positive 5.103700e-07
ITS Date Deployed date evenness spearman 45 NaN 0.734719 9.157831e-09 positive 8.394679e-07
ITS Date Deployed date simpson spearman 45 NaN 0.683759 2.253190e-07 positive 1.549068e-05
WGS Sample Material categorical library_size kruskal 51 5.0 35.361774 3.914587e-07 PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community 2.153023e-05
16S Days Collect_to_Extract numeric library_size spearman 45 NaN -0.651628 1.245848e-06 negative 5.501214e-05
ITS Date Deployed date library_size spearman 45 NaN -0.649285 1.400309e-06 negative 5.501214e-05
16S Date Deployed date evenness spearman 45 NaN 0.637550 2.477627e-06 positive 7.754516e-05
16S Sample Material categorical richness kruskal 45 4.0 28.614349 2.698714e-06 PDMS|Filter|Rhizosphere|Leaf 7.754516e-05
16S Sample Material categorical evenness kruskal 45 4.0 28.523575 2.819824e-06 PDMS|Filter|Rhizosphere|Leaf 7.754516e-05
WGS Sample Material categorical shannon kruskal 51 5.0 29.976225 4.949286e-06 PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community 1.195219e-04
16S Extraction Kit categorical richness mannwhitney 45 2.0 20.000000 5.215500e-06 Power Water|PowerSoil Pro 1.195219e-04
WGS paired_negative_control categorical library_size kruskal 56 9.0 38.641657 5.727833e-06 PSKB1|PWKB5|PWKB3|PWFBC|PWFBB 1.211657e-04
ITS Sample Material categorical shannon kruskal 45 4.0 26.110145 9.044138e-06 PDMS|Filter|Rhizosphere|Leaf 1.776527e-04
16S Sample Material categorical simpson kruskal 45 4.0 25.867729 1.016534e-05 PDMS|Filter|Rhizosphere|Leaf 1.863645e-04
16S Sample Material categorical shannon kruskal 45 4.0 25.375266 1.288778e-05 PDMS|Filter|Rhizosphere|Leaf 2.215087e-04
ITS Sample Material categorical evenness kruskal 45 4.0 24.794783 1.704366e-05 PDMS|Filter|Rhizosphere|Leaf 2.757062e-04
16S paired_negative_control categorical richness kruskal 45 9.0 34.677164 3.061203e-05 PSKB1|PWKB5|PWKB3|PWFBC|PWFBA 4.676838e-04
ITS Sample Material categorical simpson kruskal 45 4.0 22.896232 4.244494e-05 PDMS|Filter|Rhizosphere|Leaf 6.054960e-04
16S Relevant Kit Blank categorical richness kruskal 33 6.0 27.554516 4.447312e-05 PSKB1|PWKB3|PWKB5|PWKB1|PWKB2 6.054960e-04
16S Days Collect_to_Extract numeric shannon spearman 45 NaN 0.568476 4.623787e-05 positive 6.054960e-04
WGS Sample Material categorical evenness kruskal 51 5.0 24.740875 5.671957e-05 PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community 7.089946e-04
ITS paired_negative_control categorical richness kruskal 45 9.0 32.932103 6.334743e-05 PSKB1|PWKB5|PWKB3|PWFBC|PWFBA 7.400893e-04
WGS Date Deployed date library_size spearman 57 NaN -0.503831 6.458961e-05 negative 7.400893e-04
ITS paired_negative_control categorical library_size kruskal 45 9.0 32.759965 6.803251e-05 PSKB1|PWKB5|PWKB3|PWFBC|PWFBA 7.483576e-04
ITS paired_negative_control categorical shannon kruskal 45 9.0 32.599034 7.272074e-05 PSKB1|PWKB5|PWKB3|PWFBC|PWFBA 7.691617e-04
WGS paired_negative_control categorical shannon kruskal 56 9.0 32.435553 7.780909e-05 PSKB1|PWKB5|PWKB3|PWFBC|PWFBB 7.925000e-04
WGS Sample Material categorical simpson kruskal 51 5.0 23.817327 8.689895e-05 PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community 8.534718e-04
16S paired_positive_control categorical library_size mannwhitney 18 2.0 0.000000 1.077354e-04 P375_PSKB1|P375_PWKB5 9.557173e-04
16S paired_positive_control categorical richness mannwhitney 18 2.0 72.000000 1.077354e-04 P375_PSKB1|P375_PWKB5 9.557173e-04
16S paired_positive_control categorical shannon mannwhitney 18 2.0 72.000000 1.077354e-04 P375_PSKB1|P375_PWKB5 9.557173e-04
ITS Sample Material categorical richness kruskal 45 4.0 20.865955 1.122442e-04 PDMS|Filter|Rhizosphere|Leaf 9.645989e-04
WGS Sample Material categorical richness kruskal 51 5.0 23.028639 1.249689e-04 PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community 1.041407e-03
16S Sample Material categorical library_size kruskal 45 4.0 20.327823 1.451537e-04 PDMS|Filter|Rhizosphere|Leaf 1.174037e-03
WGS Accumulation Time (Days) categorical library_size kruskal 27 4.0 20.117252 1.605048e-04 14.0|252.0|126.0|42.0 1.261109e-03
WGS Airstream categorical library_size kruskal 54 4.0 20.009247 1.689951e-04 Greenwall|Office|Windowsill|Hallway 1.290935e-03
16S paired_negative_control categorical evenness kruskal 45 9.0 30.331884 1.846406e-04 PSKB1|PWKB5|PWKB3|PWFBC|PWFBA 1.323066e-03
WGS paired_positive_control categorical shannon mannwhitney 22 2.0 106.000000 1.876349e-04 P375_PSKB1|P375_PWKB5 1.323066e-03
WGS paired_positive_control categorical simpson mannwhitney 22 2.0 106.000000 1.876349e-04 P375_PSKB1|P375_PWKB5 1.323066e-03
ITS Sample Material categorical library_size kruskal 45 4.0 19.560255 2.093528e-04 PDMS|Filter|Rhizosphere|Leaf 1.439300e-03
WGS paired_positive_control categorical library_size mannwhitney 22 2.0 7.000000 2.814523e-04 P375_PSKB1|P375_PWKB5 1.842235e-03
16S Qubit ng/uL numeric richness spearman 37 NaN 0.562774 2.880586e-04 positive 1.842235e-03
16S ng/50uL numeric richness spearman 37 NaN 0.562774 2.880586e-04 positive 1.842235e-03
WGS Days Deploy_to_Collect categorical library_size kruskal 31 5.0 21.153156 2.952778e-04 0.0|216.0|251.0|125.0|41.0 1.845486e-03
ITS paired_negative_control categorical simpson kruskal 45 9.0 28.978744 3.198331e-04 PSKB1|PWKB5|PWKB3|PWFBC|PWFBA 1.954536e-03
ITS Accumulation Time (Days) categorical shannon kruskal 27 4.0 18.382716 3.667183e-04 14.0|252.0|126.0|42.0 2.192338e-03
WGS Airstream categorical evenness kruskal 54 4.0 18.234568 3.934700e-04 Greenwall|Office|Windowsill|Hallway 2.284368e-03
16S paired_negative_control categorical shannon kruskal 45 9.0 28.431884 3.987261e-04 PSKB1|PWKB5|PWKB3|PWFBC|PWFBA 2.284368e-03
16S paired_positive_control categorical simpson mannwhitney 18 2.0 70.000000 4.309416e-04 P375_PSKB1|P375_PWKB5 2.418550e-03
ITS paired_negative_control categorical evenness kruskal 45 9.0 28.101449 4.553382e-04 PSKB1|PWKB5|PWKB3|PWFBC|PWFBA 2.504360e-03

Beta Diversity and Community Structure

Metadata PERMANOVA

No results produced.

Metadata Ordination Associations

No results produced.

Taxonomic Composition, Source Overlap, and Core Taxa

Core Taxa

feature taxonomy prevalence mean_relative_abundance assay
Homo sapiens Homo sapiens 0.964912 0.055331 WGS
Agrobacterium tumefaciens Agrobacterium tumefaciens 0.877193 0.011979 WGS
Pseudomonas aeruginosa Pseudomonas aeruginosa 0.859649 0.011945 WGS
Pseudonocardia alni Pseudonocardia alni 0.842105 0.011229 WGS

Differential Abundance and Functional Contrasts

Differential Abundance

No results produced.

Metadata Differential Summary

No results produced.

Machine Learning and Metadata Signal Prioritization

Random Forest Top Features

No results produced.

Metadata Random Forest

No results produced.

Metadata Signal Summary

assay metadata_column alpha_tests alpha_significant_q_lt_0_1 composite_signal_score top_related_metadata
16S Relevant Field Blank 5 5 10 paired_negative_control:1.000; Concentration/Filtration Step?:0.725; Sample Material:0.725
16S Relevant Kit Blank 5 5 10 Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000
16S Sample Material 5 5 10 Accumulation Time (Days):1.000; Days Deploy_to_Collect:1.000; Concentration/Filtration Step?:0.980
16S paired_negative_control 5 5 10 Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Extraction Kit:1.000
16S paired_positive_control 5 5 10 Concentration/Filtration Step?:1.000; Extraction Kit:1.000; Relevant Kit Blank:1.000
ITS Accumulation Time (Days) 5 5 10 Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Greenwall Surface Position:1.000
ITS Airstream 5 5 10 Concentration/Filtration Step?:0.583; paired_negative_control:0.483; Sample Material:0.480
ITS Date Deployed 5 5 10 Qubit ng/uL:0.747; ng/50uL:0.747; Date Extracted:0.566
ITS Days Deploy_to_Collect 5 5 10 Greenwall Radial Distance:1.000; Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000
ITS Relevant Field Blank 5 5 10 paired_negative_control:1.000; Concentration/Filtration Step?:0.725; Sample Material:0.725
ITS Sample Material 5 5 10 Accumulation Time (Days):1.000; Days Deploy_to_Collect:1.000; Concentration/Filtration Step?:0.980
ITS paired_negative_control 5 5 10 Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Extraction Kit:1.000
WGS Accumulation Time (Days) 5 5 10 Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Greenwall Surface Position:1.000
WGS Date Deployed 5 5 10 Qubit ng/uL:0.747; ng/50uL:0.747; Date Extracted:0.566
WGS Days Deploy_to_Collect 5 5 10 Greenwall Radial Distance:1.000; Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000
WGS Relevant Kit Blank 5 5 10 Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000
WGS Sample Material 5 5 10 Accumulation Time (Days):1.000; Days Deploy_to_Collect:1.000; Concentration/Filtration Step?:0.980
WGS paired_negative_control 5 5 10 Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Extraction Kit:1.000
WGS paired_positive_control 5 5 10 Concentration/Filtration Step?:1.000; Extraction Kit:1.000; Relevant Kit Blank:1.000
16S Days Collect_to_Extract 5 4 9 Date Extracted:0.626; Qubit ng/uL:0.562; ng/50uL:0.562
ITS Concentration/Filtration Step? 5 4 9 Accumulation Time (Days):1.000; Days Deploy_to_Collect:1.000; Relevant Kit Blank:1.000
ITS Qubit ng/uL 5 4 9 ng/50uL:1.000; Date Deployed:0.747; Days Collect_to_Extract:0.562
ITS ng/50uL 5 4 9 Qubit ng/uL:1.000; Date Deployed:0.747; Days Collect_to_Extract:0.562
WGS Airstream 5 4 9 Concentration/Filtration Step?:0.583; paired_negative_control:0.483; Sample Material:0.480
16S Accumulation Time (Days) 5 3 8 Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Greenwall Surface Position:1.000
16S Airstream 5 3 8 Concentration/Filtration Step?:0.583; paired_negative_control:0.483; Sample Material:0.480
16S Date Deployed 5 3 8 Qubit ng/uL:0.747; ng/50uL:0.747; Date Extracted:0.566
16S Days Deploy_to_Collect 5 3 8 Greenwall Radial Distance:1.000; Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000
16S Extraction Kit 5 3 8 paired_negative_control:1.000; paired_positive_control:1.000; Relevant Kit Blank:1.000
16S Qubit ng/uL 5 3 8 ng/50uL:1.000; Date Deployed:0.747; Days Collect_to_Extract:0.562
16S ng/50uL 5 3 8 Qubit ng/uL:1.000; Date Deployed:0.747; Days Collect_to_Extract:0.562
ITS Relevant Kit Blank 5 3 8 Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000
WGS Days Collect_to_Extract 5 3 8 Date Extracted:0.626; Qubit ng/uL:0.562; ng/50uL:0.562
WGS Relevant Field Blank 5 3 8 paired_negative_control:1.000; Concentration/Filtration Step?:0.725; Sample Material:0.725
ITS Date Extracted 5 2 7 Days Collect_to_Extract:0.626; Date Deployed:0.566; Qubit ng/uL:0.443
WGS Sample Type 5 2 7 Sample Material:0.739; Days Deploy_to_Collect:0.310; Relevant Kit Blank:0.237
16S Concentration/Filtration Step? 5 1 6 Accumulation Time (Days):1.000; Days Deploy_to_Collect:1.000; Relevant Kit Blank:1.000
16S Date Extracted 5 1 6 Days Collect_to_Extract:0.626; Date Deployed:0.566; Qubit ng/uL:0.443
ITS Days Collect_to_Extract 5 1 6 Date Extracted:0.626; Qubit ng/uL:0.562; ng/50uL:0.562
ITS Extraction Kit 5 1 6 paired_negative_control:1.000; paired_positive_control:1.000; Relevant Kit Blank:1.000
ITS paired_positive_control 5 1 6 Concentration/Filtration Step?:1.000; Extraction Kit:1.000; Relevant Kit Blank:1.000
WGS Concentration/Filtration Step? 5 1 6 Accumulation Time (Days):1.000; Days Deploy_to_Collect:1.000; Relevant Kit Blank:1.000
WGS Date Extracted 5 1 6 Days Collect_to_Extract:0.626; Date Deployed:0.566; Qubit ng/uL:0.443
WGS Extraction Kit 5 1 6 paired_negative_control:1.000; paired_positive_control:1.000; Relevant Kit Blank:1.000
WGS Qubit ng/uL 5 1 6 ng/50uL:1.000; Date Deployed:0.747; Days Collect_to_Extract:0.562
WGS ng/50uL 5 1 6 Qubit ng/uL:1.000; Date Deployed:0.747; Days Collect_to_Extract:0.562
16S Date Collected 5 0 5 Qubit ng/uL:0.303; ng/50uL:0.303; Date Extracted:0.250
16S Greenwall Location 5 0 5 NaN
16S Greenwall Surface Position 5 0 5 Accumulation Time (Days):1.000; Greenwall Radial Distance:1.000; Days Deploy_to_Collect:0.816
ITS Date Collected 5 0 5 Qubit ng/uL:0.303; ng/50uL:0.303; Date Extracted:0.250

Metadata Covariate Links

metadata_column_a metadata_type_a metadata_column_b metadata_type_b test association_strength statistic pvalue
Days Deploy_to_Collect categorical Greenwall Radial Distance categorical cramers_v 1.000000 1.000000 NaN
Accumulation Time (Days) categorical Concentration/Filtration Step? categorical cramers_v 1.000000 1.000000 NaN
Accumulation Time (Days) categorical Days Deploy_to_Collect categorical cramers_v 1.000000 1.000000 NaN
Accumulation Time (Days) categorical Greenwall Surface Position categorical cramers_v 1.000000 1.000000 NaN
Accumulation Time (Days) categorical Relevant Kit Blank categorical cramers_v 1.000000 1.000000 NaN
Accumulation Time (Days) categorical Sample Material categorical cramers_v 1.000000 1.000000 NaN
Concentration/Filtration Step? categorical Days Deploy_to_Collect categorical cramers_v 1.000000 1.000000 NaN
Concentration/Filtration Step? categorical Relevant Kit Blank categorical cramers_v 1.000000 1.000000 NaN
Concentration/Filtration Step? categorical paired_negative_control categorical cramers_v 1.000000 1.000000 NaN
Concentration/Filtration Step? categorical paired_positive_control categorical cramers_v 1.000000 1.000000 NaN
Days Deploy_to_Collect categorical Relevant Kit Blank categorical cramers_v 1.000000 1.000000 NaN
Days Deploy_to_Collect categorical Sample Material categorical cramers_v 1.000000 1.000000 NaN
Days Deploy_to_Collect categorical paired_negative_control categorical cramers_v 1.000000 1.000000 NaN
Extraction Kit categorical paired_negative_control categorical cramers_v 1.000000 1.000000 NaN
Extraction Kit categorical paired_positive_control categorical cramers_v 1.000000 1.000000 NaN
Greenwall Radial Distance categorical Greenwall Surface Position categorical cramers_v 1.000000 1.000000 NaN
Greenwall Radial Distance categorical Relevant Kit Blank categorical cramers_v 1.000000 1.000000 NaN
Greenwall Radial Distance categorical paired_negative_control categorical cramers_v 1.000000 1.000000 NaN
Qubit ng/uL numeric ng/50uL numeric spearman 1.000000 1.000000 0.000000e+00
Relevant Field Blank categorical paired_negative_control categorical cramers_v 1.000000 1.000000 NaN
Relevant Kit Blank categorical paired_negative_control categorical cramers_v 1.000000 1.000000 NaN
Relevant Kit Blank categorical paired_positive_control categorical cramers_v 1.000000 1.000000 NaN
paired_negative_control categorical paired_positive_control categorical cramers_v 1.000000 1.000000 NaN
Accumulation Time (Days) categorical Greenwall Radial Distance categorical cramers_v 1.000000 1.000000 NaN
Accumulation Time (Days) categorical paired_negative_control categorical cramers_v 1.000000 1.000000 NaN
Extraction Kit categorical Relevant Kit Blank categorical cramers_v 1.000000 1.000000 NaN
Concentration/Filtration Step? categorical Sample Material categorical cramers_v 0.979812 0.979812 NaN
Extraction Kit categorical Sample Material categorical cramers_v 0.973848 0.973848 NaN
Sample Material categorical paired_positive_control categorical cramers_v 0.943456 0.943456 NaN
Sample Material categorical paired_negative_control categorical cramers_v 0.850016 0.850016 NaN
Days Deploy_to_Collect categorical Greenwall Surface Position categorical cramers_v 0.816497 0.816497 NaN
Greenwall Surface Position categorical Relevant Kit Blank categorical cramers_v 0.816497 0.816497 NaN
Greenwall Surface Position categorical paired_negative_control categorical cramers_v 0.816497 0.816497 NaN
Relevant Kit Blank categorical Sample Material categorical cramers_v 0.793748 0.793748 NaN
Date Deployed date Qubit ng/uL numeric spearman 0.747437 -0.747437 1.972756e-08
Date Deployed date ng/50uL numeric spearman 0.747437 -0.747437 1.972756e-08
Sample Material categorical Sample Type categorical cramers_v 0.738735 0.738735 NaN
Concentration/Filtration Step? categorical Relevant Field Blank categorical cramers_v 0.725476 0.725476 NaN
Relevant Field Blank categorical Sample Material categorical cramers_v 0.725476 0.725476 NaN
Date Extracted date Days Collect_to_Extract numeric spearman 0.626333 0.626333 5.266479e-07
Airstream categorical Concentration/Filtration Step? categorical cramers_v 0.583095 0.583095 NaN
Date Deployed date Date Extracted date spearman 0.565612 -0.565612 4.547414e-06
Days Collect_to_Extract numeric Qubit ng/uL numeric spearman 0.561994 0.561994 1.968216e-04
Days Collect_to_Extract numeric ng/50uL numeric spearman 0.561994 0.561994 1.968216e-04
Days Deploy_to_Collect categorical Relevant Field Blank categorical cramers_v 0.512989 0.512989 NaN
Accumulation Time (Days) categorical Relevant Field Blank categorical cramers_v 0.512989 0.512989 NaN
Airstream categorical paired_negative_control categorical cramers_v 0.482654 0.482654 NaN
Airstream categorical Sample Material categorical cramers_v 0.480384 0.480384 NaN
Concentration/Filtration Step? categorical Extraction Kit categorical cramers_v 0.469304 0.469304 NaN
Airstream categorical Days Deploy_to_Collect categorical cramers_v 0.458831 0.458831 NaN

Hypothesis Answer Layer

Hypothesis to Output Map

category study_question primary_groups secondary_groups mapped_outputs
Methodology Are there differences in 16S / WGS / ITS outcomes Sample Material: Leaf, Rhizosphere, Zymo Mock Community NaN cross_assay_correlations.tsv; metadata_alpha_associations.tsv; metadata_column_catalog.tsv; metadata_permanova.tsv; study_overview.tsv
Diversity Are the collection of microorganisms found in the greenwall related to human health in any way? Sample Material: Leaf, Rhizosphere NaN beneficial_taxa_summary.tsv; core_taxa.tsv; differential_abundance.tsv; metadata_column_catalog.tsv; study_overview.tsv
Uncategorized NaN Sample Material: Leaf, Rhizosphere NaN metadata_column_catalog.tsv; study_overview.tsv
Uncategorized Is the microbiome of the "greenwall room" different from the other airstreams? Sample Material: Filter Airstream: Hallway, Office, Greenwall, Windowsill metadata_alpha_associations.tsv; metadata_column_catalog.tsv; metadata_permanova.tsv; source_overlap.tsv; study_overview.tsv
Uncategorized NaN Sample Material: Filter Airstream: Hallway, Office, Greenwall, Windowsill metadata_column_catalog.tsv; study_overview.tsv
Uncategorized Is there evidence of diversification of the "greenwall room" microbiome by greenwall microorganisms? Sample Material: Filter\nAirstream: Greenwall Sample Material: Leaf, Rhizosphere diversity_metrics.tsv; diversity_tests.tsv; metadata_alpha_associations.tsv; metadata_column_catalog.tsv; metadata_permanova.tsv; source_overlap.tsv; study_overview.tsv
Uncategorized NaN Sample Material: PDMS Sample Material: Leaf, Rhizosphere metadata_column_catalog.tsv; study_overview.tsv
Metabolism Is there evidence of metabolic function within the greenwall microbiome related to chemicals identified in relevant airstreams? Sample Material: Leaf, Rhizosphere NaN metadata_alpha_associations.tsv; metadata_column_catalog.tsv; metadata_differential_abundance_summary.tsv; metadata_permanova.tsv; source_overlap.tsv; study_overview.tsv; wgs_function_top_differences.tsv
Uncategorized NaN Sample Material: Leaf, Rhizosphere NaN metadata_column_catalog.tsv; study_overview.tsv
Uncategorized NaN NaN NaN metadata_column_catalog.tsv; study_overview.tsv
Uncategorized NaN NaN NaN metadata_column_catalog.tsv; study_overview.tsv
Uncategorized NaN NaN NaN metadata_column_catalog.tsv; study_overview.tsv
Uncategorized NaN NaN NaN metadata_column_catalog.tsv; study_overview.tsv

Markdown Summary

# GREENWALL Hypothesis-Oriented Analysis Summary

This report is organized as a logical analysis sequence modeled on a publication-style microbiome report.

## Executive Summary

- Imported hypotheses: 9
- Manifest samples: 57
- Metadata columns scanned: 20
- Metadata-linked signals at q < 0.1: 159

## Hypothesis Verdict Cards

- **Mixed**: Are there differences in 16S / WGS / ITS outcomes
  Evidence: Top metadata-linked signals were 16S:Relevant Field Blank; 16S:Relevant Kit Blank; 16S:Sample Material. 4 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; metadata_covariate_links.tsv; metadata_signal_summary.tsv; positive_control_benchmark.tsv
- **Mixed**: Are the collection of microorganisms found in the greenwall related to human health in any way?
  Evidence: 10 beneficial-taxa keyword hits were detected across assays. 4 positive-control benchmark summaries were generated.
  Outputs: beneficial_taxa_summary.tsv; core_taxa.tsv; hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Not Supported Yet**: 
  Evidence: 4 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Mixed**: Is the microbiome of the "greenwall room" different from the other airstreams?
  Evidence: Maximum source-overlap fraction observed was 0.797. 4 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv; source_overlap.tsv
- **Not Supported Yet**: 
  Evidence: 4 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Mixed**: Is there evidence of diversification of the "greenwall room" microbiome by greenwall  microorganisms?
  Evidence: 159 alpha-diversity metadata associations passed q < 0.1. 4 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; metadata_alpha_associations.tsv; positive_control_benchmark.tsv
- **Not Supported Yet**: 
  Evidence: 4 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Mixed**: Is there evidence of metabolic function within the greenwall microbiome related to chemicals identified in relevant airstreams?
  Evidence: Maximum source-overlap fraction observed was 0.797. 4 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv; source_overlap.tsv
- **Not Supported Yet**: 
  Evidence: 4 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Not Supported Yet**: 
  Evidence: 4 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Not Supported Yet**: 
  Evidence: 4 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Not Supported Yet**: 
  Evidence: 4 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Not Supported Yet**: 
  Evidence: 4 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

## Logical Steps

1. **Study Overview and QC**
   - Purpose: Summarize assay coverage, sample inventory, control annotations, and high-level dataset structure.
   - Outputs: study_overview.tsv; metadata_column_catalog.tsv; hypotheses.imported.tsv
2. **Alpha Diversity**
   - Purpose: Compute richness, Shannon, Simpson, evenness, and test them against targeted contrasts and every usable metadata column.
   - Outputs: diversity_metrics.tsv; diversity_tests.tsv; metadata_alpha_associations.tsv
3. **Beta Diversity and Community Structure**
   - Purpose: Build Bray-Curtis dissimilarities, PCoA ordinations, and PERMANOVA summaries for assay-specific and metadata-driven effects.
   - Outputs: pcoa_coordinates.tsv; permanova.tsv; metadata_permanova.tsv; metadata_pcoa_associations.tsv
4. **Taxonomic Composition and Beneficial Taxa**
   - Purpose: Screen for beneficial taxa, summarize source overlap, and prepare interpretable taxonomic summaries.
   - Outputs: beneficial_taxa_summary.tsv; source_overlap.tsv
5. **Differential Abundance and Function**
   - Purpose: Test assay- and metadata-driven contrasts for taxonomic and WGS functional differences.
   - Outputs: differential_abundance.tsv; metadata_differential_abundance_summary.tsv; wgs_function_top_differences.tsv
6. **Core Microbiome and Cross-Assay Concordance**
   - Purpose: Define core taxa, assess overlap across assays, and quantify assay bias or concordance on shared samples.
   - Outputs: core_taxa.tsv; cross_assay_sample_metrics.tsv; cross_assay_correlations.tsv
7. **Machine Learning and Network Structure**
   - Purpose: Rank discriminatory features and summarize co-occurrence structure for hypothesis generation.
   - Outputs: random_forest_summary.tsv; random_forest_top_features.tsv; metadata_random_forest_summary.tsv; cooccurrence_network_edges.tsv
8. **Hypothesis Answer Layer**
   - Purpose: Map outputs back to the stated hypotheses and surface the strongest metadata-linked signals.
   - Outputs: hypothesis_to_output_map.tsv; metadata_signal_summary.tsv; hypothesis_summary.md

## Step 1. Study Overview and QC

- Manifest rows: 57
- Manifest columns: 30
- Testable metadata columns: 23
- QC flag rows: 204
- Samples with recommended exclusion: 0

## Step 2. Alpha Diversity

- Targeted contrasts: 24
- Metadata-wide alpha tests: 275
- Top metadata-linked alpha signals:
```tsv
assay	metadata_column	metadata_type	metric	test	n	n_groups	statistic	pvalue	effect_direction	qvalue
ITS	Date Deployed	date	shannon	spearman	45		0.7475963859174021	3.624354368116372e-09	positive	5.103700080070658e-07
16S	Days Collect_to_Extract	numeric	richness	spearman	45		0.7472745509538982	3.711781876415024e-09	positive	5.103700080070658e-07
ITS	Date Deployed	date	evenness	spearman	45		0.7347186662999966	9.157831334004277e-09	positive	8.394678722837255e-07
ITS	Date Deployed	date	simpson	spearman	45		0.6837587063166806	2.2531898568860227e-07	positive	1.5490680266091405e-05
WGS	Sample Material	categorical	library_size	kruskal	51	5.0	35.361773624185375	3.9145866564586555e-07	PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community	2.1530226610522606e-05
```

## Step 3. Beta Diversity and Community Structure

- Assay-specific PERMANOVA rows: 0
- Metadata PERMANOVA rows: 0
- Ordination metadata tests: 0

## Step 4. Taxonomic Composition and Beneficial Taxa

- Beneficial taxa screen rows: 15
- Source overlap rows: 6
- Positive-control diagnostic rows: 8

## Step 5. Differential Abundance and Function

- Taxonomic differential abundance rows: 0
- Metadata differential summaries: 0
- WGS functional contrasts: 0

## Step 6. Core Microbiome and Cross-Assay Concordance

- Core taxa rows: 4
- Cross-assay correlation rows: 12

## Step 7. Machine Learning and Networks

- Primary Random Forest summaries: 0
- Metadata Random Forest summaries: 0

## Step 8. Hypothesis Answer Layer

- Highest-signal metadata fields across all assays/tests:
```tsv
assay	metadata_column	alpha_tests	alpha_significant_q_lt_0_1	composite_signal_score	top_related_metadata
16S	Relevant Field Blank	5	5	10	paired_negative_control:1.000; Concentration/Filtration Step?:0.725; Sample Material:0.725
16S	Relevant Kit Blank	5	5	10	Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000
16S	Sample Material	5	5	10	Accumulation Time (Days):1.000; Days Deploy_to_Collect:1.000; Concentration/Filtration Step?:0.980
16S	paired_negative_control	5	5	10	Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Extraction Kit:1.000
16S	paired_positive_control	5	5	10	Concentration/Filtration Step?:1.000; Extraction Kit:1.000; Relevant Kit Blank:1.000
ITS	Accumulation Time (Days)	5	5	10	Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Greenwall Surface Position:1.000
ITS	Airstream	5	5	10	Concentration/Filtration Step?:0.583; paired_negative_control:0.483; Sample Material:0.480
ITS	Date Deployed	5	5	10	Qubit ng/uL:0.747; ng/50uL:0.747; Date Extracted:0.566
ITS	Days Deploy_to_Collect	5	5	10	Greenwall Radial Distance:1.000; Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000
ITS	Relevant Field Blank	5	5	10	paired_negative_control:1.000; Concentration/Filtration Step?:0.725; Sample Material:0.725
```
- Strongest metadata-metadata covariate links:
```tsv
metadata_column_a	metadata_type_a	metadata_column_b	metadata_type_b	test	association_strength	statistic	pvalue
Days Deploy_to_Collect	categorical	Greenwall Radial Distance	categorical	cramers_v	1.0000000000000002	1.0000000000000002	
Accumulation Time (Days)	categorical	Concentration/Filtration Step?	categorical	cramers_v	1.0	1.0	
Accumulation Time (Days)	categorical	Days Deploy_to_Collect	categorical	cramers_v	1.0	1.0	
Accumulation Time (Days)	categorical	Greenwall Surface Position	categorical	cramers_v	1.0	1.0	
Accumulation Time (Days)	categorical	Relevant Kit Blank	categorical	cramers_v	1.0	1.0	
Accumulation Time (Days)	categorical	Sample Material	categorical	cramers_v	1.0	1.0	
Concentration/Filtration Step?	categorical	Days Deploy_to_Collect	categorical	cramers_v	1.0	1.0	
Concentration/Filtration Step?	categorical	Relevant Kit Blank	categorical	cramers_v	1.0	1.0	
Concentration/Filtration Step?	categorical	paired_negative_control	categorical	cramers_v	1.0	1.0	
Concentration/Filtration Step?	categorical	paired_positive_control	categorical	cramers_v	1.0	1.0
```