GREENWALL MICROBIOME ANALYSIS

Explorable Report

Publication-style, stepwise summary of WGS, 16S, and ITS workflow outputs, statistical tests, metadata-wide associations, control diagnostics, and direct hypothesis-linked conclusions.

Manifest Samples57
Testable Metadata Columns23
Hypothesis Cards13
QC Exclusions Recommended0

Assays Included in This Report

This report explicitly checks the hypothesis-suite inputs for WGS, 16S, and ITS. Included means nonzero sample and feature counts were loaded from the completed workflow exports.

WGSIncluded

57 samples; 8007 features loaded into the hypothesis/reporting layer.

16SNot detected

0 samples; 0 features loaded into the hypothesis/reporting layer.

ITSNot detected

0 samples; 0 features loaded into the hypothesis/reporting layer.

Hypothesis Verdict Cards

Mixed

Are there differences in 16S / WGS / ITS outcomes

Top metadata-linked signals were WGS:Accumulation Time (Days); WGS:Date Deployed; WGS:Days Deploy_to_Collect. 2 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; metadata_covariate_links.tsv; metadata_signal_summary.tsv; positive_control_benchmark.tsv

Mixed

Are the collection of microorganisms found in the greenwall related to human health in any way?

5 beneficial-taxa keyword hits were detected across assays. 2 positive-control benchmark summaries were generated.

Outputs: beneficial_taxa_summary.tsv; core_taxa.tsv; hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Not Supported Yet

nan

2 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Mixed

Is the microbiome of the "greenwall room" different from the other airstreams?

Maximum source-overlap fraction observed was 0.797. 2 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv; source_overlap.tsv

Not Supported Yet

nan

2 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Mixed

Is there evidence of diversification of the "greenwall room" microbiome by greenwall microorganisms?

46 alpha-diversity metadata associations passed q < 0.1. 2 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; metadata_alpha_associations.tsv; positive_control_benchmark.tsv

Not Supported Yet

nan

2 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Mixed

Is there evidence of metabolic function within the greenwall microbiome related to chemicals identified in relevant airstreams?

Maximum source-overlap fraction observed was 0.797. 2 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv; source_overlap.tsv

Not Supported Yet

nan

2 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Not Supported Yet

nan

2 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Not Supported Yet

nan

2 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Not Supported Yet

nan

2 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Not Supported Yet

nan

2 positive-control benchmark summaries were generated.

Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

Logical Steps

  1. Step 1. Study Overview and QC
    Summarize assay coverage, sample inventory, control annotations, and high-level dataset structure.
    study_overview.tsv; metadata_column_catalog.tsv; hypotheses.imported.tsv
  2. Step 2. Alpha Diversity
    Compute richness, Shannon, Simpson, evenness, and test them against targeted contrasts and every usable metadata column.
    diversity_metrics.tsv; diversity_tests.tsv; metadata_alpha_associations.tsv
  3. Step 3. Beta Diversity and Community Structure
    Build Bray-Curtis dissimilarities, PCoA ordinations, and PERMANOVA summaries for assay-specific and metadata-driven effects.
    pcoa_coordinates.tsv; permanova.tsv; metadata_permanova.tsv; metadata_pcoa_associations.tsv
  4. Step 4. Taxonomic Composition and Beneficial Taxa
    Screen for beneficial taxa, summarize source overlap, and prepare interpretable taxonomic summaries.
    beneficial_taxa_summary.tsv; source_overlap.tsv
  5. Step 5. Differential Abundance and Function
    Test assay- and metadata-driven contrasts for taxonomic and WGS functional differences.
    differential_abundance.tsv; metadata_differential_abundance_summary.tsv; wgs_function_top_differences.tsv
  6. Step 6. Core Microbiome and Cross-Assay Concordance
    Define core taxa, assess overlap across assays, and quantify assay bias or concordance on shared samples.
    core_taxa.tsv; cross_assay_sample_metrics.tsv; cross_assay_correlations.tsv
  7. Step 7. Machine Learning and Network Structure
    Rank discriminatory features and summarize co-occurrence structure for hypothesis generation.
    random_forest_summary.tsv; random_forest_top_features.tsv; metadata_random_forest_summary.tsv; cooccurrence_network_edges.tsv
  8. Step 8. Hypothesis Answer Layer
    Map outputs back to the stated hypotheses and surface the strongest metadata-linked signals.
    hypothesis_to_output_map.tsv; metadata_signal_summary.tsv; hypothesis_summary.md

Study Overview and QC

section metric value
dataset manifest_rows 57
dataset manifest_columns 30
dataset hypothesis_testable_metadata_columns 23
assay_summary WGS_samples 57
assay_summary WGS_features 8007
assay_summary 16S_samples 0
assay_summary 16S_features 0
assay_summary ITS_samples 0
assay_summary ITS_features 0
Sample Material PDMS 16
Sample Material Filter 15
Sample Material Rhizosphere 12
Sample Material Leaf 6
Sample Material Zymo Mock Community 2
Airstream Greenwall 45
Airstream Hallway 3
Airstream Office 3
Airstream Windowsill 3
Extraction Kit Power Water 43
Extraction Kit PowerSoil Pro 14
Sample Type Experimental 45
Sample Type Negative Control 10
Sample Type Positive Control 2
controls paired_negative_control_annotated 56
controls paired_positive_control_annotated 22
alpha WGS_alpha_samples 57

Metadata Catalog

column type non_null_samples unique_values usable analysis_roles
Accumulation Time (Days) categorical 57 5 True alpha,beta,differential,classification
Airstream categorical 57 5 True alpha,beta,differential,classification
Concentration/Filtration Step? categorical 57 2 True alpha,beta,differential,classification
Days Deploy_to_Collect categorical 57 6 True alpha,beta,differential,classification
Extraction Kit categorical 57 2 True alpha,beta,differential,classification
Greenwall Location categorical 57 7 True alpha,beta,differential,classification
Greenwall Radial Distance categorical 57 16 True alpha,beta,differential,classification
Greenwall Surface Position categorical 57 10 True alpha,beta,differential,classification
Relevant Field Blank categorical 57 4 True alpha,beta,differential,classification
Relevant Kit Blank categorical 57 7 True alpha,beta,differential,classification
Sample Material categorical 57 6 True alpha,beta,differential,classification
Sample Type categorical 57 3 True alpha,beta,differential,classification
paired_negative_control categorical 57 10 True alpha,beta,differential,classification
paired_positive_control categorical 57 3 True alpha,beta,differential,classification
Date Collected date 53 10 True alpha,ordination
Date Deployed date 31 5 True alpha,ordination
Date Extracted date 57 8 True alpha,ordination
Days Collect_to_Extract numeric 53 7 True alpha,ordination
Qubit ng/uL numeric 41 39 True alpha,ordination
ng/50uL numeric 41 39 True alpha,ordination
16S assay_flag 57 1 False alpha,beta,differential,classification
ITS assay_flag 57 1 False alpha,beta,differential,classification
WGS assay_flag 57 1 False alpha,beta,differential,classification
its_r1 path 57 57 False NaN
its_r2 path 57 57 False NaN
s16_r1 path 57 57 False NaN
s16_r2 path 57 57 False NaN
wgs_r1 path 57 57 False NaN
wgs_r2 path 57 57 False NaN

QC Flags

sample assay metric value threshold flag reason exclude_recommended
PWFBA WGS post_filter_counts 815924.0 1000.0 False Low post-filter Bracken count total False
OFC_PWFBC WGS post_filter_counts 932569.0 1000.0 False Low post-filter Bracken count total False
6L02_PWKB3 WGS post_filter_counts 191843.0 1000.0 False Low post-filter Bracken count total False
HaA_PWFBA WGS post_filter_counts 641394.0 1000.0 False Low post-filter Bracken count total False
PWKB5 WGS post_filter_counts 93942.0 1000.0 False Low post-filter Bracken count total False
PWKB3 WGS post_filter_counts 357641.0 1000.0 False Low post-filter Bracken count total False
WiA_PWFBA WGS post_filter_counts 626646.0 1000.0 False Low post-filter Bracken count total False
PWKB1 WGS post_filter_counts 124933.0 1000.0 False Low post-filter Bracken count total False
6L13_PWKB3 WGS post_filter_counts 430785.0 1000.0 False Low post-filter Bracken count total False
6L11_PWKB3 WGS post_filter_counts 199098.0 1000.0 False Low post-filter Bracken count total False
L18T3_PWKB5 WGS post_filter_counts 1554800.0 1000.0 False Low post-filter Bracken count total False
R10T2_PSKB1 WGS post_filter_counts 2310438.0 1000.0 False Low post-filter Bracken count total False
GrC_PWFBC WGS post_filter_counts 1060935.0 1000.0 False Low post-filter Bracken count total False
R18T2_PSKB1 WGS post_filter_counts 2975934.0 1000.0 False Low post-filter Bracken count total False
R11T1_PSKB1 WGS post_filter_counts 4513526.0 1000.0 False Low post-filter Bracken count total False
R11T2_PSKB1 WGS post_filter_counts 2648318.0 1000.0 False Low post-filter Bracken count total False
OFB_PWFBB WGS post_filter_counts 674258.0 1000.0 False Low post-filter Bracken count total False
3L06_PWKB2 WGS post_filter_counts 161911.0 1000.0 False Low post-filter Bracken count total False
R10T1_PSKB1 WGS post_filter_counts 2393675.0 1000.0 False Low post-filter Bracken count total False
PWKB2 WGS post_filter_counts 57930.0 1000.0 False Low post-filter Bracken count total False
PWFBB WGS post_filter_counts 1465112.0 1000.0 False Low post-filter Bracken count total False
L11T3_PWKB5 WGS post_filter_counts 9165150.0 1000.0 False Low post-filter Bracken count total False
6M11_PWKB4 WGS post_filter_counts 596483.0 1000.0 False Low post-filter Bracken count total False
WiC_PWFBC WGS post_filter_counts 194200.0 1000.0 False Low post-filter Bracken count total False
HaC_PWFBC WGS post_filter_counts 283713.0 1000.0 False Low post-filter Bracken count total False
FB_PWKB4 WGS post_filter_counts 1270337.0 1000.0 False Low post-filter Bracken count total False
R02T1_PSKB1 WGS post_filter_counts 3968345.0 1000.0 False Low post-filter Bracken count total False
OFA_PWFBA WGS post_filter_counts 1347806.0 1000.0 False Low post-filter Bracken count total False
3L16_PWKB2 WGS post_filter_counts 186513.0 1000.0 False Low post-filter Bracken count total False
PSKB1 WGS post_filter_counts 191810.0 1000.0 False Low post-filter Bracken count total False
1L14_PWKB1 WGS post_filter_counts 243032.0 1000.0 False Low post-filter Bracken count total False
L03T3_PWKB5 WGS post_filter_counts 546220.0 1000.0 False Low post-filter Bracken count total False
6E11_PWKB3 WGS post_filter_counts 2007928.0 1000.0 False Low post-filter Bracken count total False
P375_PWKB5 WGS post_filter_counts 12287355.0 1000.0 False Low post-filter Bracken count total False
WiB_PWFBB WGS post_filter_counts 644971.0 1000.0 False Low post-filter Bracken count total False
P375_PSKB1 WGS post_filter_counts 11331652.0 1000.0 False Low post-filter Bracken count total False
R19T1_PSKB1 WGS post_filter_counts 7324409.0 1000.0 False Low post-filter Bracken count total False
R02T2_PSKB1 WGS post_filter_counts 2568735.0 1000.0 False Low post-filter Bracken count total False
L10T3_PWKB5 WGS post_filter_counts 2288618.0 1000.0 False Low post-filter Bracken count total False
6E13_PWKB3 WGS post_filter_counts 1700897.0 1000.0 False Low post-filter Bracken count total False
1L10_PWKB1 WGS post_filter_counts 61682.0 1000.0 False Low post-filter Bracken count total False
PWKB4 WGS post_filter_counts 1268280.0 1000.0 False Low post-filter Bracken count total False
6E02_PWKB3 WGS post_filter_counts 1081639.0 1000.0 False Low post-filter Bracken count total False
L19T3_PWKB5 WGS post_filter_counts 7309206.0 1000.0 False Low post-filter Bracken count total False
HaB_PWFBB WGS post_filter_counts 1037183.0 1000.0 False Low post-filter Bracken count total False
6M13_PWKB4 WGS post_filter_counts 526051.0 1000.0 False Low post-filter Bracken count total False
GrA_PWFBA WGS post_filter_counts 244698.0 1000.0 False Low post-filter Bracken count total False
1L07_PWKB1 WGS post_filter_counts 4719919.0 1000.0 False Low post-filter Bracken count total False
PWFBC WGS post_filter_counts 6603362.0 1000.0 False Low post-filter Bracken count total False
L02T3_PWKB5 WGS post_filter_counts 679260.0 1000.0 False Low post-filter Bracken count total False

QC Summary

assay sample flagged_metrics exclude_recommended
WGS 1L07_PWKB1 0 False
WGS 1L10_PWKB1 0 False
WGS 1L14_PWKB1 0 False
WGS 3L06_PWKB2 0 False
WGS 3L16_PWKB2 0 False
WGS 3L18_PWKB2 0 False
WGS 6E02_PWKB3 0 False
WGS 6E11_PWKB3 0 False
WGS 6E13_PWKB3 0 False
WGS 6L02_PWKB3 0 False
WGS 6L11_PWKB3 0 False
WGS 6L13_PWKB3 0 False
WGS 6M02_PWKB4 0 False
WGS 6M11_PWKB4 0 False
WGS 6M13_PWKB4 0 False
WGS FB_PWKB4 0 False
WGS GrA_PWFBA 0 False
WGS GrB_PWFBB 0 False
WGS GrC_PWFBC 0 False
WGS HaA_PWFBA 0 False
WGS HaB_PWFBB 0 False
WGS HaC_PWFBC 0 False
WGS L02T3_PWKB5 0 False
WGS L03T3_PWKB5 0 False
WGS L10T3_PWKB5 0 False
WGS L11T3_PWKB5 0 False
WGS L18T3_PWKB5 0 False
WGS L19T3_PWKB5 0 False
WGS OFA_PWFBA 0 False
WGS OFB_PWFBB 0 False
WGS OFC_PWFBC 0 False
WGS P375_PSKB1 0 False
WGS P375_PWKB5 0 False
WGS PSKB1 0 False
WGS PWFBA 0 False
WGS PWFBB 0 False
WGS PWFBC 0 False
WGS PWKB1 0 False
WGS PWKB2 0 False
WGS PWKB3 0 False
WGS PWKB4 0 False
WGS PWKB5 0 False
WGS R02T1_PSKB1 0 False
WGS R02T2_PSKB1 0 False
WGS R03T1_PSKB1 0 False
WGS R03T2_PSKB1 0 False
WGS R10T1_PSKB1 0 False
WGS R10T2_PSKB1 0 False
WGS R11T1_PSKB1 0 False
WGS R11T2_PSKB1 0 False

Positive Control Evaluation

assay sample total_counts nonzero_features top1_fraction top10_fraction top_features
WGS_species P375_PSKB1 11331652.0 100 0.250709 0.986473 Salmonella enterica:2840944.0000;Escherichia coli:1985121.0000;Pseudomonas aeruginosa:1753812.0000;Bacillus spizizenii:1469103.0000;Staphylococcus aureus:893120.0000;Enterococcus faecalis:890384.0000;Listeria monocytogenes:642953.0000;Limosilactobacillus fermentum:519475.0000;Saccharomyces cerevisiae:155658.0000;Bacillus subtilis:27803.0000
WGS_species P375_PWKB5 12287355.0 92 0.214053 0.985745 Salmonella enterica:2630148.0000;Escherichia coli:1902867.0000;Pseudomonas aeruginosa:1694812.0000;Bacillus spizizenii:1596350.0000;Staphylococcus aureus:1155467.0000;Listeria monocytogenes:1127978.0000;Limosilactobacillus fermentum:962784.0000;Enterococcus faecalis:950700.0000;Saccharomyces cerevisiae:67942.0000;Cryptococcus deneoformans:23156.0000
WGS_genus P375_PSKB1 11332082.0 29 0.251054 0.997220 Salmonella:2844961.0000;Escherichia:1985447.0000;Pseudomonas:1753967.0000;Bacillus:1504857.0000;Enterococcus:947629.0000;Staphylococcus:904337.0000;Listeria:651336.0000;Limosilactobacillus:519471.0000;Saccharomyces:156456.0000;Cryptococcus:32120.0000
WGS_genus P375_PWKB5 12287685.0 27 0.214228 0.997386 Salmonella:2632366.0000;Escherichia:1903137.0000;Pseudomonas:1694944.0000;Bacillus:1621564.0000;Staphylococcus:1167120.0000;Listeria:1140525.0000;Enterococcus:1026341.0000;Limosilactobacillus:962765.0000;Saccharomyces:68232.0000;Cryptococcus:38576.0000

Positive Control Benchmark

assay n_positive_controls median_top10_fraction median_nonzero_features median_total_counts
WGS_genus 2 0.997303 28.0 11809883.5
WGS_species 2 0.986109 96.0 11809503.5

Alpha Diversity

Alpha Metrics

sample assay richness shannon simpson evenness library_size
PWFBA WGS 1879 4.992962 0.955161 0.662329 1.0
OFC_PWFBC WGS 2310 5.589036 0.980712 0.721631 1.0
6L02_PWKB3 WGS 241 3.660373 0.950011 0.667367 1.0
HaA_PWFBA WGS 2031 6.076817 0.992295 0.797872 1.0
PWKB5 WGS 312 3.903406 0.938905 0.679680 1.0
PWKB3 WGS 1129 4.822211 0.929649 0.686037 1.0
WiA_PWFBA WGS 1648 6.073117 0.993446 0.819881 1.0
PWKB1 WGS 482 4.101376 0.900069 0.663874 1.0
6L13_PWKB3 WGS 492 4.299825 0.962901 0.693690 1.0
6L11_PWKB3 WGS 327 4.133770 0.955723 0.713955 1.0
L18T3_PWKB5 WGS 758 3.679144 0.902815 0.554867 1.0
R10T2_PSKB1 WGS 1354 4.538816 0.960294 0.629445 1.0
GrC_PWFBC WGS 2532 5.848288 0.987012 0.746263 1.0
R18T2_PSKB1 WGS 2104 5.529873 0.987834 0.722708 1.0
R11T1_PSKB1 WGS 2773 5.418866 0.983172 0.683537 1.0
R11T2_PSKB1 WGS 1726 4.980057 0.967603 0.668145 1.0
OFB_PWFBB WGS 1836 5.640630 0.987375 0.750548 1.0
3L06_PWKB2 WGS 205 3.422721 0.912644 0.643005 1.0
R10T1_PSKB1 WGS 2084 5.501766 0.984128 0.719934 1.0
PWKB2 WGS 227 3.903949 0.924407 0.719629 1.0
PWFBB WGS 2642 5.438785 0.969404 0.690263 1.0
L11T3_PWKB5 WGS 599 1.440901 0.433087 0.225308 1.0
6M11_PWKB4 WGS 801 3.923268 0.931057 0.586801 1.0
WiC_PWFBC WGS 901 5.652212 0.985641 0.830779 1.0
HaC_PWFBC WGS 1315 5.679795 0.988450 0.790882 1.0
FB_PWKB4 WGS 2463 4.899123 0.911719 0.627358 1.0
R02T1_PSKB1 WGS 2616 5.674997 0.989452 0.721147 1.0
OFA_PWFBA WGS 3116 6.352722 0.993937 0.789717 1.0
3L16_PWKB2 WGS 290 3.789243 0.941364 0.668311 1.0
PSKB1 WGS 617 5.043418 0.983310 0.784984 1.0
1L14_PWKB1 WGS 211 3.471870 0.929943 0.648722 1.0
L03T3_PWKB5 WGS 256 3.254449 0.872557 0.586897 1.0
6E11_PWKB3 WGS 2957 5.731379 0.988741 0.717146 1.0
P375_PWKB5 WGS 92 2.143042 0.864852 0.473937 1.0
WiB_PWFBB WGS 1908 5.909986 0.988205 0.782385 1.0
P375_PSKB1 WGS 100 2.086642 0.847782 0.453109 1.0
R19T1_PSKB1 WGS 2462 4.317675 0.948260 0.552929 1.0
R02T2_PSKB1 WGS 1812 5.343380 0.982939 0.712243 1.0
L10T3_PWKB5 WGS 414 3.825167 0.947312 0.634791 1.0
6E13_PWKB3 WGS 2492 5.402886 0.985967 0.690832 1.0
1L10_PWKB1 WGS 87 2.742041 0.857450 0.613994 1.0
PWKB4 WGS 2325 5.055294 0.962450 0.652172 1.0
6E02_PWKB3 WGS 1599 4.944393 0.976388 0.670232 1.0
L19T3_PWKB5 WGS 626 1.398967 0.431926 0.217253 1.0
HaB_PWFBB WGS 2216 5.292463 0.975576 0.687024 1.0
6M13_PWKB4 WGS 821 4.268226 0.957093 0.636050 1.0
GrA_PWFBA WGS 658 4.390302 0.917501 0.676555 1.0
1L07_PWKB1 WGS 2070 4.020943 0.943881 0.526625 1.0
PWFBC WGS 3534 3.191219 0.727930 0.390593 1.0
L02T3_PWKB5 WGS 349 3.368472 0.906909 0.575308 1.0

Metadata-Wide Alpha Associations

assay metadata_column metadata_type metric test n n_groups statistic pvalue effect_direction qvalue
WGS Sample Material categorical library_size kruskal 51 5.0 35.361774 3.914587e-07 PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community 0.000037
WGS Sample Material categorical shannon kruskal 51 5.0 29.976225 4.949286e-06 PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community 0.000181
WGS paired_negative_control categorical library_size kruskal 56 9.0 38.641657 5.727833e-06 PSKB1|PWKB5|PWKB3|PWFBA|PWFBC 0.000181
WGS Sample Material categorical evenness kruskal 51 5.0 24.740875 5.671957e-05 PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community 0.001179
WGS Date Deployed date library_size spearman 57 NaN -0.503831 6.458961e-05 negative 0.001179
WGS paired_negative_control categorical shannon kruskal 56 9.0 32.435553 7.780909e-05 PSKB1|PWKB5|PWKB3|PWFBA|PWFBC 0.001179
WGS Sample Material categorical simpson kruskal 51 5.0 23.817327 8.689895e-05 PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community 0.001179
WGS Sample Material categorical richness kruskal 51 5.0 23.028639 1.249689e-04 PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community 0.001484
WGS Accumulation Time (Days) categorical library_size kruskal 27 4.0 20.117252 1.605048e-04 14.0|252.0|126.0|42.0 0.001485
WGS Airstream categorical library_size kruskal 54 4.0 20.009247 1.689951e-04 Greenwall|Office|Hallway|Windowsill 0.001485
WGS paired_positive_control categorical shannon mannwhitney 22 2.0 106.000000 1.876349e-04 P375_PSKB1|P375_PWKB5 0.001485
WGS paired_positive_control categorical simpson mannwhitney 22 2.0 106.000000 1.876349e-04 P375_PSKB1|P375_PWKB5 0.001485
WGS paired_positive_control categorical library_size mannwhitney 22 2.0 7.000000 2.814523e-04 P375_PSKB1|P375_PWKB5 0.002004
WGS Days Deploy_to_Collect categorical library_size kruskal 31 5.0 21.153156 2.952778e-04 0.0|216.0|251.0|125.0|41.0 0.002004
WGS Airstream categorical evenness kruskal 54 4.0 18.234568 3.934700e-04 Greenwall|Office|Hallway|Windowsill 0.002492
WGS paired_negative_control categorical simpson kruskal 56 9.0 27.803652 5.130594e-04 PSKB1|PWKB5|PWKB3|PWFBA|PWFBC 0.003046
WGS paired_positive_control categorical richness mannwhitney 22 2.0 103.000000 6.004316e-04 P375_PSKB1|P375_PWKB5 0.003186
WGS Airstream categorical shannon kruskal 54 4.0 17.248350 6.282988e-04 Greenwall|Office|Hallway|Windowsill 0.003186
WGS Accumulation Time (Days) categorical evenness kruskal 27 4.0 17.218695 6.371906e-04 14.0|252.0|126.0|42.0 0.003186
WGS paired_negative_control categorical evenness kruskal 56 9.0 26.521791 8.546927e-04 PSKB1|PWKB5|PWKB3|PWFBA|PWFBC 0.004060
WGS Accumulation Time (Days) categorical shannon kruskal 27 4.0 16.408730 9.348762e-04 14.0|252.0|126.0|42.0 0.004229
WGS Relevant Kit Blank categorical simpson kruskal 35 6.0 20.427490 1.038667e-03 PSKB1|PWKB5|PWKB3|PWKB2|PWKB4 0.004485
WGS paired_negative_control categorical richness kruskal 56 9.0 25.556190 1.250537e-03 PSKB1|PWKB5|PWKB3|PWFBA|PWFBC 0.005165
WGS Relevant Kit Blank categorical shannon kruskal 35 6.0 19.732147 1.402917e-03 PSKB1|PWKB5|PWKB3|PWKB2|PWKB4 0.005553
WGS Days Deploy_to_Collect categorical evenness kruskal 31 5.0 17.119960 1.831914e-03 0.0|216.0|251.0|125.0|41.0 0.006961
WGS Relevant Kit Blank categorical library_size kruskal 35 6.0 18.713196 2.173423e-03 PSKB1|PWKB5|PWKB3|PWKB2|PWKB4 0.007941
WGS Airstream categorical simpson kruskal 54 4.0 14.452391 2.349786e-03 Greenwall|Office|Hallway|Windowsill 0.008268
WGS Date Deployed date shannon spearman 57 NaN 0.385438 3.067904e-03 positive 0.010409
WGS Relevant Kit Blank categorical evenness kruskal 35 6.0 17.495622 3.649714e-03 PSKB1|PWKB5|PWKB3|PWKB2|PWKB4 0.011956
WGS Days Deploy_to_Collect categorical shannon kruskal 31 5.0 15.359879 4.010157e-03 0.0|216.0|251.0|125.0|41.0 0.012699
WGS Date Deployed date evenness spearman 57 NaN 0.367714 4.892973e-03 positive 0.014995
WGS Days Collect_to_Extract numeric richness spearman 53 NaN 0.374833 5.685888e-03 positive 0.016880
WGS Accumulation Time (Days) categorical simpson kruskal 27 4.0 12.372134 6.211263e-03 14.0|252.0|126.0|42.0 0.017790
WGS paired_positive_control categorical evenness mannwhitney 22 2.0 95.000000 6.367076e-03 P375_PSKB1|P375_PWKB5 0.017790
WGS Days Deploy_to_Collect categorical simpson kruskal 31 5.0 12.828629 1.214422e-02 0.0|216.0|251.0|125.0|41.0 0.032963
WGS Relevant Kit Blank categorical richness kruskal 35 6.0 14.463440 1.291871e-02 PSKB1|PWKB5|PWKB3|PWKB2|PWKB4 0.034091
WGS Days Collect_to_Extract numeric simpson spearman 53 NaN 0.330067 1.579173e-02 positive 0.040546
WGS Relevant Field Blank categorical library_size kruskal 27 3.0 8.232254 1.630755e-02 PWFBA|PWFBC|PWFBB 0.040769
WGS Relevant Field Blank categorical evenness kruskal 27 3.0 8.012949 1.819744e-02 PWFBA|PWFBC|PWFBB 0.044327
WGS Sample Type categorical simpson kruskal 57 3.0 7.847026 1.977152e-02 Experimental|Negative Control|Positive Control 0.046957
WGS Date Deployed date simpson spearman 57 NaN 0.306143 2.055645e-02 positive 0.047631
WGS Accumulation Time (Days) categorical richness kruskal 27 4.0 9.276455 2.583236e-02 14.0|252.0|126.0|42.0 0.058430
WGS Days Deploy_to_Collect categorical richness kruskal 31 5.0 10.271169 3.609939e-02 0.0|216.0|251.0|125.0|41.0 0.079754
WGS Days Collect_to_Extract numeric shannon spearman 53 NaN 0.280910 4.159829e-02 positive 0.089814
WGS Date Extracted date evenness spearman 57 NaN -0.267315 4.440822e-02 negative 0.093751
WGS Date Deployed date richness spearman 57 NaN 0.264627 4.667448e-02 positive 0.096393
WGS Concentration/Filtration Step? categorical library_size mannwhitney 57 2.0 509.500000 5.492372e-02 No|Yes 0.104875
WGS Qubit ng/uL numeric library_size spearman 41 NaN 0.301246 5.562174e-02 positive 0.104875
WGS ng/50uL numeric library_size spearman 41 NaN 0.301246 5.562174e-02 positive 0.104875
WGS Sample Type categorical library_size kruskal 57 3.0 5.765377 5.598404e-02 Experimental|Negative Control|Positive Control 0.104875

Beta Diversity and Community Structure

Metadata PERMANOVA

No results produced.

Metadata Ordination Associations

No results produced.

Taxonomic Composition, Source Overlap, and Core Taxa

Core Taxa

feature taxonomy prevalence mean_relative_abundance assay
Homo sapiens Homo sapiens 0.964912 0.055331 WGS
Agrobacterium tumefaciens Agrobacterium tumefaciens 0.877193 0.011979 WGS
Pseudomonas aeruginosa Pseudomonas aeruginosa 0.859649 0.011945 WGS
Pseudonocardia alni Pseudonocardia alni 0.842105 0.011229 WGS

Differential Abundance and Functional Contrasts

Differential Abundance

No results produced.

Metadata Differential Summary

No results produced.

Machine Learning and Metadata Signal Prioritization

Random Forest Top Features

No results produced.

Metadata Random Forest

No results produced.

Metadata Signal Summary

assay metadata_column alpha_tests alpha_significant_q_lt_0_1 composite_signal_score top_related_metadata
WGS Accumulation Time (Days) 5 5 10 Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Greenwall Surface Position:1.000
WGS Date Deployed 5 5 10 Qubit ng/uL:0.747; ng/50uL:0.747; Date Extracted:0.566
WGS Days Deploy_to_Collect 5 5 10 Greenwall Radial Distance:1.000; Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000
WGS Relevant Kit Blank 5 5 10 Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000
WGS Sample Material 5 5 10 Accumulation Time (Days):1.000; Days Deploy_to_Collect:1.000; Concentration/Filtration Step?:0.980
WGS paired_negative_control 5 5 10 Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Extraction Kit:1.000
WGS paired_positive_control 5 5 10 Concentration/Filtration Step?:1.000; Extraction Kit:1.000; Relevant Kit Blank:1.000
WGS Airstream 5 4 9 Concentration/Filtration Step?:0.583; paired_negative_control:0.483; Sample Material:0.480
WGS Days Collect_to_Extract 5 3 8 Date Extracted:0.626; Qubit ng/uL:0.562; ng/50uL:0.562
WGS Relevant Field Blank 5 2 7 paired_negative_control:1.000; Concentration/Filtration Step?:0.725; Sample Material:0.725
WGS Date Extracted 5 1 6 Days Collect_to_Extract:0.626; Date Deployed:0.566; Qubit ng/uL:0.443
WGS Sample Type 5 1 6 Sample Material:0.739; Days Deploy_to_Collect:0.310; Relevant Kit Blank:0.237
WGS Concentration/Filtration Step? 5 0 5 Accumulation Time (Days):1.000; Days Deploy_to_Collect:1.000; Relevant Kit Blank:1.000
WGS Date Collected 5 0 5 Qubit ng/uL:0.303; ng/50uL:0.303; Date Extracted:0.250
WGS Extraction Kit 5 0 5 paired_negative_control:1.000; paired_positive_control:1.000; Relevant Kit Blank:1.000
WGS Greenwall Location 5 0 5 NaN
WGS Greenwall Surface Position 5 0 5 Accumulation Time (Days):1.000; Greenwall Radial Distance:1.000; Days Deploy_to_Collect:0.816
WGS Qubit ng/uL 5 0 5 ng/50uL:1.000; Date Deployed:0.747; Days Collect_to_Extract:0.562
WGS ng/50uL 5 0 5 Qubit ng/uL:1.000; Date Deployed:0.747; Days Collect_to_Extract:0.562

Metadata Covariate Links

metadata_column_a metadata_type_a metadata_column_b metadata_type_b test association_strength statistic pvalue
Days Deploy_to_Collect categorical Greenwall Radial Distance categorical cramers_v 1.000000 1.000000 NaN
Accumulation Time (Days) categorical Concentration/Filtration Step? categorical cramers_v 1.000000 1.000000 NaN
Accumulation Time (Days) categorical Days Deploy_to_Collect categorical cramers_v 1.000000 1.000000 NaN
Accumulation Time (Days) categorical Greenwall Surface Position categorical cramers_v 1.000000 1.000000 NaN
Accumulation Time (Days) categorical Relevant Kit Blank categorical cramers_v 1.000000 1.000000 NaN
Accumulation Time (Days) categorical Sample Material categorical cramers_v 1.000000 1.000000 NaN
Concentration/Filtration Step? categorical Days Deploy_to_Collect categorical cramers_v 1.000000 1.000000 NaN
Concentration/Filtration Step? categorical Relevant Kit Blank categorical cramers_v 1.000000 1.000000 NaN
Concentration/Filtration Step? categorical paired_negative_control categorical cramers_v 1.000000 1.000000 NaN
Concentration/Filtration Step? categorical paired_positive_control categorical cramers_v 1.000000 1.000000 NaN
Days Deploy_to_Collect categorical Relevant Kit Blank categorical cramers_v 1.000000 1.000000 NaN
Days Deploy_to_Collect categorical Sample Material categorical cramers_v 1.000000 1.000000 NaN
Days Deploy_to_Collect categorical paired_negative_control categorical cramers_v 1.000000 1.000000 NaN
Extraction Kit categorical paired_negative_control categorical cramers_v 1.000000 1.000000 NaN
Extraction Kit categorical paired_positive_control categorical cramers_v 1.000000 1.000000 NaN
Greenwall Radial Distance categorical Greenwall Surface Position categorical cramers_v 1.000000 1.000000 NaN
Greenwall Radial Distance categorical Relevant Kit Blank categorical cramers_v 1.000000 1.000000 NaN
Greenwall Radial Distance categorical paired_negative_control categorical cramers_v 1.000000 1.000000 NaN
Qubit ng/uL numeric ng/50uL numeric spearman 1.000000 1.000000 0.000000e+00
Relevant Field Blank categorical paired_negative_control categorical cramers_v 1.000000 1.000000 NaN
Relevant Kit Blank categorical paired_negative_control categorical cramers_v 1.000000 1.000000 NaN
Relevant Kit Blank categorical paired_positive_control categorical cramers_v 1.000000 1.000000 NaN
paired_negative_control categorical paired_positive_control categorical cramers_v 1.000000 1.000000 NaN
Accumulation Time (Days) categorical Greenwall Radial Distance categorical cramers_v 1.000000 1.000000 NaN
Accumulation Time (Days) categorical paired_negative_control categorical cramers_v 1.000000 1.000000 NaN
Extraction Kit categorical Relevant Kit Blank categorical cramers_v 1.000000 1.000000 NaN
Concentration/Filtration Step? categorical Sample Material categorical cramers_v 0.979812 0.979812 NaN
Extraction Kit categorical Sample Material categorical cramers_v 0.973848 0.973848 NaN
Sample Material categorical paired_positive_control categorical cramers_v 0.943456 0.943456 NaN
Sample Material categorical paired_negative_control categorical cramers_v 0.850016 0.850016 NaN
Days Deploy_to_Collect categorical Greenwall Surface Position categorical cramers_v 0.816497 0.816497 NaN
Greenwall Surface Position categorical Relevant Kit Blank categorical cramers_v 0.816497 0.816497 NaN
Greenwall Surface Position categorical paired_negative_control categorical cramers_v 0.816497 0.816497 NaN
Relevant Kit Blank categorical Sample Material categorical cramers_v 0.793748 0.793748 NaN
Date Deployed date Qubit ng/uL numeric spearman 0.747437 -0.747437 1.972756e-08
Date Deployed date ng/50uL numeric spearman 0.747437 -0.747437 1.972756e-08
Sample Material categorical Sample Type categorical cramers_v 0.738735 0.738735 NaN
Concentration/Filtration Step? categorical Relevant Field Blank categorical cramers_v 0.725476 0.725476 NaN
Relevant Field Blank categorical Sample Material categorical cramers_v 0.725476 0.725476 NaN
Date Extracted date Days Collect_to_Extract numeric spearman 0.626333 0.626333 5.266479e-07
Airstream categorical Concentration/Filtration Step? categorical cramers_v 0.583095 0.583095 NaN
Date Deployed date Date Extracted date spearman 0.565612 -0.565612 4.547414e-06
Days Collect_to_Extract numeric Qubit ng/uL numeric spearman 0.561994 0.561994 1.968216e-04
Days Collect_to_Extract numeric ng/50uL numeric spearman 0.561994 0.561994 1.968216e-04
Days Deploy_to_Collect categorical Relevant Field Blank categorical cramers_v 0.512989 0.512989 NaN
Accumulation Time (Days) categorical Relevant Field Blank categorical cramers_v 0.512989 0.512989 NaN
Airstream categorical paired_negative_control categorical cramers_v 0.482654 0.482654 NaN
Airstream categorical Sample Material categorical cramers_v 0.480384 0.480384 NaN
Concentration/Filtration Step? categorical Extraction Kit categorical cramers_v 0.469304 0.469304 NaN
Airstream categorical Days Deploy_to_Collect categorical cramers_v 0.458831 0.458831 NaN

Hypothesis Answer Layer

Hypothesis to Output Map

category study_question primary_groups secondary_groups mapped_outputs
Methodology Are there differences in 16S / WGS / ITS outcomes Sample Material: Leaf, Rhizosphere, Zymo Mock Community NaN cross_assay_correlations.tsv; metadata_alpha_associations.tsv; metadata_column_catalog.tsv; metadata_permanova.tsv; study_overview.tsv
Diversity Are the collection of microorganisms found in the greenwall related to human health in any way? Sample Material: Leaf, Rhizosphere NaN beneficial_taxa_summary.tsv; core_taxa.tsv; differential_abundance.tsv; metadata_column_catalog.tsv; study_overview.tsv
Uncategorized NaN Sample Material: Leaf, Rhizosphere NaN metadata_column_catalog.tsv; study_overview.tsv
Uncategorized Is the microbiome of the "greenwall room" different from the other airstreams? Sample Material: Filter Airstream: Hallway, Office, Greenwall, Windowsill metadata_alpha_associations.tsv; metadata_column_catalog.tsv; metadata_permanova.tsv; source_overlap.tsv; study_overview.tsv
Uncategorized NaN Sample Material: Filter Airstream: Hallway, Office, Greenwall, Windowsill metadata_column_catalog.tsv; study_overview.tsv
Uncategorized Is there evidence of diversification of the "greenwall room" microbiome by greenwall microorganisms? Sample Material: Filter\nAirstream: Greenwall Sample Material: Leaf, Rhizosphere diversity_metrics.tsv; diversity_tests.tsv; metadata_alpha_associations.tsv; metadata_column_catalog.tsv; metadata_permanova.tsv; source_overlap.tsv; study_overview.tsv
Uncategorized NaN Sample Material: PDMS Sample Material: Leaf, Rhizosphere metadata_column_catalog.tsv; study_overview.tsv
Metabolism Is there evidence of metabolic function within the greenwall microbiome related to chemicals identified in relevant airstreams? Sample Material: Leaf, Rhizosphere NaN metadata_alpha_associations.tsv; metadata_column_catalog.tsv; metadata_differential_abundance_summary.tsv; metadata_permanova.tsv; source_overlap.tsv; study_overview.tsv; wgs_function_top_differences.tsv
Uncategorized NaN Sample Material: Leaf, Rhizosphere NaN metadata_column_catalog.tsv; study_overview.tsv
Uncategorized NaN NaN NaN metadata_column_catalog.tsv; study_overview.tsv
Uncategorized NaN NaN NaN metadata_column_catalog.tsv; study_overview.tsv
Uncategorized NaN NaN NaN metadata_column_catalog.tsv; study_overview.tsv
Uncategorized NaN NaN NaN metadata_column_catalog.tsv; study_overview.tsv

Markdown Summary

# GREENWALL Hypothesis-Oriented Analysis Summary

This report is organized as a logical analysis sequence modeled on a publication-style microbiome report.

## Executive Summary

- Imported hypotheses: 9
- Manifest samples: 57
- Metadata columns scanned: 20
- Metadata-linked signals at q < 0.1: 46

## Hypothesis Verdict Cards

- **Mixed**: Are there differences in 16S / WGS / ITS outcomes
  Evidence: Top metadata-linked signals were WGS:Accumulation Time (Days); WGS:Date Deployed; WGS:Days Deploy_to_Collect. 2 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; metadata_covariate_links.tsv; metadata_signal_summary.tsv; positive_control_benchmark.tsv
- **Mixed**: Are the collection of microorganisms found in the greenwall related to human health in any way?
  Evidence: 5 beneficial-taxa keyword hits were detected across assays. 2 positive-control benchmark summaries were generated.
  Outputs: beneficial_taxa_summary.tsv; core_taxa.tsv; hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Not Supported Yet**: 
  Evidence: 2 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Mixed**: Is the microbiome of the "greenwall room" different from the other airstreams?
  Evidence: Maximum source-overlap fraction observed was 0.797. 2 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv; source_overlap.tsv
- **Not Supported Yet**: 
  Evidence: 2 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Mixed**: Is there evidence of diversification of the "greenwall room" microbiome by greenwall  microorganisms?
  Evidence: 46 alpha-diversity metadata associations passed q < 0.1. 2 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; metadata_alpha_associations.tsv; positive_control_benchmark.tsv
- **Not Supported Yet**: 
  Evidence: 2 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Mixed**: Is there evidence of metabolic function within the greenwall microbiome related to chemicals identified in relevant airstreams?
  Evidence: Maximum source-overlap fraction observed was 0.797. 2 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv; source_overlap.tsv
- **Not Supported Yet**: 
  Evidence: 2 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Not Supported Yet**: 
  Evidence: 2 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Not Supported Yet**: 
  Evidence: 2 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Not Supported Yet**: 
  Evidence: 2 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
- **Not Supported Yet**: 
  Evidence: 2 positive-control benchmark summaries were generated.
  Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv

## Logical Steps

1. **Study Overview and QC**
   - Purpose: Summarize assay coverage, sample inventory, control annotations, and high-level dataset structure.
   - Outputs: study_overview.tsv; metadata_column_catalog.tsv; hypotheses.imported.tsv
2. **Alpha Diversity**
   - Purpose: Compute richness, Shannon, Simpson, evenness, and test them against targeted contrasts and every usable metadata column.
   - Outputs: diversity_metrics.tsv; diversity_tests.tsv; metadata_alpha_associations.tsv
3. **Beta Diversity and Community Structure**
   - Purpose: Build Bray-Curtis dissimilarities, PCoA ordinations, and PERMANOVA summaries for assay-specific and metadata-driven effects.
   - Outputs: pcoa_coordinates.tsv; permanova.tsv; metadata_permanova.tsv; metadata_pcoa_associations.tsv
4. **Taxonomic Composition and Beneficial Taxa**
   - Purpose: Screen for beneficial taxa, summarize source overlap, and prepare interpretable taxonomic summaries.
   - Outputs: beneficial_taxa_summary.tsv; source_overlap.tsv
5. **Differential Abundance and Function**
   - Purpose: Test assay- and metadata-driven contrasts for taxonomic and WGS functional differences.
   - Outputs: differential_abundance.tsv; metadata_differential_abundance_summary.tsv; wgs_function_top_differences.tsv
6. **Core Microbiome and Cross-Assay Concordance**
   - Purpose: Define core taxa, assess overlap across assays, and quantify assay bias or concordance on shared samples.
   - Outputs: core_taxa.tsv; cross_assay_sample_metrics.tsv; cross_assay_correlations.tsv
7. **Machine Learning and Network Structure**
   - Purpose: Rank discriminatory features and summarize co-occurrence structure for hypothesis generation.
   - Outputs: random_forest_summary.tsv; random_forest_top_features.tsv; metadata_random_forest_summary.tsv; cooccurrence_network_edges.tsv
8. **Hypothesis Answer Layer**
   - Purpose: Map outputs back to the stated hypotheses and surface the strongest metadata-linked signals.
   - Outputs: hypothesis_to_output_map.tsv; metadata_signal_summary.tsv; hypothesis_summary.md

## Step 1. Study Overview and QC

- Manifest rows: 57
- Manifest columns: 30
- Testable metadata columns: 23
- QC flag rows: 57
- Samples with recommended exclusion: 0

## Step 2. Alpha Diversity

- Targeted contrasts: 8
- Metadata-wide alpha tests: 95
- Top metadata-linked alpha signals:
```tsv
assay	metadata_column	metadata_type	metric	test	n	n_groups	statistic	pvalue	effect_direction	qvalue
WGS	Sample Material	categorical	library_size	kruskal	51	5.0	35.361773624185375	3.9145866564586555e-07	PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community	3.718857323635723e-05
WGS	Sample Material	categorical	shannon	kruskal	51	5.0	29.97622549019607	4.949285909900996e-06	PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community	0.0001813813893402897
WGS	paired_negative_control	categorical	library_size	kruskal	56	9.0	38.641656575471494	5.727833347588096e-06	PSKB1|PWKB5|PWKB3|PWFBA|PWFBC	0.0001813813893402897
WGS	Sample Material	categorical	evenness	kruskal	51	5.0	24.740874811463073	5.671956757119998e-05	PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community	0.0011793428847587607
WGS	Date Deployed	date	library_size	spearman	57		-0.5038310692604261	6.458960756609466e-05	negative	0.0011793428847587607
```

## Step 3. Beta Diversity and Community Structure

- Assay-specific PERMANOVA rows: 0
- Metadata PERMANOVA rows: 0
- Ordination metadata tests: 0

## Step 4. Taxonomic Composition and Beneficial Taxa

- Beneficial taxa screen rows: 5
- Source overlap rows: 2
- Positive-control diagnostic rows: 4

## Step 5. Differential Abundance and Function

- Taxonomic differential abundance rows: 0
- Metadata differential summaries: 0
- WGS functional contrasts: 0

## Step 6. Core Microbiome and Cross-Assay Concordance

- Core taxa rows: 4
- Cross-assay correlation rows: 0

## Step 7. Machine Learning and Networks

- Primary Random Forest summaries: 0
- Metadata Random Forest summaries: 0

## Step 8. Hypothesis Answer Layer

- Highest-signal metadata fields across all assays/tests:
```tsv
assay	metadata_column	alpha_tests	alpha_significant_q_lt_0_1	composite_signal_score	top_related_metadata
WGS	Accumulation Time (Days)	5	5	10	Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Greenwall Surface Position:1.000
WGS	Date Deployed	5	5	10	Qubit ng/uL:0.747; ng/50uL:0.747; Date Extracted:0.566
WGS	Days Deploy_to_Collect	5	5	10	Greenwall Radial Distance:1.000; Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000
WGS	Relevant Kit Blank	5	5	10	Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000
WGS	Sample Material	5	5	10	Accumulation Time (Days):1.000; Days Deploy_to_Collect:1.000; Concentration/Filtration Step?:0.980
WGS	paired_negative_control	5	5	10	Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Extraction Kit:1.000
WGS	paired_positive_control	5	5	10	Concentration/Filtration Step?:1.000; Extraction Kit:1.000; Relevant Kit Blank:1.000
WGS	Airstream	5	4	9	Concentration/Filtration Step?:0.583; paired_negative_control:0.483; Sample Material:0.480
WGS	Days Collect_to_Extract	5	3	8	Date Extracted:0.626; Qubit ng/uL:0.562; ng/50uL:0.562
WGS	Relevant Field Blank	5	2	7	paired_negative_control:1.000; Concentration/Filtration Step?:0.725; Sample Material:0.725
```
- Strongest metadata-metadata covariate links:
```tsv
metadata_column_a	metadata_type_a	metadata_column_b	metadata_type_b	test	association_strength	statistic	pvalue
Days Deploy_to_Collect	categorical	Greenwall Radial Distance	categorical	cramers_v	1.0000000000000002	1.0000000000000002	
Accumulation Time (Days)	categorical	Concentration/Filtration Step?	categorical	cramers_v	1.0	1.0	
Accumulation Time (Days)	categorical	Days Deploy_to_Collect	categorical	cramers_v	1.0	1.0	
Accumulation Time (Days)	categorical	Greenwall Surface Position	categorical	cramers_v	1.0	1.0	
Accumulation Time (Days)	categorical	Relevant Kit Blank	categorical	cramers_v	1.0	1.0	
Accumulation Time (Days)	categorical	Sample Material	categorical	cramers_v	1.0	1.0	
Concentration/Filtration Step?	categorical	Days Deploy_to_Collect	categorical	cramers_v	1.0	1.0	
Concentration/Filtration Step?	categorical	Relevant Kit Blank	categorical	cramers_v	1.0	1.0	
Concentration/Filtration Step?	categorical	paired_negative_control	categorical	cramers_v	1.0	1.0	
Concentration/Filtration Step?	categorical	paired_positive_control	categorical	cramers_v	1.0	1.0
```