GREENWALL MICROBIOME ANALYSIS
Publication-style, stepwise summary of WGS, 16S, and ITS workflow outputs, statistical tests, metadata-wide associations, control diagnostics, and direct hypothesis-linked conclusions.
This report explicitly checks the hypothesis-suite inputs for WGS, 16S, and ITS. Included means nonzero sample and feature counts were loaded from the completed workflow exports.
57 samples; 8007 features loaded into the hypothesis/reporting layer.
0 samples; 0 features loaded into the hypothesis/reporting layer.
0 samples; 0 features loaded into the hypothesis/reporting layer.
Top metadata-linked signals were WGS:Accumulation Time (Days); WGS:Date Deployed; WGS:Days Deploy_to_Collect. 2 positive-control benchmark summaries were generated.
Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; metadata_covariate_links.tsv; metadata_signal_summary.tsv; positive_control_benchmark.tsv
5 beneficial-taxa keyword hits were detected across assays. 2 positive-control benchmark summaries were generated.
Outputs: beneficial_taxa_summary.tsv; core_taxa.tsv; hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
2 positive-control benchmark summaries were generated.
Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
Maximum source-overlap fraction observed was 0.797. 2 positive-control benchmark summaries were generated.
Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv; source_overlap.tsv
2 positive-control benchmark summaries were generated.
Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
46 alpha-diversity metadata associations passed q < 0.1. 2 positive-control benchmark summaries were generated.
Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; metadata_alpha_associations.tsv; positive_control_benchmark.tsv
2 positive-control benchmark summaries were generated.
Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
Maximum source-overlap fraction observed was 0.797. 2 positive-control benchmark summaries were generated.
Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv; source_overlap.tsv
2 positive-control benchmark summaries were generated.
Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
2 positive-control benchmark summaries were generated.
Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
2 positive-control benchmark summaries were generated.
Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
2 positive-control benchmark summaries were generated.
Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
2 positive-control benchmark summaries were generated.
Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv
study_overview.tsv; metadata_column_catalog.tsv; hypotheses.imported.tsvdiversity_metrics.tsv; diversity_tests.tsv; metadata_alpha_associations.tsvpcoa_coordinates.tsv; permanova.tsv; metadata_permanova.tsv; metadata_pcoa_associations.tsvbeneficial_taxa_summary.tsv; source_overlap.tsvdifferential_abundance.tsv; metadata_differential_abundance_summary.tsv; wgs_function_top_differences.tsvcore_taxa.tsv; cross_assay_sample_metrics.tsv; cross_assay_correlations.tsvrandom_forest_summary.tsv; random_forest_top_features.tsv; metadata_random_forest_summary.tsv; cooccurrence_network_edges.tsvhypothesis_to_output_map.tsv; metadata_signal_summary.tsv; hypothesis_summary.md| section | metric | value |
|---|---|---|
| dataset | manifest_rows | 57 |
| dataset | manifest_columns | 30 |
| dataset | hypothesis_testable_metadata_columns | 23 |
| assay_summary | WGS_samples | 57 |
| assay_summary | WGS_features | 8007 |
| assay_summary | 16S_samples | 0 |
| assay_summary | 16S_features | 0 |
| assay_summary | ITS_samples | 0 |
| assay_summary | ITS_features | 0 |
| Sample Material | PDMS | 16 |
| Sample Material | Filter | 15 |
| Sample Material | Rhizosphere | 12 |
| Sample Material | Leaf | 6 |
| Sample Material | Zymo Mock Community | 2 |
| Airstream | Greenwall | 45 |
| Airstream | Hallway | 3 |
| Airstream | Office | 3 |
| Airstream | Windowsill | 3 |
| Extraction Kit | Power Water | 43 |
| Extraction Kit | PowerSoil Pro | 14 |
| Sample Type | Experimental | 45 |
| Sample Type | Negative Control | 10 |
| Sample Type | Positive Control | 2 |
| controls | paired_negative_control_annotated | 56 |
| controls | paired_positive_control_annotated | 22 |
| alpha | WGS_alpha_samples | 57 |
| column | type | non_null_samples | unique_values | usable | analysis_roles |
|---|---|---|---|---|---|
| Accumulation Time (Days) | categorical | 57 | 5 | True | alpha,beta,differential,classification |
| Airstream | categorical | 57 | 5 | True | alpha,beta,differential,classification |
| Concentration/Filtration Step? | categorical | 57 | 2 | True | alpha,beta,differential,classification |
| Days Deploy_to_Collect | categorical | 57 | 6 | True | alpha,beta,differential,classification |
| Extraction Kit | categorical | 57 | 2 | True | alpha,beta,differential,classification |
| Greenwall Location | categorical | 57 | 7 | True | alpha,beta,differential,classification |
| Greenwall Radial Distance | categorical | 57 | 16 | True | alpha,beta,differential,classification |
| Greenwall Surface Position | categorical | 57 | 10 | True | alpha,beta,differential,classification |
| Relevant Field Blank | categorical | 57 | 4 | True | alpha,beta,differential,classification |
| Relevant Kit Blank | categorical | 57 | 7 | True | alpha,beta,differential,classification |
| Sample Material | categorical | 57 | 6 | True | alpha,beta,differential,classification |
| Sample Type | categorical | 57 | 3 | True | alpha,beta,differential,classification |
| paired_negative_control | categorical | 57 | 10 | True | alpha,beta,differential,classification |
| paired_positive_control | categorical | 57 | 3 | True | alpha,beta,differential,classification |
| Date Collected | date | 53 | 10 | True | alpha,ordination |
| Date Deployed | date | 31 | 5 | True | alpha,ordination |
| Date Extracted | date | 57 | 8 | True | alpha,ordination |
| Days Collect_to_Extract | numeric | 53 | 7 | True | alpha,ordination |
| Qubit ng/uL | numeric | 41 | 39 | True | alpha,ordination |
| ng/50uL | numeric | 41 | 39 | True | alpha,ordination |
| 16S | assay_flag | 57 | 1 | False | alpha,beta,differential,classification |
| ITS | assay_flag | 57 | 1 | False | alpha,beta,differential,classification |
| WGS | assay_flag | 57 | 1 | False | alpha,beta,differential,classification |
| its_r1 | path | 57 | 57 | False | NaN |
| its_r2 | path | 57 | 57 | False | NaN |
| s16_r1 | path | 57 | 57 | False | NaN |
| s16_r2 | path | 57 | 57 | False | NaN |
| wgs_r1 | path | 57 | 57 | False | NaN |
| wgs_r2 | path | 57 | 57 | False | NaN |
| sample | assay | metric | value | threshold | flag | reason | exclude_recommended |
|---|---|---|---|---|---|---|---|
| PWFBA | WGS | post_filter_counts | 815924.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| OFC_PWFBC | WGS | post_filter_counts | 932569.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| 6L02_PWKB3 | WGS | post_filter_counts | 191843.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| HaA_PWFBA | WGS | post_filter_counts | 641394.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| PWKB5 | WGS | post_filter_counts | 93942.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| PWKB3 | WGS | post_filter_counts | 357641.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| WiA_PWFBA | WGS | post_filter_counts | 626646.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| PWKB1 | WGS | post_filter_counts | 124933.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| 6L13_PWKB3 | WGS | post_filter_counts | 430785.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| 6L11_PWKB3 | WGS | post_filter_counts | 199098.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| L18T3_PWKB5 | WGS | post_filter_counts | 1554800.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| R10T2_PSKB1 | WGS | post_filter_counts | 2310438.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| GrC_PWFBC | WGS | post_filter_counts | 1060935.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| R18T2_PSKB1 | WGS | post_filter_counts | 2975934.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| R11T1_PSKB1 | WGS | post_filter_counts | 4513526.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| R11T2_PSKB1 | WGS | post_filter_counts | 2648318.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| OFB_PWFBB | WGS | post_filter_counts | 674258.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| 3L06_PWKB2 | WGS | post_filter_counts | 161911.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| R10T1_PSKB1 | WGS | post_filter_counts | 2393675.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| PWKB2 | WGS | post_filter_counts | 57930.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| PWFBB | WGS | post_filter_counts | 1465112.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| L11T3_PWKB5 | WGS | post_filter_counts | 9165150.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| 6M11_PWKB4 | WGS | post_filter_counts | 596483.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| WiC_PWFBC | WGS | post_filter_counts | 194200.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| HaC_PWFBC | WGS | post_filter_counts | 283713.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| FB_PWKB4 | WGS | post_filter_counts | 1270337.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| R02T1_PSKB1 | WGS | post_filter_counts | 3968345.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| OFA_PWFBA | WGS | post_filter_counts | 1347806.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| 3L16_PWKB2 | WGS | post_filter_counts | 186513.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| PSKB1 | WGS | post_filter_counts | 191810.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| 1L14_PWKB1 | WGS | post_filter_counts | 243032.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| L03T3_PWKB5 | WGS | post_filter_counts | 546220.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| 6E11_PWKB3 | WGS | post_filter_counts | 2007928.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| P375_PWKB5 | WGS | post_filter_counts | 12287355.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| WiB_PWFBB | WGS | post_filter_counts | 644971.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| P375_PSKB1 | WGS | post_filter_counts | 11331652.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| R19T1_PSKB1 | WGS | post_filter_counts | 7324409.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| R02T2_PSKB1 | WGS | post_filter_counts | 2568735.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| L10T3_PWKB5 | WGS | post_filter_counts | 2288618.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| 6E13_PWKB3 | WGS | post_filter_counts | 1700897.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| 1L10_PWKB1 | WGS | post_filter_counts | 61682.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| PWKB4 | WGS | post_filter_counts | 1268280.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| 6E02_PWKB3 | WGS | post_filter_counts | 1081639.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| L19T3_PWKB5 | WGS | post_filter_counts | 7309206.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| HaB_PWFBB | WGS | post_filter_counts | 1037183.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| 6M13_PWKB4 | WGS | post_filter_counts | 526051.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| GrA_PWFBA | WGS | post_filter_counts | 244698.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| 1L07_PWKB1 | WGS | post_filter_counts | 4719919.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| PWFBC | WGS | post_filter_counts | 6603362.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| L02T3_PWKB5 | WGS | post_filter_counts | 679260.0 | 1000.0 | False | Low post-filter Bracken count total | False |
| assay | sample | flagged_metrics | exclude_recommended |
|---|---|---|---|
| WGS | 1L07_PWKB1 | 0 | False |
| WGS | 1L10_PWKB1 | 0 | False |
| WGS | 1L14_PWKB1 | 0 | False |
| WGS | 3L06_PWKB2 | 0 | False |
| WGS | 3L16_PWKB2 | 0 | False |
| WGS | 3L18_PWKB2 | 0 | False |
| WGS | 6E02_PWKB3 | 0 | False |
| WGS | 6E11_PWKB3 | 0 | False |
| WGS | 6E13_PWKB3 | 0 | False |
| WGS | 6L02_PWKB3 | 0 | False |
| WGS | 6L11_PWKB3 | 0 | False |
| WGS | 6L13_PWKB3 | 0 | False |
| WGS | 6M02_PWKB4 | 0 | False |
| WGS | 6M11_PWKB4 | 0 | False |
| WGS | 6M13_PWKB4 | 0 | False |
| WGS | FB_PWKB4 | 0 | False |
| WGS | GrA_PWFBA | 0 | False |
| WGS | GrB_PWFBB | 0 | False |
| WGS | GrC_PWFBC | 0 | False |
| WGS | HaA_PWFBA | 0 | False |
| WGS | HaB_PWFBB | 0 | False |
| WGS | HaC_PWFBC | 0 | False |
| WGS | L02T3_PWKB5 | 0 | False |
| WGS | L03T3_PWKB5 | 0 | False |
| WGS | L10T3_PWKB5 | 0 | False |
| WGS | L11T3_PWKB5 | 0 | False |
| WGS | L18T3_PWKB5 | 0 | False |
| WGS | L19T3_PWKB5 | 0 | False |
| WGS | OFA_PWFBA | 0 | False |
| WGS | OFB_PWFBB | 0 | False |
| WGS | OFC_PWFBC | 0 | False |
| WGS | P375_PSKB1 | 0 | False |
| WGS | P375_PWKB5 | 0 | False |
| WGS | PSKB1 | 0 | False |
| WGS | PWFBA | 0 | False |
| WGS | PWFBB | 0 | False |
| WGS | PWFBC | 0 | False |
| WGS | PWKB1 | 0 | False |
| WGS | PWKB2 | 0 | False |
| WGS | PWKB3 | 0 | False |
| WGS | PWKB4 | 0 | False |
| WGS | PWKB5 | 0 | False |
| WGS | R02T1_PSKB1 | 0 | False |
| WGS | R02T2_PSKB1 | 0 | False |
| WGS | R03T1_PSKB1 | 0 | False |
| WGS | R03T2_PSKB1 | 0 | False |
| WGS | R10T1_PSKB1 | 0 | False |
| WGS | R10T2_PSKB1 | 0 | False |
| WGS | R11T1_PSKB1 | 0 | False |
| WGS | R11T2_PSKB1 | 0 | False |
| assay | sample | total_counts | nonzero_features | top1_fraction | top10_fraction | top_features |
|---|---|---|---|---|---|---|
| WGS_species | P375_PSKB1 | 11331652.0 | 100 | 0.250709 | 0.986473 | Salmonella enterica:2840944.0000;Escherichia coli:1985121.0000;Pseudomonas aeruginosa:1753812.0000;Bacillus spizizenii:1469103.0000;Staphylococcus aureus:893120.0000;Enterococcus faecalis:890384.0000;Listeria monocytogenes:642953.0000;Limosilactobacillus fermentum:519475.0000;Saccharomyces cerevisiae:155658.0000;Bacillus subtilis:27803.0000 |
| WGS_species | P375_PWKB5 | 12287355.0 | 92 | 0.214053 | 0.985745 | Salmonella enterica:2630148.0000;Escherichia coli:1902867.0000;Pseudomonas aeruginosa:1694812.0000;Bacillus spizizenii:1596350.0000;Staphylococcus aureus:1155467.0000;Listeria monocytogenes:1127978.0000;Limosilactobacillus fermentum:962784.0000;Enterococcus faecalis:950700.0000;Saccharomyces cerevisiae:67942.0000;Cryptococcus deneoformans:23156.0000 |
| WGS_genus | P375_PSKB1 | 11332082.0 | 29 | 0.251054 | 0.997220 | Salmonella:2844961.0000;Escherichia:1985447.0000;Pseudomonas:1753967.0000;Bacillus:1504857.0000;Enterococcus:947629.0000;Staphylococcus:904337.0000;Listeria:651336.0000;Limosilactobacillus:519471.0000;Saccharomyces:156456.0000;Cryptococcus:32120.0000 |
| WGS_genus | P375_PWKB5 | 12287685.0 | 27 | 0.214228 | 0.997386 | Salmonella:2632366.0000;Escherichia:1903137.0000;Pseudomonas:1694944.0000;Bacillus:1621564.0000;Staphylococcus:1167120.0000;Listeria:1140525.0000;Enterococcus:1026341.0000;Limosilactobacillus:962765.0000;Saccharomyces:68232.0000;Cryptococcus:38576.0000 |
| assay | n_positive_controls | median_top10_fraction | median_nonzero_features | median_total_counts |
|---|---|---|---|---|
| WGS_genus | 2 | 0.997303 | 28.0 | 11809883.5 |
| WGS_species | 2 | 0.986109 | 96.0 | 11809503.5 |
| sample | assay | richness | shannon | simpson | evenness | library_size |
|---|---|---|---|---|---|---|
| PWFBA | WGS | 1879 | 4.992962 | 0.955161 | 0.662329 | 1.0 |
| OFC_PWFBC | WGS | 2310 | 5.589036 | 0.980712 | 0.721631 | 1.0 |
| 6L02_PWKB3 | WGS | 241 | 3.660373 | 0.950011 | 0.667367 | 1.0 |
| HaA_PWFBA | WGS | 2031 | 6.076817 | 0.992295 | 0.797872 | 1.0 |
| PWKB5 | WGS | 312 | 3.903406 | 0.938905 | 0.679680 | 1.0 |
| PWKB3 | WGS | 1129 | 4.822211 | 0.929649 | 0.686037 | 1.0 |
| WiA_PWFBA | WGS | 1648 | 6.073117 | 0.993446 | 0.819881 | 1.0 |
| PWKB1 | WGS | 482 | 4.101376 | 0.900069 | 0.663874 | 1.0 |
| 6L13_PWKB3 | WGS | 492 | 4.299825 | 0.962901 | 0.693690 | 1.0 |
| 6L11_PWKB3 | WGS | 327 | 4.133770 | 0.955723 | 0.713955 | 1.0 |
| L18T3_PWKB5 | WGS | 758 | 3.679144 | 0.902815 | 0.554867 | 1.0 |
| R10T2_PSKB1 | WGS | 1354 | 4.538816 | 0.960294 | 0.629445 | 1.0 |
| GrC_PWFBC | WGS | 2532 | 5.848288 | 0.987012 | 0.746263 | 1.0 |
| R18T2_PSKB1 | WGS | 2104 | 5.529873 | 0.987834 | 0.722708 | 1.0 |
| R11T1_PSKB1 | WGS | 2773 | 5.418866 | 0.983172 | 0.683537 | 1.0 |
| R11T2_PSKB1 | WGS | 1726 | 4.980057 | 0.967603 | 0.668145 | 1.0 |
| OFB_PWFBB | WGS | 1836 | 5.640630 | 0.987375 | 0.750548 | 1.0 |
| 3L06_PWKB2 | WGS | 205 | 3.422721 | 0.912644 | 0.643005 | 1.0 |
| R10T1_PSKB1 | WGS | 2084 | 5.501766 | 0.984128 | 0.719934 | 1.0 |
| PWKB2 | WGS | 227 | 3.903949 | 0.924407 | 0.719629 | 1.0 |
| PWFBB | WGS | 2642 | 5.438785 | 0.969404 | 0.690263 | 1.0 |
| L11T3_PWKB5 | WGS | 599 | 1.440901 | 0.433087 | 0.225308 | 1.0 |
| 6M11_PWKB4 | WGS | 801 | 3.923268 | 0.931057 | 0.586801 | 1.0 |
| WiC_PWFBC | WGS | 901 | 5.652212 | 0.985641 | 0.830779 | 1.0 |
| HaC_PWFBC | WGS | 1315 | 5.679795 | 0.988450 | 0.790882 | 1.0 |
| FB_PWKB4 | WGS | 2463 | 4.899123 | 0.911719 | 0.627358 | 1.0 |
| R02T1_PSKB1 | WGS | 2616 | 5.674997 | 0.989452 | 0.721147 | 1.0 |
| OFA_PWFBA | WGS | 3116 | 6.352722 | 0.993937 | 0.789717 | 1.0 |
| 3L16_PWKB2 | WGS | 290 | 3.789243 | 0.941364 | 0.668311 | 1.0 |
| PSKB1 | WGS | 617 | 5.043418 | 0.983310 | 0.784984 | 1.0 |
| 1L14_PWKB1 | WGS | 211 | 3.471870 | 0.929943 | 0.648722 | 1.0 |
| L03T3_PWKB5 | WGS | 256 | 3.254449 | 0.872557 | 0.586897 | 1.0 |
| 6E11_PWKB3 | WGS | 2957 | 5.731379 | 0.988741 | 0.717146 | 1.0 |
| P375_PWKB5 | WGS | 92 | 2.143042 | 0.864852 | 0.473937 | 1.0 |
| WiB_PWFBB | WGS | 1908 | 5.909986 | 0.988205 | 0.782385 | 1.0 |
| P375_PSKB1 | WGS | 100 | 2.086642 | 0.847782 | 0.453109 | 1.0 |
| R19T1_PSKB1 | WGS | 2462 | 4.317675 | 0.948260 | 0.552929 | 1.0 |
| R02T2_PSKB1 | WGS | 1812 | 5.343380 | 0.982939 | 0.712243 | 1.0 |
| L10T3_PWKB5 | WGS | 414 | 3.825167 | 0.947312 | 0.634791 | 1.0 |
| 6E13_PWKB3 | WGS | 2492 | 5.402886 | 0.985967 | 0.690832 | 1.0 |
| 1L10_PWKB1 | WGS | 87 | 2.742041 | 0.857450 | 0.613994 | 1.0 |
| PWKB4 | WGS | 2325 | 5.055294 | 0.962450 | 0.652172 | 1.0 |
| 6E02_PWKB3 | WGS | 1599 | 4.944393 | 0.976388 | 0.670232 | 1.0 |
| L19T3_PWKB5 | WGS | 626 | 1.398967 | 0.431926 | 0.217253 | 1.0 |
| HaB_PWFBB | WGS | 2216 | 5.292463 | 0.975576 | 0.687024 | 1.0 |
| 6M13_PWKB4 | WGS | 821 | 4.268226 | 0.957093 | 0.636050 | 1.0 |
| GrA_PWFBA | WGS | 658 | 4.390302 | 0.917501 | 0.676555 | 1.0 |
| 1L07_PWKB1 | WGS | 2070 | 4.020943 | 0.943881 | 0.526625 | 1.0 |
| PWFBC | WGS | 3534 | 3.191219 | 0.727930 | 0.390593 | 1.0 |
| L02T3_PWKB5 | WGS | 349 | 3.368472 | 0.906909 | 0.575308 | 1.0 |
| assay | metadata_column | metadata_type | metric | test | n | n_groups | statistic | pvalue | effect_direction | qvalue |
|---|---|---|---|---|---|---|---|---|---|---|
| WGS | Sample Material | categorical | library_size | kruskal | 51 | 5.0 | 35.361774 | 3.914587e-07 | PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community | 0.000037 |
| WGS | Sample Material | categorical | shannon | kruskal | 51 | 5.0 | 29.976225 | 4.949286e-06 | PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community | 0.000181 |
| WGS | paired_negative_control | categorical | library_size | kruskal | 56 | 9.0 | 38.641657 | 5.727833e-06 | PSKB1|PWKB5|PWKB3|PWFBA|PWFBC | 0.000181 |
| WGS | Sample Material | categorical | evenness | kruskal | 51 | 5.0 | 24.740875 | 5.671957e-05 | PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community | 0.001179 |
| WGS | Date Deployed | date | library_size | spearman | 57 | NaN | -0.503831 | 6.458961e-05 | negative | 0.001179 |
| WGS | paired_negative_control | categorical | shannon | kruskal | 56 | 9.0 | 32.435553 | 7.780909e-05 | PSKB1|PWKB5|PWKB3|PWFBA|PWFBC | 0.001179 |
| WGS | Sample Material | categorical | simpson | kruskal | 51 | 5.0 | 23.817327 | 8.689895e-05 | PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community | 0.001179 |
| WGS | Sample Material | categorical | richness | kruskal | 51 | 5.0 | 23.028639 | 1.249689e-04 | PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community | 0.001484 |
| WGS | Accumulation Time (Days) | categorical | library_size | kruskal | 27 | 4.0 | 20.117252 | 1.605048e-04 | 14.0|252.0|126.0|42.0 | 0.001485 |
| WGS | Airstream | categorical | library_size | kruskal | 54 | 4.0 | 20.009247 | 1.689951e-04 | Greenwall|Office|Hallway|Windowsill | 0.001485 |
| WGS | paired_positive_control | categorical | shannon | mannwhitney | 22 | 2.0 | 106.000000 | 1.876349e-04 | P375_PSKB1|P375_PWKB5 | 0.001485 |
| WGS | paired_positive_control | categorical | simpson | mannwhitney | 22 | 2.0 | 106.000000 | 1.876349e-04 | P375_PSKB1|P375_PWKB5 | 0.001485 |
| WGS | paired_positive_control | categorical | library_size | mannwhitney | 22 | 2.0 | 7.000000 | 2.814523e-04 | P375_PSKB1|P375_PWKB5 | 0.002004 |
| WGS | Days Deploy_to_Collect | categorical | library_size | kruskal | 31 | 5.0 | 21.153156 | 2.952778e-04 | 0.0|216.0|251.0|125.0|41.0 | 0.002004 |
| WGS | Airstream | categorical | evenness | kruskal | 54 | 4.0 | 18.234568 | 3.934700e-04 | Greenwall|Office|Hallway|Windowsill | 0.002492 |
| WGS | paired_negative_control | categorical | simpson | kruskal | 56 | 9.0 | 27.803652 | 5.130594e-04 | PSKB1|PWKB5|PWKB3|PWFBA|PWFBC | 0.003046 |
| WGS | paired_positive_control | categorical | richness | mannwhitney | 22 | 2.0 | 103.000000 | 6.004316e-04 | P375_PSKB1|P375_PWKB5 | 0.003186 |
| WGS | Airstream | categorical | shannon | kruskal | 54 | 4.0 | 17.248350 | 6.282988e-04 | Greenwall|Office|Hallway|Windowsill | 0.003186 |
| WGS | Accumulation Time (Days) | categorical | evenness | kruskal | 27 | 4.0 | 17.218695 | 6.371906e-04 | 14.0|252.0|126.0|42.0 | 0.003186 |
| WGS | paired_negative_control | categorical | evenness | kruskal | 56 | 9.0 | 26.521791 | 8.546927e-04 | PSKB1|PWKB5|PWKB3|PWFBA|PWFBC | 0.004060 |
| WGS | Accumulation Time (Days) | categorical | shannon | kruskal | 27 | 4.0 | 16.408730 | 9.348762e-04 | 14.0|252.0|126.0|42.0 | 0.004229 |
| WGS | Relevant Kit Blank | categorical | simpson | kruskal | 35 | 6.0 | 20.427490 | 1.038667e-03 | PSKB1|PWKB5|PWKB3|PWKB2|PWKB4 | 0.004485 |
| WGS | paired_negative_control | categorical | richness | kruskal | 56 | 9.0 | 25.556190 | 1.250537e-03 | PSKB1|PWKB5|PWKB3|PWFBA|PWFBC | 0.005165 |
| WGS | Relevant Kit Blank | categorical | shannon | kruskal | 35 | 6.0 | 19.732147 | 1.402917e-03 | PSKB1|PWKB5|PWKB3|PWKB2|PWKB4 | 0.005553 |
| WGS | Days Deploy_to_Collect | categorical | evenness | kruskal | 31 | 5.0 | 17.119960 | 1.831914e-03 | 0.0|216.0|251.0|125.0|41.0 | 0.006961 |
| WGS | Relevant Kit Blank | categorical | library_size | kruskal | 35 | 6.0 | 18.713196 | 2.173423e-03 | PSKB1|PWKB5|PWKB3|PWKB2|PWKB4 | 0.007941 |
| WGS | Airstream | categorical | simpson | kruskal | 54 | 4.0 | 14.452391 | 2.349786e-03 | Greenwall|Office|Hallway|Windowsill | 0.008268 |
| WGS | Date Deployed | date | shannon | spearman | 57 | NaN | 0.385438 | 3.067904e-03 | positive | 0.010409 |
| WGS | Relevant Kit Blank | categorical | evenness | kruskal | 35 | 6.0 | 17.495622 | 3.649714e-03 | PSKB1|PWKB5|PWKB3|PWKB2|PWKB4 | 0.011956 |
| WGS | Days Deploy_to_Collect | categorical | shannon | kruskal | 31 | 5.0 | 15.359879 | 4.010157e-03 | 0.0|216.0|251.0|125.0|41.0 | 0.012699 |
| WGS | Date Deployed | date | evenness | spearman | 57 | NaN | 0.367714 | 4.892973e-03 | positive | 0.014995 |
| WGS | Days Collect_to_Extract | numeric | richness | spearman | 53 | NaN | 0.374833 | 5.685888e-03 | positive | 0.016880 |
| WGS | Accumulation Time (Days) | categorical | simpson | kruskal | 27 | 4.0 | 12.372134 | 6.211263e-03 | 14.0|252.0|126.0|42.0 | 0.017790 |
| WGS | paired_positive_control | categorical | evenness | mannwhitney | 22 | 2.0 | 95.000000 | 6.367076e-03 | P375_PSKB1|P375_PWKB5 | 0.017790 |
| WGS | Days Deploy_to_Collect | categorical | simpson | kruskal | 31 | 5.0 | 12.828629 | 1.214422e-02 | 0.0|216.0|251.0|125.0|41.0 | 0.032963 |
| WGS | Relevant Kit Blank | categorical | richness | kruskal | 35 | 6.0 | 14.463440 | 1.291871e-02 | PSKB1|PWKB5|PWKB3|PWKB2|PWKB4 | 0.034091 |
| WGS | Days Collect_to_Extract | numeric | simpson | spearman | 53 | NaN | 0.330067 | 1.579173e-02 | positive | 0.040546 |
| WGS | Relevant Field Blank | categorical | library_size | kruskal | 27 | 3.0 | 8.232254 | 1.630755e-02 | PWFBA|PWFBC|PWFBB | 0.040769 |
| WGS | Relevant Field Blank | categorical | evenness | kruskal | 27 | 3.0 | 8.012949 | 1.819744e-02 | PWFBA|PWFBC|PWFBB | 0.044327 |
| WGS | Sample Type | categorical | simpson | kruskal | 57 | 3.0 | 7.847026 | 1.977152e-02 | Experimental|Negative Control|Positive Control | 0.046957 |
| WGS | Date Deployed | date | simpson | spearman | 57 | NaN | 0.306143 | 2.055645e-02 | positive | 0.047631 |
| WGS | Accumulation Time (Days) | categorical | richness | kruskal | 27 | 4.0 | 9.276455 | 2.583236e-02 | 14.0|252.0|126.0|42.0 | 0.058430 |
| WGS | Days Deploy_to_Collect | categorical | richness | kruskal | 31 | 5.0 | 10.271169 | 3.609939e-02 | 0.0|216.0|251.0|125.0|41.0 | 0.079754 |
| WGS | Days Collect_to_Extract | numeric | shannon | spearman | 53 | NaN | 0.280910 | 4.159829e-02 | positive | 0.089814 |
| WGS | Date Extracted | date | evenness | spearman | 57 | NaN | -0.267315 | 4.440822e-02 | negative | 0.093751 |
| WGS | Date Deployed | date | richness | spearman | 57 | NaN | 0.264627 | 4.667448e-02 | positive | 0.096393 |
| WGS | Concentration/Filtration Step? | categorical | library_size | mannwhitney | 57 | 2.0 | 509.500000 | 5.492372e-02 | No|Yes | 0.104875 |
| WGS | Qubit ng/uL | numeric | library_size | spearman | 41 | NaN | 0.301246 | 5.562174e-02 | positive | 0.104875 |
| WGS | ng/50uL | numeric | library_size | spearman | 41 | NaN | 0.301246 | 5.562174e-02 | positive | 0.104875 |
| WGS | Sample Type | categorical | library_size | kruskal | 57 | 3.0 | 5.765377 | 5.598404e-02 | Experimental|Negative Control|Positive Control | 0.104875 |
No results produced.
No results produced.
| feature | taxonomy | prevalence | mean_relative_abundance | assay |
|---|---|---|---|---|
| Homo sapiens | Homo sapiens | 0.964912 | 0.055331 | WGS |
| Agrobacterium tumefaciens | Agrobacterium tumefaciens | 0.877193 | 0.011979 | WGS |
| Pseudomonas aeruginosa | Pseudomonas aeruginosa | 0.859649 | 0.011945 | WGS |
| Pseudonocardia alni | Pseudonocardia alni | 0.842105 | 0.011229 | WGS |
No results produced.
No results produced.
No results produced.
No results produced.
| assay | metadata_column | alpha_tests | alpha_significant_q_lt_0_1 | composite_signal_score | top_related_metadata |
|---|---|---|---|---|---|
| WGS | Accumulation Time (Days) | 5 | 5 | 10 | Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Greenwall Surface Position:1.000 |
| WGS | Date Deployed | 5 | 5 | 10 | Qubit ng/uL:0.747; ng/50uL:0.747; Date Extracted:0.566 |
| WGS | Days Deploy_to_Collect | 5 | 5 | 10 | Greenwall Radial Distance:1.000; Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000 |
| WGS | Relevant Kit Blank | 5 | 5 | 10 | Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000 |
| WGS | Sample Material | 5 | 5 | 10 | Accumulation Time (Days):1.000; Days Deploy_to_Collect:1.000; Concentration/Filtration Step?:0.980 |
| WGS | paired_negative_control | 5 | 5 | 10 | Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Extraction Kit:1.000 |
| WGS | paired_positive_control | 5 | 5 | 10 | Concentration/Filtration Step?:1.000; Extraction Kit:1.000; Relevant Kit Blank:1.000 |
| WGS | Airstream | 5 | 4 | 9 | Concentration/Filtration Step?:0.583; paired_negative_control:0.483; Sample Material:0.480 |
| WGS | Days Collect_to_Extract | 5 | 3 | 8 | Date Extracted:0.626; Qubit ng/uL:0.562; ng/50uL:0.562 |
| WGS | Relevant Field Blank | 5 | 2 | 7 | paired_negative_control:1.000; Concentration/Filtration Step?:0.725; Sample Material:0.725 |
| WGS | Date Extracted | 5 | 1 | 6 | Days Collect_to_Extract:0.626; Date Deployed:0.566; Qubit ng/uL:0.443 |
| WGS | Sample Type | 5 | 1 | 6 | Sample Material:0.739; Days Deploy_to_Collect:0.310; Relevant Kit Blank:0.237 |
| WGS | Concentration/Filtration Step? | 5 | 0 | 5 | Accumulation Time (Days):1.000; Days Deploy_to_Collect:1.000; Relevant Kit Blank:1.000 |
| WGS | Date Collected | 5 | 0 | 5 | Qubit ng/uL:0.303; ng/50uL:0.303; Date Extracted:0.250 |
| WGS | Extraction Kit | 5 | 0 | 5 | paired_negative_control:1.000; paired_positive_control:1.000; Relevant Kit Blank:1.000 |
| WGS | Greenwall Location | 5 | 0 | 5 | NaN |
| WGS | Greenwall Surface Position | 5 | 0 | 5 | Accumulation Time (Days):1.000; Greenwall Radial Distance:1.000; Days Deploy_to_Collect:0.816 |
| WGS | Qubit ng/uL | 5 | 0 | 5 | ng/50uL:1.000; Date Deployed:0.747; Days Collect_to_Extract:0.562 |
| WGS | ng/50uL | 5 | 0 | 5 | Qubit ng/uL:1.000; Date Deployed:0.747; Days Collect_to_Extract:0.562 |
| metadata_column_a | metadata_type_a | metadata_column_b | metadata_type_b | test | association_strength | statistic | pvalue |
|---|---|---|---|---|---|---|---|
| Days Deploy_to_Collect | categorical | Greenwall Radial Distance | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Accumulation Time (Days) | categorical | Concentration/Filtration Step? | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Accumulation Time (Days) | categorical | Days Deploy_to_Collect | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Accumulation Time (Days) | categorical | Greenwall Surface Position | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Accumulation Time (Days) | categorical | Relevant Kit Blank | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Accumulation Time (Days) | categorical | Sample Material | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Concentration/Filtration Step? | categorical | Days Deploy_to_Collect | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Concentration/Filtration Step? | categorical | Relevant Kit Blank | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Concentration/Filtration Step? | categorical | paired_negative_control | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Concentration/Filtration Step? | categorical | paired_positive_control | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Days Deploy_to_Collect | categorical | Relevant Kit Blank | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Days Deploy_to_Collect | categorical | Sample Material | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Days Deploy_to_Collect | categorical | paired_negative_control | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Extraction Kit | categorical | paired_negative_control | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Extraction Kit | categorical | paired_positive_control | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Greenwall Radial Distance | categorical | Greenwall Surface Position | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Greenwall Radial Distance | categorical | Relevant Kit Blank | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Greenwall Radial Distance | categorical | paired_negative_control | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Qubit ng/uL | numeric | ng/50uL | numeric | spearman | 1.000000 | 1.000000 | 0.000000e+00 |
| Relevant Field Blank | categorical | paired_negative_control | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Relevant Kit Blank | categorical | paired_negative_control | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Relevant Kit Blank | categorical | paired_positive_control | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| paired_negative_control | categorical | paired_positive_control | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Accumulation Time (Days) | categorical | Greenwall Radial Distance | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Accumulation Time (Days) | categorical | paired_negative_control | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Extraction Kit | categorical | Relevant Kit Blank | categorical | cramers_v | 1.000000 | 1.000000 | NaN |
| Concentration/Filtration Step? | categorical | Sample Material | categorical | cramers_v | 0.979812 | 0.979812 | NaN |
| Extraction Kit | categorical | Sample Material | categorical | cramers_v | 0.973848 | 0.973848 | NaN |
| Sample Material | categorical | paired_positive_control | categorical | cramers_v | 0.943456 | 0.943456 | NaN |
| Sample Material | categorical | paired_negative_control | categorical | cramers_v | 0.850016 | 0.850016 | NaN |
| Days Deploy_to_Collect | categorical | Greenwall Surface Position | categorical | cramers_v | 0.816497 | 0.816497 | NaN |
| Greenwall Surface Position | categorical | Relevant Kit Blank | categorical | cramers_v | 0.816497 | 0.816497 | NaN |
| Greenwall Surface Position | categorical | paired_negative_control | categorical | cramers_v | 0.816497 | 0.816497 | NaN |
| Relevant Kit Blank | categorical | Sample Material | categorical | cramers_v | 0.793748 | 0.793748 | NaN |
| Date Deployed | date | Qubit ng/uL | numeric | spearman | 0.747437 | -0.747437 | 1.972756e-08 |
| Date Deployed | date | ng/50uL | numeric | spearman | 0.747437 | -0.747437 | 1.972756e-08 |
| Sample Material | categorical | Sample Type | categorical | cramers_v | 0.738735 | 0.738735 | NaN |
| Concentration/Filtration Step? | categorical | Relevant Field Blank | categorical | cramers_v | 0.725476 | 0.725476 | NaN |
| Relevant Field Blank | categorical | Sample Material | categorical | cramers_v | 0.725476 | 0.725476 | NaN |
| Date Extracted | date | Days Collect_to_Extract | numeric | spearman | 0.626333 | 0.626333 | 5.266479e-07 |
| Airstream | categorical | Concentration/Filtration Step? | categorical | cramers_v | 0.583095 | 0.583095 | NaN |
| Date Deployed | date | Date Extracted | date | spearman | 0.565612 | -0.565612 | 4.547414e-06 |
| Days Collect_to_Extract | numeric | Qubit ng/uL | numeric | spearman | 0.561994 | 0.561994 | 1.968216e-04 |
| Days Collect_to_Extract | numeric | ng/50uL | numeric | spearman | 0.561994 | 0.561994 | 1.968216e-04 |
| Days Deploy_to_Collect | categorical | Relevant Field Blank | categorical | cramers_v | 0.512989 | 0.512989 | NaN |
| Accumulation Time (Days) | categorical | Relevant Field Blank | categorical | cramers_v | 0.512989 | 0.512989 | NaN |
| Airstream | categorical | paired_negative_control | categorical | cramers_v | 0.482654 | 0.482654 | NaN |
| Airstream | categorical | Sample Material | categorical | cramers_v | 0.480384 | 0.480384 | NaN |
| Concentration/Filtration Step? | categorical | Extraction Kit | categorical | cramers_v | 0.469304 | 0.469304 | NaN |
| Airstream | categorical | Days Deploy_to_Collect | categorical | cramers_v | 0.458831 | 0.458831 | NaN |
| category | study_question | primary_groups | secondary_groups | mapped_outputs |
|---|---|---|---|---|
| Methodology | Are there differences in 16S / WGS / ITS outcomes | Sample Material: Leaf, Rhizosphere, Zymo Mock Community | NaN | cross_assay_correlations.tsv; metadata_alpha_associations.tsv; metadata_column_catalog.tsv; metadata_permanova.tsv; study_overview.tsv |
| Diversity | Are the collection of microorganisms found in the greenwall related to human health in any way? | Sample Material: Leaf, Rhizosphere | NaN | beneficial_taxa_summary.tsv; core_taxa.tsv; differential_abundance.tsv; metadata_column_catalog.tsv; study_overview.tsv |
| Uncategorized | NaN | Sample Material: Leaf, Rhizosphere | NaN | metadata_column_catalog.tsv; study_overview.tsv |
| Uncategorized | Is the microbiome of the "greenwall room" different from the other airstreams? | Sample Material: Filter | Airstream: Hallway, Office, Greenwall, Windowsill | metadata_alpha_associations.tsv; metadata_column_catalog.tsv; metadata_permanova.tsv; source_overlap.tsv; study_overview.tsv |
| Uncategorized | NaN | Sample Material: Filter | Airstream: Hallway, Office, Greenwall, Windowsill | metadata_column_catalog.tsv; study_overview.tsv |
| Uncategorized | Is there evidence of diversification of the "greenwall room" microbiome by greenwall microorganisms? | Sample Material: Filter\nAirstream: Greenwall | Sample Material: Leaf, Rhizosphere | diversity_metrics.tsv; diversity_tests.tsv; metadata_alpha_associations.tsv; metadata_column_catalog.tsv; metadata_permanova.tsv; source_overlap.tsv; study_overview.tsv |
| Uncategorized | NaN | Sample Material: PDMS | Sample Material: Leaf, Rhizosphere | metadata_column_catalog.tsv; study_overview.tsv |
| Metabolism | Is there evidence of metabolic function within the greenwall microbiome related to chemicals identified in relevant airstreams? | Sample Material: Leaf, Rhizosphere | NaN | metadata_alpha_associations.tsv; metadata_column_catalog.tsv; metadata_differential_abundance_summary.tsv; metadata_permanova.tsv; source_overlap.tsv; study_overview.tsv; wgs_function_top_differences.tsv |
| Uncategorized | NaN | Sample Material: Leaf, Rhizosphere | NaN | metadata_column_catalog.tsv; study_overview.tsv |
| Uncategorized | NaN | NaN | NaN | metadata_column_catalog.tsv; study_overview.tsv |
| Uncategorized | NaN | NaN | NaN | metadata_column_catalog.tsv; study_overview.tsv |
| Uncategorized | NaN | NaN | NaN | metadata_column_catalog.tsv; study_overview.tsv |
| Uncategorized | NaN | NaN | NaN | metadata_column_catalog.tsv; study_overview.tsv |
# GREENWALL Hypothesis-Oriented Analysis Summary This report is organized as a logical analysis sequence modeled on a publication-style microbiome report. ## Executive Summary - Imported hypotheses: 9 - Manifest samples: 57 - Metadata columns scanned: 20 - Metadata-linked signals at q < 0.1: 46 ## Hypothesis Verdict Cards - **Mixed**: Are there differences in 16S / WGS / ITS outcomes Evidence: Top metadata-linked signals were WGS:Accumulation Time (Days); WGS:Date Deployed; WGS:Days Deploy_to_Collect. 2 positive-control benchmark summaries were generated. Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; metadata_covariate_links.tsv; metadata_signal_summary.tsv; positive_control_benchmark.tsv - **Mixed**: Are the collection of microorganisms found in the greenwall related to human health in any way? Evidence: 5 beneficial-taxa keyword hits were detected across assays. 2 positive-control benchmark summaries were generated. Outputs: beneficial_taxa_summary.tsv; core_taxa.tsv; hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv - **Not Supported Yet**: Evidence: 2 positive-control benchmark summaries were generated. Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv - **Mixed**: Is the microbiome of the "greenwall room" different from the other airstreams? Evidence: Maximum source-overlap fraction observed was 0.797. 2 positive-control benchmark summaries were generated. Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv; source_overlap.tsv - **Not Supported Yet**: Evidence: 2 positive-control benchmark summaries were generated. Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv - **Mixed**: Is there evidence of diversification of the "greenwall room" microbiome by greenwall microorganisms? Evidence: 46 alpha-diversity metadata associations passed q < 0.1. 2 positive-control benchmark summaries were generated. Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; metadata_alpha_associations.tsv; positive_control_benchmark.tsv - **Not Supported Yet**: Evidence: 2 positive-control benchmark summaries were generated. Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv - **Mixed**: Is there evidence of metabolic function within the greenwall microbiome related to chemicals identified in relevant airstreams? Evidence: Maximum source-overlap fraction observed was 0.797. 2 positive-control benchmark summaries were generated. Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv; source_overlap.tsv - **Not Supported Yet**: Evidence: 2 positive-control benchmark summaries were generated. Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv - **Not Supported Yet**: Evidence: 2 positive-control benchmark summaries were generated. Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv - **Not Supported Yet**: Evidence: 2 positive-control benchmark summaries were generated. Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv - **Not Supported Yet**: Evidence: 2 positive-control benchmark summaries were generated. Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv - **Not Supported Yet**: Evidence: 2 positive-control benchmark summaries were generated. Outputs: hypothesis_summary.md; hypothesis_to_output_map.tsv; positive_control_benchmark.tsv ## Logical Steps 1. **Study Overview and QC** - Purpose: Summarize assay coverage, sample inventory, control annotations, and high-level dataset structure. - Outputs: study_overview.tsv; metadata_column_catalog.tsv; hypotheses.imported.tsv 2. **Alpha Diversity** - Purpose: Compute richness, Shannon, Simpson, evenness, and test them against targeted contrasts and every usable metadata column. - Outputs: diversity_metrics.tsv; diversity_tests.tsv; metadata_alpha_associations.tsv 3. **Beta Diversity and Community Structure** - Purpose: Build Bray-Curtis dissimilarities, PCoA ordinations, and PERMANOVA summaries for assay-specific and metadata-driven effects. - Outputs: pcoa_coordinates.tsv; permanova.tsv; metadata_permanova.tsv; metadata_pcoa_associations.tsv 4. **Taxonomic Composition and Beneficial Taxa** - Purpose: Screen for beneficial taxa, summarize source overlap, and prepare interpretable taxonomic summaries. - Outputs: beneficial_taxa_summary.tsv; source_overlap.tsv 5. **Differential Abundance and Function** - Purpose: Test assay- and metadata-driven contrasts for taxonomic and WGS functional differences. - Outputs: differential_abundance.tsv; metadata_differential_abundance_summary.tsv; wgs_function_top_differences.tsv 6. **Core Microbiome and Cross-Assay Concordance** - Purpose: Define core taxa, assess overlap across assays, and quantify assay bias or concordance on shared samples. - Outputs: core_taxa.tsv; cross_assay_sample_metrics.tsv; cross_assay_correlations.tsv 7. **Machine Learning and Network Structure** - Purpose: Rank discriminatory features and summarize co-occurrence structure for hypothesis generation. - Outputs: random_forest_summary.tsv; random_forest_top_features.tsv; metadata_random_forest_summary.tsv; cooccurrence_network_edges.tsv 8. **Hypothesis Answer Layer** - Purpose: Map outputs back to the stated hypotheses and surface the strongest metadata-linked signals. - Outputs: hypothesis_to_output_map.tsv; metadata_signal_summary.tsv; hypothesis_summary.md ## Step 1. Study Overview and QC - Manifest rows: 57 - Manifest columns: 30 - Testable metadata columns: 23 - QC flag rows: 57 - Samples with recommended exclusion: 0 ## Step 2. Alpha Diversity - Targeted contrasts: 8 - Metadata-wide alpha tests: 95 - Top metadata-linked alpha signals: ```tsv assay metadata_column metadata_type metric test n n_groups statistic pvalue effect_direction qvalue WGS Sample Material categorical library_size kruskal 51 5.0 35.361773624185375 3.9145866564586555e-07 PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community 3.718857323635723e-05 WGS Sample Material categorical shannon kruskal 51 5.0 29.97622549019607 4.949285909900996e-06 PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community 0.0001813813893402897 WGS paired_negative_control categorical library_size kruskal 56 9.0 38.641656575471494 5.727833347588096e-06 PSKB1|PWKB5|PWKB3|PWFBA|PWFBC 0.0001813813893402897 WGS Sample Material categorical evenness kruskal 51 5.0 24.740874811463073 5.671956757119998e-05 PDMS|Filter|Rhizosphere|Leaf|Zymo Mock Community 0.0011793428847587607 WGS Date Deployed date library_size spearman 57 -0.5038310692604261 6.458960756609466e-05 negative 0.0011793428847587607 ``` ## Step 3. Beta Diversity and Community Structure - Assay-specific PERMANOVA rows: 0 - Metadata PERMANOVA rows: 0 - Ordination metadata tests: 0 ## Step 4. Taxonomic Composition and Beneficial Taxa - Beneficial taxa screen rows: 5 - Source overlap rows: 2 - Positive-control diagnostic rows: 4 ## Step 5. Differential Abundance and Function - Taxonomic differential abundance rows: 0 - Metadata differential summaries: 0 - WGS functional contrasts: 0 ## Step 6. Core Microbiome and Cross-Assay Concordance - Core taxa rows: 4 - Cross-assay correlation rows: 0 ## Step 7. Machine Learning and Networks - Primary Random Forest summaries: 0 - Metadata Random Forest summaries: 0 ## Step 8. Hypothesis Answer Layer - Highest-signal metadata fields across all assays/tests: ```tsv assay metadata_column alpha_tests alpha_significant_q_lt_0_1 composite_signal_score top_related_metadata WGS Accumulation Time (Days) 5 5 10 Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Greenwall Surface Position:1.000 WGS Date Deployed 5 5 10 Qubit ng/uL:0.747; ng/50uL:0.747; Date Extracted:0.566 WGS Days Deploy_to_Collect 5 5 10 Greenwall Radial Distance:1.000; Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000 WGS Relevant Kit Blank 5 5 10 Accumulation Time (Days):1.000; Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000 WGS Sample Material 5 5 10 Accumulation Time (Days):1.000; Days Deploy_to_Collect:1.000; Concentration/Filtration Step?:0.980 WGS paired_negative_control 5 5 10 Concentration/Filtration Step?:1.000; Days Deploy_to_Collect:1.000; Extraction Kit:1.000 WGS paired_positive_control 5 5 10 Concentration/Filtration Step?:1.000; Extraction Kit:1.000; Relevant Kit Blank:1.000 WGS Airstream 5 4 9 Concentration/Filtration Step?:0.583; paired_negative_control:0.483; Sample Material:0.480 WGS Days Collect_to_Extract 5 3 8 Date Extracted:0.626; Qubit ng/uL:0.562; ng/50uL:0.562 WGS Relevant Field Blank 5 2 7 paired_negative_control:1.000; Concentration/Filtration Step?:0.725; Sample Material:0.725 ``` - Strongest metadata-metadata covariate links: ```tsv metadata_column_a metadata_type_a metadata_column_b metadata_type_b test association_strength statistic pvalue Days Deploy_to_Collect categorical Greenwall Radial Distance categorical cramers_v 1.0000000000000002 1.0000000000000002 Accumulation Time (Days) categorical Concentration/Filtration Step? categorical cramers_v 1.0 1.0 Accumulation Time (Days) categorical Days Deploy_to_Collect categorical cramers_v 1.0 1.0 Accumulation Time (Days) categorical Greenwall Surface Position categorical cramers_v 1.0 1.0 Accumulation Time (Days) categorical Relevant Kit Blank categorical cramers_v 1.0 1.0 Accumulation Time (Days) categorical Sample Material categorical cramers_v 1.0 1.0 Concentration/Filtration Step? categorical Days Deploy_to_Collect categorical cramers_v 1.0 1.0 Concentration/Filtration Step? categorical Relevant Kit Blank categorical cramers_v 1.0 1.0 Concentration/Filtration Step? categorical paired_negative_control categorical cramers_v 1.0 1.0 Concentration/Filtration Step? categorical paired_positive_control categorical cramers_v 1.0 1.0 ```