Parameterized Quarto reports improve understanding of soil health

Jadey Ryan
Molly McIlquham, Kwabena Sarpong, Leslie Michel,
Teal Potter, Deirdre Griffin LaHue, Dani Gelardi

posit::conf(2023) | September 20

Don’t care about soils?

Parameterized reporting is for everyone!

RStudio RMarkdown parameterized report project for mapping ocean floors.

Ocean floor maps

Report for Alabama spending with text and plots generated from parameterized RMarkdown.

Fiscal briefs

Report for Alaska driver quality with text, plot, and table generated from parameterized RMarkdown.

Bad drivers

Why should you listen to me?

Hired as an environmental technician.

Jadey in waders setting up a transect with a measuring tape to measure flow in a small stream.

Then transitioned to data scientist.

Jadey's cat, Mai, sitting on her laptop that has RStudio open. The monitor next to the laptop and cat has the R Packages book by Hadley Wickham and Jenny Bryan open.

I went from picking leeches off my boots to picking cats off my keyboard!

How did I go from a ‘non-coder’ to a data scientist?

With a skills test!

Task: generate custom Soil Health Reports for each individual farmer

Timeframe: one week

Challenge: how do I automate custom reports???

Cat with an expression of existential dread

YouTube and blog rabbit holes & tutorial hell…

2021: the reports that got me my job

Cover page of 2021 report with project summary

Example 2021 report with box plots

2023: new HTML reports with Quarto!

and MS Word to PDF

…because LaTeX 😖 🤷

First page of 2023 example MS Word report

Page from example MS Word report showing soil texture triangle

Code snippets and 7 pages of resources

We’re end-of-conference tired 🥱 and our brains are full 😵.

Don’t worry about taking notes or copying code.

QR code to slides. — Slides at tinyurl.com/quarto-params

Workflow

Flow chart of workflow: Learn about parameterized reports, prepare subsettable data, create content for easy consumption, design for accessibility.

Parameterized Quarto reports work with both Knitr and Jupyter engines.

Cat typing on laptop.

— Step 1 —
Learn about parameterized reports

What are parameterized reports?

File with the word '.qmd' inside and the word 'Function' above.

An arrow points from 'Input' with 'params$year' to the previous image with 'Function' and '.qmd' file.

In addition to the previous two images, arrows point to five reports with years 2019 through 2023 on them in a flow chart.

YAML Header: Yet Another Markdown Language

---
title: "Soil Health Report"         # Metadata
format:                             # Set format types
  html:                                     
  docx:                            
params:                             # Set default parameter key-value pairs
  farmer: Sammy Sunflower                   
  year: 2023                                
---
    
Report content goes here.           # Write content

Access your parameters

Run any line or chunk to add params to your environment.

params object is a list.

str(params)

List of 2
 $ farmer: chr "Sammy Sunflower"
 $ year  : num 2023

Access with $ notation.

params$farmer

[1] "Sammy Sunflower"

For inline code, enclose the expression in `r `.

These are results for `r params$farmer`.

These are results for Sammy Sunflower.

Render with the RStudio IDE

RStudio Quarto Render button with a dropdown for HTML and MS Word with Render on Save option checked

Render Button, keyboard shortcut Ctrl + Shift + K, or by saving.

Uses default parameters in the YAML.

Output file has same name and location as input file.

Render with `quarto::quarto_render()`

Use in Console or an iterate.R script.

quarto::quarto_render(
  input = "template.qmd",                           # Input Quarto file
  output_format = "html",                           # Output format
  output_file = "2022_OscarOrchard_Report.html",    # Output file name
  execute_params = list(                            # Named list of parameters
    farmer = "Oscar Orchard",
    year = 2022
  )

Render all reports at once

Create a data frame with two columns that match quarto_render() args:

output_file: filename.ext
execute_params: named list of parameters

Code

reports <- data |>
  dplyr::distinct(farmer, year) |>
  dplyr::mutate(
    output_file = paste0(
      year, "_", gsub(" ", "", farmer), "_Report.html"
    ),
    execute_params = purrr::map2(
      farmer, year,
      \(x, y) list(farmer = x, year = y)
    )
  ) |> 
  dplyr::select(output_file, execute_params)

head(reports, 3)

output_file	execute_params
2023_SammySunflower_Report.html	Sammy Sunflower, 2023
2022_OscarOrchard_Report.html	Oscar Orchard, 2022
2022_TinaTomato_Report.html	Tina Tomato, 2022

Render all reports at once

Map over each row of the reports dataframe.

reports |>
  purrr::pwalk(
    quarto::quarto_render,
    input = "template.qmd",
    output_format = "html"  
    )

Surprised cat next to the surprised Pikachu meme.

— Step 2 —
Prepare subsettable data

Our simple data

year	sampleId	farmer	crop	sand_%	silt_%	clay_%
2023	23-SUN-01	Sammy Sunflower	Sunflower	44	23	3
2022	22-ORC-02	Oscar Orchard	Cherry	69	21	10
2022	22-TOM-02	Tina Tomato	Vegetable	11	79	10
2022	22-HAR-03	Henry Harvest	Herb	36	51	13
2023	23-GAR-03	Grace Gardener	Sunflower	64	33	3
2022	22-QUI-02	Quincy Quinoa	Quinoa	24	62	14

fake farmer names: ChatGPT 🙏🏻

Use `params` to set up comparisons

data |> 
  dplyr::filter(farmer == params$farmer) |> 
  more data wrangling

Code

# Get the farmer's crops
farmer_crop <- data |>
  dplyr::filter(farmer == params$farmer) |>
  dplyr::pull(crop) |>
  unique()

data_long <- data |>
  # Tidy data: one observation per row
  tidyr::pivot_longer(cols = c("sand_%":"clay_%"),
                    names_to = "measurement") |> 
  dplyr::mutate(
    # Categorize samples based on if they are from the farmer of
    # interest or are of the same crop type
    category = dplyr::case_when(
      farmer == params$farmer ~ "Your fields",
      crop %in% farmer_crop ~ "Same crop",
      TRUE ~ "Other fields"
    )
  )

# Set category factor levels
data_long$category <- factor(
  data_long$category,
  levels = c("Other fields", "Same crop", "Your fields"),
  labels = c("Other fields", "Same crop", "Your fields")
)

data_long |> 
  dplyr::select(-c(year, sampleId)) |> 
  dplyr::slice_head(by = category) |> 
  kableExtra::kable() |> 
  kableExtra::column_spec(5, background = "#FDCE86")

farmer	crop	measurement	value	category
Sammy Sunflower	Sunflower	sand_%	44	Your fields
Oscar Orchard	Cherry	sand_%	69	Other fields
Grace Gardener	Sunflower	sand_%	64	Same crop

— Step 3 —
Create content for easy consumption

Interpretation through comparison

Show how their results compare to other samples in similar contexts.

Code

library(ggplot2)
set.seed(123)
data_long |>
  ggplot(aes(
    x = measurement,
    y = value,
    color = category,
    shape = category,
    size = category,
    alpha = category
  )) +
  geom_jitter(width = 0.2) +
  # Define styles for producer's samples versus all samples
  scale_alpha_manual(values = c(
    "Other fields" = 0.5,
    "Same crop" = 1,
    "Your fields" = 1
  )) +
  scale_color_manual(values = c(
    "Other fields" = "#CCC29C",
    "Same crop" = "#76ADBC",
    "Your fields" = "#A60F2D"
  )) +
  scale_size_manual(values = c(
    "Other fields" = 2,
    "Same crop" = 4,
    "Your fields" = 4
  )) +
  washi::washi_theme(
    legend_position = "bottom",
    gridline_x = FALSE,
    gridline_y = FALSE,
    text_scale = 1.5
  ) +
  theme(legend.title = element_blank(),
        axis.title.x = element_blank(),
        axis.title.y = element_blank(),
        panel.border = element_rect(color = "gray", fill = NA))

Make your reports self-contained

Provide your reader the context to understand the results.

Instead of ~300 lines of markdown:

## What We Measured in Your Soil

{{< include soil_health_measurements.qmd >}}

Includes shortcodes make it easy to reuse information across reports.

Make your HTML reports self-contained

# Include in YAML
---
format: 
  html:
    embed-resources: true
---

Otherwise, no one knows what to do with a _files folder of dependencies.

Be mindful of file size.

Cat on sinking Titanic meme.

— Step 4 —
Design for accessibility

Conditional markdown

:::: {.content-visible when-format="html"}
::: panel-tabset

Background on soil measurements.

:::
::::

Screenshot of soil measurement tabset in HTML report.

Conditional markdown

:::: {.content-visible unless-format="html"}

Background on soil measurements.

::::

Screenshot of soil measurements without tabset in MS Word report.

Conditional code evaluation

Evaluate plotly code only for HTML reports and regular ggplot2 code for anything else.

Get the output format type in your setup chunk:

```{r setup}
# Include in first chunk of .qmd
# Get output file type
out_type <- knitr::opts_knit$get("rmarkdown.pandoc.to")
```

Conditional code evaluation

Use out_type in eval chunk option.

Interactive plot for HTML reports:

```{r}
#| eval: !expr out_type == "html"

# code to create interactive {plotly}
```

Static plot for MS Word reports:

```{r}
#| eval: !expr out_type == "docx"

# code to create static {ggplot2}
```

Style with font size and color contrast in mind

Colour Contrast

Viz Palette

Use style sheets and templates

# Include in YAML header:
---
format:                                     
  html:                                     
    css: styles.css               # Style for HTML output
  word_document:                            
    reference_docx: styles.docx   # Style for MS Word output
---

Check HTML with browser developer tools

To get to developer tools: Right-click > Inspect > Ctrl + Shift + C > Hover over element. You can see contrast checker when hovering over any HTML element.

Check HTML with browser developer tools

Clicking on the color attribute in the inspector pane opens a dialog that allows you to see the contrast ratio and adjust the color to meet accessibility guidelines.

Everyone gets a custom decision-support tool!

Check out an example HTML report.

Recap

Flow chart of workflow: Learn about parameterized reports, prepare subsettable data, create content for easy consumption, design for accessibility.

Cat falling asleep in human's lap with a phone with video playing on the sleepy cat's belly.

— Resources —

Quarto

Read about parameterized reports

Examples of parameterized reports

RMarkdown
Quarto
- ✨ soil health reports by producer & year (Jadey Ryan) ✨ {soils} is a WIP!
- elevator reports by region and state (Mike Mahoney)
- penguins dataset by species and year(Meghan Hall)

Videos about parameterized reports

Style sheets

HTML
- Quarto Documentation: CSS Styles
MS Word
- Quarto Documentation: Docx Style Template
- Create A MS Word Template for R Markdown (Video by Yihui Xie)

Accessibility tools

Web Content Accessibility Guidelines (WCAG) Quick Reference

Pick your colors
Check your colors
- Color Blindness Simulator
- colorblindr package

Check HTML reports with browser `Inspect` tool

I’d love to connect with you!

My three fave things: cats, nature, and R/Quarto.

Slides

@jadeynryan

linkedin.com/in/jadey-ryan

jadeynryan

WA Dept. of Agriculture Webpage

Parameterized Quarto reports improve understanding of soil health

Don’t care about soils?

Why should you listen to me?

How did I go from a ‘non-coder’ to a data scientist?

2021: the reports that got me my job

2023: new HTML reports with Quarto!

and MS Word to PDF

Code snippets and 7 pages of resources

Workflow

— Step 1 — Learn about parameterized reports

What are parameterized reports?

YAML Header: Yet Another Markdown Language

Access your parameters

Render with the RStudio IDE

Render with quarto::quarto_render()

Render all reports at once

Render all reports at once

— Step 2 — Prepare subsettable data

Our simple data

Use params to set up comparisons

— Step 3 — Create content for easy consumption

Interpretation through comparison

Make your reports self-contained

Make your HTML reports self-contained

— Step 4 — Design for accessibility

Conditional markdown

Conditional markdown

Conditional code evaluation

Conditional code evaluation

Style with font size and color contrast in mind

Use style sheets and templates

Check HTML with browser developer tools

Check HTML with browser developer tools

Everyone gets a custom decision-support tool!

Recap

— Resources —

Quarto

Read about parameterized reports

Examples of parameterized reports

Videos about parameterized reports

Style sheets

Accessibility tools

Check HTML reports with browser Inspect tool

I’d love to connect with you!

— Step 1 —
Learn about parameterized reports

Render with `quarto::quarto_render()`

— Step 2 —
Prepare subsettable data

Use `params` to set up comparisons

— Step 3 —
Create content for easy consumption

— Step 4 —
Design for accessibility

Check HTML reports with browser `Inspect` tool