Skip to content

Commit

Permalink
positconf (#794)
Browse files Browse the repository at this point in the history
* positconf

* Accept submission

* Separate source links

* Separate source links in year readme.

---------

Co-authored-by: jonthegeek <jonthegeek@users.noreply.github.com>
  • Loading branch information
jonthegeek and jonthegeek authored Jan 9, 2025
1 parent f8203c2 commit 006bae2
Show file tree
Hide file tree
Showing 9 changed files with 606 additions and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -32,6 +32,7 @@ Our over-arching goal for TidyTuesday is to provide real-world datasets so that
| Week | Date | Data | Source | Article
| :---: | :---: | :--- | :--- | :---|
| 1 | `2025-01-07` | Bring your own data to start the year! | | |
| 2 | `2025-01-14` | [posit::conf talks](data/2025/2025-01-14/readme.md) | [posit::conf attendee portal 2023](https://reg.conf.posit.co/flow/posit/positconf23/attendee-portal/page/sessioncatalog), [posit::conf attendee portal 2024](https://reg.conf.posit.co/flow/posit/positconf24/attendee-portal/page/sessioncatalog) | [posit::conf(2025) in-person registration is now open!](https://posit.co/blog/positconf2025-in-person-registration-is-now-open/) |

***

Expand Down
317 changes: 317 additions & 0 deletions data/2025/2025-01-14/conf2023.csv

Large diffs are not rendered by default.

165 changes: 165 additions & 0 deletions data/2025/2025-01-14/conf2024.csv

Large diffs are not rendered by default.

Binary file added data/2025/2025-01-14/conf2025.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
28 changes: 28 additions & 0 deletions data/2025/2025-01-14/meta.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
title: "posit::conf talks"
article:
title: "posit::conf(2025) in-person registration is now open!"
url: "https://posit.co/blog/positconf2025-in-person-registration-is-now-open/"
data_source:
title: "posit::conf attendee portals"
url: "https://reg.conf.posit.co/flow/posit/positconf23/attendee-portal/page/sessioncatalog, https://reg.conf.posit.co/flow/posit/positconf24/attendee-portal/page/sessioncatalog"
images:
# Please include at least one image, and up to three images
- file: "conf2025.png"
alt: >
The logo of posit conf 2025. A series of paralellograms in light blue,
green, dark blue, and yellow, forming step-like cubes. The words posit conf
2025 are set off by the wall of cubes, in dark blue text on a light blue
background.
- file: "posit2024_crowd.png"
alt: >
A crowd of seated attendees watch Hadley Wickham on stage at the opening of
posit conf 2024. The crowd is very large and stretches deep into the
background of the photo.
credit:
# We want to thank you for curating this dataset! If you do not want a
# particular type of credit, please delete the related line.
post: "Jon Harmon"
bluesky: "https://bsky.app/profile/jonthegeek.com"
linkedin: "https://www.linkedin.com/in/jonthegeek"
mastodon: "@jonthegeek@fosstodon.org"
github: "https://github.com/jonthegeek"
Binary file added data/2025/2025-01-14/posit2024_crowd.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
92 changes: 92 additions & 0 deletions data/2025/2025-01-14/readme.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,92 @@
# posit::conf talks

This week we're exploring posit::conf talks!
On or near the day when this data is posted, the call for speakers for posit::conf(2025) will open.
Keep an eye on the [posit::conf website](https://posit.co/conference/) if you would like to submit a talk proposal.
To help inspire you, we've collected data about posit::conf talks from 2023 and 2024.
Thank you to [Rachael Dempsey](https://www.linkedin.com/in/rachaeldempsey/) for the Google sheets!

> posit::conf is our annual conference that focuses on the R and Python programming languages and their applications in data science. The conference features a variety of workshops, talks, and networking opportunities for attendees, with a particular emphasis on fostering a sense of community among data science professionals. In addition to providing opportunities for learning and professional development, posit::conf also aims to create a fun and engaging atmosphere that encourages attendees to connect with one another and explore the latest trends and technologies in the field.
- Which speakers gave talks in both 2023 and 2024?
- Are there keywords that appear in track titles in both 2023 and 2024?
- What is the average sentiment of the descriptions in each track?

Be careful to de-duplicate talk data when necessary!
Talks with multiple speakers might appear more than once.

Thank you to [Jon Harmon](https://github.com/jonthegeek) for curating this week's dataset.

## The Data

```r
# Option 1: tidytuesdayR package
## install.packages("tidytuesdayR")

tuesdata <- tidytuesdayR::tt_load('2025-01-14')
## OR
tuesdata <- tidytuesdayR::tt_load(2025, week = 2)

conf2023 <- tuesdata$conf2023
conf2024 <- tuesdata$conf2024

# Option 2: Read directly from GitHub

conf2023 <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2025/2025-01-14/conf2023.csv')
conf2024 <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2025/2025-01-14/conf2024.csv')
```

## How to Participate

- [Explore the data](https://r4ds.hadley.nz/), watching out for interesting relationships. We would like to emphasize that you should not draw conclusions about **causation** in the data. There are various moderating variables that affect all data, many of which might not have been captured in these datasets. As such, our suggestion is to use the data provided to practice your data tidying and plotting techniques, and to consider for yourself what nuances might underlie these relationships.
- Create a visualization, a model, a [shiny app](https://shiny.posit.co/), or some other piece of data-science-related output, using R or another programming language.
- [Share your output and the code used to generate it](../../../sharing.md) on social media with the #TidyTuesday hashtag.
- [Submit your own dataset!](../../../.github/pr_instructions.md)

### Data Dictionary

# `conf2023.csv`

|variable |class |description |
|:-------------------|:-------------------------|:-------------------------------------|
|speaker_name |character |The name of the speaker. The data is indexed by this field, so other fields may contain duplicate data. |
|speaker_affiliation |character |The company or organization with which the speaker is affiliated. |
|session_type |character |Whether the session is a "keynote" a "regular" talk, or a "lightning" talk. |
|session_title |character |The title of the talk. |
|block_track_title |character |The title of the block in which the talk was presented. A block is a set of talks on a related area. |
|session_date |date |The date on which this talk was given (either 2023-09-19 or 2023-09-20). |
|session_start |datetime |The start time of the talk in the America/Chicago (US CDT) timezone. |
|session_length |integer |The duration of the talk in minutes. |
|session_abstract |character |A brief description of the talk. |

# `conf2024.csv`

|variable |class |description |
|:------------|:---------|:-------------------------------------|
|talk_title |character |The title of the talk. |
|speaker_name |character |The name of the speaker. The data is indexed by this field, so other fields may contain duplicate data. |
|track |character |The title of the block in which the talk was presented. A block is a set of talks on a related area. |
|description |character |A brief description of the talk. |
|yt_url |character |The URL of the YouTube video of the talk. |

### Cleaning Script

```r
library(tidyverse)
library(googlesheets4)

# Mostly clean data provided by Posit.
conf2023_raw <- googlesheets4::read_sheet("<REDACTED>")
conf2023 <- conf2023_raw |>
dplyr::mutate(
session_date = lubridate::ymd(session_date),
session_start = lubridate::ymd_hm(
paste(session_date, session_start),
tz = "America/Chicago"
),
session_length = as.integer(session_length)
)

conf2024 <- googlesheets4::read_sheet("<REDACTED>") |>
janitor::clean_names()
```
1 change: 1 addition & 0 deletions data/2025/readme.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,3 +5,4 @@ Archive of datasets and articles from the 2025 series of `#TidyTuesday` events.
| Week | Date | Data | Source | Article
| :---: | :---: | :--- | :--- | :---|
| 1 | `2025-01-07` | Bring your own data from 2024! | | |
| 2 | `2025-01-14` | [posit::conf talks](2025-01-14/readme.md) | [posit::conf attendee portal 2023](https://reg.conf.posit.co/flow/posit/positconf23/attendee-portal/page/sessioncatalog), [posit::conf attendee portal 2024](https://reg.conf.posit.co/flow/posit/positconf24/attendee-portal/page/sessioncatalog) | [posit::conf(2025) in-person registration is now open!](https://posit.co/blog/positconf2025-in-person-registration-is-now-open/) |
2 changes: 2 additions & 0 deletions static/tt_data_type.csv
Original file line number Diff line number Diff line change
@@ -1,4 +1,6 @@
Week,Date,year,data_files,data_type,delim
2,2025-01-14,2025,conf2023.csv,csv,","
2,2025-01-14,2025,conf2024.csv,csv,","
53,2024-12-31,2024,book.csv,csv,","
53,2024-12-31,2024,broadcast_media.csv,csv,","
53,2024-12-31,2024,journalism.csv,csv,","
Expand Down

0 comments on commit 006bae2

Please sign in to comment.