output

title

subtitle

author

html_document

toc	toc_float	toc_depth	number_sections	code_folding	code_download	fig_width	fig_height	fig_align	highlight	theme	keep_md
true	false	3	true	hide	true	9	5	center	pygments	cerulean	true

Human Learning meets Machine Learning

1,200+ hours of piano practice

by Peter Hontaru

Introduction

Problem statement {-}

the why {-}

Learning a piano piece is a time-intensive process. Like with most other things, we tend to overestimate our own ability and then become frustrated that we cannot learn and play that Chopin piece like a concert pianist after only 30 minutes of practice. Fortunately, unlike what you might hear on Wall Street, previous performance is indicative of future success.

There's also a secondary goal here to hopefully provide a source of inspiration for other people that have always thought to themselves "one day I'll learn a musical instrument". Any other skill qualifies here, though. I aim to be doing this by, at the very least, allowing for visibility into my own journey. If this is what you want, why not give it a try?

the what {-}

Can we predict how long it would take to learn a piano piece based on a number of factors? If so, which factors influenced the total amount of hours required to learn the piece the most?

context {-}

I started playing the piano in 2018 as a complete beginner and I've been tracking my practice time for around 2 and a half years. I've now decided to put that data to good use and see what interesting patterns I might be able to find and hopefully develop a tool that others might be able to use in their journeys.

Here's an example of a recent performance - I mainly play classical music but cannot help but love Elton John's music.

Data collection {-}

imputed conservative estimations for the first 10 months of the first year (Jan '18 to Oct '18) and on Excel spreadsheet for Nov '18
everything from Dec '18 onwards was tracked using Toggl, a time-tracking app/tool
time spent in piano lessons was not tracked/included (usually 2-3 hours total per month)
the Extract, Transform, Load script is available in the global.R file of this repo;
for security reasons, I am not able to share the API script as the token also gives the option to change/remove the profile data; the raw data however, is stored in the raw data folder of this repo (not having the API call in simply just means that it won't be up to date for the current year)

Disclaimer: I am not affiliated with Toggl. I started using it a few years ago because it provided all the functionality I needed and loved its minimalistic design. The standard membership, which I use, is free of charge.

Key insights

Summary:

identified various trends in my practice habits
pieces could take me anywhere from ~4 hours to 40+ hours of practice, subject to difficulty (as assessed by the ABRSM grade)
the Random Forest model was shown to be the most optimal model (bootstrap resampling, 25x)
- Rsquared - 0.57
- MAE - 6.0 hours
- RMSE - 7.6 hours
looking at the variability of errors, there is a tendency to over-predict for pieces that took very little time to learn and under-predict for the more difficult ones. There could be two main reasons for this:
- artificially inflating the number of hours spent on a piece by returning to it a second time (due to a recital performance, wanting to improve the interpretation further or simply just liking it enough to play it again)
- learning easier pieces later on in my journey which means I will learn them faster than expected (based on my earlier data where a piece of a similar difficulty took longer)

the most important variables were shown to be the length of the piece, standard of playing(performance vs casual) and experience(lifetime total practice before first practice session on each piece)

Exploratory Data Analysis (EDA)

Piano practice timeline

How long did I practice per piece?

Based on the level at the time and the difficulty of the piece, we can see that each piece took around 10-30 hours of practice.

How consistent was my practice?

Generally, I've done pretty well to maintain a high level of consistency with the exception of August/December. This is usually where I tend to be away from home on annual leave, and thus, not have access to a piano.

Was there a trend in my amount of daily average practice? {.tabset .tabset-fade .tabset-pills}

We can see that my practice time was correlated with the consistency, where the average session was much shorter in the months I was away from the piano. There's also a trend where my practice close to an exam session was significantly higher than any other time of the year. Can you spot in which month I had my exam in 2019? What about the end of 2020?

the average practice length per month includes the days in which I did not practice

overall {-}

Year on Year {-}

Similar trends as before are apparent where my average daily session is longer before the exams than during any other time in the year and a dip in the months where I usually take most of my annual leave. I really do need to start picking up the pace and get back to where I used to be.

Did COVID significantly impact my practice time? {.tabset .tabset-fade .tabset-pills}

graph {-}

Despite a similar median, we can see that the practice sessions were less likely to be over 80 min after COVID. We can test if this was a significant impact with a t-test.

skewness assumption {-}

Given the extremely low p-value, the Shapiro-Wilk normality test implies that the distribution of the data is significantly different from a normal distribution and that we cannot assume the normality assumption. However, we're working with the entire population dataset for each class and thus, unlike the independence of data, this assumption is not crucial.

Shapiro-Wilk normality test

group	statistic	p.value	method
After COVID	0.9589908	1e-07	Shapiro-Wilk normality test
Before COVID	0.9549818	0e+00	Shapiro-Wilk normality test

equal variances assumption {-}

We can see that with a large p value, we should fail to reject the null hypothesis (Ho) and conclude that we do not have evidence to believe that population variances are not equal. We can assume that the equal variances assumption was met for our t test

Levene's test

statistic	p.value	df	df.residual
0.0293711	0.8639715	1	746

t-test {-}

My practice sessions post-COVID are significantly shorter than those before the pandemic. This might be surprising, given that we were in the lockdown most of the time. However, I've been spending my time doing a few other things such as improving my technical skillset with R (this analysis wouldn't have been possible otherwise) and learning italian.

.y.	group1	group2	n1	n2	statistic	df	p	p.signif
Duration	Before COVID	After COVID	433	315	3.892883	746	0.000108	***

What type of music do I tend to play? {.tabset .tabset-fade .tabset-pills}

by genre {-}

by composer {-}

by piece {-}

Relation between difficulty and number of practice hours {.tabset .tabset-fade .tabset-pills}

ABRSM grade {-}

Simplified, ABRSM grades are a group of 8 exams based on their difficulty (1 - beginner to 8 - advanced). There are also diploma grades but those are extremely advanced, equivalent to university level studies and out of the scope of this analysis.

More information can be found on their official website at https://gb.abrsm.org/en/exam-support/your-guide-to-abrsm-exams/

level {-}

A further aggregation of ABRSM grades; this is helpful given the very limited dataset within each grade and much easier on the eye. This is an oversimplification but they're classified as:

1-4: Beginner
5-6: Intermediate
7-8: Advanced

What about the piece length?

Learning effect - do pieces of the same difficulty become easier to learn with time?

We can spot a trend where the time required to learn a piece of a similar difficulty (ABRSM Grade) decreases as my ability to play the piano increases (as judged by cumulative hours of practice). We should keep this in mind and include it as a variable into our prediction model.

Does "pausing" a piece impact the total time required to learn it?

How do we differentiate between pieces that we learn once and those that we come back to repeatedly? Examples could include wanting to improve the playing further, loving it so much we wanted to relearn it, preparing it for a new recital performance, etc.

As anyone that ever played the piano knows, re-learning a piece, particularly after you "drop" it for a few months/years, results in a much better performance/understanding of the piece. I definitely found that to be true in my experience, particularly with my exam pieces.The downside is that these pieces take longer to learn.

Repertoire

Repertoire (red - response variable; green - predictor variables)

Project	Duration	Genre	Length	Standard	Experience	Break	ABRSM	Level	Started
Bach - Marche 127	8	Baroque	2.2	Casual	1200	No	4	Beginner	2021-02-23
Elton John - Rocket man	48	Modern	4.0	Performance	1130	Yes	7	Advanced	2020-12-08
Schumann - Träumerei	14	Romantic	3.0	Casual	1087	No	7	Advanced	2020-11-09
Mozart - Allegro (3rd movement) K282	28	Classical	3.3	Casual	1081	Yes	6	Intermediate	2020-11-05
Ibert - Sérénade sur l’eau	10	Modern	1.8	Performance	1038	No	6	Intermediate	2020-09-24
Kuhlau - Rondo Vivace	24	Classical	2.2	Casual	1014	No	6	Intermediate	2020-08-03
C. Hartmann - The little ballerina	21	Romantic	2.0	Performance	998	No	6	Intermediate	2020-07-14
Schumann - Lalling Melody	5	Romantic	1.5	Casual	981	No	1	Beginner	2020-06-28
Schumann - Melody	4	Romantic	1.1	Casual	972	No	1	Beginner	2020-06-20
Clementi - Sonatina no 3 - Mov 2	3	Classical	1.0	Performance	952	No	1	Beginner	2020-06-04
Clementi - Sonatina no 3 - Mov 3	20	Classical	2.1	Performance	952	No	4	Beginner	2020-06-04
Chopin - Waltz in Fm	27	Romantic	2.0	Performance	895	Yes	6	Intermediate	2020-04-18
Clementi - Sonatina no 3 - Mov 1	30	Classical	2.5	Performance	877	No	4	Beginner	2020-04-07
Schumann - Kinderszenen 1	10	Romantic	2.0	Casual	855	No	5	Intermediate	2020-03-25
Bach - Prelude in G from Cello Suite No 1	25	Baroque	2.5	Performance	788	No	5	Intermediate	2020-02-04
Georg Böhm - Minuet in G	7	Baroque	0.8	Casual	780	Yes	2	Beginner	2020-01-27
Bach - Invention 4 in Dm	21	Baroque	1.5	Performance	777	No	5	Intermediate	2020-01-25
Chopin - Contredanse in Gb	23	Romantic	2.2	Performance	762	No	6	Intermediate	2020-01-16
Bach - Minuet in Gm - 115	7	Baroque	1.3	Casual	750	No	2	Beginner	2020-01-07
Bach - Minuet in G - 114	4	Baroque	1.5	Casual	726	No	1	Beginner	2019-12-06
Elton John - Your song (Arr Cornick)	36	Modern	3.2	Performance	713	No	5	Intermediate	2019-11-21
Poulenc - Valse Tyrolienne	17	Modern	1.5	Performance	562	No	5	Intermediate	2019-09-02
Bach - Prelude in Cm - 934	25	Baroque	2.6	Performance	536	No	5	Intermediate	2019-08-15
Schumann - Volksliedchen	10	Romantic	2.0	Casual	501	No	3	Beginner	2019-07-01
Haydn - Andante in A	39	Classical	2.8	Casual	468	Yes	5	Intermediate	2019-06-08
Schumann - Remembrance	34	Romantic	2.2	Performance	422	Yes	5	Intermediate	2019-04-28
Bach - Minuet in G - 116	8	Baroque	1.8	Casual	361	Yes	3	Beginner	2019-03-04
Bach - Invention 1 in C	27	Baroque	1.5	Performance	350	Yes	5	Intermediate	2019-02-22
Chopin - Waltz in Am	26	Romantic	2.6	Performance	305	Yes	5	Intermediate	2019-01-07

Modelling

Question: Can we predict how long it would take to learn a piano piece based on a number of factors? If so, which factors influenced the total amount of hours required to learn the piece the most?

outliers

We can see that there are some outliers in our dataset:

no. 16 and no. 28 are two Advanced (Grade 7 pieces). These are the only two pieces within this category and will be removed as they are both outliers; they will be introduced as I learn more Advanced repertoire (it is also likely that no. 16 is significantly harder than grade 7, but it is a custom arrangement and cannot be assigned a specific grade)
no. 14 is an extremely short movement of a piece (a few seconds long) that unites the first and third movement of the same piece and it took very little time to learn
no 4 is a piece that I previously learnt but did not track the time spent on a piece (as I wasn't tracking individual times back then). It then took significantly less time to re-learn it

missing values

There are no missing values in the modelling dataset following the ETL process.

feature engineering

categorical:
- ABRSM grade: 1 to 8
- Genre: Baroque, Classical, Romantic, Modern
- Break: learning it continuously or setting it aside for a while (1 month minimum)
- Standard of practice: public performance or casual (relative to someone's level of playing)
numerical:
- Experience: total hours practiced before the first practice session on each piece
- Length of the piece: minutes

near zero variance

We can see that the Break feature has low variance (a high ratio of the most common answer "No" to the second most common "Yes"). We can exclude this from the model.

## [1] "Break"

pre-processing

Let's use some basic standardisation offered by the caret package such as centering (subtract mean from values) and scaling (divide values by standard deviation).

resampling

Given the small size of the dataset, bootstrapping resampling method will be applied. This will give multiple estimates of out-of-sample error, rather than a single estimate.

model selection

We chose the Random Forest model as it was the best performing model. It is known as a model which is:

not very sensitive to outliers
good for non-linearity
however, variable importance can be biased if categorical variables have few levels (toward high levels) or are correlated

The model selected had the mtry parameter (number of randomly selected variables used at each split) equal to 6.

## 
## Call:
## summary.resamples(object = model_comparison)
## 
## Models: ranger, lmStepAIC, lm, ridge, rf, gbm, pls 
## Number of resamples: 25 
## 
## MAE 
##               Min.  1st Qu.   Median     Mean  3rd Qu.      Max. NA's
## ranger    2.419997 4.080959 5.196473 5.123377 6.077655  8.195117    0
## lmStepAIC 3.968242 6.687709 7.781494 8.072575 8.813780 20.205612    0
## lm        4.418096 6.524852 7.964707 8.557616 9.539582 18.349102    0
## ridge     3.307420 4.515884 5.699746 5.569833 6.399087  8.408524    6
## rf        3.205740 5.118807 5.932867 6.038378 6.666790  9.904368    0
## gbm       4.089667 6.355343 7.168571 7.662897 9.228424 12.431810    0
## pls       3.483062 3.983705 4.621237 4.774614 5.353614  6.982680    0
## 
## RMSE 
##               Min.  1st Qu.    Median      Mean   3rd Qu.      Max. NA's
## ranger    3.068576 5.072204  6.300954  6.309951  7.265692  9.490835    0
## lmStepAIC 4.733275 7.863627 10.469526 10.245300 10.969664 24.911850    0
## lm        4.814727 7.559685 10.062189 10.764801 12.000870 23.377924    0
## ridge     4.085891 5.456852  6.509323  6.710812  7.824420 10.594639    6
## rf        4.064252 6.520044  7.170733  7.281298  8.102163 12.275178    0
## gbm       4.748131 7.952227  8.436318  9.455613 12.101037 16.468413    0
## pls       4.118035 5.008394  5.609687  5.704572  6.350988  7.854617    0
## 
## Rsquared 
##                   Min.   1st Qu.    Median      Mean   3rd Qu.      Max. NA's
## ranger    0.0281671341 0.5739776 0.6517399 0.6225855 0.7995505 0.9432270    0
## lmStepAIC 0.1525972285 0.3153611 0.4496154 0.4768795 0.6350944 0.9211133    0
## lm        0.0131385156 0.1889992 0.4552897 0.4218550 0.6417713 0.8298442    0
## ridge     0.2034848134 0.5639332 0.6313464 0.6250199 0.7636798 0.8591136    6
## rf        0.0635134558 0.4669159 0.5633518 0.5693602 0.7458371 0.8831978    0
## gbm       0.0003225047 0.1205368 0.4806110 0.4128403 0.6792372 0.8289043    0
## pls       0.2550967916 0.6282707 0.7328504 0.7041134 0.8232193 0.9018937    0

model evaluation

Based on our regression model, it does not look like we have significant multicollinearity between the full model variables so we can continue with our full model of 6 variables.

Variance Inflation Factor (VIF)

names	VIF
ABRSM5	9.6
ABRSM6	5.7
Cumulative_Duration	5.2
ABRSM4	4.8
ABRSM3	4.5
StandardPerformance	3.3
ABRSM2	3.1
GenreClassical	3.0
GenreRomantic	2.3
Length	2.2
GenreModern	1.6

actuals vs predictions

residual distribution {.tabset .tabset-fade .tabset-pills}

histogram {-}

We can see that the residuals are mostly situated around 0.

QQ plot / normal probability plot of residuals {-}

Similar to the previous histogram, we can spot some deviations from the normal distribution. Overall, we can state that the residuals follow a normal distribution.

independence of residuals (and hence observations)

There seems to be a slight trend where newer pieces have a smaller residuals. This could mean a lack of independence from the order of data collection (the model predictions are based on my current level of playing).

actuals versus residuals

Looking at the variability of errors, there is still a tendency to over-predict for pieces that took very little time to learn and under-predict for the more difficult ones. There could be two main reasons for this:

artificially inflating the number of hours spent on a piece by returning to it a second time (due to a recital performance, wanting to improve the interpretation further or simply just liking it enough to play it again)
learning easier pieces later on in my journey which means I will learn them faster than expected (based on my earlier data where a piece of a similar difficulty took longer)

Linear Regression (LR) or Random Forest (RF)?

We can see that the Random Forest performed significantly better than the simpler Linear Regression model. This isn't surprising since there might be non-linear trends within the data, and RFs are known to be more accurate (at the cost of interpretability and computing power).

Random Forest vs Linear Regression

estimate	statistic	p.value	parameter	conf.low	conf.high	method	alternative
3.483503	3.988638	0.0005423	24	1.680984	5.286022	One Sample t-test	two.sided

How many predictors did the most optimal model have?

What were the most important variables?

The most important variables were shown to be the length of the piece, standard of playing(performance vs casual) and experience(lifetime total practice before first practice session on each piece)

Limitations

very limited data which did not allow for a train/test split; however, a bootstrap resampling method is known to be a good substitute
biased to one person's learning ability (others might learn faster or slower)
on top of total hours of practice, quality of practice is a significant factor which is not captured in this dataset
very difficult to assess when a piece is "finished" as you can always further improve on your interpretation
not all pieces had official ABRSM ratings and a few had to be estimated; even for those that do have an official rating, the difficulty of a piece is highly subjective to each pianist and hard to quantify with one number
memorisation might be a confounding variable that was not accounted for and it could lead to inflating the numbers required on a specific piece

What's next?

keep practicing, gather more data and refresh this analysis + adjust the model
add a recommender tab to the shiny dashboard to recommend pieces based on specific features

Hardest things about this analysis:

the Extract-Transform-Load process - cleaning the "dirty data" and finding creative ways to input the data on the front-end of the app in order to make it reporting-friendly on the back-end
- especially true for metadata such as Genre, Type of practice, Composer and Piece name, tag pieces as "work in progress", etc
automate ways to differentiate between pieces that I came back to vs pieces I only studied once (such whether the maximum difference between two consecutive practice sessions exceeded a threshold)
work with very limited data

Interactive application

you can find an interactive display of this presentation, as well as the model in production at the following link
https://peterhontaru.shinyapps.io/Piano-Journal/

Files

EDA-and-modelling.md

Latest commit

History