Skip to content

Research about the global alcohol consumption related with facts such as nutrition, poverty or scholar marks.

License

Notifications You must be signed in to change notification settings

raistef/AlcoholConsumption

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Final Project

Research about possible causes and consequences of alcohol Consumption.

UPM GitHub contributors

 

Stefano Baggetto
Stefana Raileanu
Angel Igareta
Cristian Abrante

         

Abstract (5 points)

summary of the importance and findings of the project within the 150 word limit.

Introduction and Related Work (10 points)

We are interested to know which are the potential causes and consequences of alcohol consumption based on our data. Especially we looked for data related to young people. We planned on relating this topic with other domains such as education and health and find out if the causes of alcohol consumption is due to facts like the economical situation or Human Development Index of the countries.

Examples of related data driven projects

This project from the website Our World in Data presents the trends in alcohol consumption and substance use in countries all over the world, with data from 1990 to 2016. The aim is to give a better understanding of the global prevalence of substance use disorders.

Some of the conclusions in the research are:

  • Across most countries, the number of alcohol use disorders is higher than other drug use disorders.
  • Substance use disorders are more common among men than women.

This project carried out by the World Health Organization provide an interactive graph where we can see the age-standardized death rate per 100 000 related to alcohol. The study includes causes such as cancer of esophagus and larynx, alcohol dependence syndrome, chronic liver disease, cirrhosis etc.

This study by the University of Toronto tries to measure the relation between alcohol consumption and the presence of type 2 diabetes. For obtaining the data, the researchers have found several data sources both from public sources or from previous articles.

The main conclusion of this study is that a relationship exists between alcohol consumption and the presence or diabetes in an u-shaped way, both for men and women.

One of the reasons behind initializing this project was the fact that there is no universal low-risk limit for alcohol consumption. In order to define this threshold - to see how much alcohol can be tolerated without causing significant harm - the researchers studied individual-participant data from 599 912 current drinkers without previous cardiovascular disease.

The main finding was that the threshold for lowest risk for all-cause mortality was about 100 g per week. Long-term reduction of alcohol consumption (below the limit recommended in US guidelines) was associated with an increase of longer life expectancy. Exploratory analyses suggested that drinkers of beer or spirits, as well as binge drinkers, had the highest risk for all-cause mortality.

Project 5:

Consuming Alcohol in Moderation Can Lower Mortality Risks

As in the fourth project, heavy or binge drinking was associated with increased risk of all-cause mortality, while light and moderate alcohol intake might have, according to their study, a protective effect on all-cause and CVD-specific mortality in U.S. adults.

Exploratory Data Analysis (20 points)

  • Introduces the dataset by describing the origin (source) and structure (shape, relevant features) of the data being used (5 points)
  • Creates 5 well designed and formatted graphics (15 points, 3 each)
    • The visual uses the appropriate visual encodings based on the data type (1 point)
    • Written interpretation of graphic is provided (1 point)
    • Clear axis labels, titles, and legends are included, where appropriate (1 point)

Methods (30 points)

The appropriate methods are employed to answer the question of interest, including:

  • Strength of relationships: Uses the appropriate technique to assess the strength of relationships amongst your variables of interest. You should include:
    • A formula describing how you believe your features (independent variables) are related to your outcome of interest (dependent variable) (5 points)
    • A defense of the variables included in your formula (5 points)
    • Creating the appropriate model based on your dataset (5 points)
  • Prediction: You must also make predictions for your outcome of interest. In doing so, you must demonstrate a clear use of:
    • Splitting your data into testing/training data (2 points)
    • Applying cross validation to your model (3 points)
    • Appropriately handling any missing values (2 points)
    • Appropriately using categorical variables (3 points)
    • Using a grid search to find the best parameters for you model of interest (2 points)
    • Employing the algorithm of interest (3 points)

Results (20 points)

You must provide a clear interpretation of your statistical and machine learning results, including at least one visual or table for each.

  • Strengths of relationships: For the features you included in your model, you must describe the strength (significance) and magnitude of the relationships. This can be presented in a table or chart, and pertinent observations should be described in the text. (10 points)
  • Predictions: How well were you able to predict values in the dataset? You should both report appropriate metrics based on the type of outcome you're predicting (e.g., root mean squared error v.s. accuracy), as well as a high quality visual showing the strength of your model (10 points)

Discussion and Future Work (10 points)

Based on specific observations from the results section, the report clearly provides:

  • An analysis of the real world implications of the results, at least one full paragraph (5 points)
  • Clear suggestion for directions of future research, at least one full paragraph (5 points)

Code Quality (5 points)

Code is well commented and structured (e.g., indented), organized across multiple different files, uses clear variable names, and runs on any computer.

About

Research about the global alcohol consumption related with facts such as nutrition, poverty or scholar marks.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 97.9%
  • R 2.1%