GitHub - felixrech/PC-AI_analysis: Code behind my Master's thesis "Using NLP to Study the Public Consultation on the AI Act Proposal"

Using NLP to Study the Public Consultation on the AI Act Proposal

This repository contains the code for the Master's thesis "Computer, how should we regulate artificial intelligence? Using NLP to Study the Public Consultation on the AI Act Proposal". Feel free to use it to reproduce our research or adapt it to your topic!

Note that the 'Have your Say' scraper was also developed for this thesis. You can find it on PyPI and GitHub.

Structure

This repository is structured as follows:

01_pdf_extractor_comparison/: contains the code for the evaluation of PDF extraction libraries (see Section 4.2.1 for an explanation)
02_eval_tool/: contains a tool internally used to help with the evaluation but might also be used later on in the research (no corresponding section in the thesis)
03_dataset_statistics_biases/: computes statistics about the dataset and its biases - powers Section 2.3 of the thesis
04_dataset_access: the code used to convert the dataset into text files and easily access, tokenize, and lemmatize it (Sections 4.2.2, 4.2.3, 4.3.1 of the thesis)
05_topic_modeling: the code powering Sections 4.3 and 5.1 of the thesis
06_explorer/explorer: the code for our interactive visualization dashboard, see Section 4.5 for details
07_sentiment/sentiment: the code to process the dataset using ABSA, see Sections 4.4 and 5.2

Getting started

All parts expect the dataset to be already downloaded. You can either use hys_scraper to scrape it from the 'Have your Say' platform yourself (and fix some issues yourself, too) or use our version (with said patches already applied):

git clone git@github.com:felixrech/PC-AI.git -b main --single-branch --depth=1 24212003_requirements_for_artificial_intelligence
git clone git@github.com:felixrech/PC-AI.git -b white_paper --single-branch --depth=1 7639546_requirements_for_artificial_intelligence

Each part then has its own setup instructions in its README.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Using NLP to Study the Public Consultation on the AI Act Proposal

Structure

Getting started

About

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
01_pdf_extractor_comparison		01_pdf_extractor_comparison
02_eval_tool		02_eval_tool
03_dataset_statistics_biases		03_dataset_statistics_biases
04_dataset_access		04_dataset_access
05_topic_modeling		05_topic_modeling
06_explorer/explorer		06_explorer/explorer
07_sentiment/sentiment		07_sentiment/sentiment
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md

felixrech/PC-AI_analysis

Folders and files

Latest commit

History

Repository files navigation

Using NLP to Study the Public Consultation on the AI Act Proposal

Structure

Getting started

About

Resources

Stars

Watchers

Forks

Languages