BEAR-ML: BERD's Educational Archive for Research on Machine Learning

0. Introduction & Scope

Introducing BEAR-ML (BERD's Educational Archive for Research on Machine Learning): the go-to repository for novice users looking to learn practical computing and data skills in the context of business and economics research.

As more and more business and economics researchers rely on digital tools and methods, the need for accessible and effective training in these areas becomes increasingly pressing. BEAR-ML seeks to fill this gap by providing a repository of training materials designed specifically for novice users who want to learn the fundamentals of data management, analysis, and visualization in the context of business and economics research.

Whether you are a graduate student, an early career researcher, or an experienced professional looking to update your skills, BEAR-ML can help you improve your computing and data skills in a practical and effective way.

Target Group

Applied researchers from Business, Economics, Social Sciences or related field, who

.. have little or no formal eduction or experience with analysis pipelines for unstructured data.
.. want to know how to appropriately structure the codebase for their data analysis project.
.. seek advice for specific problems faced when analyzing unstructured data and training models (see our Discussion board).

Practical Lessons

The repository contains introductory units for various topics.

Currently available:

If you feel that some important lesson is missing, please don't hesitate to contact us and we will happily try to add it.

Educational Resources

We maintain a carefully curated list of educational resources intended to help researchers read up on specific topics. This list includes, but is not limited to, (a) courses, (b) books, (c) blog posts, (d) great overview papers, and (e) talks/presentations. It is available under this link

Discussion Board

This repository is accompanied by a discussion board intended for active communication with and among the community. Please feel free to ask your questions there, share valuable insights and give us feedback on our material.

Disclaimer

Please note that the contents of this repository are still in the experimental early stages and may be subject to significant changes, bugs, and limitations. We are continuously working on improving the BEARM-ML repository and welcome any feedback or contributions. Thank you for your understanding.

1. Usage

First, clone the repo and change to the project directory.

git clone https://github.com/BERD-NFDI/bear-ml.git
cd bear-ml

The relevant use-cases and source codes are located in bearml. Currently, we support python >= 3.10. It is recommended to install the required dependencies in a separate environment, e.g. via conda. A simpler alternative is a virtual environment, which is created and activated with:

python -m venv .venv
source .venv/bin/activate

Dependencies are then installed via pip.

pip install -r requirements.txt

The bearml project is structured like a python package, which has the advantage of being able to install it and thus reuse modules or functions without worrying about absolute filepaths. An editable version of bearml is also installed over pip:

pip install -e .

The project contains some jupyter notebooks, which were converted to python files due to better handling in the repository. These files end with _nb.py and can be converted back to a .ipynb file with jupytext:

jupytext --to ipynb --execute <your_file>_nb.py

The --execute flag triggers executing every cell during conversion. Alternatively, you can run the _nb.py files like every other python script.

2. Contributing

New ideas and improvements are always welcome. Feel free to open an issue or contribute over a pull request. Our repository has a few automatic checks in place that ensure a compliance with PEP8 and static typing. It is recommended to use pre-commit as a utility to adhere to the GitHub actions hooks beforehand. First, install the package over pip and then set a hook:

pip install pre-commit
pre-commit install

To ensure code serialization and keeping the memory profile low, .ipynb are blacklisted in this repository. A notebook can be saved to the repo by converting it to a serializable format via jupytext, preferably py:percent:

jupytext --to py:percent <notebook-to-convert>.ipynb

The result is a python file, which can be committed and later on be converted back to .ipynb. A notebook-python file from jupytext shall carry the suffix _nb.py.

Citation

If you use our repository or code in your research, please cite us:

@misc{bearml,
  author = {Tobias Weber and Amirhossein Vahidi and Maximilian Witte and Bernd Bischl and Matthias Aßenmacher},
  title = {BEAR-ML: BERD's Educational Archive for Research on Machine Learning},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/BERD-NFDI/bear-ml}},
}

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
.github/workflows		.github/workflows
assets		assets
bearml		bearml
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BEAR-ML: BERD's Educational Archive for Research on Machine Learning

0. Introduction & Scope

Target Group

Practical Lessons

Educational Resources

Discussion Board

Disclaimer

1. Usage

2. Contributing

Citation

About

Releases

Packages

Contributors 3

Languages

License

BERD-NFDI/bear-ml

Folders and files

Latest commit

History

Repository files navigation

BEAR-ML: BERD's Educational Archive for Research on Machine Learning

0. Introduction & Scope

Target Group

Practical Lessons

Educational Resources

Discussion Board

Disclaimer

1. Usage

2. Contributing

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages